You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
nextflow pipeline for analyzing Mycobacterium bovis genomes
Input/output options
Define where the pipeline should find input data and save output data.
Parameter
Description
Type
Default
Required
Hidden
input
Path to comma-separated file containing information about the samples in the experiment. HelpYou will need to create a design file with information about
the samples in your experiment before running the pipeline. Use this parameter to specify its location. It has to be a comma-separated file with 3 columns, and a header row.
string
reference
Path to reference genome
string
None
kraken2db
Path to Kraken 2 database
string
None
outdir
The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
string
email
Email address for completion summary. HelpSet this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the
workflow exits. If set in your user config file (~/.nextflow/config) then you don't need to specify this on the command line for every run.
string
multiqc_title
MultiQC report title. Printed as page header, used for filename if not otherwise specified.
string
Quality Control options
Parameter
Description
Type
Default
Required
Hidden
skip_fastp
Skip the fastp trimming step.
boolean
skip_fastqc
Skip the fastQC step.
boolean
save_trimmed_fail
Save failed trimmed reads.
boolean
skip_multiqc
Skip MultiQC.
boolean
adapter_file
Path to file containing adapters in FASTA format.
string
'${baseDir}/assets/adapters.fas'
skip_kraken2
Skip Kraken 2 and Bracken.
boolean
Sub-sampling options
Parameter
Description
Type
Default
Required
Hidden
skip_subsampling
Skip sub-sampling with Rasusa.
boolean
subsampling_depth_cutoff
Desired coverage depth when sub-sampling.
integer
100
genome_size
Genome size for sub-sampling (set to 4.3 Mb for M. bovis).
string
4.3mb
Clustering options
Parameter
Description
Type
Default
Required
Hidden
skip_clusters
Skip APHA clustering
boolean
Alignment options
Parameter
Description
Type
Default
Required
Hidden
non_GATC_threshold
Maximum non GATC bases (i.e - and N) to allow in pseudogenome sequences.
number
0.25
skip_alignment
Skip alignment step.
boolean
Institutional config options
Parameters used to describe centralised config profiles. These should not be edited.
Parameter
Description
Type
Default
Required
Hidden
custom_config_version
Git commit id for Institutional configs.
string
master
True
custom_config_base
Base directory for Institutional configs. HelpIf you're running offline, Nextflow will not be able to fetch the institutional config files
from the internet. If you don't need them, then this is not a problem. If you do need them, you should download the files from the repo and tell Nextflow where to find them with this