nf-core/rnafusion
RNA-seq analysis pipeline for detection of gene-fusions
Skip analysis steps
Skip QC steps
boolean
Skip vcf creation step
boolean
Skip visualisation steps
boolean
Define where the pipeline should find input data and save output data.
Path to samplesheet file containing information about the samples in the experiment.
string
^\S+\.(csv|yaml|yml|json)$
The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
string
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
MultiQC report title. Printed as page header, used for filename if not otherwise specified.
string
COSMIC username
string
COSMIC password
string
Path to reference folder
string
gencode version
integer
Comma-delimited list of tools to run
string
The length of the reads provided to the pipeline. This is used for the ‘—sjdbOverhang’ option of STAR as read_length - 1. Providing 1 to this option will disable overhang handling.
integer
100
Path to arriba reference blacklist
string
\S+\.tsv$
Path to arriba reference cytobands
string
\S+\.tsv$
Path to arriba reference known fusions
string
\S+\.tsv$
Path to arriba reference protein domain
string
\S+\.gff3$
Path to arriba output
string
\S+\.tsv$
Download references instead of building them (for fusioncatcher and starfusion)
boolean
Path to fusioncatcher output
string
\S+\.txt$
Use limitSjdbInsertNsj with int for fusioncatcher
integer
2000000
Path to fusioncatcher references
string
Fusioncatcher download link
string
Use limitSjdbInsertNsj with int for fusioninspector STAR process
integer
1000000
Path to a fusion list file built with format GENE1—GENE2
string
\S+\.tsv$
Path to fusionreport references
string
Path to HGNC database file
string
\S+\.txt$
Path to HGNC timestamp file for database retrieval
string
\S+\.txt$
Use QIAGEN instead of SANGER to download COSMIC database
boolean
Path to salmon index
string
Path to starfusion output
string
\S+\.tsv$
Path to starfusion references
string
Path to the cancer introns CSV file to create the CTAT-SPLICING reference with
string
https://data.broadinstitute.org/Trinity/CTAT_RESOURCE_LIB/CANCER_SPLICING_LIB_SUPPLEMENT/cancer_introns.GRCh38.Jun232020.tsv.gz
\S+\.tsv\.gz$
Path to Dfam HMM database file
string
\S+\.hmm$
Path to Dfam H3F database file
string
\S+\.h3f$
Path to Dfam H3I database file
string
\S+\.h3i$
Path to Dfam H3M database file
string
\S+\.h3m$
Path to Dfam H3P database file
string
\S+\.h3p$
Path to Pfam database file
string
\S+\.hmm(\.gz)?$
URL to annotation filter rule file
string
https://data.broadinstitute.org/Trinity/CTAT_RESOURCE_LIB/AnnotFilterRule.pm
^https?://.*
Path to starindex references
string
Discard fusions identified by less than INT tools
integer
1
Path to fusions to add to the input of fusioninspector
string
\S+\.tsv$
Options to adjust read trimming criteria.
The amount of bases to trim at the tail of each read, none will be trimmed by default
integer
The amount of bases to trim at the tail of each read for fusioncatcher, none will be trimmed by default
integer
Path to adapter fasta file: default: []
string
\S+\.f(n|ast)?a$
Option to compress BAM files to CRAM.
Output CRAM files instead of BAM files.
boolean
Reference genome related files and options required for the workflow.
Skip running the analysis, only builds the references
boolean
Path to FASTA genome file.
string
^\S+\.fn?a(sta)?(\.gz)?$
Path to FASTA genome index file.
string
^\S+\.fn?ai(\.gz)?$
Name of iGenomes reference.
string
Path to GTF genome file.
string
^\S+\.gtf?(\.gz)?$
Path to GTF genome file.
string
^\S+\.refflat?$
Path to ribosomal interval list.
string
^\S+\.interval_list?$
Avoid using Cosmic DB (for example in clinical case applications where a paid license applies.
boolean
Path to Fusion Annotation Library to be used in STARFUSION_BUILD.
string
\S+\.dat\.gz$
Which species dfam should automatically download, default: homo_sapiens.
string
homo_sapiens
Version of dfam to use
number,null
Version of pfam to use
number,null
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Less common options for the pipeline, typically set in a config file.
Display version and exit.
boolean
Method used to save pipeline results to output directory.
string
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Do not use coloured log outputs.
boolean
Incoming hook URL for messaging service
string
Custom config file to supply to MultiQC.
string
Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file
string
Custom MultiQC yaml file containing HTML including a methods description.
string
Boolean whether to validate parameters against the schema at runtime
boolean
true
Base URL or local path to location of pipeline test dataset files
string
https://raw.githubusercontent.com/nf-core/test-datasets/
Suffix to add to the trace report filename. Default is the date and time in the format yyyy-MM-dd_HH-mm-ss.
string
Sequencing center
string
Sequencing platform
string
Whether to ignore the GTF in STAR alignment
boolean
The maximum amount of RAM to use for sorting the BAM file in STAR. Should by in bits. Setting this value to 0
will use the default amount of STAR.
integer