4.23. classify_single¶
Runs raw reads through taxonomic classification (Kraken2), human read depletion (based on Kraken2), de novo assembly (SPAdes), and FASTQC/multiQC of reads.
4.23.1. Inputs¶
4.23.1.1. Required inputs¶
classify_single.download_annotations.emailAddress
String — Default: None
???
classify_single.kraken2_db_tgz
File — Default: None
Pre-built Kraken database tarball containing three files: hash.k2d, opts.k2d, and taxo.k2d.
classify_single.krona_taxonomy_db_kraken2_tgz
File — Default: None
Krona taxonomy database containing a single file: taxonomy.tab, or possibly just a compressed taxonomy.tab
classify_single.ncbi_taxdump_tgz
File — Default: None
An NCBI taxdump.tar.gz file that contains, at the minimum, a nodes.dmp and names.dmp file.
classify_single.reads_bams
Array[File]+ — Default: None
Reads to classify. May be unmapped or mapped or both, paired-end or single-end. Multiple input files will be merged first.
classify_single.spades.spades_n_reads
Int — Default: 10000000
Subsample reads threshold prior to assembly. Default set to 10000000
classify_single.trim_clip_db
File — Default: None
Adapter sequences to remove via trimmomatic prior to SPAdes assembly
4.23.1.2. Other inputs¶
Show/Hide
classify_single.bbnorm_bam.cpu
Int? — Default: None
???
classify_single.bbnorm_bam.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-core"
???
classify_single.bbnorm_bam.kmer_length
Int? — Default: None
Kmer length for BBNorm analysis. Longer kmers are more specific but require more memory. (default: bbnorm default of 31)
classify_single.bbnorm_bam.machine_mem_gb
Int? — Default: None
???
classify_single.bbnorm_bam.passes
Int? — Default: None
Number of normalization passes. More passes give more accurate normalization but take longer. (default: 2 for inputs < 15GB, 1 for inputs >= 15GB to optimize runtime)
classify_single.bbnorm_bam.target
Int — Default: 1000
BBNorm target normalization depth. Reads are downsampled to achieve approximately this coverage depth. (default: 10000)
classify_single.deplete.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-classify"
???
classify_single.deplete.machine_mem_gb
Int — Default: 8
???
classify_single.deplete.minimum_hit_groups
Int? — Default: None
???
classify_single.deplete.taxonomic_ids
Array[Int]? — Default: None
???
classify_single.deplete.withoutChildren
Boolean — Default: false
???
classify_single.download_annotations.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-phylo"
???
classify_single.filter_acellular.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-classify"
???
classify_single.filter_acellular.machine_mem_gb
Int — Default: 8
???
classify_single.filter_acellular.minimum_hit_groups
Int? — Default: None
???
classify_single.filter_acellular.taxonomic_ids
Array[Int]? — Default: None
???
classify_single.filter_acellular.withoutChildren
Boolean — Default: false
???
classify_single.kraken2.confidence_threshold
Float? — Default: 0.05
Kraken2 confidence score threshold (0.0-1.0). See https://ccb.jhu.edu/software/kraken2/index.shtml?t=manual#confidence-scoring
classify_single.kraken2.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-classify"
???
classify_single.kraken2.machine_mem_gb
Int — Default: 90
???
classify_single.kraken2.min_base_qual
Int? — Default: None
Minimum base quality used in classification
classify_single.max_reads_for_assembly
Int — Default: 10000000
???
classify_single.merge_raw_reads.disk_size
Int — Default: 750
???
classify_single.merge_raw_reads.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-core"
???
classify_single.merge_raw_reads.machine_mem_gb
Int — Default: 8
???
classify_single.merge_raw_reads.out_basename
String — Default: basename(in_bams[0],".bam")
???
classify_single.merge_raw_reads.reheader_table
File? — Default: None
???
classify_single.merge_raw_reads.sample_name
String? — Default: None
???
classify_single.min_reads_for_rmdup
Int — Default: 5000000
???
classify_single.report_primary_kraken_taxa.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-classify"
???
classify_single.report_primary_kraken_taxa.focal_taxon
String — Default: "Viruses"
???
classify_single.select_references.cpu
Int — Default: 2
???
classify_single.select_references.disk_size
Int — Default: 100
???
classify_single.select_references.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-assemble"
???
classify_single.select_references.machine_mem_gb
Int — Default: 4
???
classify_single.select_references.skani_c
Int? — Default: None
???
classify_single.select_references.skani_m
Int? — Default: None
???
classify_single.select_references.skani_n
Int? — Default: None
???
classify_single.select_references.skani_s
Int? — Default: None
???
classify_single.spades.cpu
Int? — Default: None
???
classify_single.spades.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-assemble"
???
classify_single.spades.machine_mem_gb
Int? — Default: None
???
classify_single.spades.sample_name
String — Default: basename(basename(reads_unmapped_bam,".bam"),".taxfilt")
???
classify_single.spades.spades_min_contig_len
Int? — Default: None
Minimum length of output contig.
classify_single.spades.spades_options
String? — Default: None
Display additional options to pass the SPAdes assembler.
classify_single.spikein.cpu
Int? — Default: None
???
classify_single.spikein.docker
String — Default: "ghcr.io/broadinstitute/viral-ngs:3.0.4-core"
???
classify_single.spikein.machine_mem_gb
Int? — Default: None
???
classify_single.spikein.topNHits
Int — Default: 3
???
classify_single.spikein_db
File? — Default: None
ERCC spike-in sequences
classify_single.tax_lookup.set_default_keys
Array[String] — Default: []
???
classify_single.taxa_to_avoid_assembly
Array[String] — Default: ["Vertebrata", "other sequences", "Bacteria"]
???
classify_single.taxa_to_dehost
Array[String] — Default: ["Vertebrata"]
???
classify_single.taxid_to_ref_accessions_tsv
File? — Default: None
???
4.23.2. Outputs¶
classify_single.contigs_fasta
File
???
classify_single.kraken2_focal_taxon_name
String
???
classify_single.kraken2_focal_total_reads
Int
???
classify_single.kraken2_krona_plot
File
Visualize the results of the Kraken2 analysis with Krona, which disaplys taxonmic hierarchiral data in multi-layerd pie.
classify_single.kraken2_reads_report
File
???
classify_single.kraken2_summary_report
File
Kraken report output file.
classify_single.kraken2_top_taxa_report
File
???
classify_single.kraken2_top_taxon_id
String
???
classify_single.kraken2_top_taxon_name
String
???
classify_single.kraken2_top_taxon_num_reads
Int
???
classify_single.kraken2_top_taxon_pct_of_focal
Float
???
classify_single.kraken2_top_taxon_rank
String
???
classify_single.read_counts_acellular
Int
???
classify_single.read_counts_assembly_input
Int
???
classify_single.read_counts_dehosted
Int
???
classify_single.read_counts_prespades_subsample
Int
???
classify_single.read_counts_raw
Int
???
classify_single.reads_acellular_ubam
File
???
classify_single.reads_assembly_input_ubam
File
???
classify_single.reads_dehosted_ubam
File
???
classify_single.skani_contigs_to_refs_dist_tsv
File?
???
classify_single.skani_hits_taxids
Array[String]?
???
classify_single.skani_hits_taxnames
Array[String]?
???
classify_single.skani_num_hits
Int
???
classify_single.spikein_pct_lesser_hits
String?
???
classify_single.spikein_pct_of_total_reads
String?
???
classify_single.spikein_report
File?
???
classify_single.spikein_tophit
String?
???
classify_single.viral_assemble_version
String
???
classify_single.viral_classify_version
String
???
Generated using WDL AID (1.0.0)