Skip to main content

bracken

Tags: metagenomics taxonomic-classification abundance-estimation kraken2 bracken sample-scope

Estimate species abundance from metagenomic reads.

This subworkflow performs taxonomic classification and abundance estimation using Kraken2 and Bracken. It processes metagenomic reads, classifies them against a reference database, and generates abundance estimates at different taxonomic levels with optional abundance correction.

Uses explicit positional record fields for reads:

  • Input: record(meta, r1, r2, se, lr) where each read slot is Path?

Take

reads: Channel<Record>
FieldDescription
metaGroovy Record containing sample information
r1Illumina R1 reads (paired-end)
r2Illumina R2 reads (paired-end)
seSingle-end Illumina reads
lrLong reads (ONT/PacBio)
database: Path
NameTypeDescription
databasePathPath to the Kraken2 database for taxonomic classification.

Emit

Published

The sample_outputs and run_outputs emissions are aggregates of output files that will be published in the entry workflow.

sample_outputs

OutputDescription
tsvTab-delimited summary of Bracken primary and secondary species abundances
special_metaA simplified metadata record for internal use
classifiedReads classified to belong to any of the taxa on the Kraken2 database
unclassifiedReads not classified to belong to any of the taxa on the Kraken2 database
kraken2_reportKraken2 report containing stats about classified and not classified reads
kraken2_outputKraken2 output file containing the taxonomic classification of each read
bracken_reportBracken report containing stats about classified and not classified reads
kronaInteractive Krona HTML visualization
abundancesBracken abundance estimates for each taxon
classificationBracken per-read classification details
adjusted_abundancesBracken abundance estimates adjusted for unclassified reads

run_outputs

OutputDescription
csvAggregated results in CSV format

Module Composition

This subworkflow calls the following modules:

  • bracken - Taxonomic classification and abundance estimation.
  • csvtk_concat - Concatenate multiple CSV or TSV files into a single table.
  • csvtk_concat - Concatenate multiple CSV or TSV files into a single table.

Used By

This subworkflow is used by the following workflows:

  • bracken - Estimate taxonomic abundance of metagenomic samples.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub