Skip to main content

gtdbtk_classifywf

Tags: taxonomy classification phylogeny gtdb bacteria archaea marker-genes sample-scope

Taxonomic classification of bacterial and archaeal genomes using GTDB-Tk.

Uses GTDB-Tk to assign objective taxonomic classifications to genome assemblies based on the Genome Taxonomy Database. It identifies marker genes, aligns them, and places the genome into the reference tree to determine taxonomy.

Database Required

Requires the massive GTDB-Tk database (~60GB+) to be available.

Inputs

record (
meta: Record,
fna: Path
)
FieldTypeDescription
metaRecordGroovy Record containing sample information
fnaPathAssembled contigs in FASTA format
db: Path
NameTypeDescription
dbPathPath (or Set of paths) to the GTDB-Tk reference database

Outputs

record (
meta: Record,
bac_tsv: Path?,
ar_tsv: Path?,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
FieldTypeDescription
metaRecordSample information record
bac_tsvPath?The bacterial classification summary file containing the taxonomic assignment
ar_tsvPath?The archaeal classification summary file containing the taxonomic assignment
resultsSet<Path>All output files to be published
logsSet<Path?>Optional program specific log files
nf_logsSet<Path>Nextflow-specific log files (e.g. .command.{begin
versionsSet<Path>A YAML formatted file with program versions

Parameters

GTDB Classify Parameters

ParameterTypeDefaultDescription
--gtdb_min_afnumber0.65Minimum alignment fraction to consider closest genome
--gtdb_min_perc_aainteger10Filter genomes with an insufficient percentage of AA in the MSA
--force_gtdbbooleanfalseContinue processing if an error occurs on a single genome

Used By

Subworkflows

  • gtdb - Taxonomic classification with the Genome Taxonomy Database.

Workflows

  • gtdb - Identify marker genes and assign taxonomic classifications using GTDB.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub

Version

GTDBTK_CLASSIFYWF:
- gtdbtk: 2.7.1