Skip to main content

checkm2_predict

Tags: quality-control completeness contamination machine-learning bacteria archaea sample-scope

Assess genome quality using machine learning.

Uses CheckM2 to predict the completeness and contamination of genome assemblies. Unlike the original CheckM, it uses a gradient boost machine learning model to predict quality without relying on lineage-specific marker sets, making it more accurate for novel or reduced genomes.

Database Required

Requires the CheckM2 database (Diamond database file) to be available.

Inputs

record (
meta: Record,
fna: Path
)
FieldTypeDescription
metaRecordGroovy Record containing sample information
fnaPathAssembled contigs in FASTA format
db: Path
NameTypeDescription
dbPathThe CheckM2 database file (*.dmnd)

Outputs

record (
meta: Record,
tsv: Path,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
FieldTypeDescription
metaRecordSample information record
tsvPathTab-delimited report of quality metrics (Completeness, Contamination)
resultsSet<Path>All output files to be published
logsSet<Path?>Optional program specific log files
nf_logsSet<Path>Nextflow-specific log files (e.g. .command.{begin
versionsSet<Path>A YAML formatted file with program versions

Parameters

CheckM2 Parameters

ParameterTypeDefaultDescription
--checkm2_lowmembooleanLow memory mode. Reduces DIAMOND blocksize to significantly reduce RAM usage at the expense of longer runtime
--checkm2_generalbooleanForce the use of the general quality prediction model (gradient boost)
--checkm2_specificbooleanForce the use of the specific quality prediction model (neural network)
--checkm2_allmodelsbooleanOutput quality prediction for both models for each genome.
--checkm2_genesbooleanTreat input files as protein files. [Default: False]
--checkm2_optsstringAdditional options to pass to CheckM2

Used By

Subworkflows

  • checkm2 - Assess metagenome bin completeness using CheckM2.

Workflows

  • checkm2 - Machine learning-based assessment of microbial genome assembly quality.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub

Version

CHECKM2_PREDICT:
- checkm2: 1.1.0