Skip to main content

merlin_dist

Tags: merlin mash routing logic genus-specific automation sample-scope

Identify species to trigger genus-specific downstream analyses (Merlin).

This is a specialized process for the Merlin workflow. It runs mash dist against a reference database and parses the results to detect specific genera (e.g., Salmonella, Staphylococcus). Based on the detected genus, it outputs data into specific channels to trigger targeted tools (e.g., finding Salmonella triggers Sistr).

Inputs

record (
meta: Record,
fna: Path,
r1: Path?,
r2: Path?,
se: Path?,
lr: Path?
)
FieldTypeDescription
metaRecordGroovy Record containing sample information
fnaPathAssembled contigs in FASTA format
r1Path?Illumina R1 reads (paired-end)
r2Path?Illumina R2 reads (paired-end)
sePath?Single-end Illumina reads
lrPath?Long reads (ONT/PacBio)
reference: Path
NameTypeDescription
referencePathThe reference Mash database to screen against

Outputs

record (
meta: Record,
fna: Path,
r1: Path,
r2: Path,
se: Path,
lr: Path,
escherichia: Path?,
haemophilus: Path?,
klebsiella: Path?,
legionella: Path?,
listeria: Path?,
mycobacterium: Path?,
neisseria: Path?,
pseudomonas: Path?,
salmonella: Path?,
staphylococcus: Path?,
streptococcus: Path?,
genus: Set<Path?>,
dist: Path,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
FieldTypeDescription
metaRecordSample information record
fnaPathPassthrough of assembled contigs
r1PathPassthrough of Illumina R1 reads
r2PathPassthrough of Illumina R2 reads
sePathPassthrough of single-end reads
lrPathPassthrough of long reads
escherichiaPath?Conditional marker file triggering Escherichia analysis tools
haemophilusPath?Conditional marker file triggering Haemophilus analysis tools
klebsiellaPath?Conditional marker file triggering Klebsiella analysis tools
legionellaPath?Conditional marker file triggering Legionella analysis tools
listeriaPath?Conditional marker file triggering Listeria analysis tools
mycobacteriumPath?Conditional marker file triggering Mycobacterium analysis tools
neisseriaPath?Conditional marker file triggering Neisseria analysis tools
pseudomonasPath?Conditional marker file triggering Pseudomonas analysis tools
salmonellaPath?Conditional marker file triggering Salmonella analysis tools
staphylococcusPath?Conditional marker file triggering Staphylococcus analysis tools
streptococcusPath?Conditional marker file triggering Streptococcus analysis tools
genusSet<Path?>Marker file indicating the detected genus
distPathRaw Mash distance results
resultsSet<Path>All output files to be published
logsSet<Path?>Optional program specific log files
nf_logsSet<Path>Nextflow-specific log files (e.g. .command.{begin
versionsSet<Path>A YAML formatted file with program versions

Parameters

mashdist Parameters

ParameterTypeDefaultDescription
--mash_sketchstringThe reference sequence as a Mash Sketch (.msh file)
--full_merlinbooleanfalseGo full Merlin and run all species-specific tools, no matter the Mash distance

Used By

Subworkflows

  • merlindist - Identify species from assembly and read data using Mash distances.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub

Version

MERLIN_DIST:
- mash: 2.3