Skip to main content

snippy_run

Tags: variant-calling snp reference-mapping phylogenetics outbreak sample-scope

Call variants against a reference genome using Snippy.

This subworkflow performs rapid haploid variant calling from bacterial sequence reads using Snippy. It maps reads to a reference genome, identifies SNPs and indels, and generates consensus sequences. The tool produces multiple output formats including VCF, aligned FASTA, and annotated variants for downstream phylogenetic analysis with snippy-core.

Uses explicit positional record fields for reads:

  • Input: record(meta, r1, r2, se, lr) where each read slot is Path?

Take

reads: Channel<Record>
FieldDescription
metaGroovy Record containing sample information
r1Illumina R1 reads (paired-end)
r2Illumina R2 reads (paired-end)
seSingle-end Illumina reads
lrLong reads (ONT/PacBio)
reference: Path
NameTypeDescription
referencePathReference genome in GenBank format (preferred, for annotation) or FASTA format

Emit

Published

The sample_outputs and run_outputs emissions are aggregates of output files that will be published in the entry workflow.

sample_outputs

OutputDescription
aligned_faA version of the reference with - at zero coverage positions
vcfThe final annotated variants in VCF format
aligned_fa_errorAligned FASTA file generated during error state
vcf_errorVCF file generated during error state
errorError log text file
annotated_vcfAnnotated VCF file
bamThe alignments in BAM format (includes unmapped/multimapping)
baiIndex for the BAM file
bedThe variants in BED format
consensus_faReference genome with all variants instantiated
consensus_subs_faReference genome with only substitution variants instantiated
consensus_subs_masked_faReference genome with substitutions instantiated and low coverage masked
coveragePer-base coverage depth information
csvA comma-separated summary of variants
filt_vcfThe filtered variant calls from Freebayes
gffThe variants in GFF3 format
htmlA HTML summary of the variants
raw_vcfThe unfiltered variant calls from Freebayes
subs_vcfVCF containing only substitution variants
tabA simple tab-separated summary of all variants
txtTab-separated columnar list of alignment statistics

run_outputs

No run-scope outputs.

Downstream Inputs

The following emissions are meant to be used as inputs to downstream subworkflows.

variants

Per-sample VCFs and aligned FAs filtered to only samples with variant data

Module Composition

This subworkflow calls the following modules:

  • snippy_run - Rapid haploid variant calling and core genome alignment.

Used By

This subworkflow is used by the following workflows:

  • snippy - Rapid haplotype variant calling and core genome alignment.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub