Skip to main content

gubbins

Tags: recombination phylogeny filter snp core-genome run-scope

Detect and filter recombination regions in bacterial alignments.

This subworkflow uses Gubbins (Globally Optimised Bacterial Phylogenomic analysis) to identify recombination regions in bacterial core-genome alignments. It iteratively filters out recombination to produce a recombination-free phylogeny, then calculates SNP distances from the masked alignment. Gubbins is essential for accurate phylogenetic reconstruction of recombining bacterial species.

Take

alignment: Channel<Record>
FieldDescription
metaGroovy Record containing sample information
alnMultiple sequence alignment in FASTA format

Emit

Published

The sample_outputs and run_outputs emissions are aggregates of output files that will be published in the entry workflow.

sample_outputs

No sample-scope outputs.

run_outputs

OutputDescription
masked_alnRecombination-masked alignment in FASTA format
fastaConcatenated alignment before masking in FASTA format
gffGFF file containing recombination region coordinates
vcfVCF file containing SNPs filtered by Gubbins
statsSummary statistics of the Gubbins analysis
phylipRecombination-masked alignment in PHYLIP format
embl_predictedRecombination predictions in EMBL format
embl_branchBranch-specific recombination in EMBL format
treeMaximum likelihood tree from filtered SNPs in Newick format
tree_labelledAnnotated tree with node labels in Newick format
bootstrap_treeBootstrapped phylogenetic tree in Newick format
tsvPairwise SNP distances from masked alignment in TSV format

Downstream Inputs

The following emissions are meant to be used as inputs to downstream subworkflows.

alignment

OutputDescription
alnRecombination-masked alignment for downstream phylogenetic analysis

Subworkflow Composition

This subworkflow calls the following subworkflows:

  • snpdists - Calculate pairwise SNP distances from sequence alignments.

Module Composition

This subworkflow calls the following modules:

  • gubbins - Detect recombination and construct a recombination-free phylogeny.

Used By

This subworkflow is used by the following workflows:

  • snippy - Rapid haplotype variant calling and core genome alignment.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub