Skip to main content

pangenome

Tags: alignment core-genome pan-genome phylogeny comparative-genomics run-scope

Perform pangenome analysis with optional core-genome phylogeny.

This subworkflow creates a pangenome from GFF3 annotation files using one of three tools: Panaroo (default), PIRATE, or Roary. It generates core-genome alignments and gene presence/absence matrices, followed by SNP distance calculations using snp-dists. The workflow conditionally executes the selected pangenome tool based on Boolean parameters.

Take

gff: Channel<Record>
FieldDescription
metaGroovy Record containing sample information
gffSet of GFF3 annotation files from assembled genomes
use_pirate: Boolean
use_roary: Boolean
NameTypeDescription
use_pirateBooleanBoolean flag to use PIRATE for pangenome analysis
use_roaryBooleanBoolean flag to use Roary for pangenome analysis

Emit

Published

The sample_outputs and run_outputs emissions are aggregates of output files that will be published in the entry workflow.

sample_outputs

No sample-scope outputs.

run_outputs

OutputDescription
alnCore-genome alignment in FASTA format
csvGene presence/absence matrix
supplementalIntermediate files and detailed outputs
tsvPairwise SNP distance matrix from core-genome alignment

Downstream Inputs

The following emissions are meant to be used as inputs to downstream subworkflows.

alignment

OutputDescription
alnCore-genome alignment for downstream analysis (e.g., recombination detection)

phylogeny_input

OutputDescription
alnCore-genome alignment with iqtree-ready meta for phylogeny construction

csv

OutputDescription
csvGene presence/absence matrix for downstream analysis (e.g., pan-GWAS)

Subworkflow Composition

This subworkflow calls the following subworkflows:

  • pirate - Build a pangenome from GFF3 annotations using PIRATE.
  • roary - Build a pangenome from GFF3 annotations using Roary.
  • panaroo - Build a pangenome from GFF3 annotations using Panaroo.
  • snpdists - Calculate pairwise SNP distances from sequence alignments.

Used By

This subworkflow is used by the following workflows:

  • pangenome - Pangenome analysis with optional core-genome phylogeny.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub