Skip to main content

panaroo

Tags: pangenome pan-genome comparative-genomics core-genome alignment run-scope

Build a pangenome from GFF3 annotations using Panaroo.

This subworkflow creates a pangenome from bacterial genome annotations using Panaroo. Panaroo is a pangenome pipeline that produces polished pangenomes by removing errors and contamination from input annotations. It generates gene presence/absence matrices and core-genome alignments suitable for downstream phylogenetic analysis.

Take

gff: Channel<Record>
FieldDescription
metaGroovy Record containing sample information
gffSet of GFF3 annotation files representing the genomic annotations for each sample

Emit

Published

The sample_outputs and run_outputs emissions are aggregates of output files that will be published in the entry workflow.

sample_outputs

No sample-scope outputs.

run_outputs

OutputDescription
alnCore genome alignment in FASTA format (optional)
filtered_alnCore genome alignment with recombinant regions filtered out (optional)
csvGene presence/absence matrix in Roary-compatible CSV format (optional)
panaroo_csvGene presence/absence matrix in Panaroo's native CSV format (optional)
supplementalDirectory containing Panaroo intermediate files and data structures

Module Composition

This subworkflow calls the following modules:

  • panaroo_run - Fast and scalable bacterial pangenome analysis using a graph-based approach.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub