Skip to main content

panaroo_run

Tags: pan-genome orthologs core-genome gene-presence-absence graph-based annotation run-scope

Fast and scalable bacterial pangenome analysis using a graph-based approach.

Uses Panaroo to cluster genes from multiple annotated bacterial genomes into orthologous groups, correcting for gene splitting and merges. The primary outputs are the gene presence/absence matrix (the pan-genome) and a core-genome alignment (for phylogenetics).

Inputs

record (
meta: Record,
gff: Set<Path>
)
FieldTypeDescription
metaRecordGroovy Record containing sample information
gffSet<Path>A list of annotated genome files in GFF3 format (required input)

Outputs

record (
meta: Record,
aln: Path?,
filtered_aln: Path?,
csv: Path?,
panaroo_csv: Path?,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
FieldTypeDescription
metaRecordSample information record
alnPath?The core-genome alignment (*core-genome.aln.gz), suitable for phylogenetic tree building
filtered_alnPath?The core-genome alignment with highly recombinant regions filtered out
csvPath?Gene presence/absence matrix in Roary-compatible CSV format
panaroo_csvPath?Gene presence/absence matrix in Panaroo's native CSV format
resultsSet<Path>All output files to be published
logsSet<Path?>Optional program specific log files
nf_logsSet<Path>Nextflow-specific log files (e.g. .command.{begin
versionsSet<Path>A YAML formatted file with program versions

Parameters

Panaroo Run Parameters

ParameterTypeDefaultDescription
--panaroo_merge_paralogsbooleanfalseDo not split paralogs
--panaroo_optsstringAdditional options to pass to panaroo

Used By

Subworkflows

  • panaroo - Build a pangenome from GFF3 annotations using Panaroo.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub

Version

PANAROO_RUN:
- panaroo: 1.6.0