Skip to main content

pirate

Tags: pan-genome orthologs core-genome gene-presence-absence epidemiology annotation run-scope

Pangenome Identification and Reconciliation Analysis Tool for Epidemiology (PIRATE).

Uses PIRATE to construct the pangenome of a collection of bacterial isolates. It clusters orthologous genes and generates the core genome alignment and a gene presence/absence matrix, which is compatible with downstream analysis tools like Scoary for association testing.

Inputs

record (
meta: Record,
gff: Set<Path>
)
FieldTypeDescription
metaRecordGroovy Record containing sample information
gffSet<Path>A list of annotated genome files in GFF3 format

Outputs

record (
meta: Record,
aln: Path?,
csv: Path?,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
FieldTypeDescription
metaRecordSample information record
alnPath?The core-genome alignment (*core-genome.aln.gz), suitable for phylogenetic tree building
csvPath?Gene presence/absence matrix in CSV format, compatible with Scoary
resultsSet<Path>All output files to be published
logsSet<Path?>Optional program specific log files
nf_logsSet<Path>Nextflow-specific log files (e.g. .command.{begin
versionsSet<Path>A YAML formatted file with program versions

Parameters

PIRATE Parameters

ParameterTypeDefaultDescription
--use_piratebooleanfalseUse PIRATE instead of panaroo in the 'pangenome' subworkflow
--pirate_stepsstring50,60,70,80,90,95,98Percent identity thresholds to use for pangenome construction
--pirate_featuresstringCDSComma-delimited features to use for pangenome construction
--pirate_para_offbooleanfalseSwitch off paralog identification

Used By

Subworkflows

  • pirate - Build a pangenome from GFF3 annotations using PIRATE.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub

Version

PIRATE:
- pirate: 1.0.5