Skip to main content

csvtk_join

Tags: utility table join merge csv tsv csvtk relational run-scope

Join two CSV or TSV files based on common fields.

Uses csvtk join to merge two tabular files horizontally by matching values in a specified key column (similar to a SQL JOIN). It supports inner, left, right, and outer joins via optional arguments.

Inputs

record (
meta: Record,
csv1: Path,
csv2: Path
)
FieldTypeDescription
metaRecordGroovy Record containing sample information
csv1PathThe first CSV/TSV file (Left table)
csv2PathThe second CSV/TSV file (Right table)
in_format: String
out_format: String
key: String
NameTypeDescription
in_formatStringInput format string ('csv', 'tsv', or a specific delimiter character)
out_formatStringOutput format string ('csv', 'tsv', or a specific delimiter character)
keyStringThe column name(s) or index(es) to use as the join key (e.g., "sample_id" or "1")

Outputs

record (
meta: Record,
csv: Path,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
FieldTypeDescription
metaRecordSample information record
csvPathThe joined tabular file (*.csv or *.tsv)
resultsSet<Path>All output files to be published
logsSet<Path?>Optional program specific log files
nf_logsSet<Path>Nextflow-specific log files (e.g. .command.{begin
versionsSet<Path>A YAML formatted file with program versions

Parameters

Used By

Subworkflows

  • teton - Perform taxonomic classification and estimate bacterial genome sizes.

Workflows

  • teton - Taxonomic classification and abundance profiling of metagenomic reads.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub

Version

CSVTK_JOIN:
- csvtk: 0.31.0