Skip to main content

srahumanscrubber

Tags: contamination human scrub sra sequencing fastq sample-scope

Remove human contamination from sequencing reads for SRA submission.

This subworkflow uses the SRA Human Scrubber to identify and remove human reads from sequencing data. It first initializes a human reference database and then scrubs the input reads to ensure they meet SRA submission requirements.

Uses explicit positional record fields for reads:

  • Input: record(meta, r1, r2, se, lr) where each read slot is Path?

Take

reads: Channel<Record>
FieldDescription
metaGroovy Record containing sample information
r1Illumina R1 reads (paired-end)
r2Illumina R2 reads (paired-end)
seSingle-end Illumina reads
lrLong reads (ONT/PacBio)

Emit

Published

The sample_outputs and run_outputs emissions are aggregates of output files that will be published in the entry workflow.

sample_outputs

OutputDescription
special_metaGroovy Record with name for downstream aggregation
scrubbedScrubbed FASTQ files with human reads removed
scrubbed_extraPlaceholder files for pipeline compatibility
scrub_reportReport of scrubbing statistics

run_outputs

No run-scope outputs.

Module Composition

This subworkflow calls the following modules:

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub