Skip to main content

blastp

Tags: blast protein alignment database sample-scope

Search protein sequences against protein database.

This subworkflow uses BLASTP from the NCBI BLAST+ suite to search protein sequences against a protein database. It processes each assembly individually and aggregates the results into a single consolidated report.

Take

blastdb: Channel<Record>
FieldDescription
metaGroovy Record containing sample information
blastdbA compressed tarball containing the BLAST database
query: Path
NameTypeDescription
queryPathPath to protein database for searching against translated sequences

Emit

Published

The sample_outputs and run_outputs emissions are aggregates of output files that will be published in the entry workflow.

sample_outputs

OutputDescription
tsvA tab-delimited summary of alignments (standard BLAST outfmt 6)

run_outputs

OutputDescription
csvAggregated results in CSV format

Module Composition

This subworkflow calls the following modules:

  • blast_blastp - Search a protein database using a protein query.
  • csvtk_concat - Concatenate multiple CSV or TSV files into a single table.

Used By

This subworkflow is used by the following workflows:

  • blastp - Search against protein BLAST databases using protein queries.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub