Skip to main content

blast_blastp

Tags: blast blastp alignment protein amino-acid search fasta sample-scope

Search a protein database using a protein query.

Uses BLASTP to align amino acid query sequences (FASTA) against a protein BLAST database. It is used to identify homologous proteins.

Inputs

record (
meta: Record,
blastdb: Path
)
FieldTypeDescription
metaRecordGroovy Record containing sample information
blastdbPathA compressed tarball containing the protein BLAST database
query: Path
NameTypeDescription
queryPathFASTA file containing amino acid query sequences

Outputs

record (
meta: Record,
tsv: Path,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
FieldTypeDescription
metaRecordSample information record
tsvPathTab-delimited protein alignment results (BLAST outfmt 6)
resultsSet<Path>All output files to be published
logsSet<Path?>Optional program specific log files
nf_logsSet<Path>Nextflow-specific log files (e.g. .command.{begin
versionsSet<Path>A YAML formatted file with program versions

Parameters

BLASTP Parameters

ParameterTypeDefaultDescription
--blastp_querystringA fasta file containing the query sequences to BLAST against the database
--blastp_outfmtstringsseqid qseqid pident qlen slen length nident positive mismatch gapopen gaps qstart qend sstart send evalue bitscoreThe columns to include with -outfmt 6
--blastp_optsstringAdditional options to pass to BLASTN
--blastp_qcov_hsp_percinteger50Percent query coverage per hsp
--blastp_max_target_seqsinteger2000Maximum number of aligned sequences to keep

Used By

Subworkflows

  • blastp - Search protein sequences against protein database.

Workflows

  • blastp - Search against protein BLAST databases using protein queries.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub

Version

BLAST_BLASTP:
- blast: 2.17.0