blast_tblastx
Tags: blast tblastx alignment translation dna search fasta sample-scope
Search a translated nucleotide database using a translated nucleotide query.
Uses TBLASTX to align nucleotide query sequences (translated in all six frames) against a nucleotide BLAST database (also translated in all six frames). This is useful for identifying distant relationships between nucleotide sequences that have significant divergence but conserved protein structure.
Inputs
record (
meta: Record,
blastdb: Path
)
| Field | Type | Description |
|---|---|---|
meta | Record | Groovy Record containing sample information |
blastdb | Path | A compressed tarball containing the nucleotide BLAST database |
query: Path
| Name | Type | Description |
|---|---|---|
query | Path | FASTA file containing nucleotide query sequences |
Outputs
record (
meta: Record,
tsv: Path,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
| Field | Type | Description |
|---|---|---|
meta | Record | Sample information record |
tsv | Path | Tab-delimited translated nucleotide-to-translated nucleotide alignment results (BLAST outfmt 6) |
results | Set<Path> | All output files to be published |
logs | Set<Path?> | Optional program specific log files |
nf_logs | Set<Path> | Nextflow-specific log files (e.g. .command.{begin |
versions | Set<Path> | A YAML formatted file with program versions |
Parameters
TBLASTX Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
--tblastx_query | string | A fasta file containing the query sequences to BLAST against the database | |
--tblastx_outfmt | string | sseqid qseqid pident qlen slen length nident positive mismatch gapopen gaps qstart qend sstart send evalue bitscore | The columns to include with -outfmt 6 |
--tblastx_opts | string | Additional options to pass to BLASTN | |
--tblastx_qcov_hsp_perc | integer | 50 | Percent query coverage per hsp |
--tblastx_max_target_seqs | integer | 2000 | Maximum number of aligned sequences to keep |
Used By
Subworkflows
- tblastx - Translate nucleotide query sequences and search nucleotide database.
Workflows
- tblastx - Search against translated nucleotide databases using translated nucleotide queries.
Citations
If you use this in your analysis, please cite the following.
-
Bactopia
Petit III RA, Read TD Bactopia - a flexible pipeline for complete analysis of bacterial genomes. mSystems 5 (2020) -
BLAST
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009)
Source
Version
BLAST_TBLASTX:
- blast: 2.17.0