Run BLAST, parse hits and more
Project description
BlasTax
A graphical user interface to run BLAST and parse hits:
- Make BLAST database: Create a BLAST database from a sequence file
- Regular BLAST: Find regions of similarity between sequences in a query file and a BLAST database
- BLAST-Append: Append the aligned part of matching sequences to the original query sequences
- BLAST-Append-X: Like BLAST-Append, but appends nucleotides c orresponding to the protein database
- Decontaminate: Remove contaminants from query sequences based on two ingroup and outgroup databases
- Decontamination by taxonomy: Filter query sequences based on BLAST matches and taxon IDs
- Assign taxonomy: Assign taxonomic information to each query sequence based on the best BLAST hit
- Museoscript: Create sequence files from BLAST matches
- Database operations: Extract sequences from a BLAST database and save them in FASTA format
The program also includes a variety of tools for processing FASTA files:
- Fast prepare: Rename FASTA sequence identifiers in preparation for BLAST analysis
- Fast split: Split large sequences or text files into smaller files
- Fast merge: Merge multiple sequences or text files into a single large file
- Group merge: Merge FASTA files by filename
- Removal of stop codons: Remove stop codons from a dataset
- Codon trimming: Trim coding sequences to start with first codon position
Some extra tools are also available:
- SCaFoSpy: Create chimerical sequences for species
- Protein translator: Generate protein translations for each sequence
- MAFFT alignment: Multiple sequence alignment using MAFFTpy
- Codon-aware alignment: Align nucleotide sequences as proteins without altering codons
- Cutadapt: Remove adapter sequences and quality trimming
Input sequences must be in the FASTA or FASTQ file formats.
Executables
Download and run the standalone executables without installing Python or BLAST+.
Installation
BlasTax is available on PyPI and can be installed using pip:
pip install itaxotools-blastax
After installation, run the program with:
blastax
If the BLAST+ binaries are not found in your system PATH at runtime, the program will prompt you to automatically download them to your system-specific configuration directory (under iTaxoTools/BlasTax, see platformdirs).
To reset the location where the program looks for BLAST+, run:
blastax --reset
Citations
BlasTax was developed in the framework of the iTaxoTools project:
Vences M. et al. (2021): iTaxoTools 0.1: Kickstarting a specimen-based software toolkit for taxonomists. - Megataxa 6: 77-92.
Code by Nikita Kulikov, Anja-Kristina Schulz and Stefanos Patmanidis.
BlasTax integrates the BLAST+ suite from NCBI:
Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., and Madden, T.L. 2009. BLAST+: architecture and applications. BMC Bioinformatics, 10, 421.
Cutadapt is included as a Python module to remove adapter sequences from high-throughput sequencing reads:
Martin, M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal, 17(1), 10-12.
Museoscript was rewritten following the original concept of the Linux bash script:
Rancilhac, L., Bruy, T., Scherz, M. D., Pereira, E. A., Preick, M., Straube, N., Lyra, M. L., Ohler, A., Streicher, J. W., Andreone, F., Crottini, A., Hutter, C. R., Randrianantoandro,J. C., Rokotoarison, A., Glaw, F., Hofreiter, M. & Vences, M. (2020). Target-enriched DNA sequencing from historical type material enables a partial revision of the Madagascar giant stream frogs (genus Mantidactylus). Journal of Natural History, 1-32.
MAFFT is a multiple sequence alignment program that was integrated using MAFFTpy:
Katoh, K., Misawa, K., Kuma, K., & Miyata, T. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research, 30(14), 3059-3066.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file itaxotools_blastax-1.0.0.tar.gz.
File metadata
- Download URL: itaxotools_blastax-1.0.0.tar.gz
- Upload date:
- Size: 8.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0639d6a89ef685d220c538b6f9c54aaafde4d398be6450df1b6f5ffec99263b1
|
|
| MD5 |
77220a7489d489645c54b94d1a964054
|
|
| BLAKE2b-256 |
ae67b50a5ef7a028964d7cb376babb4c9fe78dbb4206d7a5a9cc269eb8e2d2a9
|
Provenance
The following attestation bundles were made for itaxotools_blastax-1.0.0.tar.gz:
Publisher:
deploy.yml on iTaxoTools/BlasTax
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
itaxotools_blastax-1.0.0.tar.gz -
Subject digest:
0639d6a89ef685d220c538b6f9c54aaafde4d398be6450df1b6f5ffec99263b1 - Sigstore transparency entry: 1209268309
- Sigstore integration time:
-
Permalink:
iTaxoTools/BlasTax@bcdadbe8b42d2e32f9bc35f5b18a1f241535a2b5 -
Branch / Tag:
refs/tags/v1.0.0 - Owner: https://github.com/iTaxoTools
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
deploy.yml@bcdadbe8b42d2e32f9bc35f5b18a1f241535a2b5 -
Trigger Event:
push
-
Statement type:
File details
Details for the file itaxotools_blastax-1.0.0-py3-none-any.whl.
File metadata
- Download URL: itaxotools_blastax-1.0.0-py3-none-any.whl
- Upload date:
- Size: 274.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dba406f8a4737b35033bddb0f3af0cab5a2895a45f333b35bb571c4079bd60e9
|
|
| MD5 |
46445ef4320d10ecddbdc7b5c6fb7fd1
|
|
| BLAKE2b-256 |
875575e3485f5bea6b165b76baff1fdb30c3717d9ff09d2a2676a2a68d061b92
|
Provenance
The following attestation bundles were made for itaxotools_blastax-1.0.0-py3-none-any.whl:
Publisher:
deploy.yml on iTaxoTools/BlasTax
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
itaxotools_blastax-1.0.0-py3-none-any.whl -
Subject digest:
dba406f8a4737b35033bddb0f3af0cab5a2895a45f333b35bb571c4079bd60e9 - Sigstore transparency entry: 1209268367
- Sigstore integration time:
-
Permalink:
iTaxoTools/BlasTax@bcdadbe8b42d2e32f9bc35f5b18a1f241535a2b5 -
Branch / Tag:
refs/tags/v1.0.0 - Owner: https://github.com/iTaxoTools
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
deploy.yml@bcdadbe8b42d2e32f9bc35f5b18a1f241535a2b5 -
Trigger Event:
push
-
Statement type: