Skip to main content

Run BLAST, parse hits and more

Project description

BlasTax

PyPI - Version PyPI - Python Version GitHub - Tests GitHub - Windows GitHub - macOS

A graphical user interface to run BLAST and parse hits:

  • Make BLAST database: Create a BLAST database from a sequence file
  • Regular BLAST: Find regions of similarity between sequences in a query file and a BLAST database
  • BLAST-Append: Append the aligned part of matching sequences to the original query sequences
  • BLAST-Append-X: Like BLAST-Append, but appends nucleotides c orresponding to the protein database
  • Decontaminate: Remove contaminants from query sequences based on two ingroup and outgroup databases
  • Decontamination by taxonomy: Filter query sequences based on BLAST matches and taxon IDs
  • Assign taxonomy: Assign taxonomic information to each query sequence based on the best BLAST hit
  • Museoscript: Create sequence files from BLAST matches
  • Database operations: Extract sequences from a BLAST database and save them in FASTA format

The program also includes a variety of tools for processing FASTA files:

  • Fast prepare: Rename FASTA sequence identifiers in preparation for BLAST analysis
  • Fast split: Split large sequences or text files into smaller files
  • Fast merge: Merge multiple sequences or text files into a single large file
  • Group merge: Merge FASTA files by filename
  • Removal of stop codons: Remove stop codons from a dataset
  • Codon trimming: Trim coding sequences to start with first codon position

Some extra tools are also available:

  • SCaFoSpy: Create chimerical sequences for species
  • Protein translator: Generate protein translations for each sequence
  • MAFFT alignment: Multiple sequence alignment using MAFFTpy
  • Codon-aware alignment: Align nucleotide sequences as proteins without altering codons
  • Cutadapt: Remove adapter sequences and quality trimming

Input sequences must be in the FASTA or FASTQ file formats.

Screenshot

Executables

Download and run the standalone executables without installing Python or BLAST+.

Release Windows MacOS

Installation

BlasTax is available on PyPI and can be installed using pip:

pip install itaxotools-blastax

After installation, run the program with:

blastax

If the BLAST+ binaries are not found in your system PATH at runtime, the program will prompt you to automatically download them to your system-specific configuration directory (under iTaxoTools/BlasTax, see platformdirs).

To reset the location where the program looks for BLAST+, run:

blastax --reset

Citations

BlasTax was developed in the framework of the iTaxoTools project:

Vences M. et al. (2021): iTaxoTools 0.1: Kickstarting a specimen-based software toolkit for taxonomists. - Megataxa 6: 77-92.

Code by Nikita Kulikov, Anja-Kristina Schulz and Stefanos Patmanidis.


BlasTax integrates the BLAST+ suite from NCBI:

Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., and Madden, T.L. 2009. BLAST+: architecture and applications. BMC Bioinformatics, 10, 421.

Cutadapt is included as a Python module to remove adapter sequences from high-throughput sequencing reads:

Martin, M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal, 17(1), 10-12.

Museoscript was rewritten following the original concept of the Linux bash script:

Rancilhac, L., Bruy, T., Scherz, M. D., Pereira, E. A., Preick, M., Straube, N., Lyra, M. L., Ohler, A., Streicher, J. W., Andreone, F., Crottini, A., Hutter, C. R., Randrianantoandro,J. C., Rokotoarison, A., Glaw, F., Hofreiter, M. & Vences, M. (2020). Target-enriched DNA sequencing from historical type material enables a partial revision of the Madagascar giant stream frogs (genus Mantidactylus). Journal of Natural History, 1-32.

MAFFT is a multiple sequence alignment program that was integrated using MAFFTpy:

Katoh, K., Misawa, K., Kuma, K., & Miyata, T. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research, 30(14), 3059-3066.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

itaxotools_blastax-1.0.0.tar.gz (8.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

itaxotools_blastax-1.0.0-py3-none-any.whl (274.2 kB view details)

Uploaded Python 3

File details

Details for the file itaxotools_blastax-1.0.0.tar.gz.

File metadata

  • Download URL: itaxotools_blastax-1.0.0.tar.gz
  • Upload date:
  • Size: 8.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for itaxotools_blastax-1.0.0.tar.gz
Algorithm Hash digest
SHA256 0639d6a89ef685d220c538b6f9c54aaafde4d398be6450df1b6f5ffec99263b1
MD5 77220a7489d489645c54b94d1a964054
BLAKE2b-256 ae67b50a5ef7a028964d7cb376babb4c9fe78dbb4206d7a5a9cc269eb8e2d2a9

See more details on using hashes here.

Provenance

The following attestation bundles were made for itaxotools_blastax-1.0.0.tar.gz:

Publisher: deploy.yml on iTaxoTools/BlasTax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file itaxotools_blastax-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for itaxotools_blastax-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dba406f8a4737b35033bddb0f3af0cab5a2895a45f333b35bb571c4079bd60e9
MD5 46445ef4320d10ecddbdc7b5c6fb7fd1
BLAKE2b-256 875575e3485f5bea6b165b76baff1fdb30c3717d9ff09d2a2676a2a68d061b92

See more details on using hashes here.

Provenance

The following attestation bundles were made for itaxotools_blastax-1.0.0-py3-none-any.whl:

Publisher: deploy.yml on iTaxoTools/BlasTax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page