Skip to main content

Run BLAST, parse hits and more

Project description

BlasTax

PyPI - Version PyPI - Python Version GitHub - Tests GitHub - Windows GitHub - macOS

A graphical user interface to run BLAST and parse hits:

  • Make BLAST database: Create a BLAST database from a sequence file
  • Regular BLAST: Find regions of similarity between sequences in a query file and a BLAST database
  • BLAST-Append: Append the aligned part of matching sequences to the original query sequences
  • BLAST-Append-X: Like BLAST-Append, but appends nucleotides c orresponding to the protein database
  • Decontaminate: Remove contaminants from query sequences based on two ingroup and outgroup databases
  • Museoscript: Create sequence files from BLAST matches

The program also includes a variety of tools for processing FASTA files:

  • Fast prepare: Rename FASTA sequence identifiers in preparation for BLAST analysis
  • Fast split: Split large sequences or text files into smaller files
  • Fast merge: Merge multiple sequences or text files into a single large file
  • Group merge: Merge FASTA files by filename
  • Removal of stop codons: Remove stop codons from a dataset
  • Codon trimming: Trim coding sequences to start with first codon position

Some extra tools are also available:

  • SCaFoSpy: Create chimerical sequences for species
  • Protein translator: Generate protein translations for each sequence
  • MAFFT alignment: Multiple sequence alignment using MAFFTpy
  • Codon-aware alignment: Align nucleotide sequences as proteins without altering codons
  • Cutadapt: Remove adapter sequences and quality trimming

Input sequences must be in the FASTA or FASTQ file formats.

Screenshot

Executables

Download and run the standalone executables without installing Python or BLAST+.

Release Windows MacOS

Installation

BlasTax is available on PyPI and can be installed using pip:

pip install itaxotools-blastax

After installation, run the program with:

blastax

If the BLAST+ binaries are not found in your system PATH at runtime, the program will prompt you to automatically download them to your system-specific configuration directory (under iTaxoTools/BlasTax, see platformdirs).

To reset the location where the program looks for BLAST+, run:

blastax --reset

Citations

BlasTax was developed in the framework of the iTaxoTools project:

Vences M. et al. (2021): iTaxoTools 0.1: Kickstarting a specimen-based software toolkit for taxonomists. - Megataxa 6: 77-92.

Code by Nikita Kulikov, Anja-Kristina Schulz and Stefanos Patmanidis.


BlasTax integrates the BLAST+ suite from NCBI:

Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., and Madden, T.L. 2009. BLAST+: architecture and applications. BMC Bioinformatics, 10, 421.

Cutadapt is included as a Python module to remove adapter sequences from high-throughput sequencing reads:

Martin, M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal, 17(1), 10-12.

Museoscript was rewritten following the original concept of the Linux bash script:

Rancilhac, L., Bruy, T., Scherz, M. D., Pereira, E. A., Preick, M., Straube, N., Lyra, M. L., Ohler, A., Streicher, J. W., Andreone, F., Crottini, A., Hutter, C. R., Randrianantoandro,J. C., Rokotoarison, A., Glaw, F., Hofreiter, M. & Vences, M. (2020). Target-enriched DNA sequencing from historical type material enables a partial revision of the Madagascar giant stream frogs (genus Mantidactylus). Journal of Natural History, 1-32.

MAFFT is a multiple sequence alignment program that was integrated using MAFFTpy:

Katoh, K., Misawa, K., Kuma, K., & Miyata, T. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research, 30(14), 3059-3066.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

itaxotools_blastax-0.1.1.tar.gz (8.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

itaxotools_blastax-0.1.1-py3-none-any.whl (237.2 kB view details)

Uploaded Python 3

File details

Details for the file itaxotools_blastax-0.1.1.tar.gz.

File metadata

  • Download URL: itaxotools_blastax-0.1.1.tar.gz
  • Upload date:
  • Size: 8.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for itaxotools_blastax-0.1.1.tar.gz
Algorithm Hash digest
SHA256 1a8a8aef5a5d518a6dde1f7f9061542dc28c450619e1124f5b7b949a6339cdcb
MD5 da4f7efc08b2c87199e00e245da13bcf
BLAKE2b-256 cccb2e67108c1f1197ef4c31d714f88bb8025b565d993893f2be615988502815

See more details on using hashes here.

Provenance

The following attestation bundles were made for itaxotools_blastax-0.1.1.tar.gz:

Publisher: deploy.yml on iTaxoTools/BlasTax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file itaxotools_blastax-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for itaxotools_blastax-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c3eecfa351de62eed1ed77205a925194ff830b28b848086f6f4beaf028f76c46
MD5 21d6556b819afe953229cd3e3842d167
BLAKE2b-256 23865bd74e9a4cc5491a53d9052dd1fed2d88f68fb40adc0358bfc24b9c95fab

See more details on using hashes here.

Provenance

The following attestation bundles were made for itaxotools_blastax-0.1.1-py3-none-any.whl:

Publisher: deploy.yml on iTaxoTools/BlasTax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page