Skip to main content

Run BLAST, parse hits and more

Project description

BlasTax

PyPI - Version PyPI - Python Version GitHub - Tests GitHub - Windows GitHub - macOS

A graphical user interface to run BLAST and parse hits:

  • Make BLAST database: Create a BLAST database from a sequence file
  • Regular BLAST: Find regions of similarity between sequences in a query file and a BLAST database
  • BLAST-Append: Append the aligned part of matching sequences to the original query sequences
  • BLAST-Append-X: Like BLAST-Append, but appends nucleotides c orresponding to the protein database
  • Decontaminate: Remove contaminants from query sequences based on two ingroup and outgroup databases
  • Museoscript: Create sequence files from BLAST matches

The program also includes a variety of tools for processing FASTA files:

  • Fast prepare: Rename FASTA sequence identifiers in preparation for BLAST analysis
  • Fast split: Split large sequences or text files into smaller files
  • Fast merge: Merge multiple sequences or text files into a single large file
  • Group merge: Merge FASTA files by filename
  • Removal of stop codons: Remove stop codons from a dataset
  • Codon trimming: Trim coding sequences to start with first codon position

Some extra tools are also available:

  • SCaFoSpy: Create chimerical sequences for species
  • Protein translator: Generate protein translations for each sequence
  • MAFFT alignment: Multiple sequence alignment using MAFFTpy
  • Codon-aware alignment: Align nucleotide sequences as proteins without altering codons
  • Cutadapt: Remove adapter sequences and quality trimming

Input sequences must be in the FASTA or FASTQ file formats.

Screenshot

Executables

Download and run the standalone executables without installing Python or BLAST+.

Release Windows MacOS

Installation

BlasTax is available on PyPI and can be installed using pip:

pip install itaxotools-blastax

After installation, run the program with:

blastax

If the BLAST+ binaries are not found in your system PATH at runtime, the program will prompt you to automatically download them to your system-specific configuration directory (under iTaxoTools/BlasTax, see platformdirs).

To reset the location where the program looks for BLAST+, run:

blastax --reset

Citations

BlasTax was developed in the framework of the iTaxoTools project:

Vences M. et al. (2021): iTaxoTools 0.1: Kickstarting a specimen-based software toolkit for taxonomists. - Megataxa 6: 77-92.

Code by Nikita Kulikov, Anja-Kristina Schulz and Stefanos Patmanidis.


BlasTax integrates the BLAST+ suite from NCBI:

Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., and Madden, T.L. 2009. BLAST+: architecture and applications. BMC Bioinformatics, 10, 421.

Cutadapt is included as a Python module to remove adapter sequences from high-throughput sequencing reads:

Martin, M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal, 17(1), 10-12.

Museoscript was rewritten following the original concept of the Linux bash script:

Rancilhac, L., Bruy, T., Scherz, M. D., Pereira, E. A., Preick, M., Straube, N., Lyra, M. L., Ohler, A., Streicher, J. W., Andreone, F., Crottini, A., Hutter, C. R., Randrianantoandro,J. C., Rokotoarison, A., Glaw, F., Hofreiter, M. & Vences, M. (2020). Target-enriched DNA sequencing from historical type material enables a partial revision of the Madagascar giant stream frogs (genus Mantidactylus). Journal of Natural History, 1-32.

MAFFT is a multiple sequence alignment program that was integrated using MAFFTpy:

Katoh, K., Misawa, K., Kuma, K., & Miyata, T. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research, 30(14), 3059-3066.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

itaxotools_blastax-0.1.0.tar.gz (8.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

itaxotools_blastax-0.1.0-py3-none-any.whl (237.2 kB view details)

Uploaded Python 3

File details

Details for the file itaxotools_blastax-0.1.0.tar.gz.

File metadata

  • Download URL: itaxotools_blastax-0.1.0.tar.gz
  • Upload date:
  • Size: 8.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for itaxotools_blastax-0.1.0.tar.gz
Algorithm Hash digest
SHA256 b8fb023b44d2e4c977a6cc1b1d9c7cd93bcc98dab1ec3cb25b97be3ed0a5e5a6
MD5 029f3c5e43672275ef5e2b63fcbb3180
BLAKE2b-256 3af99d6439ec070d4d7e0f77e62a488467bf5665403de64dc5ab271c8bdceebc

See more details on using hashes here.

Provenance

The following attestation bundles were made for itaxotools_blastax-0.1.0.tar.gz:

Publisher: deploy.yml on iTaxoTools/BlasTax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file itaxotools_blastax-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for itaxotools_blastax-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fed542f1bd17917569e261a61b8094c3adabb1dec4a6ddf60e55c21cce16d035
MD5 f927924c5e531306ad7b87de6e948ff7
BLAKE2b-256 7d3325b98bd7041b0c80fb72a712cb98f148ff5895cdfdc5e1fc56b27d0a6acd

See more details on using hashes here.

Provenance

The following attestation bundles were made for itaxotools_blastax-0.1.0-py3-none-any.whl:

Publisher: deploy.yml on iTaxoTools/BlasTax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page