Skip to main content

Python package that generates consensus sequence from the forward and reverse sequences, performs multiple sequence alignment of the fasta sequences, and generates phylogenetic trees using Bayesian and Maximum Likelihood Methods

Project description

SATO - Sequence Analysis Toolkit

This Python application, built with PyQt6 and integrated with the BioPython library, serves as a Sequence Analysis Toolkit (SATO). SATO offers a user-friendly graphical interface with multiple tabs for various sequence analysis tasks. Users can perform tasks such as generating consensus sequences from two input sequences, aligning sequences using Clustal Omega or MAFFT, and conducting phylogenetic analysis using MrBayes or FastTree. The application also provides features for visualizing alignment results and phylogenetic trees, making it a versatile tool for researchers and scientists working with biological sequences.

Purpose of the Package

The package provides a comprehensive and user-friendly solution for biologists and researchers working with biological sequence data. It aims to streamline and simplify various sequence analysis tasks, including generating consensus sequences, conducting sequence alignments, and performing phylogenetic analysis. By offering a graphical user interface (GUI) and integrating with external tools and libraries like BioPython, Clustal Omega, MAFFT, MrBayes, FastTree, Jalview, and FigTree, the package empowers users to efficiently analyze and visualize biological sequence data, making it a valuable resource for molecular biology and bioinformatics research.

Features

SATO is a powerful Sequence Analysis Toolkit that offers a range of features for working with biological sequences. Whether you need to generate consensus sequences, perform sequence alignment, or conduct phylogenetic analysis, SATO has you covered. Below are some of the key features of this application:

Consensus Sequence Generation

  • Users can provide two sequences in FASTA format.
  • The app generates a consensus sequence by finding the best overlapping window that minimizes mismatches while maximizing sequence length.

Sequence Alignment

  • Users can perform sequence alignment using either Clustal Omega or MAFFT.
  • Input sequences are validated for FASTA format.
  • Aligned sequences are displayed in a user-friendly format.

Phylogenetic Analysis

  • Users can conduct phylogenetic analysis using either MrBayes (Bayesian Phylogeny) or FastTree (Maximum Likelihood).
  • Supports both DNA and Protein sequences.
  • The app handles input alignments in FASTA or Nexus format.
  • Generates a phylogenetic tree and visualizes it using FigTree.

User-Friendly Interface

  • The app offers a tabbed interface for easy navigation between different analysis functions.

Installation Instructions

1. Python Environment - Requires Python 3 environment and so ensure you have installed it on your computer.

pip install SATO

2. SATO uses the following programs:

However, note that Clustal Omega, MAFFT, MrBayes, FastTree, SeaView and FigTree are bundled with the SATO. If you wish to use jalview, please install it on your system

Usage

After installation, open the terminal (linux or macOS) or command prompt (windows) and type sato, then press Enter to launch SATO's GUI

Standalone

There is also a standalone excutable at SATO v0.1.1
After extracting, double-click on the excutable to launch the GUI

Acknowledgment

  1. Huelsenbeck, J. P., & Ronquist, F. (2001). MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics, 17(8), 754-755.
  2. Price, M. N., Dehal, P. S., & Arkin, A. P. (2009). FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Molecular biology and evolution, 26(7), 1641-1650.
  3. Sievers, F., Wilm, A., Dineen, D., Gibson, T. J., Karplus, K., Li, W., Lopez, R., McWilliam, H., Remmert, M., Söding, J., Thompson, J. D., & Higgins, D. G. (2011). Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Molecular systems biology, 7, 539. https://doi.org/10.1038/msb.2011.75
  4. Katoh, K., Misawa, K., Kuma, K., & Miyata, T. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic acids research, 30(14), 3059–3066. https://doi.org/10.1093/nar/gkf436
  5. Rambaut, A. (2009). FigTree. Tree figure drawing tool. http://tree. bio. ed. ac. uk/software/figtree/.
  6. Waterhouse, A., Procter, J., Martin, D.A. and Barton, G.J., 2005. Jalview: visualization and analysis of molecular sequences, alignments, and structures. BMC Bioinformatics, 6(3), pp.1-1.
  7. Gouy, M., Guindon, S., & Gascuel, O. (2010). SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Molecular biology and evolution, 27(2), 221-224.

Contribution

Should you notice a bug, please let us know through issues in the, GitHub Issue Tracker

Author

Dr. Clabe Wekesa

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

SATO-0.1.1.tar.gz (1.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

SATO-0.1.1-py3-none-any.whl (1.7 MB view details)

Uploaded Python 3

File details

Details for the file SATO-0.1.1.tar.gz.

File metadata

  • Download URL: SATO-0.1.1.tar.gz
  • Upload date:
  • Size: 1.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for SATO-0.1.1.tar.gz
Algorithm Hash digest
SHA256 3fe0d55567c46d3ed5da8132749238822ed00a0b0a87e92f4861e9388f96a2ce
MD5 445671f4c4b0a8720200e095e5d20129
BLAKE2b-256 a2e7f52571e6d69f815b69c7a3d63df81436fdc40599328a6e331bba437bbc87

See more details on using hashes here.

File details

Details for the file SATO-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: SATO-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 1.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for SATO-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 fdf605d66fd049c6e896afa0ae3cb444a78476f0ca937112196a755638a2baf2
MD5 f0798da610ac18fbb680154ab3e0ca28
BLAKE2b-256 bc1e1c9100601fb921c3c6073bb32e4ee9e73f63e10b92a76a72b06f5c740ba5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page