Skip to main content

Python package that generates consensus sequence from the forward and reverse sequences, performs multiple sequence alignment of the fasta sequences, and generates phylogenetic trees using Bayesian and Maximum Likelihood Methods

Project description

SATO - Sequence Analysis Toolkit

This Python application, built with PyQt6 and integrated with the BioPython library, serves as a Sequence Analysis Toolkit (SATO). SATO offers a user-friendly graphical interface with multiple tabs for various sequence analysis tasks. Users can perform tasks such as generating consensus sequences from two input sequences, aligning sequences using Clustal Omega or MAFFT, and conducting phylogenetic analysis using MrBayes or FastTree. The application also provides features for visualizing alignment results and phylogenetic trees, making it a versatile tool for researchers and scientists working with biological sequences.

Logo

Purpose of the Package

The package provides a comprehensive and user-friendly solution for biologists and researchers working with biological sequence data. It aims to streamline and simplify various sequence analysis tasks, including generating consensus sequences, conducting sequence alignments, and performing phylogenetic analysis. By offering a graphical user interface (GUI) and integrating with external tools and libraries like BioPython, Clustal Omega, MAFFT, MrBayes, FastTree, Jalview, and FigTree, the package empowers users to efficiently analyze and visualize biological sequence data, making it a valuable resource for molecular biology and bioinformatics research.

Features

SATO is a powerful Sequence Analysis Toolkit that offers a range of features for working with biological sequences. Whether you need to generate consensus sequences, perform sequence alignment, or conduct phylogenetic analysis, SATO has you covered. Below are some of the key features of this application:

Consensus Sequence Generation

  • Users can provide two sequences in FASTA format.
  • The app generates a consensus sequence by finding the best overlapping window that minimizes mismatches while maximizing sequence length.

Sequence Alignment

  • Users can perform sequence alignment using either Clustal Omega or MAFFT.
  • Input sequences are validated for FASTA format.
  • Aligned sequences are displayed in a user-friendly format.

Phylogenetic Analysis

  • Users can conduct phylogenetic analysis using either MrBayes (Bayesian Phylogeny) or FastTree (Maximum Likelihood).
  • Supports both DNA and Protein sequences.
  • The app handles input alignments in FASTA or Nexus format.
  • Generates a phylogenetic tree and visualizes it using FigTree.

User-Friendly Interface

  • The app offers a tabbed interface for easy navigation between different analysis functions.

Installation Instructions

1. Python Environment - Requires Python 3 environment and so ensure you have installed it on your computer.

pip install SATO

2. SATO uses the following programs:

Ensure that they are all installed on your computer

Usage

After installation, open the terminal (linux or macOS) or command prompt (windows) and type sato, then press Enter to launch SATO's GUI

Standalone

There is also a standalone executable at SATO v0.1.1
After extracting, double-click on the executable to launch the GUI

Acknowledgment

  1. Huelsenbeck, J. P., & Ronquist, F. (2001). MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics, 17(8), 754-755.
  2. Price, M. N., Dehal, P. S., & Arkin, A. P. (2009). FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Molecular biology and evolution, 26(7), 1641-1650.
  3. Sievers, F., Wilm, A., Dineen, D., Gibson, T. J., Karplus, K., Li, W., Lopez, R., McWilliam, H., Remmert, M., Söding, J., Thompson, J. D., & Higgins, D. G. (2011). Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Molecular systems biology, 7, 539. https://doi.org/10.1038/msb.2011.75
  4. Katoh, K., Misawa, K., Kuma, K., & Miyata, T. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic acids research, 30(14), 3059–3066. https://doi.org/10.1093/nar/gkf436
  5. Rambaut, A. (2009). FigTree. Tree figure drawing tool. http://tree. bio. ed. ac. uk/software/figtree/.
  6. Waterhouse, A., Procter, J., Martin, D.A. and Barton, G.J., 2005. Jalview: visualization and analysis of molecular sequences, alignments, and structures. BMC Bioinformatics, 6(3), pp.1-1.
  7. Gouy, M., Guindon, S., & Gascuel, O. (2010). SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Molecular biology and evolution, 27(2), 221-224.

Contribution

Should you notice a bug, please let us know through issues in the, GitHub Issue Tracker

Author

Dr. Clabe Wekesa, Dr. Patrick Okoth and Prof. John Muoma

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

SATO-0.1.7.tar.gz (253.2 kB view hashes)

Uploaded Source

Built Distribution

SATO-0.1.7-py3-none-any.whl (249.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page