Skip to main content

BioSuite Ultra - Comprehensive open-source bioinformatics platform with 48 analysis modules

Project description

BioSuite Ultra

Python License Tests Modules Lines

The most comprehensive open-source bioinformatics platform.

BioSuite Ultra is a full-stack bioinformatics platform with 48 analysis modules, 36+ visualization types, a cyberpunk GUI, and a 99+ option CLI — all in pure Python. No external binaries required.


Features

48 Analysis Modules

Domain Modules Coverage
Sequence Analysis FASTA/FASTQ I/O, GC%, translation, reverse complement, ORF finder, primer design, restriction enzymes, codon usage 80%
Alignment Needleman-Wunsch, Smith-Waterman, BLAST (k-mer), MSA (progressive + Clustal) 70%
Phylogenetics p-distance, UPGMA, NJ, ML (RAxML), Bayesian (MrBayes) 85%
Transcriptomics CPM/TPM normalization, differential expression, GO/KEGG enrichment 60%
NGS/Genomics BAM/VCF parsing, read alignment (BWA/Bowtie2), variant calling, SV/CNV detection 65%
Single-Cell Scanpy-based scRNA-seq pipeline 80%
Proteins PDB analysis, ESMFold structure prediction, molecular docking 50%
Epigenomics Bisulfite methylation, DMR detection 40%
Metagenomics K-mer classifier, 16S rRNA pipeline, alpha/beta diversity 65%
Metabolomics Peak detection, ANOVA, feature alignment 50%
Population Genetics HWE, FST, Tajima's D, LD, PCA 70%
CRISPR Guide RNA design, PAM finding, off-target scoring 70%
Metabolism Flux balance analysis (FBA), knockout simulation 55%
Machine Learning Random Forest, SVM, SHAP, cross-validation 50%
Workflow Pipeline builder, batch processor, HTML report generator 80%
GO/Pathways GO browser, pathway visualization (KEGG-style maps) 60%
GWAS Chi-squared test, Manhattan/QQ plots, lead SNP detection 70%
Epitope Prediction T-cell (MHC binding), B-cell (surface propensity), linear epitopes 70%

36+ Visualization Types

Volcano, PCA, Manhattan, MA, Venn, Barplot, Boxplot, Heatmap, Scatter, Time Series, QQ-plot, Clustered Heatmap, Circos, Alignment Viewer, Violin, Raincloud, Ridge, Dot Plot, GSEA, Motif Logo, Sankey, UMAP, Network (PPI/Regulatory/Metabolic), UpSet, Genome Browser, Interactive (Plotly), Sequence Logo, Conservation, Synteny Dotplot, and more.

Dual-Mode Architecture

Every module follows a consistent pattern:

def analyze(input, ...):
    # Try external tool first (fast)
    if _has_external_tool():
        return _run_external(input, ...)
    # Fall back to pure Python (always works)
    return _run_builtin(input, ...)

Cyberpunk GUI

  • 29 analysis tabs with scrollable sidebar
  • 3 themes: Dark-Green-Cyber, Dark-Purple-Cyber, Light-Blue-Cyber
  • Keyboard shortcuts (Ctrl+S, Ctrl+Q, F1, F5, Escape)
  • Progress bars for long operations
  • Plot history (last 10 plots)
  • API key configuration panel
  • 15 built-in help guides

CLI with 99+ Options

Professional CLI menu with organized sections for every analysis type.


Installation

# Clone the repository
git clone https://github.com/yourusername/BioSuite-Better.git
cd BioSuite-Better

# Install dependencies
pip install -r requirements.txt

# Or install individually
pip install numpy pandas matplotlib seaborn scipy scikit-learn
pip install biopython customtkinter tqdm goatools gseapy
pip install scanpy anndata pysam scikit-bio biotite networkx
pip install plotly ete3 cobra shap statsmodels

Quick Start

CLI Mode

python run.py

GUI Mode

python run.py --gui

Programmatic API

from bioplatter.core.sequence import gc_content, reverse_complement, translate
from bioplatter.core.alignment import needleman_wunsch, smith_waterman
from bioplatter.core.phylogeny import distance_matrix, upgma_tree
from bioplatter.core.workflow.pipeline import Pipeline
from bioplatter.plotting.upset_plots import plot_upset

# Quick analysis
gc = gc_content("ATCGATCG")  # 50.0
rc = reverse_complement("ATCG")  # "CGAT"
protein = translate("ATGAAATTTTAA")  # "MKF"

# Pipeline
p = Pipeline("my_analysis")
p.add_step("gc", gc_content, args=("ATCGATCG",))
p.add_step("revcomp", reverse_complement, args=("ATCGATCG",))
p.run()
print(p.results)

Project Structure

BioSuite-Better/
├── run.py                    # Entry point
├── bioplatter/
│   ├── core/                 # 48 analysis modules
│   │   ├── sequence.py       # Sequence I/O & analysis
│   │   ├── alignment.py      # Pairwise alignment (vectorized)
│   │   ├── blast.py          # Sequence search
│   │   ├── msa.py            # Multiple sequence alignment
│   │   ├── phylogeny.py      # Distance-based trees
│   │   ├── ml_phylogeny.py   # ML trees (NJ + RAxML)
│   │   ├── bayesian_phylogeny.py  # Bayesian trees
│   │   ├── expression.py     # Differential expression
│   │   ├── enrichment.py     # GO/KEGG enrichment
│   │   ├── single_cell.py    # scRNA-seq (scanpy)
│   │   ├── ngs.py            # BAM/VCF utilities
│   │   ├── read_aligner.py   # Read mapping
│   │   ├── variant_calling.py # Variant detection + SV/CNV
│   │   ├── peak_calling.py   # ChIP-seq peaks
│   │   ├── assembly.py       # Genome assembly
│   │   ├── metagenomics.py   # Taxonomic classification + 16S
│   │   ├── trimming.py       # Read QC
│   │   ├── quantification.py # RNA-seq quantification
│   │   ├── structure.py      # PDB analysis
│   │   ├── structure_prediction.py  # Protein structure
│   │   ├── docking.py        # Molecular docking
│   │   ├── crispr.py         # Guide RNA design
│   │   ├── metabolism.py     # Flux balance analysis
│   │   ├── popgen.py         # Population genetics
│   │   ├── epigenomics.py    # Methylation analysis
│   │   ├── metabolomics.py   # Mass spec analysis
│   │   ├── md_simulation.py  # Molecular dynamics
│   │   ├── bio_ml.py         # Machine learning
│   │   ├── orf_finder.py     # ORF, restriction enzymes, primers
│   │   ├── codon_usage.py    # Codon tables, k-mer, complexity
│   │   ├── survival.py       # Kaplan-Meier, Cox PH
│   │   ├── file_formats.py   # BED/GFF/Newick/Stockholm/BigWig
│   │   ├── databases.py      # NCBI/UniProt/PDB/KEGG/Ensembl
│   │   ├── go_browser.py     # Gene Ontology browser
│   │   ├── pathway_viz.py    # Pathway visualization
│   │   ├── gwas.py           # GWAS analysis
│   │   ├── epitope.py        # Epitope prediction
│   │   └── workflow/         # Pipeline, batch, report
│   ├── plotting/             # 36+ visualization types
│   ├── gui/                  # Cyberpunk GUI (29 tabs)
│   ├── cli/                  # CLI with 99+ options
│   └── tests/                # 271 automated tests
├── examples/                 # Jupyter notebooks
└── requirements.txt

Testing

# Run all 271 tests
python -m pytest tests/ -v

# Run specific test file
python -m pytest tests/test_phase4.py -v

# Run with short traceback
python -m pytest tests/ -v --tb=short

Requirements

All pip-installable:

numpy, pandas, matplotlib, seaborn, scipy, scikit-learn
biopython, customtkinter, tqdm, goatools, gseapy
scanpy, anndata, pysam, scikit-bio, biotite, networkx
plotly, ete3, cobra, shap, statsmodels

Platform

  • OS: Windows, macOS, Linux
  • Python: 3.10+
  • GPU: Not required (CPU-only)
  • RAM: 4GB minimum, 8GB recommended
  • External tools: Optional (BLAST+, Clustal Omega, etc. provide speed boosts)

Author

Sahand Touri Molecular Cell Biology Student, Urmia IAU, Iran

Built as a comprehensive bioinformatics portfolio project demonstrating:

  • Full-stack software engineering (48 modules, 21,000+ lines, 271 tests)
  • Domain expertise across 20+ bioinformatics areas
  • Dual-mode architecture (pure Python + optional external tools)
  • Professional GUI and CLI design
  • Automated testing and quality assurance

License

MIT License - Free for academic and commercial use.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biosuitestkod-3.0.4.tar.gz (192.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

biosuitestkod-3.0.4-py3-none-any.whl (199.1 kB view details)

Uploaded Python 3

File details

Details for the file biosuitestkod-3.0.4.tar.gz.

File metadata

  • Download URL: biosuitestkod-3.0.4.tar.gz
  • Upload date:
  • Size: 192.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for biosuitestkod-3.0.4.tar.gz
Algorithm Hash digest
SHA256 3cc0a4edeb14a90289911a77447e461de6ddb68c910e2d24e00a18a4c25900b4
MD5 148f5d062be56af816f8dc1465eecae8
BLAKE2b-256 a562b6a1f8376b72ac85b46756e7abc50027aec32c6627cdc5d8af1b553bdbe8

See more details on using hashes here.

File details

Details for the file biosuitestkod-3.0.4-py3-none-any.whl.

File metadata

  • Download URL: biosuitestkod-3.0.4-py3-none-any.whl
  • Upload date:
  • Size: 199.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for biosuitestkod-3.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c4e52976ea7dae8a96e9425ff4e088af97ecbbda2d3a83df4ad88f49b1ea570f
MD5 4b3dcb1768d77c890dd65fc15548d51c
BLAKE2b-256 9fbebbe944b4bfb75049ed2a97c9aac0db53a4591f03c96ca5518c5aa7454acb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page