Skip to main content

Privacy-first command-line tool for biotech devs and researchers to analyse sequence data.

Project description

🧬 bioai-seq

bioai-seq is a lightweight command-line tool for basic biological sequence analysis. It’s part of my journey toward becoming a Bio AI Software Engineer — combining software engineering, biology, and machine learning.


💻 Local Development & Testing

1. Create and activate a virtual environment

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

2. Install required build tools

pip install --upgrade pip setuptools build

3. Run the CLI locally

Run the CLI directly:

python3 -m bioai_seq.cli

Or install locally and use as a command:

pip install .
bioseq

🚀 Deploying to PyPI (Production)

1. Clean previous builds

rm -rf dist build *.egg-info

2. Build the package

python3 -m build

3. Upload to PyPI

pip install --upgrade twine
twine upload dist/*

📦 Installation (User Guide)

pip install bioai-seq

Then run:

bioseq

🧪 Planned Example Output

✅ Sequence loaded: 1273 amino acids
🧬 Detected: SARS-CoV-2 spike protein (likely variant: Omicron)
🔍 Running ESM-2 embeddings...
🧪 Predicted secondary structure: 40% alpha-helix, 25% beta-sheet
🧬 Mutation sites detected vs reference: 15
📚 Similar sequences:
 - UniProt P0DTC2 (99.7%)
 - UniProt A0A6H2L9T9 (98.9%)
🧠 Summary:
"This sequence appears to be a mutated spike protein, likely from a recent SARS-CoV-2 variant. Multiple substitutions are present in the RBD region."

🌐 Follow the Journey

This project is part of a broader initiative to define and grow the Bio AI Software Engineer role.


🛠️ License

MIT — free to use, share, and improve.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioai_seq-0.0.1.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bioai_seq-0.0.1-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file bioai_seq-0.0.1.tar.gz.

File metadata

  • Download URL: bioai_seq-0.0.1.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioai_seq-0.0.1.tar.gz
Algorithm Hash digest
SHA256 21863acb3c6e1d514eca95051c035f51d478c5d1de7e9a14ffa99e17e110faee
MD5 970ba24deaf64a4890e8e0d3fd3dd132
BLAKE2b-256 82f6a3d645a3e5be920342c009c74765bb0a786d1789ce5a15dff92ffe985c8d

See more details on using hashes here.

File details

Details for the file bioai_seq-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: bioai_seq-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 7.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioai_seq-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 486cb851df3bdd180d3ba552592c4a3467a2fb4ca2371412e0af75d138789336
MD5 2f402fa61d66042ed0252e92bca4a490
BLAKE2b-256 a8885efe985380426d865520a6fb8bca528411a53b8f25f69bfe5be8d7435809

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page