Skip to main content

Privacy-first command-line tool for biotech devs and researchers to analyse sequence data.

Project description

🧬 bioai-seq

bioai-seq is a lightweight command-line tool for basic biological sequence analysis. It’s part of my journey toward becoming a Bio AI Software Engineer — combining software engineering, biology, and machine learning.


💻 Local Development & Testing

1. Create and activate a virtual environment

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

2. Install required build tools

pip install --upgrade pip setuptools build

3. Run the CLI locally

Run the CLI directly:

python3 -m bioai_seq.cli

Or install locally and use as a command:

pip install .
bioseq

🚀 Deploying to PyPI (Production)

1. Clean previous builds

rm -rf dist build *.egg-info

2. Build the package

python3 -m build

3. Upload to PyPI

pip install --upgrade twine
twine upload dist/*

📦 Installation (User Guide)

pip install bioai-seq

Then run:

bioseq

🧪 Planned Example Output

✅ Sequence loaded: 1273 amino acids
🧬 Detected: SARS-CoV-2 spike protein (likely variant: Omicron)
🔍 Running ESM-2 embeddings...
🧪 Predicted secondary structure: 40% alpha-helix, 25% beta-sheet
🧬 Mutation sites detected vs reference: 15
📚 Similar sequences:
 - UniProt P0DTC2 (99.7%)
 - UniProt A0A6H2L9T9 (98.9%)
🧠 Summary:
"This sequence appears to be a mutated spike protein, likely from a recent SARS-CoV-2 variant. Multiple substitutions are present in the RBD region."

🌐 Follow the Journey

This project is part of a broader initiative to define and grow the Bio AI Software Engineer role.


🛠️ License

MIT — free to use, share, and improve.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioai_seq-0.0.2.tar.gz (6.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bioai_seq-0.0.2-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file bioai_seq-0.0.2.tar.gz.

File metadata

  • Download URL: bioai_seq-0.0.2.tar.gz
  • Upload date:
  • Size: 6.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioai_seq-0.0.2.tar.gz
Algorithm Hash digest
SHA256 6cd7fdcc2a41df0985954063ccd37c06d8b58b96b328d8feafb623204e8dd235
MD5 e3f679ae0c66f126e2aad5ebf5d765e4
BLAKE2b-256 328d7458b3a5fed8d4fd4ad1f59582fd4a17f79890bd2babdb2983613b5249c6

See more details on using hashes here.

File details

Details for the file bioai_seq-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: bioai_seq-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 7.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioai_seq-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b51f6c72530ddeb429d7d07540cc7310378edeb74eb3028d7cde6d7659d6a955
MD5 ca758139152f0de6283e906a35c7e3f6
BLAKE2b-256 1952f6b860575c041c4caee018ed0413ef9b018b9cf066596e560e7f4a711461

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page