Privacy-first command-line tool for biotech devs and researchers to analyse sequence data.
Project description
🧬 bioai-seq
bioai-seq is a lightweight command-line tool for basic biological sequence analysis. It’s part of my journey toward becoming a Bio AI Software Engineer — combining software engineering, biology, and machine learning.
💻 Local Development & Testing
1. Create and activate a virtual environment
python3 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
2. Install required build tools
pip install --upgrade pip setuptools build
3. Run the CLI locally
Run the CLI directly:
python3 -m bioai_seq.cli
Or install locally and use as a command:
pip install .
bioseq
🚀 Deploying to PyPI (Production)
1. Clean previous builds
rm -rf dist build *.egg-info
2. Build the package
python3 -m build
3. Upload to PyPI
pip install --upgrade twine
twine upload dist/*
- Username:
__token__ - Password: your API token from https://pypi.org/manage/account/token/
📦 Installation (User Guide)
pip install bioai-seq
Then run:
bioseq
🧪 Planned Example Output
✅ Sequence loaded: 1273 amino acids
🧬 Detected: SARS-CoV-2 spike protein (likely variant: Omicron)
🔍 Running ESM-2 embeddings...
🧪 Predicted secondary structure: 40% alpha-helix, 25% beta-sheet
🧬 Mutation sites detected vs reference: 15
📚 Similar sequences:
- UniProt P0DTC2 (99.7%)
- UniProt A0A6H2L9T9 (98.9%)
🧠 Summary:
"This sequence appears to be a mutated spike protein, likely from a recent SARS-CoV-2 variant. Multiple substitutions are present in the RBD region."
🌐 Follow the Journey
This project is part of a broader initiative to define and grow the Bio AI Software Engineer role.
- 🌍 Blog: https://bioaisoftware.engineer
- 🧑💻 GitHub: https://github.com/babilonczyk
- 💼 LinkedIn: https://www.linkedin.com/in/jan-piotrzkowski/
🛠️ License
MIT — free to use, share, and improve.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bioai_seq-0.0.1.tar.gz.
File metadata
- Download URL: bioai_seq-0.0.1.tar.gz
- Upload date:
- Size: 6.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
21863acb3c6e1d514eca95051c035f51d478c5d1de7e9a14ffa99e17e110faee
|
|
| MD5 |
970ba24deaf64a4890e8e0d3fd3dd132
|
|
| BLAKE2b-256 |
82f6a3d645a3e5be920342c009c74765bb0a786d1789ce5a15dff92ffe985c8d
|
File details
Details for the file bioai_seq-0.0.1-py3-none-any.whl.
File metadata
- Download URL: bioai_seq-0.0.1-py3-none-any.whl
- Upload date:
- Size: 7.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
486cb851df3bdd180d3ba552592c4a3467a2fb4ca2371412e0af75d138789336
|
|
| MD5 |
2f402fa61d66042ed0252e92bca4a490
|
|
| BLAKE2b-256 |
a8885efe985380426d865520a6fb8bca528411a53b8f25f69bfe5be8d7435809
|