Privacy-first command-line tool for biotech devs and researchers to analyse sequence data.
Project description
bioai-seq
bioai-seq is a lightweight command-line tool for basic biological sequence analysis. It’s part of my journey toward becoming a Bio AI Software Engineer - combining software engineering, biology, and machine learning.
How to install
1. Create and activate a virtual environment
python3 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
2. Install bioai-seq
pip install --upgrade bioai-seq
bioseq
Deploying to PyPI (Production)
1. Clean previous builds
rm -rf dist build *.egg-info
2. Build the package
python3 -m build
3. Upload to PyPI
pip install --upgrade twine
twine upload dist/*
- Username:
__token__ - Password: your API token from https://pypi.org/manage/account/token/
Installation (User Guide)
pip install bioai-seq
Then run:
bioseq
🧪 Planned Example Output
✅ Sequence loaded: 1273 amino acids
🧬 Detected: SARS-CoV-2 spike protein (likely variant: Omicron)
🔍 Running ESM-2 embeddings...
🧪 Predicted secondary structure: 40% alpha-helix, 25% beta-sheet
🧬 Mutation sites detected vs reference: 15
📚 Similar sequences:
- UniProt P0DTC2 (99.7%)
- UniProt A0A6H2L9T9 (98.9%)
🧠 Summary:
"This sequence appears to be a mutated spike protein, likely from a recent SARS-CoV-2 variant. Multiple substitutions are present in the RBD region."
Follow the Journey
- 🌍 Blog: https://bioaisoftware.engineer
- 🧑💻 GitHub: https://github.com/babilonczyk
- 💼 LinkedIn: https://www.linkedin.com/in/jan-piotrzkowski/
License
Apache 2.0 - free to use, and improve.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bioai_seq-0.0.3.tar.gz.
File metadata
- Download URL: bioai_seq-0.0.3.tar.gz
- Upload date:
- Size: 6.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
937041d9dd2eaf2392399d098fa305b9c7966429b6c7a2979ea47c35c169dbd2
|
|
| MD5 |
6167eab72a8c5955b955262779881fb5
|
|
| BLAKE2b-256 |
c7efe8413c75de81838d13a890675216aca5af78c522743a45c8370215741797
|
File details
Details for the file bioai_seq-0.0.3-py3-none-any.whl.
File metadata
- Download URL: bioai_seq-0.0.3-py3-none-any.whl
- Upload date:
- Size: 7.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5c75f0ce67f4c0aded44cf8ad5f078ddfc29ee8413503b9864ace90cf7e6bb65
|
|
| MD5 |
e14ddc7a4ead05fa2d88a11a6ebae01e
|
|
| BLAKE2b-256 |
425492a687effb88f045941df18bfbdfaa58d17ef75084287d0c6eedb629a576
|