A comprehensive package of biological constants, serving as a foundational resource for biology and bioinformatics, complemented by functions to streamline related tasks.
Project description
Biobase
A Python package providing standardized biological constants and scoring matrices for bioinformatics pipelines. Biobase aims to eliminate the need to repeatedly recreate common biological data structures and scoring systems in your code.
Table of Contents
- Quick Start
- Requirements
- Installation
- Running Files
- Data Files
- Project Goals
- Contributing
- Project Status
- License
Quick Start
Access amino acid properties:
from biobase.constants import ONE_LETTER_CODES, MONO_MASS
print(ONE_LETTER_CODES) # 'ACDEFGHIKLMNPQRSTVWY'
print(MONO_MASS['A']) # 71.037113805
Use scoring matrices:
from biobase.matrix import Blosum
blosum62 = Blosum(62)
print(blosum62['A']['A']) # 4
print(blosum62['W']['C']) # -2
Analyze DNA sequences:
from biobase.analysis import Dna
sequence = "ATCGTAGC"
print(Dna.transcribe(sequence)) # 'AUCGUAGC'
print(Dna.complement_dna(sequence)) # 'GCTACGAT'
print(Dna.calculate_gc_content(sequence)) # 50.0
Find protein motifs:
from biobase.analysis import find_motifs
sequence = "ACDEFGHIKLMNPQRSTVWY"
print(find_motifs(sequence, "DEF")) # [3]
Requirements
- Python 3.10+
- pip (for installation)
Installation
Regular Installation
pip install biobase
Development Installation
Clone the repository and install in editable mode:
git clone https://github.com/lignum-vitae/biobase.git
cd biobase
pip install -e .
Running Files
To ensure relative imports work correctly, always run files using the module path from the project root:
Run a specific file
python -m src.biobase.matrix
Data Files
src/biobase/matrices/: Scoring matrix data stored in JSON file format
Project Goals
Biobase aims to provide Python-friendly versions of common biological constants and tools for bioinformatics pipelines. Key objectives:
- Standardize biological data structures
- Provide efficient implementations of common scoring systems
- Ensure type safety and validation
- Maintain comprehensive documentation
- Support modern Python practices
Contributing
We welcome contributions! Please read our:
Project Status
Current Version: 0.4.1-alpha
Core Features
- ✅ BLOSUM and PAM matrix implementations
- ✅ Basic amino acid constants and conversions
- ✅ DNA/RNA sequence analysis tools
- ✅ Protein motif searching
- ✅ Core biological constants
- ✅ Additional scoring matrices
- ✅ Extended amino acid properties
Analysis Tools
- ✅ GC content calculation
- ✅ DNA/RNA transcription
- ✅ DNA complementation
- ✅ Motif finding
- 🚧 File format parsers (FASTA, GenBank, etc.)
- 📋 Statistical analysis tools
Documentation
- ✅ Basic README
- ✅ Code of Conduct
- ✅ Contributing Guidelines
- ✅ Usage Examples
Development
- 🚧 PyPI package deployment
- 🚧 CI/CD Pipeline
- 🚧 Code Coverage
- 📋 Automated Releases
Legend
- ✅ Complete
- 🚧 In Progress
- 📋 Planned
Stability
This project is in the alpha stage. APIs may change without warning until version 1.0.0.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file biobase-0.4.7.tar.gz.
File metadata
- Download URL: biobase-0.4.7.tar.gz
- Upload date:
- Size: 24.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.23
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8174848aba988905a45cc872d2f133725660dbe5b087995c922f881e30e57d7a
|
|
| MD5 |
b22af804401422de2a8f953b77f025c6
|
|
| BLAKE2b-256 |
1a62b32cf8d158bb15349c14a5ff101892ccd3357e71ebb36fbcf07aec7815fd
|
File details
Details for the file biobase-0.4.7-py3-none-any.whl.
File metadata
- Download URL: biobase-0.4.7-py3-none-any.whl
- Upload date:
- Size: 3.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.23
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e28de6f22c99311672c9f27456565da7fe5b8de7ca53998bf1670c6f2588661a
|
|
| MD5 |
46693e74e96f4e9ab1af43668f044dda
|
|
| BLAKE2b-256 |
8702f3ef8f2782a2a7246e867fc7e2d8dd8df414390757d34d2fbfd3e25caf8c
|