Package for structural analysis of proteins
Project description
bioiain
By Iain Visa (iainvisa@gmail.com)
Toolbox for structural analysis of proteins.
[!WARNING] WIP EVERYTHING IS UNDER DEVELOPMENT!
Many features are still not commented/documented or even mentioned. Feel free to explore and/or use any functions.
Can be downloaded from the PiPy repository but note the used version as any function might change during development.
pip install bioiain
If you were to use this and find any issue I'll be happy to fix it :D
INFO
Relevant python code can be found in the src folder within their relevant folders.
The test is for development use, and it's contents will be probably deleted/modified at some point, and are not included in the package.
[!NOTE] Preset workflows are being developed, including the projectDimer workflow. (WIP)
Protein Framework
Originally based on Biopython's hierarchy, but no longer dependant on it. Classes for structures and chains are included for manipulation and analysis of protein models. Designed to be expandable, custom classes are encouraged to match each purpose.
Unlike Biopython, residues and atoms do not share the base entity framework as they behave in significantly different ways. Also respective classes for nucleotides, ligands and water are included.
Includes general-purpose tools and pipelines for importing, processing,saving, and exporting structures in mmCIF format (but PDB is still slightly supported)
[!IMPORTANT] PDB Parsing is not supported yet, so the input so far must be mmCIF
[!NOTE] WIP: Allow PDB parsing, dealing with structures with several models, cast data from respective Biopython objects.
Symmetries
This framework is designed to work with all the information available in crystallographic structures, therefore symmetry is considered when available.
Machine Learning
Still at a very early stage, Bioiain includes a PyTorch-based ML framework to simplify the development and training of ML models, focused on structural data.
This includes a base model with all the utilities commonly used during train/test/eval/inference of models.
Also a dataset/embedding framework is also set up with integrations with the Protein Framework.
This includes integrated logging using Tensorboard.
ALEPH
Characteristic vectors are a powerful abstraction of protein structure, and can be calculated with ALEPH, through direct integration within the FragmentedStructure and Fragment classes included in the Protein Framework.
Tools
Utility functions to use and parse some external tools are included. For now this includes:
Utilities
Additionally, a large set of utilities is included, from logging, to common mathematical operations.
Visualization
For structural visualisation, a custom PyMol scripting framework is included, replacing heavy sessions with generative commands.
Some common matplotlib utilities are also included.
Databases
Utility functions download, parse, and query some online databases are included. For now this includes:
- Plinder (protein ligand interactions dataset)
[!NOTE] Currently under development (separate repo): UniProt and COSMIC databases
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bioiain-0.0.10.1.3.tar.gz.
File metadata
- Download URL: bioiain-0.0.10.1.3.tar.gz
- Upload date:
- Size: 287.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bcee7e425f48a271de81b1c45ecfaec13e482d33cce9063b526a1a6897cf6d31
|
|
| MD5 |
c5b27661b5b341748901749577904dc2
|
|
| BLAKE2b-256 |
bc763bc33a56f43080ec08c96c8dd4570e8693338eb4d5ec83e6e3905a9e9ccb
|
File details
Details for the file bioiain-0.0.10.1.3-py3-none-any.whl.
File metadata
- Download URL: bioiain-0.0.10.1.3-py3-none-any.whl
- Upload date:
- Size: 304.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b04c34f41fc4de90b2113a31f695f527f2ae949e49aeb94751a3569289e49525
|
|
| MD5 |
dcf35a07eeb68e9b0eee477e78128b47
|
|
| BLAKE2b-256 |
67d9708edd1c1fe28ba4d7bf6b85e44fc2a0a1d9ca477821d78d568b4361d96d
|