Skip to main content

Package for structural analysis of proteins

Project description

bioiain

Ask DeepWiki

By Iain Visa (iainvisa@gmail.com)

Bioiain logo

Toolbox for structural analysis of proteins.

[!WARNING] WIP EVERYTHING IS UNDER DEVELOPMENT!

Many features are still not commented/documented or even mentioned. Feel free to explore and/or use any functions.

Can be downloaded from the PiPy repository but note the used version as any function might change during development.

pip install bioiain

If you were to use this and find any issue I'll be happy to fix it :D

INFO

Relevant python code can be found in the src folder within their relevant folders. The test is for development use, and it's contents will be probably deleted/modified at some point, and are not included in the package.

[!NOTE] Preset workflows are being developed, including the projectDimer workflow. (WIP)

Protein Framework

Originally based on Biopython's hierarchy, but no longer dependant on it. Classes for structures and chains are included for manipulation and analysis of protein models. Designed to be expandable, custom classes are encouraged to match each purpose.

Unlike Biopython, residues and atoms do not share the base entity framework as they behave in significantly different ways. Also respective classes for nucleotides, ligands and water are included.

Includes general-purpose tools and pipelines for importing, processing,saving, and exporting structures in mmCIF format (but PDB is still slightly supported)

[!IMPORTANT] PDB Parsing is not supported yet, so the input so far must be mmCIF

[!NOTE] WIP: Allow PDB parsing, dealing with structures with several models, cast data from respective Biopython objects.

Symmetries

This framework is designed to work with all the information available in crystallographic structures, therefore symmetry is considered when available.

Machine Learning

Still at a very early stage, Bioiain includes a PyTorch-based ML framework to simplify the development and training of ML models, focused on structural data.

This includes a base model with all the utilities commonly used during train/test/eval/inference of models.

Also a dataset/embedding framework is also set up with integrations with the Protein Framework.

This includes integrated logging using Tensorboard.

ALEPH

Characteristic vectors are a powerful abstraction of protein structure, and can be calculated with ALEPH, through direct integration within the FragmentedStructure and Fragment classes included in the Protein Framework.

Tools

Utility functions to use and parse some external tools are included. For now this includes:

Utilities

Additionally, a large set of utilities is included, from logging, to common mathematical operations.

Visualization

For structural visualisation, a custom PyMol scripting framework is included, replacing heavy sessions with generative commands.

Some common matplotlib utilities are also included.

Databases

Utility functions download, parse, and query some online databases are included. For now this includes:

  • Plinder (protein ligand interactions dataset)

[!NOTE] Currently under development (separate repo): UniProt and COSMIC databases

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioiain-0.0.10.1.1.tar.gz (287.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bioiain-0.0.10.1.1-py3-none-any.whl (304.1 kB view details)

Uploaded Python 3

File details

Details for the file bioiain-0.0.10.1.1.tar.gz.

File metadata

  • Download URL: bioiain-0.0.10.1.1.tar.gz
  • Upload date:
  • Size: 287.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for bioiain-0.0.10.1.1.tar.gz
Algorithm Hash digest
SHA256 0bd8b2024b30cc9de44afe65976bc63d3f923b355ce9a3f53d7baa5cd704bab0
MD5 0825b57e0707a345875a3838955d2421
BLAKE2b-256 23e11d17f4f1a632b62aa654ccf85b25818a1f93cd9e0211d879c12164bc392a

See more details on using hashes here.

File details

Details for the file bioiain-0.0.10.1.1-py3-none-any.whl.

File metadata

  • Download URL: bioiain-0.0.10.1.1-py3-none-any.whl
  • Upload date:
  • Size: 304.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for bioiain-0.0.10.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e92f29e58945de2c3cdda9dbf6a32f088e1906e228ff41fc0c31b7a25f815a35
MD5 69b5f575596a347b87b0bd87a0f9d050
BLAKE2b-256 2c5a3ddf07bea02dd093c5ccce01a81fb334875104606b4119d5bc1263543c4b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page