Skip to main content

Package for structural analysis of proteins

Project description

bioiain

Ask DeepWiki

By Iain Visa (iainvisa@gmail.com)

Bioiain logo

Toolbox for structural analysis of proteins.

[!WARNING] WIP EVERYTHING IS UNDER DEVELOPMENT!

Many features are still not commented/documented or even mentioned. Feel free to explore and/or use any functions.

Can be downloaded from the PiPy repository but note the used version as any function might change during development.

pip install bioiain

If you were to use this and find any issue I'll be happy to fix it :D

INFO

Relevant python code can be found in the src folder within their relevant folders. The test is for development use, and it's contents will be probably deleted/modified at some point, and are not included in the package.

[!NOTE] Preset workflows are being developed, including the projectDimer workflow. (WIP)

Protein Framework

Originally based on Biopython's hierarchy, but no longer dependant on it. Classes for structures and chains are included for manipulation and analysis of protein models. Designed to be expandable, custom classes are encouraged to match each purpose.

Unlike Biopython, residues and atoms do not share the base entity framework as they behave in significantly different ways. Also respective classes for nucleotides, ligands and water are included.

Includes general-purpose tools and pipelines for importing, processing,saving, and exporting structures in mmCIF format (but PDB is still slightly supported)

[!IMPORTANT] PDB Parsing is not supported yet, so the input so far must be mmCIF

[!NOTE] WIP: Allow PDB parsing, dealing with structures with several models, cast data from respective Biopython objects.

Symmetries

This framework is designed to work with all the information available in crystallographic structures, therefore symmetry is considered when available.

Machine Learning

Still at a very early stage, Bioiain includes a PyTorch-based ML framework to simplify the development and training of ML models, focused on structural data.

This includes a base model with all the utilities commonly used during train/test/eval/inference of models.

Also a dataset/embedding framework is also set up with integrations with the Protein Framework.

This includes integrated logging using Tensorboard.

ALEPH

Characteristic vectors are a powerful abstraction of protein structure, and can be calculated with ALEPH, through direct integration within the FragmentedStructure and Fragment classes included in the Protein Framework.

Tools

Utility functions to use and parse some external tools are included. For now this includes:

Utilities

Additionally, a large set of utilities is included, from logging, to common mathematical operations.

Visualization

For structural visualisation, a custom PyMol scripting framework is included, replacing heavy sessions with generative commands.

Some common matplotlib utilities are also included.

Databases

Utility functions download, parse, and query some online databases are included. For now this includes:

  • Plinder (protein ligand interactions dataset)

[!NOTE] Currently under development (separate repo): UniProt and COSMIC databases

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioiain-0.0.10.1.0.tar.gz (287.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bioiain-0.0.10.1.0-py3-none-any.whl (304.2 kB view details)

Uploaded Python 3

File details

Details for the file bioiain-0.0.10.1.0.tar.gz.

File metadata

  • Download URL: bioiain-0.0.10.1.0.tar.gz
  • Upload date:
  • Size: 287.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for bioiain-0.0.10.1.0.tar.gz
Algorithm Hash digest
SHA256 8cac40e299e9282c32021069e565b7e97e61b15de76ab41863e0758e4006cf93
MD5 a6df680fb822cce31abe0daa0f474b14
BLAKE2b-256 4fe19d0dd623f4a7acb4fa84b3f90a46ba877977b1708d230f2c54ea74741722

See more details on using hashes here.

File details

Details for the file bioiain-0.0.10.1.0-py3-none-any.whl.

File metadata

  • Download URL: bioiain-0.0.10.1.0-py3-none-any.whl
  • Upload date:
  • Size: 304.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for bioiain-0.0.10.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7829102b3aaa31c552d33591000bad55ea1c80a413b161e7cc639dd8b4468d94
MD5 a36ebb40ed83f0127c7a27d2894231c4
BLAKE2b-256 d40556b9da7e4cb9bcea82fb87e3a86b40072c68d2d4e53f1ad1f8682a8cce2d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page