Skip to main content

Package for structural analysis of proteins

Project description

bioiain

Ask DeepWiki

By Iain Visa (iainvisa@gmail.com)

Bioiain logo

Toolbox for structural analysis of proteins.

[!WARNING] WIP EVERYTHING IS UNDER DEVELOPMENT!

Many features are still not commented/documented or even mentioned. Feel free to explore and/or use any functions.

Can be downloaded from the PiPy repository but note the used version as any function might change during development.

pip install bioiain

If you were to use this and find any issue I'll be happy to fix it :D

INFO

Relevant python code can be found in the src folder within their relevant folders. The test is for development use, and it's contents will be probably deleted/modified at some point, and are not included in the package.

[!NOTE] Preset workflows are being developed, including the projectDimer workflow. (WIP)

Protein Framework

Originally based on Biopython's hierarchy, but no longer dependant on it. Classes for structures and chains are included for manipulation and analysis of protein models. Designed to be expandable, custom classes are encouraged to match each purpose.

Unlike Biopython, residues and atoms do not share the base entity framework as they behave in significantly different ways. Also respective classes for nucleotides, ligands and water are included.

Includes general-purpose tools and pipelines for importing, processing,saving, and exporting structures in mmCIF format (but PDB is still slightly supported)

[!IMPORTANT] PDB Parsing is not supported yet, so the input so far must be mmCIF

[!NOTE] WIP: Allow PDB parsing, dealing with structures with several models, cast data from respective Biopython objects.

Symmetries

This framework is designed to work with all the information available in crystallographic structures, therefore symmetry is considered when available.

Machine Learning

Still at a very early stage, Bioiain includes a PyTorch-based ML framework to simplify the development and training of ML models, focused on structural data.

This includes a base model with all the utilities commonly used during train/test/eval/inference of models.

Also a dataset/embedding framework is also set up with integrations with the Protein Framework.

This includes integrated logging using Tensorboard.

ALEPH

Characteristic vectors are a powerful abstraction of protein structure, and can be calculated with ALEPH, through direct integration within the FragmentedStructure and Fragment classes included in the Protein Framework.

Tools

Utility functions to use and parse some external tools are included. For now this includes:

Utilities

Additionally, a large set of utilities is included, from logging, to common mathematical operations.

Visualization

For structural visualisation, a custom PyMol scripting framework is included, replacing heavy sessions with generative commands.

Some common matplotlib utilities are also included.

Databases

Utility functions download, parse, and query some online databases are included. For now this includes:

  • Plinder (protein ligand interactions dataset)

[!NOTE] Currently under development (separate repo): UniProt and COSMIC databases

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioiain-0.0.10.1.4.tar.gz (287.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bioiain-0.0.10.1.4-py3-none-any.whl (304.2 kB view details)

Uploaded Python 3

File details

Details for the file bioiain-0.0.10.1.4.tar.gz.

File metadata

  • Download URL: bioiain-0.0.10.1.4.tar.gz
  • Upload date:
  • Size: 287.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for bioiain-0.0.10.1.4.tar.gz
Algorithm Hash digest
SHA256 da1944b321fe44104d2612174e24f3c3b4a674aa0419c1e5b31bacd8aa6f8675
MD5 5d7b15c56213a30016e0c217300cdf7b
BLAKE2b-256 0678748573fe7de942136d0e53a23bf8b793676cdbbb1767d907c7b49fa1ecc6

See more details on using hashes here.

File details

Details for the file bioiain-0.0.10.1.4-py3-none-any.whl.

File metadata

  • Download URL: bioiain-0.0.10.1.4-py3-none-any.whl
  • Upload date:
  • Size: 304.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for bioiain-0.0.10.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 6071b925b5a2a00dcb0f612ac0b6aa2cd8ee570b856f9c5b67bddbe2cfa8dd69
MD5 cfb58832ce5d7c8e99d3068cf42a97c1
BLAKE2b-256 e8fbdd22d4f76adf349ec3b7137617e9b4a5007a12087405180a03d4b576273b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page