Skip to main content

Protein structure datasets for machine learning.

Project description

build pypi docs downloads codecov

      Quickstart      Website      Documentation      Paper      Contribute      Leaderboard      Tutorials

ProteinShake provides one-liner imports of large scale, preprocessed protein structure datasets and tasks for various model types and frameworks.

We provide a collection of preprocessed and cleaned protein 3D structure datasets from RCSB and AlphaFoldDB, including annotations. Structures are easily converted to graphs, voxels, or point clouds and loaded natively into PyTorch, TensorFlow, NumPy, JAX, PyTorch Geometric, DGL and NetworkX. The task API enables standardized benchmarking on a variety of tasks on protein and residue level.

Find more information on the Website and the Documentation, or check out the Tutorials. The results of the paper and the baseline models can be found in the Evaluation Repository. If you would like to create your own release, see the Release Repository.


Installation:

- This is a pre-release version. There may be unannounced changes to the API and datasets. -
- We expect some bugs as well, please open an issue if you find one. -
pip install proteinshake

Code in this repository is licensed under BSD-3, the dataset files on Zenodo are licensed under CC-BY-4.0.

To build ProteinShake, we obtained and modified data from various sources. Please see the documentation of the respective dataset classes for a reference to the original data, license, and paper.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

proteinshake-0.3.14.tar.gz (48.5 kB view details)

Uploaded Source

Built Distribution

proteinshake-0.3.14-py3-none-any.whl (70.0 kB view details)

Uploaded Python 3

File details

Details for the file proteinshake-0.3.14.tar.gz.

File metadata

  • Download URL: proteinshake-0.3.14.tar.gz
  • Upload date:
  • Size: 48.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for proteinshake-0.3.14.tar.gz
Algorithm Hash digest
SHA256 26425e53f8030c692a795bab33a90b05ea7e7c4249d30a882f04cb60e04e3092
MD5 3e69791a88b859b222c67e51d0f35b41
BLAKE2b-256 bf9b340417a7993be9497c1b669ee6f2dbc8820fc0b7731e0361ae71ac73345b

See more details on using hashes here.

File details

Details for the file proteinshake-0.3.14-py3-none-any.whl.

File metadata

File hashes

Hashes for proteinshake-0.3.14-py3-none-any.whl
Algorithm Hash digest
SHA256 2bf9f732228e5c2f944f481c1c58fc1d4aebba1e1af61cbf70e2e52a9bba460f
MD5 4e5a6fe8459d24e3a2d73a76c300b953
BLAKE2b-256 8797d79033bc4af0e0af6b99e3407a5e2f19b39d4c9c26e025e25fea51b72190

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page