Macromolecular training sets that emphasize protein/non-protein interactions.
Project description
Macromolecular Gymnasium
This package builds datasets of macromolecular structures, for machine learning. Some properties of these datasets:
-
Capable of processing the entire PDB.[^1]
-
Each data point is a particular coordinate in a particular structure.
-
Only data points with sufficient density, and that feature unique combinations of molecules, are included.
-
The resulting database is self-contained; all of the necessary coordinates are copied into the database.
-
Coordinates are taken from biological assemblies, not asymmetric units.
-
Train/validation/test splits are made such that all structures which share an InterPro domain/family end up in the same split.
[^1]: With the only exceptions being 8h2i and 5zz8. Both of these
structures have errors in their biological assemblies, which cause the
assemblies to have huge volumes of empty space. Because of the way some
internal algorithms work, these particular errors end up requiring prohibitive
amounts of memory to process.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file macromol_gym-1.1.0.tar.gz.
File metadata
- Download URL: macromol_gym-1.1.0.tar.gz
- Upload date:
- Size: 22.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
57087bcb84c6d199209aa84edafae06d96911f537b255aa72ec905fe6a7a8d2c
|
|
| MD5 |
feafac4c6f368b0405869e0365530298
|
|
| BLAKE2b-256 |
f1829398c7ec44d5f9fd93add9377397844aae1b8e5eb7f092ea22df98e674b4
|
File details
Details for the file macromol_gym-1.1.0-py3-none-any.whl.
File metadata
- Download URL: macromol_gym-1.1.0-py3-none-any.whl
- Upload date:
- Size: 25.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b9b9a3aad3fc02ed56d144119a1d7a5d2a2299a902349dc996961755371634fc
|
|
| MD5 |
88807cc438d5113dc79b5b78b3d326d3
|
|
| BLAKE2b-256 |
e3e0d4e249728b5fbe9c663d3713892819add1960215019d46bebbdd4299e676
|