Skip to main content

Tools for creating machine-learning datasets from macromolecular structure

Project description

Last release Python version Documentation Test status Test coverage Last commit

Macromolecule Census is a set of tools for creating machine-learning datasets from macromolecular structure data, especially those made available by the protein data bank (PDB). The purpose of these tools is to account for the following:

  • Filter for high-quality (e.g. high resolution, low R-factor), low-redundancy (i.e. sequence identity cutoffs) structures.

  • Make robust training/validation/test splits by accounting for domain-level structural similarities.

  • Store atomic coordinates in a compact, portable, standard format (SQLite).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

macromol_census-0.2.0.tar.gz (26.3 kB view details)

Uploaded Source

Built Distribution

macromol_census-0.2.0-py3-none-any.whl (33.6 kB view details)

Uploaded Python 3

File details

Details for the file macromol_census-0.2.0.tar.gz.

File metadata

  • Download URL: macromol_census-0.2.0.tar.gz
  • Upload date:
  • Size: 26.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for macromol_census-0.2.0.tar.gz
Algorithm Hash digest
SHA256 efee53e73fb9e91e6083ed238953488dab6b193dcd85330f96bd6ac237d1f53c
MD5 932e49b06ef08ce6780bff754d24ed3f
BLAKE2b-256 f79756b84e8b392aa1e11815cc9c969bf37cb15ef6695795ba8c930a1c3292cc

See more details on using hashes here.

File details

Details for the file macromol_census-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for macromol_census-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0e3bc37571803dd89efb764a3051f2f5939df8fb92b9b285f588e5d2705210d5
MD5 29367fd0b196e0f9f64847e5f65bb918
BLAKE2b-256 3ba483d580c9dce03d352b15be945598b7f3463b2be3eede4d597a03991db81b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page