Skip to main content

A tool to massively calculate protein scores using PDB files.

Project description

PDB-score

中文文档请点击

Features

Implemented GDT scoring for a large number of predicted and actual protein models, along with calculating RMSD after coordinate alignment, which can be used to evaluate the prediction models.

Installation

pip install PDB-score

Usage

psc [-h] -c /path/to/control -t /path/to/treatment -o /path/to/output [-T THREAT] [-B BATCH]
  • -c Directory where the experimental PDB files are stored.
  • -t Directory where the predicted PDB files are stored.
  • -o Directory for saving the output scores.
  • -T Specify the number of cores, default is 4.
  • -B Specify the Batch size, default is 5000.

Output

/path/to/output/protein_scores.csv

name RMSD 1A 2A ... 128A Average
Protein1 rmsd (float) Score Score ... Score Score
Protein2 rmsd (float) Score Score ... Score Score
... ... ... ... ... ... ...

Calculation Method

  • Perform coordinate alignment using Biopython.
  • Scores are calculated using the GDT (Global Distance Test) algorithm.
  • All ligands are removed, and only the alpha carbon atoms are used to represent the residue coordinates.
  • When the count of alpha carbons differs, the extra or missing residues are directly considered as failing to meet the accuracy criteria (regardless of the accuracy setting).

Performance

  • Test Environment:
    • Default parameters: -T 4 -B 5000
    • Test Machine: Windows 11 PC, CPU Intel 12600k
    • Single sample size: 146KB with 154 residues
  • Comparing 50,000 samples took 387061ms.
  • Memory usage is less than 6GB.

Additional Notes

  • Only analyze .pdb and .ent files with the same name (excluding extensions) in the two input directories.
  • The -o option only specifies the output directory, not the file name.
  • Outputs a .csv file with a fixed file name, so be careful not to overwrite it.

Acknowledgments

@SiriNatsume
Wishing you happiness :)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdb_score-1.0.6.tar.gz (6.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

PDB_score-1.0.6-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file pdb_score-1.0.6.tar.gz.

File metadata

  • Download URL: pdb_score-1.0.6.tar.gz
  • Upload date:
  • Size: 6.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.15

File hashes

Hashes for pdb_score-1.0.6.tar.gz
Algorithm Hash digest
SHA256 76d0d0e405a5ca2286c9cd58ce5184473e2746b01e3a5138b0b3860b8b77f5ea
MD5 b9cf3e295bde16e680bdf6ac4ab72a82
BLAKE2b-256 981a97650671205070769d4e37fc697858688a98de5b2bdc0a2db7e4a54e2ab0

See more details on using hashes here.

File details

Details for the file PDB_score-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: PDB_score-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 7.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.15

File hashes

Hashes for PDB_score-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 f9306bcd481211b1d8ec84e741ab7c00179abe43f3471fb25c5b033d78376a95
MD5 4549f8ae5944c2775646a07ae3a27886
BLAKE2b-256 228d59684edae01807a98048817b22127eacbc7ecc5c3a1ab4e26b8be8b5d667

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page