A tool to massively calculate protein scores using PDB files.
Project description
PDB-score
Features
Implemented GDT scoring for a large number of predicted and actual protein models, along with calculating RMSD after coordinate alignment, which can be used to evaluate the prediction models.
Installation
pip install PDB-score
Usage
psc [-h] -c /path/to/control -t /path/to/treatment -o /path/to/output [-T THREAT] [-B BATCH]
-cDirectory where the experimental PDB files are stored.-tDirectory where the predicted PDB files are stored.-oDirectory for saving the output scores.-TSpecify the number of cores, default is 4.-BSpecify the Batch size, default is 5000.
Output
/path/to/output/protein_scores.csv
| name | RMSD | 1Å | 2Å | ... | 128Å | Average |
|---|---|---|---|---|---|---|
| Protein1 | rmsd (float) | Score | Score | ... | Score | Score |
| Protein2 | rmsd (float) | Score | Score | ... | Score | Score |
| ... | ... | ... | ... | ... | ... | ... |
Calculation Method
- Perform coordinate alignment using Biopython.
- Scores are calculated using the GDT (Global Distance Test) algorithm.
- All ligands are removed, and only the alpha carbon atoms are used to represent the residue coordinates.
- When the count of alpha carbons differs, the extra or missing residues are directly considered as failing to meet the accuracy criteria (regardless of the accuracy setting).
Performance
- Test Environment:
- Default parameters:
-T 4 -B 5000 - Test Machine:
Windows 11 PC, CPU Intel 12600k - Single sample size: 146KB with 154 residues
- Default parameters:
- Comparing 50,000 samples took 387061ms.
- Memory usage is less than 6GB.
Additional Notes
- Only analyze
.pdband.entfiles with the same name (excluding extensions) in the two input directories. - The
-ooption only specifies the output directory, not the file name. - Outputs a
.csvfile with a fixed file name, so be careful not to overwrite it.
Acknowledgments
@SiriNatsume
Wishing you happiness :)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pdb_score-1.0.5.tar.gz.
File metadata
- Download URL: pdb_score-1.0.5.tar.gz
- Upload date:
- Size: 6.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
09ea24a1827e86510dfead644e2d2b5ce980acbe637409c5b6798f266cb85575
|
|
| MD5 |
114e9e64ca503f85721fb061bf57c7c2
|
|
| BLAKE2b-256 |
2dbf6e893e3e4fef00c8308b51f668e82fc033503cf4f3cdbcad638f8212920d
|
File details
Details for the file PDB_score-1.0.5-py3-none-any.whl.
File metadata
- Download URL: PDB_score-1.0.5-py3-none-any.whl
- Upload date:
- Size: 7.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
88595085b0cb7121539d1886ed34f1673432bdea115f32704eade210b640ce3c
|
|
| MD5 |
97965309739e091ee1ee662dc1948c62
|
|
| BLAKE2b-256 |
44d8a841abc3226551fba19365bdda6bc86492222375248dbd414574a1a1fbef
|