Skip to main content

Protein chemical shift prediction based on Protein Language Model

Project description

PLM-CS

Predict protein chemical shifts from sequence

image

Train your model

If you want to train your own PLM-CS model, this repository provides all the tools and data. Just follow these steps.

Requirement

'torch == 2.5.0',
'torchaudio == 2.5.0',
'torchvision == 0.20.0',
'fair-esm == 2.0.0',
'numpy == 2.1.2',
'biopython == 1.84',
'pandas == 2.2.3'

Training set

We provide the complete training set data in RefDB training dataset. Each file in this folder is in nmrstar format, and each file corresponds to a protein. All proteins contained in the SHIFTX test are removed from it.

Training set processing

For convenience, the reasoning process of the ESM model is separate from the training process of our regression model. Therefore, we first use ESM-650M to process the data. Change the "save_path" in esm_process.py to your own path. A tensordataset containing the training data will be generated.

Train

Modify the path in the train.py to your own parh. Also, be aware that this can only train a model of one type of atom at a time.

Training parameters

Different atom types correspond to different optimizer strategies.You can modify the corresponding parameters in the train.py according to your trained model. The default number of steps for an iteration is 20,000, but you can change it to 5,000 to achieve very close performance while reducing training time

parameters C H N
learning rate 0.02 5e-4 0.002 0.01 5e-4 5e-4
optimizer SGD Adam Adam SGD Adam Adam

Evaluate

Use PLM-CS through python SDK

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

plm_cs-1.4.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

plm_cs-1.4-py3-none-any.whl (7.5 kB view details)

Uploaded Python 3

File details

Details for the file plm_cs-1.4.tar.gz.

File metadata

  • Download URL: plm_cs-1.4.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.14

File hashes

Hashes for plm_cs-1.4.tar.gz
Algorithm Hash digest
SHA256 d175941f2669d2943780d03b0b37df8413c77b37e71a6b1a49e337aabbbc90a5
MD5 5db5be5659515555aed6a8311f4eef51
BLAKE2b-256 d19ce8439dfab12a56850ee814fee7740634e2c602c387fcf5e63e42590ef4e1

See more details on using hashes here.

File details

Details for the file plm_cs-1.4-py3-none-any.whl.

File metadata

  • Download URL: plm_cs-1.4-py3-none-any.whl
  • Upload date:
  • Size: 7.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.14

File hashes

Hashes for plm_cs-1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 d5c6dd0e021dfe8b52ff668c9d27e044b274c7c53892e75a9b5c5fc4d904875a
MD5 c99df5e6d8a36b4971e57a68c2ef8747
BLAKE2b-256 8cf58f986b15793dcbbd5084c6acba3dfb9c12f040f753608b6d5f3328911b63

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page