D-SCRIPT: protein-protein interaction prediction
Project description
D-SCRIPT
D-SCRIPT is a deep learning method for predicting a physical interaction between two proteins given just their sequences. It generalizes well to new species and is robust to limitations in training data size. Its design reflects the intuition that for two proteins to physically interact, a subset of amino acids from each protein should be in contact with the other. The intermediate stages of D-SCRIPT directly implement this intuition, with the penultimate stage in D-SCRIPT being a rough estimate of the inter-protein contact map of the protein dimer. This structurally-motivated design enhances the interpretability of the results and, since structure is more conserved evolutionarily than sequence, improves generalizability across species.
You can now make predictions with D-SCRIPT via the interface on HuggingFace!
Installation
pip install dscript
Usage
Protein sequences need to first be embedded using the Bepler+Berger protein language model; this requires a .fasta file as input. Everything before the first space will be used as the key.
dscript embed --seqs [sequences] --outfile [embedding file]
Candidate pairs should be in tab-separated (.tsv) format with no header, and columns for [protein key 1], [protein key 2]. Optionally, a third column with [label] can be provided, so predictions can be made using training or test data files (but the label will not affect the predictions only the first two columns will be read).
While pre-trained model files can be downloaded directly, we recommend instead passing the name of a pre-trained model that will be automatically downloaded from HuggingFace. Available models include:
- samsl/dscript_human_v1
- samsl/topsy_turvy_human_v1 (recommended)
- samsl/tt3d_human_v1
dscript predict --pairs [input data] --embeddings [embedding file] --model [model file] --outfile [predictions file]
References
- The original D-SCRIPT model is described in the paper “Sequence-based prediction of protein-protein interactions: a structure-aware interpretable deep learning model”.
- We have updated D-SCRIPT to incorporate network information (Topsy Turvy) and structure information (TT3D)
- Documentation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dscript-0.3.1.tar.gz.
File metadata
- Download URL: dscript-0.3.1.tar.gz
- Upload date:
- Size: 57.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c8617b2fe742b69916b000559917f777546c4e0e9b0ab17023382e6c65ddc050
|
|
| MD5 |
9747cb99ff6393e94a04f7d695e789d8
|
|
| BLAKE2b-256 |
f624afa8df98e20639d2e1e087882b2610fd03cd15c6869f4798bb662eab8494
|
Provenance
The following attestation bundles were made for dscript-0.3.1.tar.gz:
Publisher:
python-publish.yml on samsledje/D-SCRIPT
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
dscript-0.3.1.tar.gz -
Subject digest:
c8617b2fe742b69916b000559917f777546c4e0e9b0ab17023382e6c65ddc050 - Sigstore transparency entry: 384878297
- Sigstore integration time:
-
Permalink:
samsledje/D-SCRIPT@0b3f7363b7d62fb99f5c8bfc6780833f088b8d84 -
Branch / Tag:
refs/tags/v0.3.1 - Owner: https://github.com/samsledje
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@0b3f7363b7d62fb99f5c8bfc6780833f088b8d84 -
Trigger Event:
release
-
Statement type:
File details
Details for the file dscript-0.3.1-py3-none-any.whl.
File metadata
- Download URL: dscript-0.3.1-py3-none-any.whl
- Upload date:
- Size: 71.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c1182f68da2af165dffeae3da062745d8bfd88a19a10cb011f9fdcdd1a2b320b
|
|
| MD5 |
f37b27796b6dacb9d92c17d803413a18
|
|
| BLAKE2b-256 |
00f85238d770333a49f1f7b953517e8e97d51dd8735d6d50ccfa8e8da0047ffa
|
Provenance
The following attestation bundles were made for dscript-0.3.1-py3-none-any.whl:
Publisher:
python-publish.yml on samsledje/D-SCRIPT
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
dscript-0.3.1-py3-none-any.whl -
Subject digest:
c1182f68da2af165dffeae3da062745d8bfd88a19a10cb011f9fdcdd1a2b320b - Sigstore transparency entry: 384878300
- Sigstore integration time:
-
Permalink:
samsledje/D-SCRIPT@0b3f7363b7d62fb99f5c8bfc6780833f088b8d84 -
Branch / Tag:
refs/tags/v0.3.1 - Owner: https://github.com/samsledje
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@0b3f7363b7d62fb99f5c8bfc6780833f088b8d84 -
Trigger Event:
release
-
Statement type: