Skip to main content

D-SCRIPT: protein-protein interaction prediction

Project description

D-SCRIPT

PyPI DOI License Pytest Ruff

D-SCRIPT Architecture

D-SCRIPT is a deep learning method for predicting a physical interaction between two proteins given just their sequences. It generalizes well to new species and is robust to limitations in training data size. Its design reflects the intuition that for two proteins to physically interact, a subset of amino acids from each protein should be in contact with the other. The intermediate stages of D-SCRIPT directly implement this intuition, with the penultimate stage in D-SCRIPT being a rough estimate of the inter-protein contact map of the protein dimer. This structurally-motivated design enhances the interpretability of the results and, since structure is more conserved evolutionarily than sequence, improves generalizability across species.

You can now make predictions with D-SCRIPT via the interface on HuggingFace!

Installation

pip install dscript

Usage

Protein sequences need to first be embedded using the Bepler+Berger protein language model; this requires a .fasta file as input. Everything before the first space will be used as the key.

dscript embed --seqs [sequences] --outfile [embedding file]

Candidate pairs should be in tab-separated (.tsv) format with no header, and columns for [protein key 1], [protein key 2]. Optionally, a third column with [label] can be provided, so predictions can be made using training or test data files (but the label will not affect the predictions only the first two columns will be read).

While pre-trained model files can be downloaded directly, we recommend instead passing the name of a pre-trained model that will be automatically downloaded from HuggingFace. Available models include:

  • samsl/dscript_human_v1
  • samsl/topsy_turvy_human_v1 (recommended)
  • samsl/tt3d_human_v1
dscript predict --pairs [input data] --embeddings [embedding file] --model [model file] --outfile [predictions file]

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dscript-0.3.1.tar.gz (57.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dscript-0.3.1-py3-none-any.whl (71.4 kB view details)

Uploaded Python 3

File details

Details for the file dscript-0.3.1.tar.gz.

File metadata

  • Download URL: dscript-0.3.1.tar.gz
  • Upload date:
  • Size: 57.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for dscript-0.3.1.tar.gz
Algorithm Hash digest
SHA256 c8617b2fe742b69916b000559917f777546c4e0e9b0ab17023382e6c65ddc050
MD5 9747cb99ff6393e94a04f7d695e789d8
BLAKE2b-256 f624afa8df98e20639d2e1e087882b2610fd03cd15c6869f4798bb662eab8494

See more details on using hashes here.

Provenance

The following attestation bundles were made for dscript-0.3.1.tar.gz:

Publisher: python-publish.yml on samsledje/D-SCRIPT

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file dscript-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: dscript-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 71.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for dscript-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c1182f68da2af165dffeae3da062745d8bfd88a19a10cb011f9fdcdd1a2b320b
MD5 f37b27796b6dacb9d92c17d803413a18
BLAKE2b-256 00f85238d770333a49f1f7b953517e8e97d51dd8735d6d50ccfa8e8da0047ffa

See more details on using hashes here.

Provenance

The following attestation bundles were made for dscript-0.3.1-py3-none-any.whl:

Publisher: python-publish.yml on samsledje/D-SCRIPT

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page