Skip to main content

Neural network model for predicting amino-acid sequence from a protein backbone structure

Project description

GCNdesign

A neural network model for prediction of amino-acid probability from a protein backbone structure.

Built with

  • pytorch
  • numpy
  • pandas
  • tqdm

Getting Started

Install

To install gcndesgn through pip

pip install gcndesign

Usage

Quick usage as a python module

from gcndesign.predictor import Predictor

gcndes = Predictor(device='cpu') # 'cuda' can also be applied
gcndes.predict(pdb='pdb-file-path') # returns list of amino-acid probabilities

Usage of scripts

gcndesign_predict.py

To predict amino-acid probabilities for each residue-site

gcndesign_predict.py  YOUR_BACKBONE_STR.pdb

gcndesign_autodesign.py

To design 20 sequences in a completely automatic fashion

gcndesign_autodesign.py  YOUR_BACKBONE_STR.pdb  -n 20

For more detailed usage, please run the following command

gcndesign_autodesign.py -h

Note

The gcndesign_autodesign script requires pyrosetta software. Installation & use of pyrosetta must be in accordance with their license.

External Packages

Documents

Note

A critical issue has fixed and the parameters were re-trained on a new dataset (CATH v4.3 S40 dataset). This change has stabilized the prediction, but has not been reflected in the document above. So there are inaccuracies in the description and figures.

Dataset

The dataset used for training GCNdesign is available here

  • dataset.tar.gz: Training/T500/TS50 dataset
  • dataset_cath40.tar.bz2: CATH-v4.3 S40 dataset (used for the latest parameter training)

Lisence

Distributed under MIT license.

Acknowledgments

The author was supported by Grant-in-Aid for JSPS Research Fellows (PD, 17J02339). Koga Laboratory of Institutes for Molecular Science (NINS, Japan) has provided a part of the computational resources. Koya Sakuma (yakomaxa) gave a critical idea for neural net architecture design in a lot of deep discussions. Naoya Kobayashi (naokob) created excellent applications to help broader needs, ColabGCNdesign and FolditStandalone_Sequence_Design.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

gcndesign-2.3.2-py3-none-any.whl (37.2 MB view details)

Uploaded Python 3

File details

Details for the file gcndesign-2.3.2-py3-none-any.whl.

File metadata

  • Download URL: gcndesign-2.3.2-py3-none-any.whl
  • Upload date:
  • Size: 37.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for gcndesign-2.3.2-py3-none-any.whl
Algorithm Hash digest
SHA256 4a8967fc52b7bf3c8b4b6c988fff0fa7a5a961c466606b8ee0c31be132ecd380
MD5 8402a1af906949884f581136778c610c
BLAKE2b-256 542301bd83e6db020712eb926b8b617405bd4d208fbda8f27026236b2fdc2ff2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page