No project description provided
Project description
DeepTMInter
tags: transmembrane protein predicting interaction sites v1.0
:information_source: News: The DeepTMInter Docker software has become available at deeptminter/releases. Please see below.
Overview
This repository is a software package of DeepTMInter. DeepTMInter is a deep-learning-based approach and it was developed using stacked generalization ensembles of ultradeep residual neural networks. The approach shows a substantial improvement for predicting interaction sites in transmembrane proteins compared to existing methods. All training and benchmarked data are available here and other data are made available upon requests of users via email.
System Requirement
We tested our software on a Linux operation system due to a number of Linux-dependent software packages generating input features. If you have the feature files as shown in ./input/, you are able to run our program on multiple platforms, e.g. Windows and Mac. Please be sure of python (version>3.5) installed before using. We highly recommend Anaconda, an integrated development environment of python, which eases the use and management of python packages.
Installation
-
install dependencies.
- HHblits - generating multiple sequence alignments
- Gaussian DCA - predictor of residue contacts
- Freecontact - predictor of residue contacts
- Phobius - predictor of transmembrane topologies
-
install protein sequence database.
- Uniclust30 database - a curated protein sequence database based on UniProt for HHblits
-
install DeepTMInter
-
To download the prediction models here. Please put the models in folder deeptminter/model/.
-
To download a stable version of DeepTMInter here.
-
To obtain the latest version of DeepTMInter do
git clone https://github.com/2003100127/deeptminter.git
-
-
install DeepTMInter of a Docker version (optional)
-
To install Docker here.
-
To download the five partitioned Docker packages here.
-
To use 7z to decompress the 5 partitioned Docker packages. This step will result in a file named
deeptminter_10.docker. -
to import
deeptminter_10.dockerbydocker load < deeptminter_10.docker -
to use
deeptminter_10.dockerbydocker exec -it deeptminter_10.docker bash
-
-
install python dependencies
pip install -r requirements.txt
Usage
-
src/troll.sh-
description troll.sh is used to generate multiple sequence alignments, transmembrane topologies, and all of evolutionary coupling features including EVfold (generated using FreeContact) and Gaussian DCA.
-
shell commands
- general (please specify the installed location of the executables or the database in
Installations 1 and 2and put your fasta sequence in the input path before running the following command.)./troll.sh -n NAME -c CHAIN -i /YOUR/INPUT/PATH/ - example
./troll.sh -n 3jcu -c H -i ./input/
- general (please specify the installed location of the executables or the database in
-
parameters
- required
-n --name -> a sequence name. -c --chain -> a chain name -i --input -> input path
- required
-
-
src/gdca.julia-
description
gdca.juliais used to generate Gaussian DCA file. You'd better run it cf. https://github.com/carlobaldassi/GaussDCA.jl.
-
-
run_deeptminter.py-
description
If you have the feature files shown in the
./input/directory, you can skip over steps 1-2 to step 3. We tested this step in a rigorous way. Be sure of every feature file already in the./input/or your preferred input file path. Finally, it works easily like this. -
python commands
- general
python run_deeptminter.py -n NAME -c CHAIN -i /YOUR/INPUT/PATH/ -o /YOUR/OUTPUT/PATH/ -r REGION
- example
python run_deeptminter.py -n 3jcu -c H -i ./input/ -o ./output/ -r transmembrane
- general
-
parameters
- required
-n --name -> a sequence name. For example, '3jcu'. -c --chain -> a chain name. For example, 'H'. This can be empty if you prefer a sequnce name like '3jcuH' or '0868'. -i --input -> input path -o --output --> prediction results -r, --region --> region of transmembrane protein. It can take 'transmembrane', 'cytoplasmic', 'extracellular', 'combined', 'all where 'combined' means accumulation of 'transmembrane', 'cytoplasmic', 'extracellular'. 'all' means the whole fasta sequence.
- required
-
-
description of output file
It finally returns an output file with the suffix of
.deeptminter.- The predictions of interaction sites in tansmembrane proteins are shown in the output file, with three columns: 1). positions of animo acids in the input sequence; 2) animo acids; 3) probabilities of being interaction sites.
- Please note that if you have a sequence sharing a high sequence identity to the proteins in the TrainData dataset, we recommend that any of the three output files with the suffix '.mexpand1;.mexpand2;.mexpand3' would be the best option for you.
- If you want to get the results in the context of no ideally preferred regions predicted by Phobius. You can set
-rascombinedto run the program. This will return the predictions of the whole fasta sequence. Then, you can tailor the whole predictions to whatever you want.
How to cite
J. Sun. D. Frishman. Improved sequence-based prediction of interaction sites in α-helical transmembrane proteins by deep learning. Comput. Struct. Biotechnol. J., 19:1512-1530, 2021. DOI: 10.1016/j.csbj.2021.03.005.
or
@article{DeepTMInter2021,
title = {Improved sequence-based prediction of interaction sites in α-helical transmembrane proteins by deep learning},
author = {Jianfeng Sun and Dmitrij Frishman},
journal = {Computational and Structural Biotechnology Journal},
volume = {19},
pages = {1512-1530},
year = {2021},
issn = {2001-0370},
doi = {https://doi.org/10.1016/j.csbj.2021.03.005},
url = {https://www.sciencedirect.com/science/article/pii/S2001037021000775},
}
Contact
If you have any question, please contact Jianfeng Sun. We highly recommend creating issue pages when you have problems. Your issues will subsequently be responded.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file deeptminter-0.0.1.tar.gz.
File metadata
- Download URL: deeptminter-0.0.1.tar.gz
- Upload date:
- Size: 33.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.2 CPython/3.12.9 Linux/6.8.0-1021-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2542107c1d262e6d9813ab6bc98da8a586ca0ace3612f8af90e5546297cb1ba7
|
|
| MD5 |
14babba797cbeeef67f9172ff979f995
|
|
| BLAKE2b-256 |
a33efefefc2f3241fa0591335da55312725a00b4cd334417794e6524e5c3eece
|
File details
Details for the file deeptminter-0.0.1-py3-none-any.whl.
File metadata
- Download URL: deeptminter-0.0.1-py3-none-any.whl
- Upload date:
- Size: 41.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.2 CPython/3.12.9 Linux/6.8.0-1021-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d41909bde9e722ed78f31473c4d8848f16831bf2f8fff6ec0e493824c1df5867
|
|
| MD5 |
a5e1ba32ed929a80a93ea455c77a3c64
|
|
| BLAKE2b-256 |
cd425592df80c537a8a650c22cfbae951f41c8f01e3631e801d6bef79e5ab2ab
|