Fast and accurate prediction of coiled coil domains in protein sequences
Project description
DeepCoil
Fast and accurate prediction of coiled coil domains in protein sequences
New in version 2.0
- Retrained with the updated dataset based on SamCC-Turbo labels.
- Faster inference time by applying SeqVec embeddings instead of psiblast profiles.
- Heptad register prediction (a and d core positions).
- No maximum sequence length limit.
- Convenient interface for using DeepCoil within python scripts.
- Automated peak detection for improved output readability.
- Simplified installation with pip.
Older DeepCoil versions are available here.
Requirements and installation
DeepCoil requires python>=3.6.1
and pip>=19.0
. Other requirements are specified in the requirements.txt
file.
The most convenient way to install DeepCoil is to use pip:
$ pip3 install deepcoil
Usage
Running DeepCoil standalone version:
deepcoil [-h] -i FILE [-out_path DIR] [-n_cpu NCPU] [--gpu] [--plot]
[--dpi DPI]
Argument | Description |
---|---|
-i |
Input file in FASTA format. Can contain multiple entries. |
-out_path |
Directory where the predictions are saved. For each entry in the input file one file will be saved. Defaults to the current directory if not specified. |
-n_cpu |
Number of CPUs to use in the prediction. By the default all cores will be used. |
--gpu |
Flag for turning on the GPU usage. Allows faster inference on large datasets. Overrides -n_cpu option. |
--plot |
Turns on the additional visual output of the predictions for each entry in the input. Plot files are saved in the -out_path directory. |
--dpi |
DPI of the saved plots, active only with --plot option. |
In a rare case of deepcoil
being not available in your PATH
after installation please look in the $HOME/.local/bin/
or other system specific pip
directory.
Description of columns in output file:
aa
- amino acid in the input protein sequencecc
- sharpened coiled coil propensityraw_cc
- raw coiled coil propensityprob_a
- probability of a core positionprob_d
- probability of d core position
Running DeepCoil within script:
from deepcoil import DeepCoil
from deepcoil.utils import plot_preds
from Bio import SeqIO
dc = DeepCoil(use_gpu=True)
inp = {str(entry.id): str(entry.seq) for entry in SeqIO.parse('example/example.fas', 'fasta')}
results = dc.predict(inp)
plot_preds(results['3WPA_1'], out_file='example/example.png')
results[entry]
for an entry of sequence length N
contains two keys:
['cc']
- per residue coiled coil propensity ([N, 1]
shape)['hept']
- per residue core positions ([N, 3]
shape, order in the second axis is: no/other position, a position, d position)
Peak detection can be performed with the deepcoil.utils.sharpen_preds
helper function.
Example graphical output:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file deepcoil-2.0.1.tar.gz
.
File metadata
- Download URL: deepcoil-2.0.1.tar.gz
- Upload date:
- Size: 100.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/47.3.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.6.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d7371e00f7168e880c685791c63b14afdfbf07c36023f5b316190e19215701ce |
|
MD5 | ff790bdaa92bf061f0655610d98dfe44 |
|
BLAKE2b-256 | 98ff410b212fab5116b1003b031ddfef23af88abbb31c804ea6999fd483e4527 |
File details
Details for the file deepcoil-2.0.1-py3-none-any.whl
.
File metadata
- Download URL: deepcoil-2.0.1-py3-none-any.whl
- Upload date:
- Size: 100.4 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/47.3.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.6.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2f373be5e072b1800305e486ab146232d57bc5accd7c16c403059f979bbef50a |
|
MD5 | a28ac4a6ae34975977cf88d61065fe27 |
|
BLAKE2b-256 | 6300788a82f5efa487ca34f765c573556bd3cde280186e7d00371a51370cd3d5 |