Python tool for automated, multiserver query for disordered regions in protein sequences.
Project description
D-score
dscore
is a meta-server tool for the prediction of disordered regions in protein sequences. It works by querying several webservers and gathering the results in an easy-to-use format. All the work is automated using simple web requests where possible, falling back to webscraping with selenium
for servers without a public API.
The servers currently used are the following (follow the links for more information and references to papers): disembl, disopred, disprot, espritz, globplot, iupred, jpred, pondr, prdos, foldindex, metapredict (used for comparison, but not included in the final dscore), seg (for complexity prediction)
Multiple algorithms are used from all servers, but not all the available ones. Check the output for details.
Features
dscore
then aggregates the data in a single table in a text file for further processing. It will also automatically generate a few plots for quickly visualising the data.
Server summary plot: disorder predictions from each algorithm available and consensus D-score.
D-score plot: line plot of the raw averaged value used for D-score prediction.
Consensus plot: an indication of how close to the consensus each server is, useful to find outliers.
Text outputs: can easily be parsed for more advanced analysis.
# 0. resn
# 1. residue
# 2. disprot_VSL2
# 3. disprot_VSL3
# 4. disprot_VLXT
# 5. disprot_PONDR-FIT
# 6. dscore_raw
# 7. dscore
0 1 2 3 4 5 6 7
--- --- --- --- --- --- ---- ---
1 M D D D D 0 D
2 T D D D D 0 D
3 K D D D D 0 D
4 K D D D D 0 D
[...]
Installation
dscore
is available on PyPi and easily installable with pip
:
pip install dscore
Firefox webdriver
To run, dscore
also requires the Firefox
webdriver to be installed. Download the latest version of the driver from the official github release: scroll down and choose the appropriate one (for example, the ...linux64.tar.gz
for a 64bit linux installation). Make sure to unzip/untar the driver and make it accessible on the PATH
enviroment variable. Here's a detailed guide on how to do this.
Usage
Note that this program works by opening a ton of firefox windows and issuing automated commands. Let it do its thing and don't get scared! :)
DISCLAIMER: dscore
relies on other webservers for its results. Some of these servers have limitations on the amount of requests per user, so you may get temporarily blocked if you submit too many sequences. Spread them out!
dscore
can be used either as a python library or from the command line. The latter has a simple interface:
dscore <sequence_or_fasta_file>
Use the --help
option for more information.
You can also generate a new dscore and new plots from a subset of columns of an existing .dscore
file:
dscore_plot <dscore_file> <column> <other_column>
References
This tool implements a simple automated version of the D-score calculation and analysis performed in the paper Modular organization of rabies virus phosphoprotein by F. Gerard et Al.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dscore-0.4.5.tar.gz
.
File metadata
- Download URL: dscore-0.4.5.tar.gz
- Upload date:
- Size: 30.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e6df2640aac73e657885e41e1200adf6f9ff75c340255170102302ed1c5d798e |
|
MD5 | ebf6276a7c31c81d3575adaa9cce3f9e |
|
BLAKE2b-256 | 0749b85e9256db1fb54f4a97b90da451eddf3f5fe4a4aa26332da82202369fbf |
File details
Details for the file dscore-0.4.5-py3-none-any.whl
.
File metadata
- Download URL: dscore-0.4.5-py3-none-any.whl
- Upload date:
- Size: 36.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 53e7f5338e48c6bcb0f78dcfb1f875a64c6312ce67b2851c19d6108efe41f130 |
|
MD5 | 17b25135083aad57f501eedcf8921eb5 |
|
BLAKE2b-256 | c2b56998d566be3fa2709533c89a90bd55c4619f8aaf157471895bce59517822 |