A tool to predict mRNA subcellular localization
Project description
MRSLpred
A computational tool for multilabel mRNA subcellular localization prediction
Introduction
MRSLpred is a tool for multilabel mRNA subcellular localization prediction using a XGBoost classifier. It uses only composition based features for predicting the subcellular locations. The final model also deploys a motif-based module which has been implemented using MERCI. MRSLpred is also available as web-server at https://webs.iiitd.edu.in/raghava/mrslpred. Please read/cite the content about MRSLpred for complete information including algorithm behind the approach.
Standalone
The Standalone version of mrslpred is written in python3 and following libraries are necessary for the successful run:
- scikit-learn
- xgboost=0.90
- Pandas
- Numpy
Minimum USAGE
To know about the available option for the standlone, type the following command:
mrslpred -h
To run the example, type the following command:
mrslpred -i example_input.fa
This will predict where the submitted sequences are going to localize. It will use other parameters by default. It will save the output in "final_prediction.csv" and "final_prob_prediction.csv" in CSV format (comma separated variables).
Full Usage
usage: mrslpred [-h] --file FILE [--th1 TH1] [--th2 TH2] [--th3 TH3]
[--th4 TH4] [--th5 TH5] [--th6 TH6] --output OUTPUT
Please provide following arguments for successful run
optional arguments:
-h, --help show this help message and exit
--file FILE, -f FILE Path to fasta file
--th1 TH1, -t1 TH1 Threshold for Ribosome
--th2 TH2, -t2 TH2 Threshold for Cytosol
--th3 TH3, -t3 TH3 Threshold for Endoplasmic Reticulum
--th4 TH4, -t4 TH4 Threshold for Membrane
--th5 TH5, -t5 TH5 Threshold for Nucleus
--th6 TH6, -t6 TH6 Threshold for Exosome
--output OUTPUT, -o OUTPUT Path to output
Input File: It allow users to provide input in FASTA format.
Output File: Program will save the results to this folder
Threshold 1: User should provide threshold for Ribosome between 0 and 1, by default it is 0.3079.
Threshold 2: User should provide threshold for Cytosol between 0 and 1, by default it is 0.1468.
Threshold 3: User should provide threshold for Endoplasmic Reticulum between 0 and 1, by default it is 0.1156.
Threshold 4: User should provide threshold for Membrane between 0 and 1, by default it is 0.1956.
Threshold 5: User should provide threshold for Nucleus between 0 and 1, by default it is 0.7028.
Threshold 6: User should provide threshold for Exosome between 0 and 1, by default it is 0.9961.
MRSLpred Package Files
It contantain following files, brief description of these files given below
INSTALLATION : Installations instructions
LICENSE : License information
README.md : This file provide information about this package
xgboost_final.pkl : This file contains the pickled version of model
mrslpred_motif.py : Main python program
example_input.fa : Example file contain nucleotide sequences in FASTA format
example_predict_prob_output.csv : Example output file containing probabilities for each location
example_predict_output.csv : Example output file containing labels for each location
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file mrslpred-1.1.tar.gz
.
File metadata
- Download URL: mrslpred-1.1.tar.gz
- Upload date:
- Size: 1.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.17
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 647cca3e7964664f32500ad47201106b9e407cde0dbea11ecd4bd524ae583922 |
|
MD5 | f844d62b073382ff38443e632715fd47 |
|
BLAKE2b-256 | 7a707a859bea3681a45d5498553b7917c90b9ece0bb7ad0793581bae60566543 |
File details
Details for the file mrslpred-1.1-py3-none-any.whl
.
File metadata
- Download URL: mrslpred-1.1-py3-none-any.whl
- Upload date:
- Size: 1.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.17
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cd08fc9bdc41fa81fce7d4e32502e0b419377883f1c93c33cd5893e0541f679f |
|
MD5 | ce091c53a238d0268d4fc3a72181d892 |
|
BLAKE2b-256 | 047d5ea3b72ac6b99e7ff956ad4d38ef733c34de02aea4ba9e767b923d0399df |