A tool to predict probability of lncRNA localizing to Cytoplasm
Project description
CytoLNCpred
A computational method to predict the probability of lncRNA localizing to cytoplasm
Introduction
CytoLNCpred is a tool developed by Raghva-Lab in 2024. It is designed to predict the probability of lncRNA localizing to the cytoplasm. It utilizes a large language model - DNABERT-2 to make predictions. CytoLNCpred is also available as web-server at https://webs.iiitd.edu.in/raghava/cytolncpred. Please read/cite the content about the CytoLNCpred for complete information including algorithm behind the approach.
PIP Installation
PIP version is also available for easy installation and usage of this tool. The following command is required to install the package
pip install cytolncpred
To know about the available option for the pip package, type the following command:
cytolncpred -h
Standalone
The Standalone version of CytoLNCpred is written in python3 and following libraries are necessary for the successful run:
-transformers==4.29 -argparse -biopython -torch -numpy -pandas
Minimum USAGE
To know about the available option for the stanadlone, type the following command:
python cytolncpred.py -h
To run the example, type the following command:
python cytolncpred.py -i example_input.fa
This will predict the probability whether a submitted sequence will localize to the cytoplasm or nucleus. It will use other parameters by default. It will save the output in "outfile.csv" in CSV (comma separated variables). During the first run, the python tool will download the model (approx size = 1.3GB) in your local system and this process takes time. Please be patient.
Full Usage
usage: cytolncpred.py [-h] -i INPUT [-o OUTPUT] [-m MODEL] [-t THRESHOLD] [-w WORKDIR]
[-d {1,2,3}]
Provide the following inputs for a successful run
optional arguments:
-h, --help show this help message and exit
-i INPUT, --input INPUT
Input: nucleotide sequence in FASTA format
-o OUTPUT, --output OUTPUT
Output: File for saving results; by default outfile.csv
-m MODEL, --model MODEL
Model path: Folder containing the fine-tuned DNABERT-2 model; by
default - dnabert2_10k
-t THRESHOLD, --threshold THRESHOLD
Threshold: Value between 0 to 1; by default 0.5
-w WORKDIR, --workdir WORKDIR
Working directory: Directory where all intermediate and final files
will be created; by default .
-d {1,2,3}, --display {1,2,3}
Display: 1:Cytoplasm-localized, 2: Nucleus-localized, 3: All; by
default 3
Input File: It allow users to provide input in the FASTA format.
Output File: Program will save the results in the CSV format, in case user does not provide output file name, it will be stored in "outfile.csv".
Threshold: User should provide threshold between 0 and 1, by default its 0.16.
Window length: User can choose any pattern length between 9 and 20 in long sequences. This option is available for only scanning module.
Display type: This option allow users to fetch either only HLA-DRB1-04:01 binding peptides by choosing option 1 or prediction against all peptides by choosing option 2.
CytoLNCpred Package Files
It contains the following files, brief description of these files given below
INSTALLATION : Installations instructions
LICENSE : License information
README.md : This file provide information about this package
cytolncpred.py : Main python program
example.fasta : Example file contain peptide sequences in FASTA format
sample_output.csv : Example output file for the program
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cytolncpred-1.0.tar.gz
.
File metadata
- Download URL: cytolncpred-1.0.tar.gz
- Upload date:
- Size: 16.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b0ff3cf4545e810f0690c89440218493b9961d0dcac8a9ed9422d7142c221322 |
|
MD5 | 0cb80ce7478adb7e4f4409a1e8531e3d |
|
BLAKE2b-256 | d25f0c336bc9681eaa810359e49c373dd2f01b9476e37e1288790ea3b95899f7 |
File details
Details for the file cytolncpred-1.0-py3-none-any.whl
.
File metadata
- Download URL: cytolncpred-1.0-py3-none-any.whl
- Upload date:
- Size: 17.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b4cace440b0b8451019a324f9e238954580587abb6ea7e2706820b8fb101dab8 |
|
MD5 | 20d834ec70ab9ade7ca6f4e01d3cfffd |
|
BLAKE2b-256 | ac50589331174852beed2901da80d93c8a92976e16eb19790a26734b3eac555b |