Skip to main content

A tool to predict probability of lncRNA localizing to Cytoplasm

Project description

CytoLNCpred

A computational method to predict the probability of lncRNA localizing to cytoplasm

Introduction

CytoLNCpred is a tool developed by Raghva-Lab in 2024. It is designed to predict the probability of lncRNA localizing to the cytoplasm. It utilizes a large language model - DNABERT-2 to make predictions. CytoLNCpred is also available as web-server at https://webs.iiitd.edu.in/raghava/cytolncpred. Please read/cite the content about the CytoLNCpred for complete information including algorithm behind the approach.

PIP Installation

PIP version is also available for easy installation and usage of this tool. The following command is required to install the package

pip install cytolncpred

To know about the available option for the pip package, type the following command:

cytolncpred -h

Standalone

The Standalone version of CytoLNCpred is written in python3 and following libraries are necessary for the successful run:

-transformers==4.29 -argparse -biopython -torch -numpy -pandas

Minimum USAGE

To know about the available option for the stanadlone, type the following command:

python cytolncpred.py -h

To run the example, type the following command:

python cytolncpred.py -i example_input.fa

This will predict the probability whether a submitted sequence will localize to the cytoplasm or nucleus. It will use other parameters by default. It will save the output in "outfile.csv" in CSV (comma separated variables). During the first run, the python tool will download the model (approx size = 1.3GB) in your local system and this process takes time. Please be patient.

Full Usage

usage: cytolncpred.py [-h] -i INPUT [-o OUTPUT] [-m MODEL] [-t THRESHOLD] [-w WORKDIR]
                      [-d {1,2,3}]

Provide the following inputs for a successful run

optional arguments:
  -h, --help            show this help message and exit
  -i INPUT, --input INPUT
                        Input: nucleotide sequence in FASTA format
  -o OUTPUT, --output OUTPUT
                        Output: File for saving results; by default outfile.csv
  -m MODEL, --model MODEL
                        Model path: Folder containing the fine-tuned DNABERT-2 model; by
                        default - dnabert2_10k
  -t THRESHOLD, --threshold THRESHOLD
                        Threshold: Value between 0 to 1; by default 0.5
  -w WORKDIR, --workdir WORKDIR
                        Working directory: Directory where all intermediate and final files
                        will be created; by default .
  -d {1,2,3}, --display {1,2,3}
                        Display: 1:Cytoplasm-localized, 2: Nucleus-localized, 3: All; by
                        default 3

Input File: It allow users to provide input in the FASTA format.

Output File: Program will save the results in the CSV format, in case user does not provide output file name, it will be stored in "outfile.csv".

Threshold: User should provide threshold between 0 and 1, by default its 0.16.

Window length: User can choose any pattern length between 9 and 20 in long sequences. This option is available for only scanning module.

Display type: This option allow users to fetch either only HLA-DRB1-04:01 binding peptides by choosing option 1 or prediction against all peptides by choosing option 2.

CytoLNCpred Package Files

It contains the following files, brief description of these files given below

INSTALLATION : Installations instructions

LICENSE : License information

README.md : This file provide information about this package

cytolncpred.py : Main python program

example.fasta : Example file contain peptide sequences in FASTA format

sample_output.csv : Example output file for the program

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cytolncpred-1.0.tar.gz (16.8 kB view details)

Uploaded Source

Built Distribution

cytolncpred-1.0-py3-none-any.whl (17.3 kB view details)

Uploaded Python 3

File details

Details for the file cytolncpred-1.0.tar.gz.

File metadata

  • Download URL: cytolncpred-1.0.tar.gz
  • Upload date:
  • Size: 16.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for cytolncpred-1.0.tar.gz
Algorithm Hash digest
SHA256 b0ff3cf4545e810f0690c89440218493b9961d0dcac8a9ed9422d7142c221322
MD5 0cb80ce7478adb7e4f4409a1e8531e3d
BLAKE2b-256 d25f0c336bc9681eaa810359e49c373dd2f01b9476e37e1288790ea3b95899f7

See more details on using hashes here.

File details

Details for the file cytolncpred-1.0-py3-none-any.whl.

File metadata

  • Download URL: cytolncpred-1.0-py3-none-any.whl
  • Upload date:
  • Size: 17.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for cytolncpred-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b4cace440b0b8451019a324f9e238954580587abb6ea7e2706820b8fb101dab8
MD5 20d834ec70ab9ade7ca6f4e01d3cfffd
BLAKE2b-256 ac50589331174852beed2901da80d93c8a92976e16eb19790a26734b3eac555b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page