Skip to main content

Package for predicting 5EU in nanopore reads and predicting RNA halflives

Project description

RNAkinet

RNAkinet is a project dedicated to detecting 5eu-modified reads directly from the raw nanopore sequencing signal. Furthermore, it offers tools to calculate transcript halflives.

Usage

Installation

pip install rnakinet

Predict 5EU in fast5/pod5 files

rnakinet-inference --path <path_to_folder_containing_fast5s> --output <predictions_name.csv>

This creates a csv file with columns read_id - the read id, 5eu_mod_score - the raw prediction score from 0 to 1, 5eu_modified_prediction - Boolean column, True if the read is predicted to be modified by 5EU, False otherwise

Nvidia GPU is recommended to run this command. If you want to run inference on a CPU-only machine, use the --use-cpu option. This will substantially increase runtime.

If you want to use pod5 files instead, add the --format pod5 flag.

Example

rnakinet-inference --path data/experiment/fast5_folder --output preds.csv

Selecting flow-cell chemistry

RNAkinet has been extensively tested on flow-cells with the R9 chemistry. Experimental support is offered for R10. You can specify the flow-cell chemistry with the --kit option.

rnakinet-inference --path data/experiment/fast5_folder --kit r10 --output preds.csv

Calculate transcript halflives

rnakinet-predict-halflives --transcriptome-bam <path_to_transcriptome_alignment.bam> --predictions <predictions_name.csv> --tl <experiment_tl> --output <halflives_name.csv>

The --tl parameter is the duration for which the cells were exposed to 5EU in hours

The --predictions parameter is the output file of the 5EU prediction step described above

This creates a csv file with columns transcript - the transcript identifier from your BAM file, reads - the amount of reads available for the given transcript, percentage_modified - the percentage of reads of the given transcript that were predicted to contain 5EU, pred_t5 - the predicted halflife of the given transcript

Example

rnakinet-predict-halflives --transcriptome-bam alignments/experiment/transcriptome_alignment.bam --predictions preds.csv --tl 2.0 --output halflives.csv

Note that the calculated halflives pred_t5 are the most reliable for transcripts with high read count. The following plots show correlation of halflives computed from RNAkinet predictions with experimentaly measured halflives [1] as we increase read count requirement. We recommend users to acknowledge this and put more confidence in halflife predictions for transcripts with high read count, and less confidence for transcripts with low read count.

[1] Eisen,T.J., Eichhorn,S.W., Subtelny,A.O., Lin,K.S., McGeary,S.E., Gupta,S. and Bartel,D.P. (2020) The Dynamics of Cytoplasmic mRNA Metabolism. Mol. Cell, 77, 786-799.e10.

Cite

Vlastimil Martinek, Jessica Martin, Cedric Belair, Matthew J Payea, Sulochan Malla, Panagiotis Alexiou, Manolis Maragkakis, Deep learning and direct sequencing of labeled RNA captures transcriptome dynamics, NAR Genomics and Bioinformatics, Volume 6, Issue 3, September 2024, lqae116, https://doi.org/10.1093/nargab/lqae116

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rnakinet-1.0.1.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rnakinet-1.0.1-py3-none-any.whl (1.3 MB view details)

Uploaded Python 3

File details

Details for the file rnakinet-1.0.1.tar.gz.

File metadata

  • Download URL: rnakinet-1.0.1.tar.gz
  • Upload date:
  • Size: 1.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.13

File hashes

Hashes for rnakinet-1.0.1.tar.gz
Algorithm Hash digest
SHA256 0790002f598217d8a4aec21596de95bb4d13aa239c8abf2ed47b4be47e7d5231
MD5 e352c3e8e93a78593f9fb546b52624e5
BLAKE2b-256 ddab983aeaee381fa912ab6d626bb429140d22e8a2dc941c25078f1670daed32

See more details on using hashes here.

File details

Details for the file rnakinet-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: rnakinet-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.13

File hashes

Hashes for rnakinet-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 79424527dfeff24211d9366be07776283d21e049a480216e3d685f209ffc5e16
MD5 d4615a374ba5a73e393898d5e72f11f6
BLAKE2b-256 b247caa0ae3e56459d72072eda8099a095cd3ce3024739a3b08040ca98fda77f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page