Skip to main content

PTM-POSE: PTM Projection onto Splice Events

Project description

PTM-POSE (PTM Projection Onto Splice Events)

PTM-POSE is an easily implementable tool to project PTM sites onto splice event data generated from RNA sequencing data and is compatible with any splice event quantification tool that outputs genomic coordinates of different splice events (MATS, SpliceSeq, etc.). PTM-POSE harnesses PTMs that have been mapped to their genomic location by a sister package, ExonPTMapper. It also contains functions for annotating these PTMs with information from various databases, like PhosphoSitePlus and ELM.

Running PTM-POSE

To run PTM-POSE, you first need to process your data such that each row corresponds to a unique splice event with the genomic location of that splice event (chromosome, strand, and the bounds of the spliced region). Strand can be indicated using either '+'/'-' or 1/-1. If desired, you can also provide a delta PSI and significance value which will be included in the final PTM dataframe. Any additional columns will be kept. At a minimum, the dataframe should look something like this (optional but recommended parameters indicated):

event_id (optional) chromosome strand region_start region_end dPSI (optional) significance (optional)
first_event 1 - 9797555 9797612 0.362 0.032

Once the data is in the correct format, simply run the project_ptms_onto_splice_events() function. By default, PTM-POSE assumes the provided coordinates are in hg38 coordinates, but you can use older coordinate systems with the coordinate_type parameter.

from ptm-pose import project

my_splice_data_annotated, spliced_ptms = project.project_ptms_onto_splice_events(my_splice_data, ptm_coordinates,
                                                                                  chromosome_col = 'chromosome',
                                                                                  strand_col = 'strand',
                                                                                  region_start_col = 'region_start',
                                                                                  region_end_col =  'region_end',
                                                                                  event_id_col = 'event_id',
                                                                                  dPSI_col='dPSI',
                                                                                  coordinate_type = 'hg19')

This will produce two dataframes:

  1. Original splice data with additional columns indicating the number and which PTMs were found associated with that splice event. 'PTMs column denotes the UniProtKB accession, residue, site number, and modification type for PTM identified.
event_id (if provided) chromosome strand region_start region_end dPSI (if provided) significance (if provided) PTMs Number of PTMs Affected
first_event 1 - 9797555 9797612 0.362 0.032 O94985_N515 (N-Glycosylation) 1
  1. New dataframe where each row is a unique event-PTM pair. This is useful for downstream analyses of the important PTM changes that are occuring in your dataset, and many functions provided for further annotation and analyses of these PTMs (see rest of documentation for examples)
event_id (if provided) UniProtKB Accession Residue Modifications PTM Info dPSI (if provided) significance (if provided)
first_event O94985 N515 N-Glycosylation O94985_N515 (N-Glycosylation) 0.362 0.032

Alternatively, we have also provided tool-specific functions for MATS and SpliceSeq outputs, which are tailored to the specific output of these tools. These functions are project_ptms_onto_MATS() and project_ptms_onto_SpliceSeq(), respectively.

Annotating and Analyzing PTMs

In addition to projection of PTMs, PTM-POSE offers a variety of functions for annotating PTMs with functional information from various databases including PhosphoSitePlus, RegPhos, PTMcode, PTMsigDB, and ELM. These functions are useful for understanding the functional implications of PTMs in your dataset. We also provide functions for identifying PTMs and functions that are enriched in your dataset, including enrichment of kinase-substrate relationships.

For more information on this analysis, see the full documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ptm_pose-0.2.0.tar.gz (19.7 MB view details)

Uploaded Source

Built Distribution

ptm_pose-0.2.0-py3-none-any.whl (20.0 MB view details)

Uploaded Python 3

File details

Details for the file ptm_pose-0.2.0.tar.gz.

File metadata

  • Download URL: ptm_pose-0.2.0.tar.gz
  • Upload date:
  • Size: 19.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.10.13

File hashes

Hashes for ptm_pose-0.2.0.tar.gz
Algorithm Hash digest
SHA256 7912aaf0b2bf1b6253f5979732ec1486d1ff0ebf503f4bf70024e1fbe4c982c0
MD5 8e4f3ecf2411337fe99b3d0bb6652dbe
BLAKE2b-256 653cbd9f145e32cfb40015c7fe794054f525ed0e5ecd0f23a1501bf8c84327c9

See more details on using hashes here.

File details

Details for the file ptm_pose-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: ptm_pose-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 20.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.10.13

File hashes

Hashes for ptm_pose-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 985eae1423dc61504b79482df01e8bbb84dc1657ff97dc238867f1fb65cb3168
MD5 b8fab97b7b2461dd18a716143ebdd8ff
BLAKE2b-256 b1935439cf33e5b8efa9799fcf0e02ece9d7173510659625440f843bde002fe7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page