NEASE: A network-based approach for the enrichment of alternative splicing events
Project description
NEASE
NEASE (Network Enrichment method for Alternative Splicing Events) a tool for functional enrichment of alternative splicing exons/events.
General info
The python package NEASE (Network-based Enrichment method for Alternative Splicing Events) first detects protein features affected by AS such as domains, motifs and residues. Next, NEASE uses a protein-protein interactions integrated with domain-domain interactions, residue-level and domain-motif interactions to identify interaction partners likely affected by AS
Next, NEASE performs gene set overrepresentation analysis and identifies enriched pathways based on affected edges. Furthermore, since the statistical approach is network-based, it also prioritizes (differentially) spliced genes and finds new disease biomarkers candidates in case of aberrant splicing.
Installation
To install the package from PyPI please run:
pip install nease
To install the package from git:
git clone https://github.com/louadi/NEASE.git && cd NEASE
pip install .
Enjoy your instance of NEASE
Data input
The standard input of the package (also the recommended) is a DataFrame object with Ensembl IDs of the genes and the exon coordinates from human genome build hg38 (GRCh38).
- First column - genes IDs (Only Ensembl gene IDs can be used).
- Second column - start of the exon coordinate.
- Third column - end of the exon coordinate.
- Fourth column - dPSI (optionally)
Gene | Start | End | dpsi |
---|---|---|---|
ENSG00000154263 | 69314431 | 69315425 | -0.10 |
ENSG00000154265 | 87411893 | 87412033 | 0.13 |
In the case of differential splicing analysis, please make sure to filter events by significance beforehand, for example using a p-value cutoff.
The package also supports the output of multiple AS differential detection tools such as rMATs, Whippet and MAJIQ. For the newer version of MAJIQ, pleasee check the issue.
If you need help with your data or need to add support for another tools, open an issue or contact us.
Main functions and examples
Please note, that all functions are annotated with dockstrings for more details.
Import NEASE package and pandas:
import nease
import pandas as pd
Run NEASE
table: Data input as DataFrame object as explained in "Data input".
organism: Either 'Human' or 'Mouse'.
input_type: Either 'Standard', 'Whippet', 'rmats'or "MAJIQ".
remove_non_in_frame: Remove exons that are predicted to disturb the ORF (Prediction source).
only_divisible_by_3: Remove exons not divisible by 3.
only_DDIs: Only use DDI annotations (No PDB and ELM).
p_value_cutoff: The enrichment p-value cutoff.
events=nease.run(table, organism='Human',input_type='Standard',
remove_non_in_frame=True, only_divisible_by_3=False)
General functions
Get statistics of your data.
events.get_stats()
Get a list of all affected domains.
events.get_domains()
Get a list of all affected linear motifs.
events.get_elm()
Get a list of all affected residues and their interactions from the PDB.
events.get_pdb()
List of affected interactions from domains and linear motif binding.
events.get_edges()
NEASE enrichment
The main function of NEASE
database: a list of pathway databases to run enrichment on it.
# Supported databases:
database= ['PharmGKB', 'HumanCyc', 'Wikipathways', 'Reactome','KEGG', 'SMPDB',
'Signalink','NetPath', 'EHMN', 'INOH','BioCarta','PID']
# Run enrichment on Reactome only
events.enrich(database=['Reactome'])
Pathway specific analysis
Get list of genes affecting pathways and their statistics
path_id: a string representing the Pathway ID. You can find pathways id in the enrichment table results.
events.path_analysis('R-HSA-112314')
Visualize a pathway in the PPI:
Generate an HTML file for visualizing the network example.
path_id: a string representing the Pathway ID.
file: a string representing a local file path for the HTML file.
save_pdf: If True, also save the figure as PDF.
k: a Float for the algorithm Fruchterman-Reingold force-directed for nodes positions, to be tuned by the user. You might need to run the following function multiple times for an optimal visualizations. more details.
events.Vis_path("R-HSA-5674135",file='AS data/enrichment/',save_pdf=False,k=0.8)
Classic gene set enrichment (Gene level)
gseapy_databases: gseapy pathways databases
non_symmetrical_only: Run classical gene set enrichment for only non-symmetrical alternative exons, that are likely to cause NMD
# Run on KEGG gene set
events.classic_enrich(gseapy_databases=['KEGG_2019_Human'],non_symmetrical_only=True)
Tutorials
A step-by-step guide to use NEASE is available here.
A simple example for running NEASE on a standard input: (Notebook/Google Colab)
Cite
If you use NEASE, please cite:
Louadi, Z., Elkjaer, M.L., Klug, M. et al. Functional enrichment of alternative splicing events with NEASE reveals insights into tissue identity and diseases. Genome Biol 22, 327 (2021). https://doi.org/10.1186/s13059-021-02538-1
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nease-1.3.1.tar.gz
.
File metadata
- Download URL: nease-1.3.1.tar.gz
- Upload date:
- Size: 23.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 712934409f5e18b3a256d3721c20cc863659bc37754e9fc7e0dbfe5247903cb6 |
|
MD5 | a993b7ee19b20403e458061997fb48fc |
|
BLAKE2b-256 | 48a20a01122bd6acf398e607e59fa8342acdfdbdc8514b55adcbc8238627166a |
File details
Details for the file nease-1.3.1-py3-none-any.whl
.
File metadata
- Download URL: nease-1.3.1-py3-none-any.whl
- Upload date:
- Size: 23.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5f0d68df654e9ea8e981105024b3ef07109e8d993b857dc34f35462b99d3b1d5 |
|
MD5 | 1fb83de62dfe0ff09a30324bcdf5d450 |
|
BLAKE2b-256 | 3c996c2b9d266cac4b5f44a8d092ac3c9881f41863e1bcc223435fc6c1efb7fa |