GSEA-InContext Gene Set Enrichment Analysis in Python

These details have not been verified by PyPI

Project links

Project description

GSEA-InContext: Gene Set Enrichment Analysis In Context

Gene Set Enrichment Analysis (GSEA) is routinely used to analyze and interpret coordinate changes in transcriptomics experiments. For an experiment where less than seven samples per condition are compared, GSEA employs a competitive null hypothesis to test significance. A gene set enrichment score is tested against a null distribution of enrichment scores generated from permuted gene sets, where genes are randomly selected from the input experiment. Looking across a variety of biological conditions, however, genes are not randomly distributed with many showing consistent patterns of up- or down-regulation. As a result, common patterns of positively and negatively enriched gene sets are observed across experiments. Placing a single experiment into the context of a relevant set of background experiments allows us to identify both the common and experiment-specific patterns of gene set enrichment. We developed the GSEA-InContext method to allow a user to account for gene expression patterns within a defined background set of experiments to identify statistically significantly enriched gene sets in their own experiment.

See below for examples on running the GSEA-InContext algorithm.

This repo is a fork of GSEApy (original documentation here. We have added a new tool GSEA_InContext which runs the GSEAPreranked algorithm but uses a background set of ranked lists to calculate an empirical null distribution for informing the permutation procedure. For examples using the original GSEApy library, visit this page.

About GSEA-InContext

Currently, there are no methods available for a user to easily compare their GSEA results to GSEA results obtained in other experiments to discern similar and/or distinct patterns affected across experiments. GSEA-InContext accounts for gene-specific variation estimated from an experimental background. Whereas GSEA identifies all signiificantly enriched gene sets in an experiment, this method allows the user to ask a complementary question; namely, which gene sets are uniquely enriched in a single experiment compared to many other, independent experiments.

Our method applies the same approach as GSEA to calculate the nominal p-value. However, in contrast to GSEAPreranked, GSEA-InContext employs an alternative significance testing procedure to generate the null distribution, in which permuted gene sets are generated using the density of gene ranks estimated from a set of user-defined background experiments. We estimate a gene's probability density using a Gaussian kernel over the experiments in the background set.

The GSEA-InContext algorithm can be run using the incontext subcommand. Additional subcommands can be run as in the original GSEApy, including: gsea, prerank, ssgsea, replot enrichr. See the original GSEApy <https://github.com/BioNinja/GSEApy>_ repository.

The full GSEA is described in: GSEA documentation. All file formats for GSEApy are identical to GSEA desktop version.

Data & Availability

The data, results and analysis described in our preprint are hosted in a Synapse project available here(doi:10.7303/syn11804693).

Dependencies & Requirements

Python 3.4+
Numpy >= 1.13.0
Pandas
Matplotlib
Beautifulsoup4
Requests (for enrichr API)

You may also need to install lxml and html5lib to parse xml files.

Running GSEApy and GSEA-InContext

Before you start:

Convert all gene symbol names to uppercase. The ranked lists input to prerank or incontext can be supplied as file paths (.rnk) or a two-column Pandas DataFrame (columns gene_name and fold_change). The background ranked lists input to incontext is supplied as a text file containing the list of .rnk files to use in permutation, or as a .csv file containing pre-permuted gene lists created with the make_background_dist() function.

Run GSEAPY inside Python console:

| Running GSEAPreranked and GSEA-InContext in Python using file paths as input


    import gseapy

    # Run GSEA Prerank
    gseapy.prerank(rnk='gsea_data.rnk', gene_sets='gene_sets.gmt', outdir='out')

    # Run GSEA-InContext
    gseapy.incontext(rnk='gsea_data.rnk', gene_sets='gene_sets.gmt', backround_rnks = 'permuted_background.csv', outdir='out')

A full example can be seen in run_example.py. The full analysis of Kegg and Hallmarks gene sets was run with run_all_442.py.

Bug Reports

If you would like to report any bugs when you running the incontext module, please create an issue on GitHub here. For issues relating to other modules, you may wish to visit theoriginal GSEAPY repo.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.9.6

Jun 13, 2019

0.9.5

Jun 4, 2019

0.9.3

Jun 4, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gsea_incontext-0.9.6.tar.gz (39.3 kB view details)

Uploaded Jun 13, 2019 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gsea_incontext-0.9.6-py3-none-any.whl (50.7 kB view details)

Uploaded Jun 13, 2019 Python 3

File details

Details for the file gsea_incontext-0.9.6.tar.gz.

File metadata

Download URL: gsea_incontext-0.9.6.tar.gz
Upload date: Jun 13, 2019
Size: 39.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/18.5 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/2.7.10

File hashes

Hashes for gsea_incontext-0.9.6.tar.gz
Algorithm	Hash digest
SHA256	`2d8f78119c442446287951bdd5799d27b2f5851b9abdeda9835b4387d850c716`
MD5	`3c64e001884ccce65f1787fa4699f19e`
BLAKE2b-256	`9723bc8cd87ec2ff62e4dce24b1c8e8ad139164bc2d2afdba1c7b44f4c30e023`

See more details on using hashes here.

File details

Details for the file gsea_incontext-0.9.6-py3-none-any.whl.

File metadata

Download URL: gsea_incontext-0.9.6-py3-none-any.whl
Upload date: Jun 13, 2019
Size: 50.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/18.5 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/2.7.10

File hashes

Hashes for gsea_incontext-0.9.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6cf939dee3bb4210b55301f1e5441908d29e7c1d0e0de3e83d21e1758c7c2b73`
MD5	`d018b0747f8877f31e8e0f4de7299382`
BLAKE2b-256	`d9c829d124919be9c081295894aa8318b7ba6729eafb3b3c156af78a5e53631f`

See more details on using hashes here.

gsea-incontext 0.9.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

GSEA-InContext: Gene Set Enrichment Analysis In Context

About GSEA-InContext

Data & Availability

Dependencies & Requirements

Running GSEApy and GSEA-InContext

Before you start:

Run GSEAPY inside Python console:

Bug Reports

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes