Skip to main content

PathIntegrate: multivariate modelling approaches for pathway-based muti-omics integration

Project description

PathIntegrate

PathIntegrate Python package for pathway-based multi-omics data integration

PathIntegrate graphical abstract

Abstract

As terabytes of multi-omics data are being generated, there is an ever-increasing need for methods facilitating the integration and interpretation of such data. Current multi-omics integration methods typically output lists, clusters, or subnetworks of molecules related to an outcome. Even with expert domain knowledge, discerning the biological processes involved is a time-consuming activity. Here we propose PathIntegrate, a method for integrating multi-omics datasets based on pathways, designed to exploit knowledge of biological systems and thus provide interpretable models for such studies. PathIntegrate employs single-sample pathway analysis to transform multi-omics datasets from the molecular to the pathway-level, and applies a predictive single-view or multi-view model to integrate the data. Model outputs include multi-omics pathways ranked by their contribution to the outcome prediction, the contribution of each omics layer, and the importance of each molecule in a pathway.

Features

  • Pathway-based multi-omics data integration using PathIntegrate Multi-View and Single-View models
    • Multi-View model: Integrates multiple omics datasets using a shared pathway-based latent space
    • Single-View model: Integrates multi-omics data into one set of multi-omics pathway scores and applies an SKlearn-compatible predictive model
    • Pathway importance
    • Sample prediction
    • NEW unsupervised SingleView models (dimensionality reduction and clustering in the pathway space)
  • SKlearn-like API for easy integration into existing pipelines
  • Support for multiple pathway databases, including KEGG, Reactome, PathBank, and custom GMT files
  • Support for multiple pathway scoring methods available via the sspa package
  • Cytoscape Network Viewer app for visualizing pathway-based multi-omics data integration results

PathIntegrate Cytoscape app

System requirements

Hardware

At least 8BG RAM recommended. PathIntegrate models can run on a Google Colab notebook server (see walkthrough tutorial below with example data).

Software

PathIntegrate has been tested on MacOs, Windows 10 and Linux. Python 3.10 or higher is required. Python dependencies are listed in the requirements.txt file.

Installation

pip install PathIntegrate

Tutorials and documentation

Please see our Quickstart guide on Google Colab

Full documentation and function reference for PathIntegrate can be found via our ReadTheDocs page

Citing PathIntegrate

If you use PathIntegrate in your research, please consider citing our paper:

@article{Wieder2024,
   author = {Cecilia Wieder and Juliette Cooke and Clement Frainay and Nathalie Poupin and Russell Bowler and Fabien Jourdan and Katerina J. Kechris and Rachel P.J. Lai and Timothy Ebbels},
   doi = {10.1371/JOURNAL.PCBI.1011814},
   issue = {3},
   journal = {PLOS Computational Biology},
   month = {3},
   pages = {e1011814},
   pmid = {38527092},
   publisher = {Public Library of Science},
   title = {PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration},
   volume = {20},
   url = {https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011814},
   year = {2024},
}

PathIntegrate applications

Check out the following papers to see how PathIntegrate has been used in research:

License

GNU GPL v3

Contributors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pathintegrate-1.0.2.tar.gz (1.7 MB view details)

Uploaded Source

Built Distribution

PathIntegrate-1.0.2-py3-none-any.whl (1.7 MB view details)

Uploaded Python 3

File details

Details for the file pathintegrate-1.0.2.tar.gz.

File metadata

  • Download URL: pathintegrate-1.0.2.tar.gz
  • Upload date:
  • Size: 1.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.5

File hashes

Hashes for pathintegrate-1.0.2.tar.gz
Algorithm Hash digest
SHA256 5324bdae0f1f230b9b13b7675f4878c395aa16610d37489e8bfdfbe99fdd63d4
MD5 6d8988375c5c49b420c59b5b6ac9caac
BLAKE2b-256 b56b46f2dd829af70c3782c48d93be4dccfcb8b3be6b1187498511de2c402c17

See more details on using hashes here.

File details

Details for the file PathIntegrate-1.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for PathIntegrate-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 64e2ed2693afbce866f9676fba4fe9ad73fa012d7cad6e6a900b59d102fab90d
MD5 ec36e5e9d0bf1f0fd9caa39a6248b649
BLAKE2b-256 412f17d05bbac6da4503d4cd2951a952937bc26df2a8cfc0fec45ad9e0a6dae0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page