Skip to main content

Compute diffusion scores over networks

Project description

DiffuPath is an analytic tool for biological networks that connects the generic label propagation algorithms from DiffuPy to biological networks encoded in several formats such as Simple Interaction Format (SIF) or Biological Expression Language (BEL). For example, in the application scenario presented in the paper, we use three pathway databases (i.e., KEGG, Reactome and WikiPathways) and their integrated network retrieved from PathMe [1] to analyze three multi-omics datasets. However, other biological networks can be imported from the Bio2BEL ecosystem [2].

Installation

  1. diffupath can be installed with the following commands:

$ python3 -m pip install git+https://github.com/multipaths/DiffuPath.git@master
  1. or in editable mode with:

$ git clone https://github.com/multipaths/DiffuPath.git
$ cd diffupath
$ python3 -m pip install -e .

Requirements

diffupath requires the following libraries:

networkx (>=2.1)
pybel (0.13.2)
biokeen (0.0.14)
click (7.0)
tqdm (4.31.1)
numpy (1.16.3)
scipy (1.2.1)
scikit-learn (0.21.3)
pandas (0.24.2)
openpyxl (3.0.2)
plotly (4.5.3)
matplotlib (3.1.2)
matplotlib_venn (0.11.5)
bio2bel (0.2.1)
pathme
diffupy

Command Line Interface

The following commands can be used directly use from your terminal:

  1. Download a database for network analysis.

The following command generates a BEL file representing the network of the given database.

$ python3 -m diffupath database network --database=<database-name>

To check the available databases, run the following command:

$ python3 -m diffupath database ls
  1. Run a diffusion analysis

The following command will run a diffusion method on a given network with the given data

$ python3 -m diffupath diffusion run --network=<path-to-network-file> --input=<path-to-data-file> --method=<method>

Networks

You can choose networks to run diffusion methods in the following ways:

  • Select a network representing an individual biological database

  • Select multiple databases to generate a harmonized network

  • Select from one of four predefined collections of biological databases representing a harmonized network

  • Submit your own network from one of the accepted formats

Network Dumps

Because of the high computational cost of generating the kernel, we provide links to precalculated kernels for a set of networks representing biological databases:

Database

Description

Reference

DrugBank

Interactions between drugs and drug targets with over 10,000 drugs

[3]

Gene Ontology

Flexible hierarchy of tens of thousands of biological processes

[4]

HSDN

Associations between thousands of diseases with hundreds of symptoms

[5]

KEGG

Multi-omics interactions present in hundreds of biological pathways

[6]

miRTarBase

Experimentally validated interactions between miRNA and their targets

[7]

Reactome

Multi-omics interactions present in thousands of biological pathways

[8]

SIDER

Associations between over a thousand drugs and side effects

[9]

WikiPathways

Multi-omics interactions present in hundreds of biological pathways

[10]

Disclaimer

DiffuPath is a scientific software that has been developed in an academic capacity, and thus comes with no warranty or guarantee of maintenance, support, or back-up of data.

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffupath-0.0.1.tar.gz (32.0 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page