Skip to main content

A tool for ranking potential targets for a given disease

Project description

This is a tool for therapeutic target prioritization using network representation learning.

Installation

Download this repository, go to the directory it resides and run:

$ git clone https://github.com/phanein/deepwalk.git
$ cd deepwalk
$ pip install .
$ cd ..
$ # Install GAT2VEC, which depends on DeepWalk
$ git clone https://github.com/ozlemmuslu/GAT2VEC.git gat2vec
$ cd gat2vec
$ pip install .
$ cd ..
$ # Actually install GuiltyTargets
$ git clone https://github.com/guiltytargets/guiltytargets.git
$ cd guiltytargets
$ pip install -e .

Usage

After that, you can use it as a library in Python

import guiltytargets

guiltytargets.run(
    input_directory,
    targets_path,
    ppi_graph_path,
    dge_path,
    auc_output_path,
    probs_output_path,
    max_adj_p=max_padj,
    max_log2_fold_change=lfc_cutoff * -1,
    min_log2_fold_change=lfc_cutoff,
    entrez_id_header=entrez_id_name,
    log2_fold_change_header=log_fold_change_name,
    adj_p_header=adjusted_p_value_name,
    base_mean_header=base_mean_name,
    entrez_delimiter=split_char,
    ppi_edge_min_confidence=confidence_cutoff,
 )

This will create files in paths auc_output_path and probs_output_path, where the former shows the AUC values of cross validation and the latter shows the predicted targets.

The parameters are explained below. A use case can be found under https://github.com/GuiltyTargets/reproduction

INPUT FILES

There are 3 files which are necessary to run this program. All input files should be found under input_directory

  1. ppi_graph_path: A path to a file containing a protein-protein interaction network in the format of:

    source_entrez_id

    target_entrez_id

    confidence

    216

    216

    0.76

    3679

    1134

    0.73

    55607

    71

    0.65

    5552

    960

    0.63

    2886

    2064

    0.90

    5058

    2064

    0.73

    1742

    2064

    0.87

    An example of such a network can be found [here](http://cbdm-01.zdv.uni-mainz.de/~mschaefer/hippie/download.php)

  2. dge_path: A path to a file containing an experiment, in tsv format. Rows show individual entries, columns are the values of the following properties:

    • Base mean

    • Log fold change

    • Adjusted p value

    • Entrez id

The file may contain other columns too, but the indices and names of the above columns must be entered to the configuration file.

  1. targets_path: A path to a file containing a list of Entrez ids of known targets, in the format of

    … code-block:: sh

    1742 3996 150 152 151

OPTIONS

The options that should be set are:

  • max_adj_p: Maximum value for adjusted p-value for a gene to be considered differentially expressed.

  • max_log2_fold_change: Maximum value for log2 fold change for a gene to be considered differentially expressed

  • min_log2_fold_change: Minimum value for log2 fold change for a gene to be considered differentially expressed

  • ppi_edge_min_confidence: Minimum confidence score for the edges in PPI network.

  • entrez_id_header: The column name for the Entrez id in the differential expression file.

  • log2_fold_change_header: The column name for the log2 fold change in the differential expression file.

  • adj_p_header: The column name for the adjusted p-value in the differential expression file.

  • base_mean_header: The column name for the base mean in the differential expression file.

  • entrez_delimiter: If there is more than one Entrez id per row in the diff. expr. file, the separator betweem them.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

guiltytargets-0.0.2.tar.gz (20.5 kB view details)

Uploaded Source

Built Distribution

guiltytargets-0.0.2-py3-none-any.whl (21.3 kB view details)

Uploaded Python 3

File details

Details for the file guiltytargets-0.0.2.tar.gz.

File metadata

  • Download URL: guiltytargets-0.0.2.tar.gz
  • Upload date:
  • Size: 20.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.2

File hashes

Hashes for guiltytargets-0.0.2.tar.gz
Algorithm Hash digest
SHA256 aeec8e5d7a800ce0d97dbc930fb62b5f9dddf061dede3bb53a25e4e53aea7374
MD5 1714b34b54ccdd3df34aa04a4412cb06
BLAKE2b-256 63dfa33cb829c425c5d10cb8970b5ddfde070233ba1efdc0939de9b89238f783

See more details on using hashes here.

File details

Details for the file guiltytargets-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: guiltytargets-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 21.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.2

File hashes

Hashes for guiltytargets-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 4bd769a48588161ffe7dbf5075bded980046c47574bb9e5accd0a757e2cf2485
MD5 0022b04bb83a73297e3e7d8a6a41d485
BLAKE2b-256 d78b4aab20b154d8acbe3f5e3568b7dde5db3b19edd9e27c6cda180b4a88c69a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page