Skip to main content

Ab initio pathway inference

Project description

PathModel is a tool to infer reaction between metabolites and new metabolites.

There is no guarantee that this script will work, it is a Work In Progress in early state.

Installation

Requirements

You must have an environment where Clingo is installed. Clingo can be obtained here. Also Clingo must be installed in an environment with Python compatibilty (a good way to have it is with conda). Python environment for Clingo must be Python 3, if it is Python 2 the script will crash.

For the wrapping script, Python3 and clyngor package are needed.

To create pathway picture, the script uses networkx (with graphviz and pygraphviz) and matplotlib packages.

To create molecule picture, Pathmodel uses the rdkit package.

Using conda environment (to install all dependencies)

Due to all the dependencies required by all the script of Pathmodel, we create a conda environment file that contains all dependencies.

First you need Conda. To avoid conflict between the conda python and your system python, you could use a conda environment and Miniconda.

If you want to test this, the first thing is to install miniconda:

# Download Miniconda
wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh

# Give the permission to the installer.
chmod +x Miniconda3-latest-Linux-x86_64.sh

# Install it at the path that you choose.
./Miniconda3-latest-Linux-x86_64.sh -p /path/where/miniconda/will/be/installed/ -b

# Delete installer.
rm Miniconda3-latest-Linux-x86_64.sh

# Add conda path to you bash settings.
echo '. /path/where/miniconda/is/installed/etc/profile.d/conda.sh' >> ~/.bashrc
# Will activate the environment.
# For more information: https://github.com/conda/conda/blob/master/CHANGELOG.md#440-2017-12-20
echo 'conda activate base' >> ~/.bashrc

After this you need to restart your terminal or use: source ~/.bashrc

Then you will get our conda environment file:

# Download our conda environment file from Pathmodel gitlab page.
wget https://gitlab.inria.fr/DYLISS/PathModel/raw/master/conda/pathmodel_env.yaml

# Use the file to create the environment and install all dependencies.
conda env create -f pathmodel.yaml

If no error occurs, you can now access a conda environment with pathmodel:

# Activate the environment.
conda activate pathmodel

# Launch the help of Pathmodel.
(pathmodel) pathmodel -h

You can exit the environment with:

# Deactivate the environment.
conda deactivate

Using conda package

It will be possible to install pathmodel (and its dependencies) with a conda install. But you have to add some channels.

# Install pathmodel
conda install pathmodel -c dyliss -c anaconda -c conda-forge -c rdkit -c potassco

Using git

At this moment, the package can be installed only using python setup. But when the git will become public, a pip package would be created.

git clone https://gitlab.inria.fr/DYLISS/PathModel

cd PathModel

python setup.py install

Using pip

If you have all the depedencies on your system, you can just download Pathmodel using pip.

pip install pathmodel

Using docker

A container will be usable in a few days.

docker run -ti -v /path/shared/container:/shared dyliss/pathmodel --name="mycontainer" bash

Description

PathModel is developed in ASP. It is divided in two scripts.

The first one, ReactionSiteExtraction.lp creates reaction site.

When a reaction is described between two molecules, the script will compare atoms and bonds of the two molecules of the reaction and will extract a reaction site before the reaction (composed of atoms and bonds that are present in the reactant but absent in the product) and a reaction site after the reaction (composed of atoms and bonds present in the product but absent in the reactant).

ReactionSiteExtraction produces two sites for each reaction (one before and one after the reaction). These sites will be used by the second script: PathModel.lp.

PathModel will use two inference methods: one creating new metabolites and one infering a reaction between two metabolites.

Input data

Molecules are modelled with atoms (hydrogen excluded) and bonds (single and double).

atom("Molecule1",1,carb). atom("Molecule1",2,carb).
bond("Molecule1",single,1,2).

atom("Molecule2",1,carb). atom("Molecule2",2,carb). atom("Molecule2",3,carb).
bond("Molecule2",single,1,2). bond("Molecule2",single,2,3).

Reaction between molecules are represented as link between two molecules with a name:

reaction(reaction1,"Molecule1","Molecule2").

A common domain is needed to find which molecules share structure with others:

atomDomain(commonDomainName,1,carb). atomDomain(commonDomainName,2,carb).
bondDomain(commonDomainName,single,1,2).

A molecule source is defined:

source("Molecule1").

Initiation and goal of the incremental grounding must be defined:

init(pathway("Molecule1","Molecule2")).
goal(pathway("Molecule1","Molecule3")).

M/Z ratio can be added to check whether there is a metabolite that can be predict with this ratio. M/Z ratio must be multiplied by 10 000 because Clingo doesn’t use decimals.

mzfiltering(2702720).

Molecules that are not in the organism of study can be added. They will not be targeted of the inference methods.

absentmolecules("Molecule1").

Use

Command-line:

pathmodel -d data.lp

In python:

import pathmodel

pathmodel.pathmodel_analysis('data.lp')

Output data

Using networkx, inferred pathways are represented as png picture. Also a result.lp file is created containing all the inferred reactions.

Example

data/data.lp contains example for sterols and mycosporine-like amino-acids pathways.

test/test_data/pathmodel_test_data.lp contains an example with fictional molecules to test PathModel.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pathmodel-0.1.2a1.tar.gz (16.9 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page