Skip to main content

Learning of Protein Signaling Logic Models powered by BioASP and CellNOptR

Project description

==========================
caspo :- BioASP, CellNOpt.
==========================

caspo combines BioASP_ and CellNOpt_ to provide an easy to use software for learning Protein Signaling Logic Models from a Prior Knowledge Network in `.sif format`_ and a phospho-proteomics dataset in `MIDAS format`_.

.. _BioASP: http://www.cs.uni-potsdam.de/~sthiele/bioasp/
.. _CellNOpt: http://www.ebi.ac.uk/saezrodriguez/cno/
.. _`.sif format`: http://wiki.cytoscape.org/Cytoscape_User_Manual/Network_Formats
.. _`MIDAS format`: http://www.ebi.ac.uk/saezrodriguez/cno/doc/cnodocs/midas.html

Installation
============

You can install caspo by running::

$ pip install caspo

caspo will try to install the R_ package CellNOpt_ (R_ must be already installed). Note that you may need root access for this. Otherwise, you can install CellNOpt_ manually from the R_ console and use a virtualenv_ to install caspo.

.. _R: http://www.r-project.org/
.. _virtualenv: http://pypi.python.org/pypi/virtualenv

Usage
=====

Typical usage is::

$ caspo.py pkn.sif midas.csv

For more options you can ask for help as follows::

$ caspo.py --help
Usage: caspo.py [options] pkn.sif midas.csv

Options:
-h, --help show this help message and exit
-t T, --tolerance=T Suboptimal enumeration tolerance (Default to 0)
-p P, --discrete=P Discretization range exponent: 10^P (Default to 2)
-q Q, --alpha=Q Size penalty exponent 1/10^Q (Default to 5)
-g, --gtts Compute Global Truth Tables (Default to False). This
could take some time for many models.
-o O, --outdir=O Output directory path (Default to current directory)

Samples
=======

Sample files are available here:
ExtLiverPCB.sif_ ExtLiverPCB.csv_

.. _ExtLiverPCB.sif: http://www.cs.uni-potsdam.de/~sthiele/bioasp/downloads/samples/liverdata/ExtLiverPCB.sif
.. _ExtLiverPCB.csv: http://www.cs.uni-potsdam.de/~sthiele/bioasp/downloads/samples/liverdata/ExtLiverPCB.csv

Output
======

By default, the output of caspo will be 4 comma-separated-values files::
- models.csv: Matrix representation of logic models
- frequencies.csv: Frequencies of hyperedges occurrence
- exclusives.csv: Mutual exclusives hyperedges with their corresponding frequencies
- inclusives.csv: Mutual inclusives hyperedges with their corresponding frequencies

When using the -g option, caspo will also output::
- gtt_stats.csv: Basic cluster analysis.
- gtt-%i.csv: Explicit computation of each Global Truth Table

Project details


Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page