Skip to main content

BBGLab interface for an OncoTree

Project description

BGOncoTree is a Python package to standardize the use of the OncoTree within the BBGLab.

Installation

BGOncoTree works with Python 3.5+ and can be installed with pip:

pip install bgoncotree

Usage

Any use of BGOncoTree requires to initialize the tree:

from bgoncotree.main import BGOncoTree

tree = BGOncoTree()

BGOncoTree accepts a file as argument if you want to provide your own tree.

Each node is composed by:

  • id: short identifier of the node (e.g. ALL)

  • parent: ancestor node

  • children: list of descendent nodes

  • synonyms: list of alternative names that can be used to search for this node

  • name: first provided synonym or node id

Python

The first thing to do is initialize the BGOncoTree object:

tree = BGOncoTree()

The BGOncoTree object is a dictionary that you can query to get each node:

node = tree['CANCER']

Using the ID of the node or any synonym you can find the node:

node = tree.find('clear cell sarcoma')

To explore the tree, you can use the iter_from method:

for node, level in tree.iter_from(tree['ALL'], descending=True):
        ...

The level is simply an integer indicating the level of nested with respect to the starting_node. If you want to explore the full tree, you can get the root node as:

root_node = tree.root

The methods descendants and ancestors do the same as iter_from but you do not need to indicate the descending flag, the level is not returned and node can also be a string. E.g.:

for node in tree.descendants('ALL'):
        ...

There are two other methods that you can make use of. search can be used to search for all nodes containing a word in their names. E.g.:

for node in tree.search('acute'):
        ...

The get method is useful if you expect only one result from a search. E.g.:

node = tree.get('Cholangiocarcinoma')

This method will raise an exception if none or more than 1 items are found.

Command line

The command line provides two utilities to explore the tree bgoncotree ancestors and bgoncotree descendants. Output can be formatted in json, and you can also set a recursion limit.

See first one level of childs of HEMATO node:

bgoncotree descendants --node HEMATO --json --limit 2

In addition, the same search function as in Python can be called from the command line as:

bgoncotree search acute

Tab completion can be enabled adding eval "$(_BGONCOTREE_COMPLETE=source bgoncotree)" to your .bashrc file as explained in the bash completion section of Click

Defining your own tree

BGOncoTree uses bgdata to get the most recent version of the oncotree used in the lab. However, you can pass a file with your own oncotree:

from bgoncotree.main import BGOncoTree

tree = BGOncoTree(file)

If you are interested in creating your own oncotree, you need a tsv file with 2 columns: ID and PARENT. Each node can only have 1 parent, and there must but one node without a parent: the root node.

You can provide an additional extra column NAMES with a comma separated list of synonyms for each node. The first name will be used as the name to display.

You can validate your tree using the command line:

bgoncotree validate my_tree.tsv

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bgoncotree-0.5.tar.gz (8.7 kB view hashes)

Uploaded Source

Built Distribution

bgoncotree-0.5-py3-none-any.whl (7.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page