Skip to main content

OntoloViz drug- and phenotype-ontology visualization GUI

Project description

Pypi version Pypi python version Python package pylint-badge

OntoloViz

OntoloViz is a graphical user interface for the creation of interactive sunburst plots of phenotype and drug ontologies. You might find it useful to quickly visualize your data for reports or to share the generated plots with collaborators. Take a look at the screenshot and demos section or explore examples in .html format from the provided templates.zip archive to get a better understanding of the scope of the package.

Quickstart

The GUI can be run by downloading the latest release or by installing the package via PyPi (OS independent, requires Python 3.7+):

pip install ontoloviz

After the installation you can run the GUI from the command line with the following command:

ontoloviz

Empty templates or pre-populated examples can be found in the templates.zip file in the release section as well. Alternatively, you can clone this repository, install the required dependencies and launch the GUI:

git clone https://github.com/Mnikley/OntoloViz.git

# optional: install and activate venv
python -m venv  # python3 on linux
venv\scripts\activate  # source venv/bin/activate on linux

# install dependencies
pip install plotly>=5 openpyxl>=3

# launch GUI
cd OntoloViz/src
python -c "from ontoloviz import run_app; run_app()"

Usage

The application allows importing .tsv and .xlsx files, but the use of .tsv and tab as a separator is recommended. The GUI can create two types of sunburst diagrams to represent either phenotype or drug ontologies, which is determined by the structure of the loaded files. Any numbers entered in the file will be converted to integers.

GUI Options

gui_small
  • Load File: load an .tsv or .xlsx file containing drug- or phenotype-ontology data
  • General
    • Set Color Scale: define a custom color scale for the sunburst color scaling when color propagation is active
    • Set Border: configures the border properties drawn around sunburst wedges
  • Display
    • Drop empty nodes (phenotype sunburst only): drops nodes who have no further children and 0 counts
    • Wedge Width (drug sunburst only): switch from full outer circle (total) to count-based wedge widths (remainder)
    • Display Labels: controls display of labels inside sunburst wedges, available options:
      • all
      • propagation
      • drugs (drug sunburst only)
      • none
  • Propagation
    • Enable: enables count- and color propagation from child to parent nodes
    • Color: controls color propagation by the options:
      • off: color scale is based on 'Color' column from imported file
      • specific: color scale is based on the maximum values of the corresponding subtree
      • global: color scale is based on the maximum values of the entire tree ontology
      • phenotype (phenotype sunburst only): Only the most outer phenotype in a branch is colored
    • Counts: controls count propagation by the options:
      • off: no counts are propagated, counts equal imported values
      • level: counts are propagated up to defined level, values above threshold remain unchanged
      • all: counts are propagated up to central node, imported values are corrected and overwritten
    • Level: controls color- and count-propagation from outer to inner levels up to defined level
      • affects color propagation when Color is set to specific or global
      • affects count propagation when Counts is set to level
      • drug sunburst: 1 corresponds to the central node, 5 to the outermost node (=drug)
      • phenotype sunburst: 0 corresponds to the central node, 13 to the outermost node
  • Summary Plot
    • Enable: displays all available subtrees in a single view (resource intensive, set Labels to none for faster loading)
    • Columns: defines the amount of columns when summary plot is enabled
  • Save: when enabled, an interactive .html file is generated for later use
  • Plot: Process and generate plot, opens in a Browser window

Phenotype Sunbursts

The phenotype sunburst structure follows the principles of the MeSH tree.

  • A Tree ID is defined by a combination of three numbers or letters, for example C01.
  • Levels are separated by a dot ., for example C01.001.
  • Ontologies up to thirteen hierarchical levels are supported.
  • A single phenotype end-node can be assigned to multiple parent-nodes by specifying the parents tree ids as pipe separated string in the column Tree ID.
  • When defining a child element which has no valid parent, the GUI will automatically generate the parent with the default color and a 0 value. This will happen recursively. For example, if the input file defines a node with the id 123.001.001, but the nodes 123 and 123.001 are non-existent, they will be created.
  • Counts entered in the file will be converted to integers. If a node should be displayed without counts, use 0.
  • The loaded file must contain 7 columns and follow the below structure to be correctly recognized:

Phenotype Ontology File Structure

Column Index Header Text Description
0 MeSH ID Required primary identifier of a node in format C01.001
1 Tree ID Required pipe delimited list of Tree IDs of a node (allows 1:N mappings)
2 Name Optional label to be displayed inside the sunburst wedges
3 Description Optional description displayed in the sunburst wedge tooltip
4 Comment Optional comment displayed in the sunburst wedge tooltip
5 Counts [Name] Required count for wedge weights, Name will be used as figure title
6 Color Optional color for the sunburst wedges, must be hex-string in format #FFFFFF

Drug Sunbursts

The drug sunburst structure follows the principles of the ATC tree.

  • ATC codes are divided into five levels, which must follow the following naming conventions:
    • 1st level: letter
    • 2nd level: two numbers
    • 3rd level: letter
    • 4th level: letter
    • 5th level: two numbers
  • Example ATC code: A10BA02
  • The hierarchy is built based on the above-mentioned format and does only allow 1:1 child-parent relationships (contrary to the phenotype structure). For example, if the drug deltatonin should be assigned to the parent nodes A01AA and B01BB, it must be defined twice with the ids A01AA01 and B01BB01.
  • The loaded file must contain 6 columns and follow the below structure to be correctly recognized as a phenotype ontology:

Drug Ontology File Structure

Column Index Header Text Description
0 ATC code Required primary identifier of a node in format A10BA02
1 Level Optional level as number, not used for building tree
2 Label Optional label to be displayed inside the sunburst wedges
3 Comment Optional comment displayed in the sunburst wedge tooltip
4 Counts [Name] Required count for wedge weights, Name will be used as figure title
5 Color Optional color for the sunburst wedges, must be hex-string in format #FFFFFF

Templates and Examples

Templates and examples can be found in the provided templates.zip archive.

  • pubmed_documents_mapped_to_mesh.tsv: based on the MeSH subtree C from 2022. Disease-related MeSH terms were extracted from the publicly available PubMed database (title + abstract) and further mapped to the nodes.

  • mesh_tree_template.tsv: empty template of the MeSH tree C and F03]. Terms are unique and mapped to all related parent nodes.

  • covid_drugs_trial_summary.tsv: based on publicly available clinical trial data related to COVID-19. One count represents one clinical trial.

  • atc_tree_template.tsv: empty template of the ATC tree based on the manually curated chemical database of bioactive molecules ChEMBL v29.

  • drug_sunburst_example.html: sample plot generated with the provided covid_drugs_trial_summary.tsv file.

  • phenotype_sunburst_example.html: sample plot generated with the provided covid_drugs_trial_summary.tsv file.

Screenshots and Demos

demo_creation_of_template

Demo 1: Minimal example for creating a phenotype based ontology

demo_creation_of_template

Demo 2: Showcase of some of the features available in OntoloViz (used template: test_tree.zip)

drug_single

Screenshot 1: Drug sunburst plot with enabled labels, counts propagated up to level 3

phenotype_overview

Screenshot 2: Summary phenotype sunburst plot with tooltip, counts propagated up to the central node, color coded

color_scale color_scale

Screenshot 3 & 4: Left: define automatic color scales based on defined counts with thresholds and hex color codes, Right: define border properties (width, opacity, colors) or disable them entirely

Special Thanks to

  • Paul Perco, who had the initial idea for this package and provided support throughout the entire process
  • Andreas Heinzel, for inspiration regarding architectural- and software-related topics
  • The Delta4 GmbH team for providing helpful inputs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ontoloviz-1.5.6.tar.gz (43.9 kB view details)

Uploaded Source

Built Distribution

ontoloviz-1.5.6-py3-none-any.whl (40.2 kB view details)

Uploaded Python 3

File details

Details for the file ontoloviz-1.5.6.tar.gz.

File metadata

  • Download URL: ontoloviz-1.5.6.tar.gz
  • Upload date:
  • Size: 43.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Windows/10

File hashes

Hashes for ontoloviz-1.5.6.tar.gz
Algorithm Hash digest
SHA256 60ed466c512c925ba5d85e96a8297f0b4d71b183f7f4e2893fdd4051fcdc1139
MD5 075de6d728769133f09b9f5834383fee
BLAKE2b-256 3528109006ba2a521182c4d7fcef7d7eadae96d07cf46fabcdb84fd43defce2c

See more details on using hashes here.

File details

Details for the file ontoloviz-1.5.6-py3-none-any.whl.

File metadata

  • Download URL: ontoloviz-1.5.6-py3-none-any.whl
  • Upload date:
  • Size: 40.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Windows/10

File hashes

Hashes for ontoloviz-1.5.6-py3-none-any.whl
Algorithm Hash digest
SHA256 7d4c74f484a29d8f65ed00e02fe495d376d6fd7e44fa175be47dbf57a24042ca
MD5 b7a80c279f70c3d7799162d3038dc5bf
BLAKE2b-256 634f78a799677206892e7de6ab557381640c13a50b582ebb67769ea272a7df9d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page