pyhpo·PyPI

A Python package to work with the HPO Ontology

Project description

A Python library to work with, analyze, filter and inspect the Human Phenotype Ontology

Visit the PyHPO Documentation for a more detailed overview of all the functionality.

Main features

It allows working on individual terms HPOTerm, a set of terms HPOSet and the full Ontology.

Internally the ontology is represented as a branched linked list, every term contains pointers to its parent and child terms. This allows fast tree traversal functioanlity.

The library is helpful for discovery of novel gene-disease associations and GWAS data analysis studies. At the same time, it can be used for oragnize clinical information of patients in research or diagnostic settings.

It provides an interface to create Pandas Dataframe from its data, allowing integration in already existing data anlysis tools.

HPOTerm

An individual HPOTerm contains all info about itself as well as pointers to its parents and its children. You can access its information-content, calculate similarity scores to other terms, find the shortest or longes connection between two terms. List all associated genes or diseases, etc.

HPOSet

An HPOSet can be used to represent e.g. a patient’s clinical information. It allows some basic filtering and comparisons to other HPOSet s.

Ontology

The Ontology represents all HPO terms and their connections and associations. It also contains pointers to associated genes and disease.

Installation / Setup

The easiest way to install PyHPO is via pip

pip install pyhpo

Note

Some features of PyHPO require pandas. The standard installation via pip will not include pandas and PyHPO will work just fine. (You will get a warning on the initial import though). As long as you don’t try to create a pandas.DataFrame, everything should work without pandas. If you want to use all features, install pandas yourself:

pip install pandas

Usage

For a detailed description of how to use PyHPO, visit the PyHPO Documentation.

Getting started

from pyhpo.ontology import Ontology

# initilize the Ontology (you can specify config parameters if needed here)
ontology = Ontology()

# Iterate through all HPO terms
for term in ontology:
    # do something, e.g.
    print(term.name)

There are multiple ways to retrieve a single term out of an ontology:

# Retrieve a term via its HPO-ID
term = ontology.get_hpo_object('HP:0002650')

# ...or via the Integer representation of the ID
term = ontology.get_hpo_object(2650)

# ...or via shortcut
term = ontology[2650]

# ...or by term name
term = ontology.get_hpo_object('Scoliosis')

You can also do substring search on term names and synonyms:

# ontology.search returns an Iterator over all matches
for term in ontology.search('Abn'):
    print(term.name)

Find the shortest path between two terms:

ontology.path(
    'Abnormality of the nervous system',
    'HP:0002650'
)

Working with terms

# check the relationship of two terms
term.path_to_other(ontology[11])

# get the information content for OMIM diseases
term.information_content['omim']

# ...or for genes
term.information_content['genes']

# compare two terms
term.similarity_score(term2, method='resnik', kind='gene')

Working with sets

# Create a clinical information set of HPO Terms
clinical_info = pyhpo.HPOSet([
    ontology[12],
    ontology[14],
    ontology.get_hpo_object(2650)
])

# Extract only child nodes and leave out all parent terms
children = clinical_info.child_nodes()

# Remove HPO modifier terms
new_ci = clinical_info.remove_modifier()

# Calculate the similarity of two Sets
sim_score = clinical_info.similarity(other_set)

Statistics

PyHPO includes some basic statics method for gene, disease and HPO-Term enrichment analysis.

# Let's say you have a patient with a couple of symptoms and
# you want to find out the most likely affected genes
# or most likely diseases

from pyhpo import stats
from pyhpo.ontology import Ontology
from pyhpo.set import HPOSet, BasicHPOSet
_ = Ontology()

hpo_terms = [
    'Decreased circulating antibody level',
    'Abnormal immunoglobulin level',
    'Abnormality of B cell physiology',
    'Abnormal lymphocyte physiology',
    'Abnormality of humoral immunity',
    'Lymphoma',
    'Lymphopenia',
    'Autoimmunity',
    'Increased circulating IgG level',
    'Abnormal lymphocyte count'
]

# you can either use a HPOSet for this
hposet = HPOSet.from_queries(hpo_terms)

# or just a plain list of HPO Terms
hposet = [Ontology.match(q) for q in hpo_terms]

# Initialize an Enrichment model for genes
gene_model = stats.EnrichmentModel('gene')

# You can also do enrichment for diseases
disease_model = stats.EnrichmentModel('omim')

# Calculate the Hypergeometric distribution test enrichment
gene_results = gene_model.enrichment(
    'hypergeom',
    hposet
)
disease_results = disease_model.enrichment(
    'hypergeom',
    hposet
)

# and print the Top-10 results
for x in gene_results[0:10]:
    print(x)
for x in disease_results[0:10]:
    print(x)

and many more examples in the PyHPO Documentation

Contributing

Yes, please do so. I would appreciate any help, suggestions for improvement or other feedback. Just create a pull-request or open an issue.

License

PyHPO is released under the MIT license.

PyHPO is using the Human Phenotype Ontology. Find out more at http://www.human-phenotype-ontology.org

Sebastian Köhler, Leigh Carmody, Nicole Vasilevsky, Julius O B Jacobsen, et al. Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources. Nucleic Acids Research. (2018) doi: 10.1093/nar/gky1105

Project details

Release history Release notifications | RSS feed

4.0.0

Mar 9, 2025

3.3.2

Feb 23, 2025

3.3.1

Jun 16, 2024

3.3.0

Mar 23, 2024

3.2.6

Mar 10, 2024

3.2.5

Oct 23, 2023

3.2.4

Aug 11, 2023

3.2.3 yanked

Aug 11, 2023

Reason this release was yanked:

The build process had a bug and didn't include the submodules. Please use 3.2.4

3.2.2

Jul 28, 2023

3.2.1

Jul 27, 2023

3.2.0

Jul 6, 2023

3.1.5

Apr 19, 2023

3.1.4

Mar 15, 2023

3.1.3

Nov 8, 2022

3.1.2

May 23, 2022

3.1.0

May 23, 2022

3.0.0

Nov 20, 2021

2.7.3

Feb 18, 2021

2.7.2

Feb 15, 2021

2.7.1

Feb 12, 2021

2.6.1

Jan 31, 2021

2.6.0

Jan 31, 2021

This version

2.5.0

Nov 7, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyhpo-2.5.0.tar.gz (12.6 MB view details)

Uploaded Nov 7, 2020 Source

Built Distribution

pyhpo-2.5.0-py3-none-any.whl (13.1 MB view details)

Uploaded Nov 7, 2020 Python 3

File details

Details for the file pyhpo-2.5.0.tar.gz.

File metadata

Download URL: pyhpo-2.5.0.tar.gz
Upload date: Nov 7, 2020
Size: 12.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.12.4 setuptools/20.3 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.5.4

File hashes

Hashes for pyhpo-2.5.0.tar.gz
Algorithm	Hash digest
SHA256	`4385927befefc950c150cf7b1fd8be7d8b59763dc11726b0de8a299316a8178a`
MD5	`50676c65c37359a230a9c96716fefaa5`
BLAKE2b-256	`a8396066b2e864a2fabe53d5c2e90d5a4a468f28bc5574817a04e6d0f0cf045f`

See more details on using hashes here.

File details

Details for the file pyhpo-2.5.0-py3-none-any.whl.

File metadata

Download URL: pyhpo-2.5.0-py3-none-any.whl
Upload date: Nov 7, 2020
Size: 13.1 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.12.4 setuptools/20.3 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.5.4

File hashes

Hashes for pyhpo-2.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`005ef352c4e10886feed97c7af6c2620d0feb9c5bf49187799ad1da1ded7d335`
MD5	`632885d1afc5b05e63b79759bcd463e2`
BLAKE2b-256	`bf1e12da46f40260a8802f840ab5984072f7e4db54d7df8abcf68f82c27e8e6e`

See more details on using hashes here.

pyhpo 2.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Main features

HPOTerm

HPOSet

Ontology

Installation / Setup

Usage

Getting started

Working with terms

Working with sets

Statistics

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes