Skip to main content

Testing similarity of HPO terms between probands sharing variants in genes.

Project description

Build Status

Similarity of phenotypes in patient groups

This estimates how likely it is for groups of individuals to have similar phenotypes. To estimate this probability, we need three things:

  • a way to quantify phenotypic similarity of two individuals. We use the maximum information content of the most informative common ancestor for each pair of HPO terms from two probands.
  • a way to quantify similarity across more than two probands. We sum phenotypic similarity scores from all pairs of probands.
  • a null distribution of similarity scores for those probands, generated by randomly sampled groups of probands

The P value is calculated as the proportion of simulated scores greater than the observed probands' score.

Install

Install the package with:

pip install hpo_similarity
Run
hpo_similarity --genes genes.json --phenotypes phenotypes.json

The data directory on github includes example files, one with proband IDs per gene (data/example_genes.json), and the other with HPO terms per proband (data/example_phenotypes.json).

Additional options:

  • --output PATH to send output gene and P-values to a file.
  • --ontology PATH to use a HPO ontology file other than the default.
  • --iterations INTEGER to change the number of iterations (default=100000)

You can also explore the HPO graph using the hpo_similarity package within python, for example:

from hpo_similarity import Ontology

ontology = Ontology()
graph = ontology.get_graph()

# find all descendant terms
graph.get_descendants('HP:0001249')

# get the text for the phenotypic abnormality
graph.nodes['HP:0001249']['name']
Acknowledgments

This code incorporates the following code and datasets:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

hpo_similarity-0.4.3-py2.py3-none-any.whl (766.9 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file hpo_similarity-0.4.3-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for hpo_similarity-0.4.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 15810bc2e3fd3cf4c5375658f7f738cb07af749e5bfbd88c31d35874a953403d
MD5 fa61e1ab3161609824249f6c425cd398
BLAKE2b-256 89641b85f87f5c25329b86983ab82df5c8947b3a87c63829ed52db4b7f65a6a7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page