Skip to main content

Collection of GA4GH Phenopackets

Project description

Phenopacket Store Toolkit

Phenopacket Store Toolkit is a Python package and CLI for managing Phenopacket Store, a collection of GA4GH Phenopacket cohorts that represent individuals with Mendelian diseases.

The toolkit aids the release and Q/C processes, and simplifies access to the Phenopacket Store data from the downstream applications.

Availability

Phenopacket Store Toolkit can be installed from Python Package Index (PyPi):

python3 -m pip install phenopacket-store-toolkit

Examples

Access Phenopacket Store

The toolkit simplifies download and loading the cohort data. The PhenopacketStoreRegistry API caches the release ZIP files locally (in $HOME/.phenopacket-store by default) and simplifies the loading:

from ppktstore.registry import configure_phenopacket_registry

registry = configure_phenopacket_registry()

with registry.open_phenopacket_store(release="0.1.18") as ps:
   phenopackets = list(ps.iter_cohort_phenopackets("SUOX"))

assert len(phenopackets) == 35

The code checks if the release ZIP of Phenopacket Store version 0.1.18 is already available locally, and downloads the release ZIP file if necessary. This is followed by opening the store as ps and loading all phenopackets of the SUOX cohort.

We use Python context manager to ensure proper closing of the ZIP file handle. ps cannot be used outside of the context manager block.

As an alternative to using a specific Phenopacket Store release, the latest release will be used if release argument is omitted.

Make Phenopacket Store release

The release is handled by the Command Line Interface (CLI) of the toolkit.

The release functionality requires additional dependencies, which are installed automatically by adding release profile:

python3 -m pip install phenopacket-store-toolkit[release]

Now, we can Q/C the phenopackets in the notebooks directory. The Q/C uses HPO hierarchy, hence HPO must be provided either as path to a hp.json file via --hpo option or as a release tag via --hpo-release:

python3 -m ppktstore qc --hpo-release v2024-04-26 --notebook-dir notebooks

and we can create the release archive by running:

python3 -m ppktstore package --notebook-dir notebooks --release-tag 0.1.18 --output all_phenopackets

This will find all phenopackets in the notebooks folder, copy them into a top-level directory called 0.1.18, and ZIP the directory into all_phenopackets.zip.

Learn more

Find more info in our detailed documentation:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phenopacket_store_toolkit-0.1.6.tar.gz (26.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

phenopacket_store_toolkit-0.1.6-py3-none-any.whl (32.0 kB view details)

Uploaded Python 3

File details

Details for the file phenopacket_store_toolkit-0.1.6.tar.gz.

File metadata

File hashes

Hashes for phenopacket_store_toolkit-0.1.6.tar.gz
Algorithm Hash digest
SHA256 0c3ecdabb43de6e22318fefeef2816ddec079c1547bc0b7572c0fd038976167b
MD5 542468f3ae6d4b7fc006b8d10e6c2dd9
BLAKE2b-256 4ea40bb8e3c143d9f4dfea405fc0d1b94b65a2bb9bc00653c0f0191dd123ca0a

See more details on using hashes here.

File details

Details for the file phenopacket_store_toolkit-0.1.6-py3-none-any.whl.

File metadata

File hashes

Hashes for phenopacket_store_toolkit-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 ccdf36b219c28f2c2d92f79f9ed9f5eb85369c59c624ca6ed92ac1d175ca0adc
MD5 8bd910f68bea47d8eb8aa800efc5f001
BLAKE2b-256 da79e7a1de6cfd559ce6ac1598e921843b315c28df7a53108a248e6550c82aff

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page