Skip to main content

Clinical flowsheet label harmonization using sentence embeddings and vector search.

Project description

auto-ehrmonize-flowsheet

Clinical flowsheet label harmonization using sentence embeddings and vector search.

Install

pip install auto-ehrmonize-flowsheet

One-time setup

Download the vector database (ChromaDB + HDF5 value embeddings) from HuggingFace into ~/.auto_ehrmonize_flowsheet/:

ehrmonize setup --token <HF_TOKEN> --repo-id <user/repo>

Alternatively, set HUGGING_FACE_KEY and HF_REPO_ID as environment variables (or in a .env file in your working directory) and just run ehrmonize setup.

Quick start (Python)

from auto_ehrmonize_flowsheet import AutoEHRmonizeFlowsheet

harmonizer = AutoEHRmonizeFlowsheet("data/test_data.csv")
results = harmonizer.harmonize("Heart Rate")
print(results)

CLI

# Look up a single label
ehrmonize lookup "Heart Rate" --data data/test_data.csv

# With options
ehrmonize lookup "Heart Rate" --data data/test_data.csv --source mimic --threshold 0.6 --top-k 5

# List unique labels in your dataset
ehrmonize labels --data data/test_data.csv

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

auto_ehrmonize_flowsheet-0.1.0.tar.gz (9.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

auto_ehrmonize_flowsheet-0.1.0-py3-none-any.whl (11.8 kB view details)

Uploaded Python 3

File details

Details for the file auto_ehrmonize_flowsheet-0.1.0.tar.gz.

File metadata

File hashes

Hashes for auto_ehrmonize_flowsheet-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2964471707a2cd75428508a30ca41ce84a41547d458baf354dae9f723117b45a
MD5 d8535ba1d1bc8faaf59f117ed675113d
BLAKE2b-256 865ac071eda2594e43c48f95d5e815af7c28e4494d7b689cbd76a6cc8e3e9e79

See more details on using hashes here.

File details

Details for the file auto_ehrmonize_flowsheet-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for auto_ehrmonize_flowsheet-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8d2e9e0f16d7ec2f76afbf8964d698d8211ac92b954d06f1f13b38e5b0b54b91
MD5 030480e46a932c8a4f5152678ff6a6ea
BLAKE2b-256 a41ca5f251fddfcdba9cfe9f8b4a36f31a40f2cda9ce0287bf3d462fdd31d1cf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page