Clinical flowsheet label harmonization using sentence embeddings and vector search.
Project description
auto-ehrmonize-flowsheet
Clinical flowsheet label harmonization using sentence embeddings and vector search.
Install
pip install auto-ehrmonize-flowsheet
One-time setup
Download the vector database (ChromaDB + HDF5 value embeddings) from HuggingFace
into ~/.auto_ehrmonize_flowsheet/:
ehrmonize setup --token <HF_TOKEN> --repo-id <user/repo>
Alternatively, set HUGGING_FACE_KEY and HF_REPO_ID as environment variables
(or in a .env file in your working directory) and just run ehrmonize setup.
Quick start (Python)
from auto_ehrmonize_flowsheet import AutoEHRmonizeFlowsheet
harmonizer = AutoEHRmonizeFlowsheet("data/test_data.csv")
results = harmonizer.harmonize("Heart Rate")
print(results)
CLI
# Look up a single label
ehrmonize lookup "Heart Rate" --data data/test_data.csv
# With options
ehrmonize lookup "Heart Rate" --data data/test_data.csv --source mimic --threshold 0.6 --top-k 5
# List unique labels in your dataset
ehrmonize labels --data data/test_data.csv
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file auto_ehrmonize_flowsheet-0.1.0.tar.gz.
File metadata
- Download URL: auto_ehrmonize_flowsheet-0.1.0.tar.gz
- Upload date:
- Size: 9.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2964471707a2cd75428508a30ca41ce84a41547d458baf354dae9f723117b45a
|
|
| MD5 |
d8535ba1d1bc8faaf59f117ed675113d
|
|
| BLAKE2b-256 |
865ac071eda2594e43c48f95d5e815af7c28e4494d7b689cbd76a6cc8e3e9e79
|
File details
Details for the file auto_ehrmonize_flowsheet-0.1.0-py3-none-any.whl.
File metadata
- Download URL: auto_ehrmonize_flowsheet-0.1.0-py3-none-any.whl
- Upload date:
- Size: 11.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8d2e9e0f16d7ec2f76afbf8964d698d8211ac92b954d06f1f13b38e5b0b54b91
|
|
| MD5 |
030480e46a932c8a4f5152678ff6a6ea
|
|
| BLAKE2b-256 |
a41ca5f251fddfcdba9cfe9f8b4a36f31a40f2cda9ce0287bf3d462fdd31d1cf
|