Skip to main content

An ETL pipeline to extract HIRID data into the MEDS format.

Project description

Extract your custom dataset via MEDS-Transforms

codecov tests code-quality python license PRs contributors Static Badge

Warning: This ETL currently needs a lot of resources to run.

A template repository for a MEDS-Transforms powered extraction pipeline for a custom dataset. Once you have customized the repository to your dataset (see instructions below), you will be able to run your extraction pipeline with a few simple command-line commands, such as:

pip install HIRID_MEDS # you can do this locally or via PyPI
# Download your data or set download credentials
MEDS_extract-HIRID root_output_dir=$ROOT_OUTPUT_DIR do_download=true raw_input_dir=$RAW_INPUT_DIR

MEDS-transforms settings

If you want to convert a large dataset, you can use parallelization with MEDS-transforms (the MEDS-transformation step that takes the longest).

Using local parallelization with the hydra-joblib-launcher package, you can set the number of workers:

pip install hydra-joblib-launcher --upgrade

Then, you can set the number of workers as environment variable:

export N_WORKERS=8

Moreover, you can set the number of subjects per shard to balance the parallelization overhead based on how many subjects you have in your dataset:

export N_SUBJECTS_PER_SHARD=100000

Citation

If you use this dataset, please cite the original publication below and the ETL (see cite this repository):

Faltys, M., Zimmermann, M., Lyu, X., Hüser, M., Hyland, S., Rätsch, G., & Merz, T. (2021). HiRID, a high time-resolution ICU dataset (version 1.1.1). PhysioNet. https://doi.org/10.13026/nkwc-js72.

Hyland, S.L., Faltys, M., Hüser, M. et al. Early prediction of circulatory failure in the intensive care unit using machine learning. Nat Med 26, 364–373 (2020). https://doi.org/10.1038/s41591-020-0789-4

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hirid_meds-0.0.1.tar.gz (130.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hirid_meds-0.0.1-py3-none-any.whl (20.1 kB view details)

Uploaded Python 3

File details

Details for the file hirid_meds-0.0.1.tar.gz.

File metadata

  • Download URL: hirid_meds-0.0.1.tar.gz
  • Upload date:
  • Size: 130.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for hirid_meds-0.0.1.tar.gz
Algorithm Hash digest
SHA256 449e4aa9f7d64e95ec810308541128396dd1442b21437b543d7dcb3451f4277f
MD5 7954d6a8d4c107669473b7c89484d8fe
BLAKE2b-256 5f1a1ba120c929f0a9a70802b76c5aa106d70d99e6b4b5229b9684634219cb4f

See more details on using hashes here.

Provenance

The following attestation bundles were made for hirid_meds-0.0.1.tar.gz:

Publisher: python-build.yaml on rvandewater/HIRID_MEDS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file hirid_meds-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: hirid_meds-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 20.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for hirid_meds-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9242391ac3f4045b776fab2f2f2086c31fd37f8dfc87a79e793603f043939680
MD5 3bfaa951edd2383302e2439dc03327b0
BLAKE2b-256 07d58511fb860c43452a85af821679b0d56a15c780057602e1da3664e72254ef

See more details on using hashes here.

Provenance

The following attestation bundles were made for hirid_meds-0.0.1-py3-none-any.whl:

Publisher: python-build.yaml on rvandewater/HIRID_MEDS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page