Skip to main content

An ETL pipeline to extract NWICU data into the MEDS format.

Project description

NWICU MEDS Extraction ETL

PyPI - Version Documentation Status codecov tests code-quality python license PRs contributors DOI

This pipeline extracts the NWICU dataset (from physionet, https://physionet.org/content/nwicu-northwestern-icu/0.1.0/) into the MEDS format.

Usage:

pip install NWICU_MEDS
export DATASET_DOWNLOAD_USERNAME=$PHYSIONET_USERNAME
export DATASET_DOWNLOAD_PASSWORD=$PHYSIONET_PASSWORD
MEDS_extract-NWICU root_output_dir=$ROOT_OUTPUT_DIR

When you run this, the program will:

  1. Download the needed raw NWICU files for the currently supported version into $ROOT_OUTPUT_DIR/raw_input.
  2. Perform initial, pre-MEDS processing on the raw NWICU files, saving the results in $ROOT_OUTPUT_DIR/pre_MEDS.
  3. Construct the final MEDS cohort, and save it to $ROOT_OUTPUT_DIR/MEDS_cohort.

You can also specify the target directories more directly, with

export DATASET_DOWNLOAD_USERNAME=$PHYSIONET_USERNAME
export DATASET_DOWNLOAD_PASSWORD=$PHYSIONET_PASSWORD
MEDS_extract-NWICU raw_input_dir=$RAW_INPUT_DIR pre_MEDS_dir=$PRE_MEDS_DIR MEDS_cohort_dir=$MEDS_COHORT_DIR

Examples and More Info:

You can run MEDS_extract-NWICU --help for more information on the arguments and options. You can also run

MEDS_extract-NWICU root_output_dir=$ROOT_OUTPUT_DIR

to run the entire pipeline.

Citation

we provide an ETL for the following resource: Moukheiber, D., Temps, W., Molgi, B., Li, Y., Lu, A., Nannapaneni, P., Chahin, A., Hao, S., Torres Fabregas, F., Celi, L. A., Wong, A., Lloyd, M., Borrat Frigola, X., Lee, H., Schneider, D., Pollard, T., Luo, Y., Kho, A., & Mark, R. (2024). Northwestern ICU (NWICU) database (version 0.1.0). PhysioNet. https://doi.org/10.13026/s84w-1829.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nwicu_meds-0.0.11.tar.gz (21.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nwicu_meds-0.0.11-py3-none-any.whl (17.3 kB view details)

Uploaded Python 3

File details

Details for the file nwicu_meds-0.0.11.tar.gz.

File metadata

  • Download URL: nwicu_meds-0.0.11.tar.gz
  • Upload date:
  • Size: 21.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for nwicu_meds-0.0.11.tar.gz
Algorithm Hash digest
SHA256 148326997861068ee49594c5b906c09ba58d85ab344be7d3effbcc71f3942c48
MD5 009bf3faace882235500af53eaf10ee2
BLAKE2b-256 36923b006490c8711e4cd323501ca74d4d04d4555a907cdb7e676e4573a27a01

See more details on using hashes here.

Provenance

The following attestation bundles were made for nwicu_meds-0.0.11.tar.gz:

Publisher: python-build.yaml on rvandewater/NWICU_MEDS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file nwicu_meds-0.0.11-py3-none-any.whl.

File metadata

  • Download URL: nwicu_meds-0.0.11-py3-none-any.whl
  • Upload date:
  • Size: 17.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for nwicu_meds-0.0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 2392f34369e82c7ea744fad9f58952952a0517191f32047e44972b739e0aa8d0
MD5 e2749a47a5bab3e3e04c96bfa3937f34
BLAKE2b-256 540eba0146baeef02fd3bd68318c1e890751abffa2e28f44d0bc679e2851a739

See more details on using hashes here.

Provenance

The following attestation bundles were made for nwicu_meds-0.0.11-py3-none-any.whl:

Publisher: python-build.yaml on rvandewater/NWICU_MEDS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page