Skip to main content

An ETL pipeline to extract INSPIRE data into the MEDS format.

Project description

Extract your custom dataset via MEDS-Transforms

codecov tests code-quality python license PRs contributors

This pipeline extracts the INSPIRE dataset (from physionet, https://physionet.org/content/inspire/) into the MEDS format.

Usage:

pip install INSPIRE_MEDS
export DATASET_DOWNLOAD_USERNAME=$PHYSIONET_USERNAME
export DATASET_DOWNLOAD_PASSWORD=$PHYSIONET_PASSWORD
MEDS_extract-INSPIRE root_output_dir=$ROOT_OUTPUT_DIR

When you run this, the program will:

  1. Download the needed raw INSPIRE files for the currently supported version into $ROOT_OUTPUT_DIR/raw_input.
  2. Perform initial, pre-MEDS processing on the raw INSPIRE files, saving the results in $ROOT_OUTPUT_DIR/pre_MEDS.
  3. Construct the final MEDS cohort, and save it to $ROOT_OUTPUT_DIR/MEDS_cohort.

You can also specify the target directories more directly, with

export DATASET_DOWNLOAD_USERNAME=$PHYSIONET_USERNAME
export DATASET_DOWNLOAD_PASSWORD=$PHYSIONET_PASSWORD
MEDS_extract-INSPIRE raw_input_dir=$RAW_INPUT_DIR pre_MEDS_dir=$PRE_MEDS_DIR MEDS_cohort_dir=$MEDS_COHORT_DIR

Examples and More Info:

You can run MEDS_extract-INSPIRE --help for more information on the arguments and options. You can also run

MEDS_extract-INSPIRE root_output_dir=$ROOT_OUTPUT_DIR

to run the entire pipeline.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inspire_meds-0.0.4.tar.gz (131.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

INSPIRE_MEDS-0.0.4-py3-none-any.whl (18.5 kB view details)

Uploaded Python 3

File details

Details for the file inspire_meds-0.0.4.tar.gz.

File metadata

  • Download URL: inspire_meds-0.0.4.tar.gz
  • Upload date:
  • Size: 131.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for inspire_meds-0.0.4.tar.gz
Algorithm Hash digest
SHA256 f4d00f118e4c2277492eafa5fee7a3f43454466e999eb5c249f6012d079c027f
MD5 8b36d59bb143fe7f16525ec9dadea04c
BLAKE2b-256 27e8bd6a82106e1be9ed2dcec844d9dc004f3a475bb0d9fc55858a7131d0e2ea

See more details on using hashes here.

Provenance

The following attestation bundles were made for inspire_meds-0.0.4.tar.gz:

Publisher: python-build.yaml on rvandewater/INSPIRE_MEDS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file INSPIRE_MEDS-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: INSPIRE_MEDS-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 18.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for INSPIRE_MEDS-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 f38b4251f89d48ed5f6358f390fa11b3fbbcd90c893ecbeca6f2a62bd7845239
MD5 0d0904a981f55f95f070d1ce73192ab3
BLAKE2b-256 ee0546b471437b86c8fc8e5cbb0c13f30138dddc3f878bb98ebd6dc9ed97adad

See more details on using hashes here.

Provenance

The following attestation bundles were made for INSPIRE_MEDS-0.0.4-py3-none-any.whl:

Publisher: python-build.yaml on rvandewater/INSPIRE_MEDS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page