Skip to main content

An ETL pipeline to extract INSPIRE data into the MEDS format.

Project description

Extract your custom dataset via MEDS-Transforms

codecov tests code-quality python license PRs contributors This pipeline extracts the INSPIRE dataset (from physionet, https://physionet.org/content/inspire/) into the MEDS format.

Usage:

pip install INSPIRE_MEDS
export DATASET_DOWNLOAD_USERNAME=$PHYSIONET_USERNAME
export DATASET_DOWNLOAD_PASSWORD=$PHYSIONET_PASSWORD
MEDS_extract-INSPIRE root_output_dir=$ROOT_OUTPUT_DIR

When you run this, the program will:

  1. Download the needed raw INSPIRE files for the currently supported version into $ROOT_OUTPUT_DIR/raw_input.
  2. Perform initial, pre-MEDS processing on the raw INSPIRE files, saving the results in $ROOT_OUTPUT_DIR/pre_MEDS.
  3. Construct the final MEDS cohort, and save it to $ROOT_OUTPUT_DIR/MEDS_cohort.

You can also specify the target directories more directly, with

export DATASET_DOWNLOAD_USERNAME=$PHYSIONET_USERNAME
export DATASET_DOWNLOAD_PASSWORD=$PHYSIONET_PASSWORD
MEDS_extract-INSPIRE raw_input_dir=$RAW_INPUT_DIR pre_MEDS_dir=$PRE_MEDS_DIR MEDS_cohort_dir=$MEDS_COHORT_DIR

Examples and More Info:

You can run MEDS_extract-INSPIRE --help for more information on the arguments and options. You can also run

MEDS_extract-INSPIRE root_output_dir=$ROOT_OUTPUT_DIR

to run the entire pipeline.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inspire_meds-0.0.3.tar.gz (132.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

INSPIRE_MEDS-0.0.3-py3-none-any.whl (19.4 kB view details)

Uploaded Python 3

File details

Details for the file inspire_meds-0.0.3.tar.gz.

File metadata

  • Download URL: inspire_meds-0.0.3.tar.gz
  • Upload date:
  • Size: 132.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for inspire_meds-0.0.3.tar.gz
Algorithm Hash digest
SHA256 191a4660632cb9e9b2496ee060917bffa6c716a5d44c66f716f7cb2bd541ee08
MD5 94ae58626347bc519713332bf0a0cca7
BLAKE2b-256 cdd0909ee3ae17439c32f3454141fcc7abe3ec114e2fdc3473d6d2c631f8b91e

See more details on using hashes here.

Provenance

The following attestation bundles were made for inspire_meds-0.0.3.tar.gz:

Publisher: python-build.yaml on rvandewater/INSPIRE_MEDS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file INSPIRE_MEDS-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: INSPIRE_MEDS-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 19.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for INSPIRE_MEDS-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 541f30ff7a9d461f797405198b1649f52f99b7ec5bcb3f67f84ca103ac3e9a9c
MD5 cdf22135a7f1afc430978350d8077086
BLAKE2b-256 b4b0a59c65f813ba5e3a4a9ea0f3c79de2d2613f115f8782e7ae9f11ad3987e9

See more details on using hashes here.

Provenance

The following attestation bundles were made for INSPIRE_MEDS-0.0.3-py3-none-any.whl:

Publisher: python-build.yaml on rvandewater/INSPIRE_MEDS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page