An ETL pipeline to extract INSPIRE data into the MEDS format.
Project description
INSPIRE-MEDS
This pipeline extracts the INSPIRE dataset (from physionet, https://physionet.org/content/inspire/) into the MEDS format.
Usage:
pip install INSPIRE_MEDS
export DATASET_DOWNLOAD_USERNAME=$PHYSIONET_USERNAME
export DATASET_DOWNLOAD_PASSWORD=$PHYSIONET_PASSWORD
MEDS_extract-INSPIRE root_output_dir=$ROOT_OUTPUT_DIR
When you run this, the program will:
- Download the needed raw INSPIRE files for the currently supported version into
$ROOT_OUTPUT_DIR/raw_input. - Perform initial, pre-MEDS processing on the raw INSPIRE files, saving the results in
$ROOT_OUTPUT_DIR/pre_MEDS. - Construct the final MEDS cohort, and save it to
$ROOT_OUTPUT_DIR/MEDS_cohort.
You can also specify the target directories more directly, with
export DATASET_DOWNLOAD_USERNAME=$PHYSIONET_USERNAME
export DATASET_DOWNLOAD_PASSWORD=$PHYSIONET_PASSWORD
MEDS_extract-INSPIRE raw_input_dir=$RAW_INPUT_DIR pre_MEDS_dir=$PRE_MEDS_DIR MEDS_cohort_dir=$MEDS_COHORT_DIR
Examples and More Info:
You can run MEDS_extract-INSPIRE --help for more information on the arguments and options. You can also run
MEDS_extract-INSPIRE root_output_dir=$ROOT_OUTPUT_DIR
to run the entire pipeline.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file inspire_meds-0.0.5.tar.gz.
File metadata
- Download URL: inspire_meds-0.0.5.tar.gz
- Upload date:
- Size: 131.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4b9c73b74714a7d31f232d2d1dae73246805db984e257df9a2d1e08d8301bf5f
|
|
| MD5 |
a0db51e32c0190c5f8260f0f2231893c
|
|
| BLAKE2b-256 |
502d36d29110cc4d1bdacecbf873615e1a73683840b08b13abb14bc0a27e4820
|
Provenance
The following attestation bundles were made for inspire_meds-0.0.5.tar.gz:
Publisher:
python-build.yaml on rvandewater/INSPIRE_MEDS
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
inspire_meds-0.0.5.tar.gz -
Subject digest:
4b9c73b74714a7d31f232d2d1dae73246805db984e257df9a2d1e08d8301bf5f - Sigstore transparency entry: 191826975
- Sigstore integration time:
-
Permalink:
rvandewater/INSPIRE_MEDS@d296416b0b8eda87dee4dd2d48e941fef697b2a9 -
Branch / Tag:
refs/tags/0.0.5 - Owner: https://github.com/rvandewater
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-build.yaml@d296416b0b8eda87dee4dd2d48e941fef697b2a9 -
Trigger Event:
push
-
Statement type:
File details
Details for the file inspire_meds-0.0.5-py3-none-any.whl.
File metadata
- Download URL: inspire_meds-0.0.5-py3-none-any.whl
- Upload date:
- Size: 18.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
05c9e6b21fc16736680dbc097406758f17e4aff9b49dc138c2d6cdc72c889303
|
|
| MD5 |
602952566d40b4b6ec0229c791010883
|
|
| BLAKE2b-256 |
1f46656f17ed0f41f842f480c7509ae9aaad669a5d52a9688fcc08a6c368fd72
|
Provenance
The following attestation bundles were made for inspire_meds-0.0.5-py3-none-any.whl:
Publisher:
python-build.yaml on rvandewater/INSPIRE_MEDS
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
inspire_meds-0.0.5-py3-none-any.whl -
Subject digest:
05c9e6b21fc16736680dbc097406758f17e4aff9b49dc138c2d6cdc72c889303 - Sigstore transparency entry: 191826976
- Sigstore integration time:
-
Permalink:
rvandewater/INSPIRE_MEDS@d296416b0b8eda87dee4dd2d48e941fef697b2a9 -
Branch / Tag:
refs/tags/0.0.5 - Owner: https://github.com/rvandewater
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-build.yaml@d296416b0b8eda87dee4dd2d48e941fef697b2a9 -
Trigger Event:
push
-
Statement type: