Utilities for evaluating autoregressive generated trajectories over MEDS datasets, including temporal AUC computation and zero-shot ACES task labeling.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mmd_pypi

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

MEDS Logo

MEDS Trajectory Evaluation

python

This package contains utilities for evaluating the quality of autoregressive, generated trajectories produced by foundation models over MEDS datasets. In particular, the following are supported:

A formal schema for representing generated trajectories over MEDS datasets (See 2.A for quickstart).
Utilities for producing simple but high-granularity AUROCs for prediction of whether or not a specified simple ACES predicate will occur within simple windows in the patient future (See 2.B for quickstart).
Utilities for converting generated trajectories into empirical probabilities given a full ACES task definition, with optional support for "relaxations" that can be applied to the ACES task definition to support more flexible labeling. (See 2.C for quickstart).

[!WARNING] This package is a work in progress and is not yet stable. The API may change in future releases.

Quickstart

1. Install

pip install MEDS_trajectory_evaluation

2. Run

2.A Generated Trajectory Schema

The included GeneratedTrajectorySchema provides a format to capture generated trajectories in a consistent manner; in particular, it asserts that generated trajectories should look identical to real MEDS data, with the addition of a prediction_time column that indicates the latest time that input data was used to generate the trajectory.

You can use this schema through the normal usage options offered by flexible_schema class objects. For example, assuming you have a DataFrame trajectories_df containing generated trajectories with the appropriate columns:

>>> trajectories_df = pl.DataFrame({
...     "subject_id": [1, 1, 1, 2, 2],
...     "time": [
...         datetime(1993, 1, 1, 12, 0),
...         datetime(1993, 1, 1, 13, 0),
...         datetime(1993, 1, 1, 14, 0),
...         datetime(1999, 1, 1, 13, 0),
...         datetime(1999, 1, 1, 14, 0),
...     ],
...     "code": ["LAB_1", "LAB_2", "ICU_DISCHARGE", "LAB_3", "LAB_4"],
...     "numeric_value": [1.0, None, None, None, 1.1],
...     "prediction_time": [
...         datetime(1993, 1, 1, 0, 0),
...         datetime(1993, 1, 1, 0, 0),
...         datetime(1993, 1, 1, 0, 0),
...         datetime(1999, 1, 1, 0, 0),
...         datetime(1999, 1, 1, 0, 0),
...     ],
... })
>>> trajectories_df
shape: (5, 5)
┌────────────┬─────────────────────┬───────────────┬───────────────┬─────────────────────┐
│ subject_id ┆ time                ┆ code          ┆ numeric_value ┆ prediction_time     │
│ ---        ┆ ---                 ┆ ---           ┆ ---           ┆ ---                 │
│ i64        ┆ datetime[μs]        ┆ str           ┆ f64           ┆ datetime[μs]        │
╞════════════╪═════════════════════╪═══════════════╪═══════════════╪═════════════════════╡
│ 1          ┆ 1993-01-01 12:00:00 ┆ LAB_1         ┆ 1.0           ┆ 1993-01-01 00:00:00 │
│ 1          ┆ 1993-01-01 13:00:00 ┆ LAB_2         ┆ null          ┆ 1993-01-01 00:00:00 │
│ 1          ┆ 1993-01-01 14:00:00 ┆ ICU_DISCHARGE ┆ null          ┆ 1993-01-01 00:00:00 │
│ 2          ┆ 1999-01-01 13:00:00 ┆ LAB_3         ┆ null          ┆ 1999-01-01 00:00:00 │
│ 2          ┆ 1999-01-01 14:00:00 ┆ LAB_4         ┆ 1.1           ┆ 1999-01-01 00:00:00 │
└────────────┴─────────────────────┴───────────────┴───────────────┴─────────────────────┘

... then you can align it to the GeneratedTrajectorySchema and write it out as a Parquet file as follows:

>>> from MEDS_trajectory_evaluation.schema import GeneratedTrajectorySchema
>>> pa_table = GeneratedTrajectorySchema.align(trajectories_df.to_arrow())
>>> pa_table
pyarrow.Table
subject_id: int64
time: timestamp[us]
code: string
numeric_value: float
prediction_time: timestamp[us]
----
subject_id: [[1,1,1,2,2]]
time: [[1993-01-01 12:00:00.000000,...,1999-01-01 14:00:00.000000]]
code: [["LAB_1",...,"LAB_4"]]
numeric_value: [[1,...,1.1]]
prediction_time: [[1993-01-01 00:00:00.000000,...,1999-01-01 00:00:00.000000]]

2.B Simple Predicate Labeling

If you have a simple definition of an "event" (defined as something expressible as an ACES "plain predicate" ) and you want to understand how well a set of trajectories are able to predict whether or not the first incidence of that event after the prediction time occurs within a given (set of) time horizon(s), potentially with an offset, you can use this package to efficiently compute AUROCs for all predicates and horizons as follows:

temporal_aucs(true_tte_df, pred_tte_df, [timedelta(days=1), timedelta(days=7)])
shape: (2, 3)
┌──────────────┬────────┬────────┐
│ duration     ┆ AUC/A  ┆ AUC/B  │
│ ---          ┆ ---    ┆ ---    │
│ duration[μs] ┆ f64    ┆ f64    │
╞══════════════╪════════╪════════╡
│ 1d           ┆ 0.65   ┆ 0.72   │
│ 7d           ┆ 0.71   ┆ 0.80   │
└──────────────┴────────┴────────┘

2.C Full ACES Task Labeling

ZSACES_label task.criteria_fp="$TASK_CRITERIA" task.predicates_fp="$PREDICATES_FP" \
    output_dir=$OUTPUT_DIR trajectories_dir=$TRAJECTORIES_DIR

Optionally, you can add relaxations to the zero-shot labeling config via labeler.remove_all_criteria=True, labeler.collapse_temporal_gap_windows=True, or labeler.remove_post_label_windows=true. See below for examples of these in action.

Full Documentation

Generated Trajectory Schema

The GeneratedTrajectorySchema class is a flexible_schema instance that extends the core MEDS data schema to include the prediction_time element from the MEDS label schema that defines the latest permitted time for the input data used to generate the source trajectory. It is expected that you store different trajectories generated from the same prediction time per subject (e.g., if you generate 100 sample trajectories per subject-prediction-time) as different data frames. This ensures that data can be properly sorted by subject_id, prediction_time, and time without ambiguity across trajectory samples.

Temporal AUC Evaluation

The temporal_AUC_evaluation package contains helpers for turning time-to-first-event observations into AUC summaries across multiple prediction horizons.

Helper functions

get_raw_tte and get_trajectory_tte extract time-to-event values for each predicate from real datasets or generated trajectories.
merge_pred_ttes stacks multiple predicted TTE tables into list columns so probability distributions can be derived per subject.
add_labels_from_true_tte converts true durations into binary labels for a given horizon and add_probs_from_pred_ttes turns predicted durations into probabilities of observing the event within that window.

Computing AUCs

temporal_aucs wires these pieces together and returns a DataFrame indexed by duration with AUC/<predicate> columns detailing discrimination for each predicate at every horizon.

>>> temporal_aucs(true_tte_df, pred_tte_df, [timedelta(days=1), timedelta(days=7)])  # doctest: +SKIP
shape: (2, 3)
┌──────────────┬────────┬────────┐
│ duration     ┆ AUC/A  ┆ AUC/B  │
│ ---          ┆ ---    ┆ ---    │
│ duration[μs] ┆ f64    ┆ f64    │
╞══════════════╪════════╪════════╡
│ 1d           ┆ 0.65   ┆ 0.72   │
│ 7d           ┆ 0.71   ┆ 0.80   │
└──────────────┴────────┴────────┘

Full ACES Task Labeling

[!IMPORTANT] This library only works with a subset of ACES configs; namely, those that have a tree-based set of dependencies between the end of the input window (the prediction time) and the end of the target window (the label window).

Terminology

Term	Description
ACES	ACES is a domain specific language for describing task cohorts and a tool to automatically extract them from EHR datasets. It is the "source of truth" for task definitions in this work.
Task Config	The (original/raw) ACES configuration file that describes the task cohort.
Input Window	The window in the ACES config defining the "prediction time". This is indicated via the `index_timestamp` marker in the ACES config.
Target Window	The window in the ACES config over which the label is extracted. This is indicated via the `label` marker in the ACES config.
Normal-form / Normalized Config	When in "normal-form" or "normalized", a config has an input window that ends with the prediction time and the prediction time node in the task config tree is an ancestor of both ends of the target window.
Relaxations	A configuration relaxation is a modification to the task config that removes constraints or simplifies the relationships between window endpoints. These are used to simplify or broaden the set of identified empirical labels during zero-shot prediction vs. task label extraction.
valid	A trajectory is "valid" under a config when it does not indicate a sequence of measurements that would violate any inclusion/exclusion criteria in the zero-shot task config.
determinable	A trajectory is "determinable" under a config if and only if it is both valid and contains valid realizations of all relevant windows in the config (e.g., we don't need to generate more).

Supported Config Relaxations

We support a few different relaxations that can help make zero-shot label extraction simpler and more accommodating. These relaxations are not always appropriate for all tasks, but they can be useful in some cases. To understand them deeply, we'll use several examples, which we'll set up first.

Example Configurations

To explore these relaxations, we'll use a few simple example task configs. To construct them, we first need to import the relevant ACES config classes:

>>> from aces.config import (
...     PlainPredicateConfig, EventConfig, TaskExtractorConfig, WindowConfig, DerivedPredicateConfig,
... )

We'll also import the print_ACES helper function to visualize the task configs:

>>> from MEDS_trajectory_evaluation.aces_utils import print_ACES

Example 1: In-hospital mortality prediction

>>> in_hosp_mortality_cfg = TaskExtractorConfig(
...     predicates={
...         "admission": PlainPredicateConfig("ADMISSION"),
...         "discharge": PlainPredicateConfig("DISCHARGE"),
...         "death": PlainPredicateConfig("MEDS_DEATH"),
...         "discharge_or_death": DerivedPredicateConfig("or(discharge, death)"),
...     },
...     trigger=EventConfig("admission"),
...     windows={
...         "sufficient_history": WindowConfig(None, "trigger", True, False, has={"_ANY_EVENT": "(5, None)"}),
...         "input": WindowConfig(
...             "trigger", "start + 24h", False, True, index_timestamp="end",
...             has={"admission": "(None, 0)", "discharge_or_death": "(None, 0)"},
...         ),
...         "gap": WindowConfig(
...             "input.end", "start + 24h", False, True,
...             has={"admission": "(None, 0)", "discharge_or_death": "(None, 0)"},
...         ),
...         "target": WindowConfig("gap.end", "start -> discharge_or_death", False, True, label="death"),
...     }
... )
>>> print_ACES(in_hosp_mortality_cfg)
trigger
├── (start of record) sufficient_history.start (at least 5 event(s))
└── (+1 day, 0:00:00) input.end (no admission, discharge_or_death); **Prediction Time**
    └── (+1 day, 0:00:00) gap.end (no admission, discharge_or_death)
        └── (next discharge_or_death) target.end; **Label: Presence of death**

Example 2: 30-day post discharge mortality prediction

Given a hospital admission, we'll use the first 24 hours of data to predict whether or not the patient will die within 30 days of discharge (with a 1-day gap window post discharge to avoid future leakage). We'll also impose another gap window after the admission to ensure that the hospitalization itself lasts at least 48 hours.

>>> post_discharge_mortality_cfg = TaskExtractorConfig(
...     predicates={
...         "admission": PlainPredicateConfig("ADMISSION"),
...         "discharge": PlainPredicateConfig("DISCHARGE"),
...         "death": PlainPredicateConfig("MEDS_DEATH"),
...         "discharge_or_death": DerivedPredicateConfig("or(discharge, death)"),
...     },
...     trigger=EventConfig("admission"),
...     windows={
...         "sufficient_history": WindowConfig(None, "trigger", True, False, has={"_ANY_EVENT": "(5, None)"}),
...         "input": WindowConfig(
...             "trigger", "start + 24h", False, True, index_timestamp="end",
...             has={"admission": "(None, 0)", "discharge_or_death": "(None, 0)"},
...         ),
...         "post_input": WindowConfig(
...             "input.end", "start + 1d", False, True,
...             has={"admission": "(None, 0)", "discharge_or_death": "(None, 0)"},
...         ),
...         "hospitalization": WindowConfig(
...             "input.end", "start -> discharge", False, True, has={"death": "(None, 0)"},
...         ),
...         "gap": WindowConfig(
...             "hospitalization.end", "start + 1d", False, True,
...             has={"admission": "(None, 0)", "death": "(None, 0)"},
...         ),
...         "target": WindowConfig("gap.end", "start + 29d", False, True, label="death"),
...     }
... )
>>> print_ACES(post_discharge_mortality_cfg)
trigger
├── (start of record) sufficient_history.start (at least 5 event(s))
└── (+1 day, 0:00:00) input.end (no admission, discharge_or_death); **Prediction Time**
    ├── (+1 day, 0:00:00) post_input.end (no admission, discharge_or_death)
    └── (next discharge) hospitalization.end (no death)
        └── (+1 day, 0:00:00) gap.end (no admission, death)
            └── (+29 days, 0:00:00) target.end; **Label: Presence of death**

Example 3: 30-day readmission prediction with censoring

This example features a 30-day readmission risk prediction task, but with a post-target censoring protection window.

>>> readmission_cfg = TaskExtractorConfig(
...     predicates={
...         "admission": PlainPredicateConfig("ADMISSION"),
...         "discharge": PlainPredicateConfig("DISCHARGE"),
...         "death": PlainPredicateConfig("MEDS_DEATH"),
...         "discharge_or_death": DerivedPredicateConfig("or(discharge, death)"),
...     },
...     trigger=EventConfig("discharge"),
...     windows={
...         "sufficient_history": WindowConfig(
...             None, "hospitalization.start", True, False, has={"_ANY_EVENT": "(5, None)"}
...         ),
...         "hospitalization": WindowConfig(
...             "end <- admission", "trigger", True, True, has={"_ANY_EVENT": "(10, None)"},
...             index_timestamp="end"
...         ),
...         "gap": WindowConfig(
...             "hospitalization.end", "start + 1d", False, True,
...             has={"admission": "(None, 0)", "death": "(None, 0)"},
...         ),
...         "target": WindowConfig("gap.end", "start + 29d", False, True, label="admission"),
...         "censoring_protection": WindowConfig(
...             "target.end", None, True, True, has={"_ANY_EVENT": "(1, None)"},
...         ),
...     }
... )
>>> print_ACES(readmission_cfg)
trigger; **Prediction Time**
├── (prior admission) hospitalization.start (at least 10 event(s))
│   └── (start of record) sufficient_history.start (at least 5 event(s))
└── (+1 day, 0:00:00) gap.end (no admission, death)
    └── (+29 days, 0:00:00) target.end; **Label: Presence of admission**
        └── (end of record) censoring_protection.end (at least 1 event(s))

Example 4: Two-stage Infusion

In this hypothetical example, we are examining a cohort of patients who are given an infusion, then given a drug, then (within 10 minutes) have their infusion stopped temporarily, then resumed. We are interested in predicting, at the time of the drug being given, about an adverse event within their second infusion stage. The reason to have such a task is to explore when relaxations are or aren't appropriate in more complex set-ups.

>>> two_stage_cfg = TaskExtractorConfig(
...     predicates={
...         "infusion_start": PlainPredicateConfig("INFUSION//START"),
...         "infusion_end": PlainPredicateConfig("INFUSION//END"),
...         "drug_given": PlainPredicateConfig("special_drug"),
...         "adverse_event": PlainPredicateConfig("special_adverse_event"),
...     },
...     trigger=EventConfig("drug_given"),
...     windows={
...         "1st_infusion": WindowConfig(
...             "trigger", "start -> infusion_end", True, True, has={"adverse_event": "(None, 0)"},
...             index_timestamp="start",
...         ),
...         "2nd_infusion": WindowConfig(
...             "1st_infusion.end -> infusion_start", "start -> infusion_end", True, True,
...             label="adverse_event"
...         ),
...     }
... )
>>> print_ACES(two_stage_cfg)
trigger; **Prediction Time**
└── (next infusion_end) 1st_infusion.end (no adverse_event)
    └── (next infusion_start) 2nd_infusion.start
        └── (next infusion_end) 2nd_infusion.end; **Label: Presence of adverse_event**

Other examples we can't reflect:

What if we only want to count something as a readmission only if the next admission has a discharge associated with a particular diagnosis code? We can't reflect this in ACES currently, but it would pose additional challenges.

Relaxations

We can perform any of the relaxations with the convert_to_zero_shot function in task_config and an appropriate labeler config. Let's import that now for use with our examples:

>>> from MEDS_trajectory_evaluation.ACES_config_evaluation.task_config import convert_to_zero_shot

Even without any relaxations, the zero-shot conversion naturally prunes the tree to include only those nodes between the prediction time window and the label window or after the label window.

>>> print_ACES(convert_to_zero_shot(in_hosp_mortality_cfg))
input.end; **Prediction Time**
└── (+1 day, 0:00:00) gap.end (no admission, discharge_or_death)
    └── (next discharge_or_death) target.end; **Label: Presence of death**

[!WARNING] This can remove some criteria that you may still want to leverage. See, for example, how the post discharge config has lost the window asserting the hospitalization is at least 48 hours. This could be corrected by having the hospitalization window depend directly on the post input window, rather than the input window.

>>> print_ACES(convert_to_zero_shot(post_discharge_mortality_cfg))
input.end; **Prediction Time**
└── (next discharge) hospitalization.end (no death)
    └── (+1 day, 0:00:00) gap.end (no admission, death)
        └── (+29 days, 0:00:00) target.end; **Label: Presence of death**

We still retain the prediction time, label, and relevant criteria in this view.

>>> print_ACES(convert_to_zero_shot(readmission_cfg))
hospitalization.end; **Prediction Time**
└── (+1 day, 0:00:00) gap.end (no admission, death)
    └── (+29 days, 0:00:00) target.end; **Label: Presence of admission**
        └── (end of record) censoring_protection.end (at least 1 event(s))
>>> print_ACES(convert_to_zero_shot(two_stage_cfg))
1st_infusion.start; **Prediction Time**
└── (next infusion_end) 1st_infusion.end (no adverse_event)
    └── (next infusion_start) 2nd_infusion.start
        └── (next infusion_end) 2nd_infusion.end; **Label: Presence of adverse_event**

1. `remove_all_criteria`: Remove inclusion/exclusion criteria

This relaxation removes all inclusion/exclusion criteria from the task config, but does not change the window boundaries that are used to compile the task cohort.

[!NOTE] Using this relaxation does not mean that predictions are made over task samples that failed to meet the task criteria (with respect to their real data). Rather, it just means that generated trajectories will not be discarded on the basis of failing to meet post-input window inclusion/exclusion criteria.

On Example 1: In Hospital Mortality

>>> print_ACES(convert_to_zero_shot(in_hosp_mortality_cfg, {"remove_all_criteria": True}))
input.end; **Prediction Time**
└── (+1 day, 0:00:00) gap.end
    └── (next discharge_or_death) target.end; **Label: Presence of death**

Here, this may be a mistake, as it will classify trajectories as true if they die after discharge, provided discharge is within 1 day. However, using this in conjunction with absorbing gap windows is likely suitable.

On Example 2: Post-discharge Mortality

>>> print_ACES(convert_to_zero_shot(post_discharge_mortality_cfg, {"remove_all_criteria": True}))
input.end; **Prediction Time**
└── (next discharge) hospitalization.end
    └── (+1 day, 0:00:00) gap.end
        └── (+29 days, 0:00:00) target.end; **Label: Presence of death**

Here, this may be a mistake, as it will classify as negative trajectories who die within 1 day after discharge (whereas previously such trajectories would be excluded). However, in concert with gap window absorption, this may be suitable.

On Example 3: Readmission

>>> print_ACES(convert_to_zero_shot(readmission_cfg, {"remove_all_criteria": True}))
hospitalization.end; **Prediction Time**
└── (+1 day, 0:00:00) gap.end
    └── (+29 days, 0:00:00) target.end; **Label: Presence of admission**
        └── (end of record) censoring_protection.end

In this example, there are both good and bad aspects of these changes. First, this will now label trajectories as negative if they are admitted within 1 day (previously, they would have been excluded), which is likely problematic. But it also renders the censoring window moot, which may improve the efficiency.

On Example 4: 2nd infusion stage adverse event

>>> print_ACES(convert_to_zero_shot(two_stage_cfg, {"remove_all_criteria": True}))
1st_infusion.start; **Prediction Time**
└── (next infusion_end) 1st_infusion.end
    └── (next infusion_start) 2nd_infusion.start
        └── (next infusion_end) 2nd_infusion.end; **Label: Presence of adverse_event**

This may be suitable here; it still tracks the right target (adverse events within the 2nd infusion period), but now will include labels for patients who have adverse events in both, which may improve the predictive quality or efficiency of the trajectory-driven predictor.

2. `collapse_temporal_gap_windows`: Absorb temporal gap windows into target

This relaxation absorbs any chain of temporal windows between the input and target window terminating at the target window into the target window. This can only be used if the constraints of these windows are all removed (or if the remove all criteria relaxation is applied as well). This relaxation allows you to make predictions with fewer generated tokens and simpler early stopping criteria.

[!NOTE] This does not remove event bounded windows, though it does remove temporal windows directly before event bound windows or absorb adjacent temporal windows together.

>>> labeler_cfg = {"remove_all_criteria": True, "collapse_temporal_gap_windows": True}

On Example 1: In Hospital Mortality

>>> print_ACES(convert_to_zero_shot(in_hosp_mortality_cfg, labeler_cfg))
input.end; **Prediction Time**
└── (next discharge_or_death) target.end; **Label: Presence of death**

This is likely appropriate, as we will now simply classify if there is any death observed before the next discharge.

On Example 2: Post-discharge Mortality

>>> print_ACES(convert_to_zero_shot(post_discharge_mortality_cfg, labeler_cfg))
input.end; **Prediction Time**
└── (next discharge) hospitalization.end
    └── (+30 days, 0:00:00) target.end; **Label: Presence of death**

This is likely suitable; we have simply streamlined the prediction target to be anytime within the 30 days post discharge, giving the trajectory labeler a more flexible target.

On Example 3: Readmission

>>> print_ACES(convert_to_zero_shot(readmission_cfg, labeler_cfg))
hospitalization.end; **Prediction Time**
└── (+30 days, 0:00:00) target.end; **Label: Presence of admission**
    └── (end of record) censoring_protection.end

This is likely an improvement over the basic config, because it is more accommodating to the target, but it still has a censoring prediction window we may want to remove.

On Example 4: 2nd infusion stage adverse event

>>> print_ACES(convert_to_zero_shot(two_stage_cfg, labeler_cfg))
1st_infusion.start; **Prediction Time**
└── (next infusion_end) 1st_infusion.end
    └── (next infusion_start) 2nd_infusion.start
        └── (next infusion_end) 2nd_infusion.end; **Label: Presence of adverse_event**

This makes no difference as there are no temporal gap windows in this example.

3. `remove_post_label_windows`: Removes all post-label windows from the task config

This relaxation removes all windows that are after the label window. This is useful for removing censoring protection windows which expand the generation scope necessary to resolve a window.

On Example 1: In Hospital Mortality

>>> print_ACES(convert_to_zero_shot(in_hosp_mortality_cfg, {"remove_post_label_windows": True}))
input.end; **Prediction Time**
└── (+1 day, 0:00:00) gap.end (no admission, discharge_or_death)
    └── (next discharge_or_death) target.end; **Label: Presence of death**

This makes no difference as there are no post-label windows in this example.

On Example 2: Post-discharge Mortality

>>> print_ACES(convert_to_zero_shot(post_discharge_mortality_cfg, {"remove_post_label_windows": True}))
input.end; **Prediction Time**
└── (next discharge) hospitalization.end (no death)
    └── (+1 day, 0:00:00) gap.end (no admission, death)
        └── (+29 days, 0:00:00) target.end; **Label: Presence of death**

This makes no difference as there are no post-label windows in this example.

On Example 3: Readmission

>>> print_ACES(convert_to_zero_shot(readmission_cfg, {"remove_post_label_windows": True}))
hospitalization.end; **Prediction Time**
└── (+1 day, 0:00:00) gap.end (no admission, death)
    └── (+29 days, 0:00:00) target.end; **Label: Presence of admission**

This is likely an improvement, as the censoring protection may complicate generation and reduce the efficiency.

On Example 4: 2nd infusion stage adverse event

>>> print_ACES(convert_to_zero_shot(two_stage_cfg, {"remove_post_label_windows": True}))
1st_infusion.start; **Prediction Time**
└── (next infusion_end) 1st_infusion.end (no adverse_event)
    └── (next infusion_start) 2nd_infusion.start
        └── (next infusion_end) 2nd_infusion.end; **Label: Presence of adverse_event**

This makes no difference as there are no post-label windows in this example.

Examples of Labeling

To see labeling in action, we'll work with the following configuration:

>>> print_ACES(sample_ACES_cfg)
trigger
└── (+1 day, 0:00:00) input.end (no icu_admission, discharge_or_death); **Prediction Time**
    └── (+1 day, 0:00:00) gap.end (no icu_admission, discharge_or_death)
        └── (next discharge_or_death) target.end; **Label: Presence of death**

We'll also use the following generated trajectories:

>>> for fn, df in sample_labeled_trajectories_dfs.items():
...     print(f"Generated trajectory: {fn}")
...     print(df)
Generated trajectory: trajectory_0.parquet
shape: (9, 5)
┌─────────────────────────┬───────────────┬───────────────┬────────────┬─────────────────────────┐
│ time                    ┆ code          ┆ numeric_value ┆ subject_id ┆ prediction_time         │
│ ---                     ┆ ---           ┆ ---           ┆ ---        ┆ ---                     │
│ datetime[μs, UTC]       ┆ str           ┆ f64           ┆ i32        ┆ datetime[μs, UTC]       │
╞═════════════════════════╪═══════════════╪═══════════════╪════════════╪═════════════════════════╡
│ 1993-01-01 12:00:00 UTC ┆ LAB_1         ┆ 1.0           ┆ 1          ┆ 1993-01-01 00:00:00 UTC │
│ 1993-01-01 13:00:00 UTC ┆ LAB_2         ┆ null          ┆ 1          ┆ 1993-01-01 00:00:00 UTC │
│ 1993-01-01 14:00:00 UTC ┆ ICU_DISCHARGE ┆ null          ┆ 1          ┆ 1993-01-01 00:00:00 UTC │
│ 1993-01-22 00:00:00 UTC ┆ MEDS_DEATH    ┆ null          ┆ 1          ┆ 1993-01-01 00:00:00 UTC │
│ 1993-02-20 00:00:00 UTC ┆ ICU_DISCHARGE ┆ null          ┆ 1          ┆ 1993-01-20 00:00:00 UTC │
│ 1995-01-01 00:00:00 UTC ┆ LAB_23        ┆ 1.2           ┆ 1          ┆ 1993-01-20 00:00:00 UTC │
│ 1999-01-01 13:00:00 UTC ┆ LAB_3         ┆ null          ┆ 2          ┆ 1999-01-01 00:00:00 UTC │
│ 1999-01-01 14:00:00 UTC ┆ ICU_DISCHARGE ┆ null          ┆ 2          ┆ 1999-01-01 00:00:00 UTC │
│ 1999-01-04 14:00:00 UTC ┆ LAB_4         ┆ 1.1           ┆ 2          ┆ 1999-01-01 00:00:00 UTC │
└─────────────────────────┴───────────────┴───────────────┴────────────┴─────────────────────────┘
Generated trajectory: trajectory_1.parquet
shape: (6, 5)
┌─────────────────────────┬───────────────┬───────────────┬────────────┬─────────────────────────┐
│ time                    ┆ code          ┆ numeric_value ┆ subject_id ┆ prediction_time         │
│ ---                     ┆ ---           ┆ ---           ┆ ---        ┆ ---                     │
│ datetime[μs, UTC]       ┆ str           ┆ f64           ┆ i32        ┆ datetime[μs, UTC]       │
╞═════════════════════════╪═══════════════╪═══════════════╪════════════╪═════════════════════════╡
│ 1993-01-01 12:00:00 UTC ┆ LAB_1         ┆ 1.0           ┆ 1          ┆ 1993-01-01 00:00:00 UTC │
│ 1993-01-04 00:00:00 UTC ┆ MEDS_DEATH    ┆ null          ┆ 1          ┆ 1993-01-01 00:00:00 UTC │
│ 1998-01-01 00:00:00 UTC ┆ LAB_1         ┆ 1.1           ┆ 1          ┆ 1993-01-20 00:00:00 UTC │
│ 2000-01-01 00:00:00 UTC ┆ LAB_3         ┆ 1.2           ┆ 1          ┆ 1993-01-20 00:00:00 UTC │
│ 1999-01-01 12:00:00 UTC ┆ ICU_ADMISSION ┆ null          ┆ 2          ┆ 1999-01-01 00:00:00 UTC │
│ 1999-02-01 00:00:00 UTC ┆ MEDS_DEATH    ┆ null          ┆ 2          ┆ 1999-01-01 00:00:00 UTC │
└─────────────────────────┴───────────────┴───────────────┴────────────┴─────────────────────────┘
Generated trajectory: trajectory_2.parquet
shape: (3, 5)
┌─────────────────────────┬───────────────┬───────────────┬────────────┬─────────────────────────┐
│ time                    ┆ code          ┆ numeric_value ┆ subject_id ┆ prediction_time         │
│ ---                     ┆ ---           ┆ ---           ┆ ---        ┆ ---                     │
│ datetime[μs, UTC]       ┆ str           ┆ null          ┆ i32        ┆ datetime[μs, UTC]       │
╞═════════════════════════╪═══════════════╪═══════════════╪════════════╪═════════════════════════╡
│ 1993-01-01 12:00:00 UTC ┆ ICU_DISCHARGE ┆ null          ┆ 1          ┆ 1993-01-01 00:00:00 UTC │
│ 1993-01-01 13:00:00 UTC ┆ ICU_ADMISSION ┆ null          ┆ 1          ┆ 1993-01-01 00:00:00 UTC │
│ 2005-01-01 00:00:00 UTC ┆ MEDS_DEATH    ┆ null          ┆ 2          ┆ 1999-01-01 00:00:00 UTC │
└─────────────────────────┴───────────────┴───────────────┴────────────┴─────────────────────────┘

What labels do we get if we run the labeling function on these with various relaxations of our config? To see, first we need to import the label function:

>>> from MEDS_trajectory_evaluation.ACES_config_evaluation.label import label_trajectories

1. No Relaxations

>>> print_ACES(convert_to_zero_shot(sample_ACES_cfg))
input.end; **Prediction Time**
└── (+1 day, 0:00:00) gap.end (no icu_admission, discharge_or_death)
    └── (next discharge_or_death) target.end; **Label: Presence of death**
>>> for fn, df in sample_labeled_trajectories_dfs.items():
...     print(f"Labels for {fn}:")
...     print(label_trajectories(df, convert_to_zero_shot(sample_ACES_cfg)))
Labels for trajectory_0.parquet:
shape: (3, 5)
┌────────────┬─────────────────────────┬───────┬──────────────┬───────┐
│ subject_id ┆ prediction_time         ┆ valid ┆ determinable ┆ label │
│ ---        ┆ ---                     ┆ ---   ┆ ---          ┆ ---   │
│ i32        ┆ datetime[μs, UTC]       ┆ bool  ┆ bool         ┆ bool  │
╞════════════╪═════════════════════════╪═══════╪══════════════╪═══════╡
│ 1          ┆ 1993-01-01 00:00:00 UTC ┆ false ┆ null         ┆ null  │
│ 1          ┆ 1993-01-20 00:00:00 UTC ┆ true  ┆ true         ┆ false │
│ 2          ┆ 1999-01-01 00:00:00 UTC ┆ false ┆ null         ┆ null  │
└────────────┴─────────────────────────┴───────┴──────────────┴───────┘
Labels for trajectory_1.parquet:
shape: (3, 5)
┌────────────┬─────────────────────────┬───────┬──────────────┬───────┐
│ subject_id ┆ prediction_time         ┆ valid ┆ determinable ┆ label │
│ ---        ┆ ---                     ┆ ---   ┆ ---          ┆ ---   │
│ i32        ┆ datetime[μs, UTC]       ┆ bool  ┆ bool         ┆ bool  │
╞════════════╪═════════════════════════╪═══════╪══════════════╪═══════╡
│ 1          ┆ 1993-01-01 00:00:00 UTC ┆ true  ┆ true         ┆ true  │
│ 1          ┆ 1993-01-20 00:00:00 UTC ┆ true  ┆ false        ┆ null  │
│ 2          ┆ 1999-01-01 00:00:00 UTC ┆ false ┆ null         ┆ null  │
└────────────┴─────────────────────────┴───────┴──────────────┴───────┘
Labels for trajectory_2.parquet:
shape: (2, 5)
┌────────────┬─────────────────────────┬───────┬──────────────┬───────┐
│ subject_id ┆ prediction_time         ┆ valid ┆ determinable ┆ label │
│ ---        ┆ ---                     ┆ ---   ┆ ---          ┆ ---   │
│ i32        ┆ datetime[μs, UTC]       ┆ bool  ┆ bool         ┆ bool  │
╞════════════╪═════════════════════════╪═══════╪══════════════╪═══════╡
│ 1          ┆ 1993-01-01 00:00:00 UTC ┆ false ┆ null         ┆ null  │
│ 2          ┆ 1999-01-01 00:00:00 UTC ┆ true  ┆ true         ┆ true  │
└────────────┴─────────────────────────┴───────┴──────────────┴───────┘

2. Without gap windows or criteria

>>> labeler_cfg = {"remove_all_criteria": True, "collapse_temporal_gap_windows": True}
>>> print(f"Under labeler_cfg={labeler_cfg}")
Under labeler_cfg={'remove_all_criteria': True, 'collapse_temporal_gap_windows': True}
>>> print_ACES(convert_to_zero_shot(sample_ACES_cfg, labeler_cfg))
input.end; **Prediction Time**
└── (next discharge_or_death) target.end; **Label: Presence of death**
>>> for fn, df in sample_labeled_trajectories_dfs.items():
...     print(f"Labels for {fn}:")
...     print(label_trajectories(df, convert_to_zero_shot(sample_ACES_cfg, labeler_cfg)))
Labels for trajectory_0.parquet:
shape: (3, 5)
┌────────────┬─────────────────────────┬───────┬──────────────┬───────┐
│ subject_id ┆ prediction_time         ┆ valid ┆ determinable ┆ label │
│ ---        ┆ ---                     ┆ ---   ┆ ---          ┆ ---   │
│ i32        ┆ datetime[μs, UTC]       ┆ bool  ┆ bool         ┆ bool  │
╞════════════╪═════════════════════════╪═══════╪══════════════╪═══════╡
│ 1          ┆ 1993-01-01 00:00:00 UTC ┆ true  ┆ true         ┆ false │
│ 1          ┆ 1993-01-20 00:00:00 UTC ┆ true  ┆ true         ┆ false │
│ 2          ┆ 1999-01-01 00:00:00 UTC ┆ true  ┆ true         ┆ false │
└────────────┴─────────────────────────┴───────┴──────────────┴───────┘
Labels for trajectory_1.parquet:
shape: (3, 5)
┌────────────┬─────────────────────────┬───────┬──────────────┬───────┐
│ subject_id ┆ prediction_time         ┆ valid ┆ determinable ┆ label │
│ ---        ┆ ---                     ┆ ---   ┆ ---          ┆ ---   │
│ i32        ┆ datetime[μs, UTC]       ┆ bool  ┆ bool         ┆ bool  │
╞════════════╪═════════════════════════╪═══════╪══════════════╪═══════╡
│ 1          ┆ 1993-01-01 00:00:00 UTC ┆ true  ┆ true         ┆ true  │
│ 1          ┆ 1993-01-20 00:00:00 UTC ┆ true  ┆ false        ┆ null  │
│ 2          ┆ 1999-01-01 00:00:00 UTC ┆ true  ┆ true         ┆ true  │
└────────────┴─────────────────────────┴───────┴──────────────┴───────┘
Labels for trajectory_2.parquet:
shape: (2, 5)
┌────────────┬─────────────────────────┬───────┬──────────────┬───────┐
│ subject_id ┆ prediction_time         ┆ valid ┆ determinable ┆ label │
│ ---        ┆ ---                     ┆ ---   ┆ ---          ┆ ---   │
│ i32        ┆ datetime[μs, UTC]       ┆ bool  ┆ bool         ┆ bool  │
╞════════════╪═════════════════════════╪═══════╪══════════════╪═══════╡
│ 1          ┆ 1993-01-01 00:00:00 UTC ┆ true  ┆ true         ┆ false │
│ 2          ┆ 1999-01-01 00:00:00 UTC ┆ true  ┆ true         ┆ true  │
└────────────┴─────────────────────────┴───────┴──────────────┴───────┘

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mmd_pypi

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.0.6

Apr 21, 2026

0.0.5

Nov 5, 2025

0.0.4

Jun 6, 2025

0.0.3

Jun 6, 2025

0.0.2

Jun 6, 2025

0.0.1

Jun 2, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

meds_trajectory_evaluation-0.0.6.tar.gz (129.3 kB view details)

Uploaded Apr 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

meds_trajectory_evaluation-0.0.6-py3-none-any.whl (44.3 kB view details)

Uploaded Apr 21, 2026 Python 3

File details

Details for the file meds_trajectory_evaluation-0.0.6.tar.gz.

File metadata

Download URL: meds_trajectory_evaluation-0.0.6.tar.gz
Upload date: Apr 21, 2026
Size: 129.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for meds_trajectory_evaluation-0.0.6.tar.gz
Algorithm	Hash digest
SHA256	`6c1ccbffd4665bfeed8136ff10b842ff51197a32775c0acde44afad4c4bb49f6`
MD5	`1a57567ce689be8a540d5408b3f6aaae`
BLAKE2b-256	`83e13483917dec07f5a88947fe43a226799cba91a3027b84484695e1af7c7f00`

See more details on using hashes here.

Provenance

The following attestation bundles were made for meds_trajectory_evaluation-0.0.6.tar.gz:

Publisher: python-build.yaml on mmcdermott/MEDS_trajectory_evaluation

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: meds_trajectory_evaluation-0.0.6.tar.gz
- Subject digest: 6c1ccbffd4665bfeed8136ff10b842ff51197a32775c0acde44afad4c4bb49f6
- Sigstore transparency entry: 1354821182
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: mmcdermott/MEDS_trajectory_evaluation@c9887cfd1034860ca54de4f14d72cee17099deb2
- Branch / Tag: refs/tags/0.0.6
- Owner: https://github.com/mmcdermott
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-build.yaml@c9887cfd1034860ca54de4f14d72cee17099deb2
- Trigger Event: push

File details

Details for the file meds_trajectory_evaluation-0.0.6-py3-none-any.whl.

File metadata

Download URL: meds_trajectory_evaluation-0.0.6-py3-none-any.whl
Upload date: Apr 21, 2026
Size: 44.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for meds_trajectory_evaluation-0.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8a394df0a46dd544737dce9b397bcb0c356f290df98c5d679521245c7614becb`
MD5	`5e3e3851fd8ac2d4b4ef59ab1148a68f`
BLAKE2b-256	`9d8674b6300211faff2f2213a46dcaab6e7052cd44fff97e188a6b36b84348a0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for meds_trajectory_evaluation-0.0.6-py3-none-any.whl:

Publisher: python-build.yaml on mmcdermott/MEDS_trajectory_evaluation

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: meds_trajectory_evaluation-0.0.6-py3-none-any.whl
- Subject digest: 8a394df0a46dd544737dce9b397bcb0c356f290df98c5d679521245c7614becb
- Sigstore transparency entry: 1354821231
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: mmcdermott/MEDS_trajectory_evaluation@c9887cfd1034860ca54de4f14d72cee17099deb2
- Branch / Tag: refs/tags/0.0.6
- Owner: https://github.com/mmcdermott
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-build.yaml@c9887cfd1034860ca54de4f14d72cee17099deb2
- Trigger Event: push

MEDS-trajectory-evaluation 0.0.6

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

MEDS Trajectory Evaluation

Quickstart

1. Install

2. Run

2.A Generated Trajectory Schema

2.B Simple Predicate Labeling

2.C Full ACES Task Labeling

Full Documentation

Generated Trajectory Schema

Temporal AUC Evaluation

Helper functions

Computing AUCs

Full ACES Task Labeling

Terminology

Supported Config Relaxations

Example Configurations

Example 1: In-hospital mortality prediction

Example 2: 30-day post discharge mortality prediction

Example 3: 30-day readmission prediction with censoring

Example 4: Two-stage Infusion

Other examples we can't reflect:

Relaxations

1. remove_all_criteria: Remove inclusion/exclusion criteria

On Example 1: In Hospital Mortality

On Example 2: Post-discharge Mortality

On Example 3: Readmission

On Example 4: 2nd infusion stage adverse event

2. collapse_temporal_gap_windows: Absorb temporal gap windows into target

On Example 1: In Hospital Mortality

On Example 2: Post-discharge Mortality

On Example 3: Readmission

On Example 4: 2nd infusion stage adverse event

3. remove_post_label_windows: Removes all post-label windows from the task config

On Example 1: In Hospital Mortality

On Example 2: Post-discharge Mortality

On Example 3: Readmission

On Example 4: 2nd infusion stage adverse event

Examples of Labeling

1. No Relaxations

2. Without gap windows or criteria

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

1. `remove_all_criteria`: Remove inclusion/exclusion criteria

2. `collapse_temporal_gap_windows`: Absorb temporal gap windows into target

3. `remove_post_label_windows`: Removes all post-label windows from the task config