18 projects
MEDS-transforms
A framework for compiling simple, mapreduce style pipelines over MEDS datasets.
dftly
dftly (pronounced deftly) is a simple library for a safe, expressive, config-file friendly, and readable DSL for encoding simple dataframe operations.
nested-ragged-tensors
Utilities for efficiently working with, saving, and loading, collections of connected nested ragged tensors in PyTorch
yaml-to-disk
A simple utility to pretty-print a directory tree, suitable for use in pytest test cases.
MEDS-extract
MEDS ETL building support leveraging MEDS-Transforms.
MIMIC-IV-MEDS
An ETL pipeline to extract MIMIC-IV data into the MEDS format.
MEDS-visualizations
A framework for compiling simple, mapreduce style pipelines over MEDS datasets.
MEDS-trajectory-evaluation
A framework for extracting labels from generated trajectories for arbitrary ACES configs.
meds-testing-helpers
Builds sample MEDS datasets for testing.
flexible-schema
A simple class to aid in defining flexible schemas for PyArrow datasets.
meds-torch-data
An efficient, flexible PyTorch dataset class for MEDS data.
eICU-MEDS
An ETL pipeline to extract the eICU dataset into the MEDS format.
MEDS-EIC-AR
A simple auto-regressive, 'everything-is-code' style model for MEDS datasets
pretty-print-directory
A simple utility to pretty-print a directory tree, suitable for use in pytest test cases.
MEDS-DEV
Task configuration and helper files for the MEDS-DEV Benchmark
ml-mixins
A collection of useful mixins for machine learning development code.
hydra-profiler
A simple hydra profiler to track and record memory usage and runtime information of jobs.
pytorch-lognormal-mixture
A pip installable version of the lognormal mixture distribution from https://github.com/shchur/ifl-tpp/tree/master/code