7 projects
pandera
A light-weight and flexible data validation and testing tool for statistical data objects.
aigym
Reinforcement learning environments for fine-tuning language models for reasoning tasks.
webgymnasium
A set of RL environments for training LMs on the live web.
webworld
An RL environment game engine for the web.
diff-llm
LLM that predicts text diffs.
meta-ml
MetaRL-based Estimator using Task-encodings for AutoML
themis-ml
Fairness-aware Machine Learning