Last released May 21, 2026
FlexEval is a tool for designing custom metrics, completion functions, and LLM-graded rubrics for evaluating the behavior of LLM-powered systems.
Last released Aug 25, 2023
Retrieval-backed LLMs for math education
Supported by