Last released Dec 19, 2025
A toolkit for evaluating the culture of MLX large language models (LLMs) on the CD Eval benchmark.
Supported by