Last released Jan 15, 2026
Evaluate language models using multiple choice items
Last released Aug 22, 2025
Computational research data management tool
Supported by