Last released Apr 8, 2026
The open-source evaluation framework for AI agents — test, compare, ship with confidence
Supported by