Last released May 11, 2026
Evaluate multi step AI agent traces, not just single LLM responses.
Supported by