Last released Feb 6, 2026
Statistical evaluation framework for AI agents - pytest for agent trajectories
Supported by