Last released Mar 30, 2026
None
Last released Mar 25, 2026
v0.66.0: sycophancy_score() direction corrected — AUROC 0.9296 [0.8754, 0.9703] on Perez 2022 (was 0.0704, wrong direction). AgentShield judge_risk field fix.
Supported by