Last released Apr 4, 2026
Production RL environment for training LLMs on hallucination avoidance — 1M+ examples across 38 datasets
Supported by