Last released Aug 1, 2025
Python SDK for evaluating multiple model outputs using configurable LLM-based jurors
Supported by