4 projects
honeyhive
HoneyHive Python SDK - LLM Observability and Evaluation Platform
konfigure
A YAML-based configuration management tool for separating code from prompts in LLMs
realign
Realign is a simulation based evaluation framework for multi-step agents.
agentsim
AgentSim is a simulation based evaluation framework for multi-step agents.