4 projects
agenteval-py
Lightweight evaluation and observability toolkit for LLM agents
smartmemo
Semantic memory for LLM agent calls with an equivalence-first cache architecture.
guardloop
A production runtime guardrail for AI agents: budget caps, timeouts, tool limits, circuit breakers, verifier retries, and OpenTelemetry traces.
orchflow
A lightweight Python framework for readable multi-agent pipelines.