2 projects
proofagent-harness
The open-source, domain-aware test harness for AI agents. Run multi-turn adversarial evaluations with jury-based scoring across production-critical metrics — hallucination, policy compliance, drift, tool use, manipulation resistance. BYO LLM. BYO traps.
proofagent-sdk
Official Python SDK for ProofAgent™