7 projects
atp-method
ATP plugin: run agent-eval-case methodology cases through the platform
atp-platform
Framework-agnostic platform for testing and evaluating AI agents
spec-runner
Task automation from markdown specs via Claude CLI
atp-platform-sdk
Python SDK for ATP benchmark platform — run benchmarks, submit results, view leaderboards
atp-games
ATP plugin for game-theoretic agent evaluation
game-environments
Standalone game theory environments for agent evaluation
mdpres
Markdown to HTML Presentation Generator