Last released Jun 16, 2026
Evidence-preserving LLM agent benchmark harness with a live mobile-first SSH TUI.
Supported by