Last released Apr 29, 2026
SWE-bench for your codebase. Turn merged PRs into reproducible coding-agent benchmarks.
Supported by