6 projects
swebench
The official SWE-bench package - a benchmark for evaluating LMs on software engineering
swe-rex
Sandboxed code execution for AI agents, locally or on the cloud.
sweer
ACI for websites
swe-rex-preview
Sandboxed code execution for AI agents, locally or on the cloud.
sb-cli
Submit predictions to the SWE-bench API and manage your runs
godule
__god__ule