Skip to main content

Evaluate whether LM-based SWE-agents can reverse-engineer black-box software systems

Project description

ProgramBench logo
ProgramBench

Can Language Models Rebuild Programs From Scratch?

Given only a compiled binary and its documentation, AI agents must architect and implement a complete codebase that reproduces the original program's behavior.

Links

Quickstart

We recommend uv for managing Python environments.

# Run without installing
uvx programbench --help

# Or install into a project
uv pip install programbench

# Or with pip
pip install programbench

For development:

git clone https://github.com/facebookresearch/programbench.git
cd programbench
uv sync  # installs editable + dev dependencies

[!NOTE] For more details, please refer to the Usage Guide.

Citation

If our work was useful for you, please cite it:

@preprint{yang2026programbench,
  title={ProgramBench: Can Language Models Rebuild Programs From Scratch?},
  author={John Yang and Kilian Lieret and Jeffrey Ma and Parth Thakkar and Dmitrii Pedchenko and Sten Sootla and Emily McMilin and Pengcheng Yin and Rui Hou and Gabriel Synnaeve and Diyi Yang and Ofir Press},
  year={2026},
}

License

ProgramBench is licensed under the terms of the license found in LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

programbench-1.0.0.tar.gz (3.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

programbench-1.0.0-py3-none-any.whl (3.4 MB view details)

Uploaded Python 3

File details

Details for the file programbench-1.0.0.tar.gz.

File metadata

  • Download URL: programbench-1.0.0.tar.gz
  • Upload date:
  • Size: 3.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for programbench-1.0.0.tar.gz
Algorithm Hash digest
SHA256 7cb3fadaf81008abf7bb8f9f033054105f01366ffce879265669cb1b9a001116
MD5 195bf093e8b697de1e608863e59f0873
BLAKE2b-256 dbcb49105629c3902993a8264ffe18aed9241404c47e8e696cd4493969ec20ef

See more details on using hashes here.

File details

Details for the file programbench-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: programbench-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for programbench-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c556bc483175a7997699357248f44aaf5bd437b7cf3a117019e3a79a88630ce4
MD5 8916b01e011521c22ca75a0198e45024
BLAKE2b-256 d5da4fb04e77bce98229db72795ae2c39193ca2f4a565e39bbf30ec35d405467

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page