Programming with Pixels (PwP) - A framework for computer-use software engineering agents
Project description
Programming with Pixels (PwP)
Overview
Programming with Pixels (PwP) is a modern framework for evaluating and developing Software Engineering (SWE) agents that interact with computers as humans do - through visual perception and basic actions like typing and clicking.
Our motivating hypothesis is that achieving general-purpose Software Engineering (SWE) agents requires a shift to computer-use agents that can interact with any IDE interface through screenshots and primitive actions, rather than through specialized tool APIs.
Installation
Prerequisites
- Python 3.6+
- Docker
- (Optional) NVIDIA GPU with CUDA support
Using pip (Recommended)
pip install programming-with-pixels
Development Installation
git clone https://github.com/ProgrammingWithPixels/pwp.git
cd pwp
pip install -e .
Quick Start
from pwp import PwP
from pwp import PwPBench
# Create a basic environment
env = PwP(image_name='pwp_env')
# Take a screenshot
observation = env.render()
observation.save('screenshot.png')
# Execute a command
result = env.step("echo 'Hello, World!'")
print(result['output'])
# Try a benchmark task
bench = PwPBench('humaneval')
dataset = bench.get_dataset()
task_env = bench.get_env(dataset[0])
Examples
Check out the examples directory for demonstration scripts:
- Quickstart: Complete walkthrough of PwP's capabilities, including environment interaction, benchmarks, and advanced features
- Basic Demo: Simple environment setup and interaction showcase
- Demo2: Additional demonstration of PwP features
Benchmark Tasks
PwP-Bench comes with a wide range of benchmark tasks for evaluating agents:
- HumanEval: Python coding problems
- Design2Code: Converting design mockups to code
- ChartMimic: Recreating charts from visual references
- SWE-bench: Software engineering tasks
- And many more!
You will first need to setup benchmarks for evaluating agents. See the benchmark documentation for more details.
Evaluating Agents
For detailed examples, check out the agent implementations in the src/pwp/agents directory. Each agent type can be customized with different LLM backends and system prompts to optimize for various tasks.
Command Line Interface
For quicker testing, PwP also comes with a convenient command-line interface:
# Start an environment
pwp env --image pwp_env --vnc
# List available benchmark tasks
pwp list
# Run a benchmark
pwp bench humaneval
Building Custom Environments
Build the Base Environment
# Build the base PWP environment
cd src/pwp/docker/
docker build -t pwp_env .
Custom Environment
You can create custom Docker environments by extending the base image:
FROM pwp_env
# Install additional dependencies
RUN apt-get update && apt-get install -y \
your-package-here \
&& rm -rf /var/lib/apt/lists/*
# Add custom files
COPY your-files /home/devuser/your-files
Package Structure
The PwP package consists of several modules:
pwp.env: Core environment module for managing Docker containerspwp.bench: Benchmark module with various programming taskspwp.agents: Agent implementations for solving taskspwp.utils: Utility functions for image processing and other helperspwp.tools: Tools for agent interaction with environmentspwp.functions: Function implementations for toolspwp.prompts: Prompt templates for different agent types
See the package documentation for more details on each module.
Contributing
We welcome contributions to the PwP project! Please see our contribution guidelines for more information.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Citation
If you use PwP in your research, please cite our paper:
@article{pwp2025,
title={Programming with Pixels: Computer-Use Meets Software Engineering},
author={Aggarwal, Pranjal and Welleck, Sean},
journal={arXiv preprint arXiv:2502.00000},
year={2025}
}
Acknowledgments
- This project builds on various open-source tools and libraries
- Thanks to all contributors who have helped shape the project
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file programming_with_pixels-0.1.0.tar.gz.
File metadata
- Download URL: programming_with_pixels-0.1.0.tar.gz
- Upload date:
- Size: 52.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9ac4964ea8d34daef2514d6426d97d7f9449e015c7aa2bb289cd32b16d015d14
|
|
| MD5 |
e538f985ee0210e4df0f41331bdc7ca0
|
|
| BLAKE2b-256 |
9b1645d5b96e8a7e4bd2aabe77a549ff86b8fb6dd156b7d62c541e8b4eb6b9bb
|
File details
Details for the file programming_with_pixels-0.1.0-py3-none-any.whl.
File metadata
- Download URL: programming_with_pixels-0.1.0-py3-none-any.whl
- Upload date:
- Size: 64.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fce5d93a1711248ea780ece8f4f1b74faab8faadf2c5e9e6246abeefe80018cf
|
|
| MD5 |
32d9c015e839434d5502430cc8af6fa2
|
|
| BLAKE2b-256 |
345be8ad8c6f919df2600a2364bc1e40c55c3151e5ceb0455f50ae805b0c77ae
|