Skip to main content

A Python package for managing programming tasks.

Project description

programmer

programmer is a command-line based AI programmer, that will help you get stuff done.

WARNING: programmer has direct access to your machine, it can run commands, and read and write files, without safety checks. You have been warned!

Demo

Quickstart

pip install programmer
programmer 

Switching to o1-based agents

Programmer works with OpenAI's just released o1 models.

GIANT WARNING: Remember, programmer directly runs agent commands on your machine, without prompting you first. The o1 models are brand new and should not be trusted to do this! You have been GIANT WARNING warned.

programmer settings set agent o1-preview-2024-09-12_o1harness
# OR
programmer settings set agent o1-mini-2024-09-12_o1harness

The o1 agents currently don't work very well, and might do dangerous things. But they do work!

Examples

  • "What processes are listening on port 4512?" ... "ok, please kill them"
  • "What's in frog.jpg?"
  • "Write a function to determine if a tic-tac-toe game is won in a file called tictactoe.py. also write unit tests, and iterate until they pass."
  • "Fix all the type errors in this project"

Usage

Just

programmer

Alternatively:

programmer prompt <initial_prompt>

To resume from an earlier state:

programmer --state <state_ref>

Tracking

Programmer is designed to get better over time. For that we need to track trajectories, identify good and bad ones to add to Evaluations (like unit tests for AI), and then iterate on programmer's prompts and architecture to improve against the Evaluations.

By default all trajectories are logged to .programmer/weave.db. You can turn on cloud logging with programmer settings set weave_logging cloud. Trajectories will be saved to Weave at wandb.ai

You can turn on git tracking with programmer settings set git_tracking on to get programmer to track all of its work in "programmer-*" branches. Each git state will be associated with the Weave trajectories, and you can browse the diffs with programmer ui

UI

Run

programmer ui

to run the local streamlit UI. This should work with either weave_logging:cloud or weave_logging:local, but there are some bugs with local mode at the moment.

Programmer UI screenshot

Weave UI

When weave_logging is set to "cloud" you can use the Weave UI at wandb.ai to browse traces.

Settings

Settings are stored in .programmer/settings

programmer settings set weave_logging

  • off: no logging
  • local: log to local sqlite db
  • cloud: log to weave cloud at wandb.ai

programmer settings set git_tracking

  • off: no git tracking
  • on: programmer with make programmer-* branches and track changes

Improving programmer

programmer is designed to be improved using weave, our toolkit for AI application development. What does this mean?

  • you can browse traces and evals in the Weave UI at https://wandb.ai
  • programmer can resume from earlier states, with the --state argument
  • programmer will log all of your interactions to a local sqlite database, or the central Weave service.
  • This data can be used to improve programmer over time, by building Evaluations, fine-tuning, and other techniques.

To run the evaluation:

python evaluate.py

roadmap

  • weave server tracking
  • git state tracking
  • basic trajectory UI
  • user-annotation of good and bad behaviors
  • eval generation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

programmer-0.1.9.tar.gz (48.7 kB view details)

Uploaded Source

Built Distribution

programmer-0.1.9-py3-none-any.whl (60.5 kB view details)

Uploaded Python 3

File details

Details for the file programmer-0.1.9.tar.gz.

File metadata

  • Download URL: programmer-0.1.9.tar.gz
  • Upload date:
  • Size: 48.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for programmer-0.1.9.tar.gz
Algorithm Hash digest
SHA256 fbd08bf6336309235907b8d67aea8b4127efb80449218a4fed3a509e7aa3bae7
MD5 b29f4701387d4b58791fe3ac27ad296c
BLAKE2b-256 f1a06e25fb657d6051be096c11fdd49178f3c67182dfc8c5a2447db019e22bd7

See more details on using hashes here.

File details

Details for the file programmer-0.1.9-py3-none-any.whl.

File metadata

  • Download URL: programmer-0.1.9-py3-none-any.whl
  • Upload date:
  • Size: 60.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for programmer-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 e9961d39ee0e2a497cb1e1119d2b1d2207406473c1e320d78bb9694012c72e89
MD5 970cd9675deccb5b8b7ec613a2b9b037
BLAKE2b-256 d7672124a32e5f0a5ee8d40a7bb490ed05fb32a8b4229b6eaf1193b5621e186c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page