Skip to main content

Engineering notebook for AI-assisted development

Project description

buildlog

The Only Agent Learning System You Can Prove Works

PyPI Python CI License: MIT Docs

Falsifiable claims. Measurable outcomes. No vibes.

buildlog - The Only Agent Learning System You Can Prove Works

RE: The art — Yes, it's AI-generated. Yes, that's hypocritical for a project about rigor over vibes. Looking for an actual artist to pay for a real logo. If you know someone good, open an issue or DM me. Budget exists.

Read the full documentation


Everyone's building "agent memory." Blog posts announce breakthroughs. Products ship with "learning" in the tagline. Ask them one question: How do you know it works?

buildlog gives you the infrastructure to answer with data. It captures engineering knowledge from work sessions, extracts rules, selects which rules to surface using a Thompson Sampling bandit, and measures impact via Repeated Mistake Rate (RMR) across tracked experiments.

Features

  • Structured capture — Document work sessions as entries with mistakes, decisions, and outcomes
  • Rule extraction — Distill and deduplicate patterns into actionable rules
  • Thompson Sampling bandit — Automatic rule selection that balances exploration and exploitation
  • Experiment tracking — Sessions, mistakes, RMR calculation with statistical rigor
  • Review gauntlet — Curated reviewer personas (Security Karen, Test Terrorist) with HITL checkpoints
  • Multi-agent support — Render rules to Claude Code, Cursor, GitHub Copilot, Windsurf, Continue.dev
  • MCP server — Full Claude Code integration via buildlog-mcp

Quick Start

uv pip install buildlog   # or: pip install buildlog (inside a venv)
buildlog init
buildlog new my-feature
buildlog distill && buildlog skills
buildlog experiment start
# ... work ...
buildlog experiment end
buildlog experiment report

Documentation

Section Description
Installation Setup, extras, and initialization
Quick Start Full pipeline walkthrough
Core Concepts The problem, the claim, and the metric
CLI Reference Every command documented
MCP Integration Claude Code setup and available tools
Experiments Running and measuring experiments
Review Gauntlet Reviewer personas and the gauntlet loop
Multi-Agent Setup Render rules to any AI coding agent
Theory The math behind Thompson Sampling
Philosophy Principles and honest limitations

Contributing

git clone https://github.com/Peleke/buildlog-template
cd buildlog-template
uv venv && source .venv/bin/activate
uv pip install -e ".[dev]"
pytest

We're especially interested in better context representations, credit assignment approaches, statistical methodology improvements, and real-world experiment results (positive or negative).

License

MIT License — see LICENSE


"Agent learning" without measurement is just prompt engineering with extra steps.

buildlog is measurement.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

buildlog-0.8.0.tar.gz (103.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

buildlog-0.8.0-py3-none-any.whl (129.5 kB view details)

Uploaded Python 3

File details

Details for the file buildlog-0.8.0.tar.gz.

File metadata

  • Download URL: buildlog-0.8.0.tar.gz
  • Upload date:
  • Size: 103.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for buildlog-0.8.0.tar.gz
Algorithm Hash digest
SHA256 e2f838689115fb9bf9322386faaf463bfb51d14ed31237b17248aa75548c2fd3
MD5 1480e8e0d4e74fc3a6c4e721e3606ab6
BLAKE2b-256 877488129f3b4081094b011f9e5140043c495b9f7ca7716f5cf7a25ed7c1e74f

See more details on using hashes here.

File details

Details for the file buildlog-0.8.0-py3-none-any.whl.

File metadata

  • Download URL: buildlog-0.8.0-py3-none-any.whl
  • Upload date:
  • Size: 129.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for buildlog-0.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9758869818e9d5be3fc4aa6d0a1ba5a662252de36b77add0191b8d8f9ead7910
MD5 7c7af467ae40ecfa2a59f9c2bc09e365
BLAKE2b-256 323ac8f49e66b934d5a449685e0306054969fac4f860a66ddb4ffa0ed41aa486

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page