Engineering notebook for AI-assisted development
Project description
buildlog
The Only Agent Learning System You Can Prove Works
Falsifiable claims. Measurable outcomes. No vibes.
RE: The art — Yes, it's AI-generated. Yes, that's hypocritical for a project about rigor over vibes. Looking for an actual artist to pay for a real logo. If you know someone good, open an issue or DM me. Budget exists.
Everyone's building "agent memory." Blog posts announce breakthroughs. Products ship with "learning" in the tagline. Ask them one question: How do you know it works?
buildlog gives you the infrastructure to answer with data. It captures engineering knowledge from work sessions, extracts rules, selects which rules to surface using a Thompson Sampling bandit, and measures impact via Repeated Mistake Rate (RMR) across tracked experiments.
Features
- Structured capture — Document work sessions as entries with mistakes, decisions, and outcomes
- Rule extraction — Distill and deduplicate patterns into actionable rules
- Thompson Sampling bandit — Automatic rule selection that balances exploration and exploitation
- Experiment tracking — Sessions, mistakes, RMR calculation with statistical rigor
- Review gauntlet — Curated reviewer personas (Security Karen, Test Terrorist) with HITL checkpoints
- Multi-agent support — Render rules to Claude Code, Cursor, GitHub Copilot, Windsurf, Continue.dev
- MCP server — Full Claude Code integration via
buildlog-mcp
Quick Start
uv pip install buildlog # or: pip install buildlog (inside a venv)
buildlog init
buildlog new my-feature
buildlog distill && buildlog skills
buildlog experiment start
# ... work ...
buildlog experiment end
buildlog experiment report
Documentation
| Section | Description |
|---|---|
| Installation | Setup, extras, and initialization |
| Quick Start | Full pipeline walkthrough |
| Core Concepts | The problem, the claim, and the metric |
| CLI Reference | Every command documented |
| MCP Integration | Claude Code setup and available tools |
| Experiments | Running and measuring experiments |
| Review Gauntlet | Reviewer personas and the gauntlet loop |
| Multi-Agent Setup | Render rules to any AI coding agent |
| Theory | The math behind Thompson Sampling |
| Philosophy | Principles and honest limitations |
Contributing
git clone https://github.com/Peleke/buildlog-template
cd buildlog-template
uv venv && source .venv/bin/activate
uv pip install -e ".[dev]"
pytest
We're especially interested in better context representations, credit assignment approaches, statistical methodology improvements, and real-world experiment results (positive or negative).
License
MIT License — see LICENSE
"Agent learning" without measurement is just prompt engineering with extra steps.
buildlog is measurement.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file buildlog-0.8.0.tar.gz.
File metadata
- Download URL: buildlog-0.8.0.tar.gz
- Upload date:
- Size: 103.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e2f838689115fb9bf9322386faaf463bfb51d14ed31237b17248aa75548c2fd3
|
|
| MD5 |
1480e8e0d4e74fc3a6c4e721e3606ab6
|
|
| BLAKE2b-256 |
877488129f3b4081094b011f9e5140043c495b9f7ca7716f5cf7a25ed7c1e74f
|
File details
Details for the file buildlog-0.8.0-py3-none-any.whl.
File metadata
- Download URL: buildlog-0.8.0-py3-none-any.whl
- Upload date:
- Size: 129.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9758869818e9d5be3fc4aa6d0a1ba5a662252de36b77add0191b8d8f9ead7910
|
|
| MD5 |
7c7af467ae40ecfa2a59f9c2bc09e365
|
|
| BLAKE2b-256 |
323ac8f49e66b934d5a449685e0306054969fac4f860a66ddb4ffa0ed41aa486
|