Engineering notebook for AI-assisted development
Project description
buildlog
Engineering Notebook for AI-Assisted Development
Capture your work as publishable content. Include the fuckups.
Quick Start · The Pipeline · Commands · Philosophy
The Problem
You're pairing with AI on real work. Hours of debugging, wrong turns, "oh shit" moments, and hard-won insights—all vanishing into chat history the moment you close the tab.
Meanwhile, your AI agent makes the same mistakes on similar problems because it has no memory of what you learned together.
The Solution
buildlog captures the signal from AI-assisted development sessions and transforms it into:
- Publishable content - Each entry is a $500+ tutorial draft
- Structured insights - Categorized learnings ready for analysis
- Agent rules - Deduplicated, confidence-scored rules that improve AI behavior
flowchart LR
A["Raw Sessions<br/>(ephemeral)"] --> B["Buildlog Entries<br/>(structured markdown)"]
B --> C["Distilled Insights<br/>(categorized patterns)"]
C --> D["Agent Rules<br/>(deduplicated + scored)"]
D --> E["CLAUDE.md<br/>settings.json<br/>Agent Skills"]
style A fill:#ff6b6b,color:#fff
style B fill:#4ecdc4,color:#fff
style C fill:#45b7d1,color:#fff
style D fill:#96ceb4,color:#fff
style E fill:#dda0dd,color:#fff
Key Concepts
| Term | What it means |
|---|---|
| Entry | A structured markdown file documenting one work session |
| Insight | A single learning extracted from an entry's Improvements section |
| Pattern | Raw insights grouped by category (architectural, workflow, etc.) |
| Rule | Deduplicated insight with stable ID, confidence score, and source tracking |
| Agent Skill | Rules promoted to .claude/skills/ for on-demand loading by Claude |
Features
Structured Capture
Templates with six required sections ensure you never forget to document the mistakes that teach the most.
Pattern Distillation
Extract categorized insights from all your entries into structured JSON/YAML for analysis.
Semantic Deduplication
"Run tests before commit" and "Always execute the test suite prior to committing" are the same insight. buildlog merges them.
Confidence Scoring
Rules are scored based on frequency and recency. High-confidence rules have been reinforced multiple times recently.
Multiple Promotion Targets
Promote rules to CLAUDE.md, settings.json, or Anthropic Agent Skills (.claude/skills/) for on-demand loading.
Pluggable Embeddings
Token-based similarity by default. Upgrade to sentence-transformers or OpenAI for semantic understanding.
Quick Start
# Install
pip install buildlog
# Initialize in your project
buildlog init
# Create an entry for today's work
buildlog new auth-api
# After a few entries, extract patterns
buildlog distill
# Generate deduplicated rules
buildlog skills
The Pipeline
buildlog is a three-stage pipeline that transforms ephemeral work into durable knowledge:
flowchart TB
subgraph Stage1["Stage 1: Capture"]
A1["buildlog new slug"] --> A2["Edit markdown entry"]
A2 --> A3["Document: Goal, Journey,<br/>Tests, Code, Improvements"]
end
subgraph Stage2["Stage 2: Distill"]
B1["buildlog distill"] --> B2["Parse all entries"]
B2 --> B3["Extract Improvements sections"]
B3 --> B4["Group by category"]
end
subgraph Stage3["Stage 3: Promote"]
C1["buildlog skills"] --> C2["Deduplicate similar insights"]
C2 --> C3["Calculate confidence scores"]
C3 --> C4["Generate stable IDs"]
C4 --> C5["Promote to target"]
end
Stage1 --> Stage2 --> Stage3
C5 --> D1["CLAUDE.md"]
C5 --> D2["settings.json"]
C5 --> D3[".claude/skills/"]
Stage 1: Capture (buildlog new)
Create structured entries as you work. Each entry has six sections:
| Section | Purpose |
|---|---|
| The Goal | What you're building and why |
| What We Built | Architecture diagram, components |
| The Journey | Chronological narrative including mistakes |
| Test Results | Actual commands, actual outputs |
| Code Samples | Key snippets with context |
| Improvements | Categorized learnings for next time |
The Improvements section is structured for machine extraction:
### Architectural
- Always validate inputs at the boundary, not conditionally
- Use frozen dataclasses for immutable data containers
### Workflow
- Run the test suite after EVERY code change, not just at the end
- Write the integration test first to clarify the API contract
### Tool Usage
- The `patch` context manager for date mocking is cleaner than fixtures
- Use `jwt.io` to decode tokens instead of console.log
### Domain Knowledge
- `datetime.utcnow()` is deprecated in Python 3.12+
- Supabase storage returns 400, not 404, for missing files
Stage 2: Distill (buildlog distill)
Extract all insights across entries into structured data:
buildlog distill # JSON to stdout
buildlog distill -o patterns.yaml # Write to file
buildlog distill --since 2026-01-01 # Filter by date
buildlog distill --category workflow # Filter by category
Output:
{
"patterns": {
"architectural": [
{"insight": "Always validate inputs at boundary...", "source": "2026-01-16-auth.md"}
],
"workflow": [...],
"tool_usage": [...],
"domain_knowledge": [...]
},
"statistics": {
"total_patterns": 47,
"by_category": {"architectural": 12, "workflow": 15, ...}
}
}
Stage 3: Generate Rules (buildlog skills)
Transform raw patterns into deduplicated, scored rules:
buildlog skills # YAML to stdout
buildlog skills -o rules.yml # Write to file
buildlog skills --format markdown # For CLAUDE.md injection
buildlog skills --min-frequency 2 # Only repeated patterns
buildlog skills --embeddings openai # Semantic deduplication
Output:
generated_at: '2026-01-16T12:00:00Z'
source_entries: 23
total_skills: 31
skills:
architectural:
- id: arch-b0fcb62a1e
rule: Always validate inputs at the boundary, not conditionally
frequency: 4
confidence: high
sources: [auth.md, api.md, validation.md, forms.md]
tags: [api, error]
- id: arch-0cda924aeb
rule: Frozen dataclasses should be the default for data containers
frequency: 2
confidence: medium
sources: [models.md, dto.md]
tags: [python]
Patterns vs Rules
Patterns are raw extractions—every insight from every entry, exactly as written.
Rules are processed patterns with:
| Property | Description |
|---|---|
| Stable ID | Same rule always gets same ID (SHA-256 based) |
| Deduplication | Similar insights merged, frequency tracked |
| Confidence | high/medium/low based on frequency + recency |
| Sources | Which entries contributed to this rule |
| Tags | Auto-extracted technology/concept keywords |
Deduplication in Action
Raw patterns from different entries:
- "Run tests before committing"
- "Always run the test suite before commit"
- "Execute tests prior to committing code"
After deduplication → 1 rule with frequency: 3:
- id: wf-96f12966f1
rule: Run tests before committing
frequency: 3
confidence: high
Promotion Targets
Rules can be promoted to different targets for agent consumption:
flowchart LR
R["Rules<br/>(buildlog_status)"] --> T1["CLAUDE.md<br/>(append)"]
R --> T2["settings.json<br/>(merge)"]
R --> T3[".claude/skills/<br/>(Agent Skill)"]
T1 --> A1["Always loaded<br/>in context"]
T2 --> A2["Project settings<br/>for Claude Code"]
T3 --> A3["On-demand loading<br/>saves context"]
style T3 fill:#96ceb4,color:#fff
| Target | File | When to Use |
|---|---|---|
claude_md |
CLAUDE.md |
Rules always in context (default) |
settings_json |
.claude/settings.json |
Project-level Claude Code settings |
skill |
.claude/skills/buildlog-learned/SKILL.md |
On-demand loading - rules load only when relevant |
Anthropic Agent Skills (New!)
The skill target creates an Anthropic Agent Skill that Claude loads on-demand:
# Via MCP tool
buildlog_promote(skill_ids=["arch-123", "wf-456"], target="skill")
Creates .claude/skills/buildlog-learned/SKILL.md:
---
name: buildlog-learned
description: Project-specific patterns learned from development history.
Use when writing code, making architectural decisions, reviewing PRs,
or ensuring consistency. Contains 12 rules across Architectural, Workflow.
---
# Learned Patterns
*12 rules extracted from buildlog entries on 2026-01-16*
## Must Follow (High Confidence)
These patterns have been reinforced multiple times.
### Architectural
- Always validate inputs at the boundary
- Use dependency injection for testability
### Workflow
- Run tests after EVERY code change
## Should Consider (Medium Confidence)
These patterns appear frequently but may have exceptions.
### Tool Usage
- Prefer `patch` context manager for date mocking
Why Agent Skills?
- On-demand loading - Rules only load when Claude determines they're relevant
- Saves context - Not always in context like CLAUDE.md
- Progressive disclosure - Claude asks before loading the full skill
Live Usage Scenario
Here's how your learned rules actually get used:
┌─────────────────────────────────────────────────────────────────┐
│ You: "Review this authentication endpoint I wrote" │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ Claude sees: "authentication" + "review" + "endpoint" │
│ │
│ Checks skill description: │
│ "Use when writing code, making architectural decisions, │
│ reviewing PRs, or ensuring consistency..." │
│ │
│ Match! Loads .claude/skills/buildlog-learned/SKILL.md │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ Claude now has access to YOUR learned rules: │
│ │
│ Must Follow: │
│ - Password hashing belongs in User model, not route handler │
│ - Always validate inputs at the boundary │
│ │
│ Worth Knowing: │
│ - bcrypt.compare() arg order is (plaintext, hash) │
│ - JWT expiry is in seconds, not milliseconds │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ Claude: "I notice the password hashing is in your route │
│ handler. Based on patterns from your buildlog, this should │
│ be a pre-save hook in the User model instead. │
│ │
│ Also, I see bcrypt.compare(hash, password) - the argument │
│ order should be (plaintext, hash). This has tripped you up │
│ before." │
└─────────────────────────────────────────────────────────────────┘
Your past mistakes now prevent future ones—automatically.
Embedding Backends
Deduplication uses text similarity. Choose your backend:
| Backend | Install | Use Case |
|---|---|---|
token (default) |
Built-in | Fast, free, good for obvious duplicates |
sentence-transformers |
pip install buildlog[embeddings] |
Local semantic similarity, no API calls |
openai |
pip install buildlog[openai] |
Best quality, requires API key |
# Token-based (default) - catches "run tests" ≈ "run testing"
buildlog skills
# Semantic - catches "use Redis for caching" ≈ "cache data in Redis"
buildlog skills --embeddings sentence-transformers
# OpenAI - best quality semantic matching
export OPENAI_API_KEY=...
buildlog skills --embeddings openai
Comparison
| Input | Token | OpenAI |
|---|---|---|
| "Run tests before commit" ≈ "Run testing before committing" | Merged | Merged |
| "Use Redis for caching" ≈ "Cache data in Redis" | Separate | Merged |
Practical Usage
1. Inject Rules into CLAUDE.md
buildlog skills --format markdown >> CLAUDE.md
Your AI agent now has access to every lesson you've learned:
## Learned Rules
Based on 23 buildlog entries, 31 actionable rules have emerged:
### Architectural (8 rules)
- Always validate inputs at the boundary (seen 4x)
- Use frozen dataclasses for data containers (seen 2x)
### Workflow (12 rules)
- Run tests after EVERY code change (seen 5x)
...
2. Create an Agent Skill
For on-demand loading instead of always-in-context:
# Via CLI (coming soon)
buildlog promote --target skill
# Via MCP
buildlog_promote(skill_ids=["arch-123"], target="skill")
3. Track Rule Evolution
Rules have stable IDs. Track which are reinforced over time:
# This week's new rules
buildlog skills --since 2026-01-10 -o this-week.yml
# Compare to baseline
diff baseline.yml this-week.yml
4. Find Your Blind Spots
buildlog stats --detailed
Buildlog Statistics
Entries: 23 total
Coverage: 87% with improvements
Category Breakdown:
architectural: 12 insights (26%)
workflow: 15 insights (33%)
tool_usage: 8 insights (17%)
domain_knowledge: 11 insights (24%)
Warnings:
- 3 entries have empty Improvements sections
Commands
| Command | Description |
|---|---|
buildlog init |
Initialize in current directory |
buildlog new <slug> |
Create entry for today |
buildlog new <slug> --date 2026-01-15 |
Create entry for specific date |
buildlog list |
List all entries |
buildlog distill |
Extract patterns from all entries |
buildlog stats |
Show statistics and analytics |
buildlog skills |
Generate deduplicated rules |
buildlog update |
Update templates to latest |
Skills Options
--output, -o PATH # Write to file instead of stdout
--format [yaml|json|markdown] # Output format (default: yaml)
--min-frequency N # Only rules seen N+ times
--since YYYY-MM-DD # Only entries from this date
--embeddings [token|sentence-transformers|openai] # Similarity backend
Architecture
flowchart TB
subgraph CLI["CLI Layer"]
cli["cli.py"]
end
subgraph Core["Core Logic"]
distill["distill.py<br/>Pattern extraction"]
skills["skills.py<br/>Deduplication + scoring"]
stats["stats.py<br/>Analytics"]
embeddings["embeddings.py<br/>Similarity backends"]
ops["core/operations.py<br/>status, promote, reject"]
end
subgraph Render["Render Adapters"]
claude_md["claude_md.py"]
settings_json["settings_json.py"]
skill_render["skill.py"]
end
subgraph MCP["MCP Server"]
server["server.py"]
tools["tools.py"]
end
cli --> distill
cli --> skills
cli --> stats
skills --> embeddings
skills --> distill
ops --> skills
ops --> Render
tools --> ops
server --> tools
Data Flow
flowchart LR
MD["buildlog/*.md"] --> Parse["Parse markdown"]
Parse --> Extract["Extract Improvements"]
Extract --> Distill["distill_all()"]
Distill --> Patterns["Patterns by category"]
Patterns --> Dedup["Deduplicate"]
Dedup --> Score["Calculate confidence"]
Score --> Rules["Rules with IDs"]
Rules --> Format["Format output"]
Format --> Out["YAML / JSON / Markdown / Agent Skill"]
Installation Options
# Basic install
pip install buildlog
# With local semantic embeddings (offline)
pip install buildlog[embeddings]
# With OpenAI embeddings
pip install buildlog[openai]
# Everything
pip install buildlog[all]
# Development
pip install buildlog[dev]
# With MCP server for Claude Code integration
pip install buildlog[mcp]
MCP Server (Claude Code Integration)
The MCP server lets Claude Code interact with your buildlog rules directly. Your agent can review learned patterns, promote them to rules, or reject false positives—all through natural conversation.
Setup for Claude Code CLI
-
Install with MCP support:
pip install buildlog[mcp] # or with uv uv pip install buildlog[mcp]
-
Add to your Claude Code settings (
~/.claude/settings.json):{ "mcpServers": { "buildlog": { "command": "buildlog-mcp", "args": [] } } }
-
Start a new Claude Code session. The buildlog tools will be available.
Available Tools
| Tool | Description |
|---|---|
buildlog_status |
Get rules grouped by category with confidence scores |
buildlog_promote |
Write rules to CLAUDE.md, settings.json, or Agent Skills |
buildlog_reject |
Mark rules to exclude from future suggestions |
buildlog_diff |
Show rules pending review (not yet promoted/rejected) |
Promotion Targets via MCP
# Append to CLAUDE.md (default)
buildlog_promote(skill_ids=["arch-123"], target="claude_md")
# Merge into settings.json
buildlog_promote(skill_ids=["arch-123"], target="settings_json")
# Create Anthropic Agent Skill (NEW!)
buildlog_promote(skill_ids=["arch-123"], target="skill")
Example Conversation
You: What patterns have I learned?
Claude: [calls buildlog_status]
Based on 23 buildlog entries, you have 31 rules:
High confidence (ready to promote):
- arch-b0fcb62a1e: "Always validate inputs at the boundary"
- wf-96f12966f1: "Run tests before committing"
Would you like me to add these to your CLAUDE.md or create an Agent Skill?
You: Create an Agent Skill so they load on-demand.
Claude: [calls buildlog_promote with target="skill"]
Created skill at .claude/skills/buildlog-learned/SKILL.md
This skill will load on-demand when relevant to your work,
saving context for when you need it most.
Philosophy
1. Write Fast, Not Pretty
Refrigerator to-do list energy. Get it down before you forget.
2. Never Delete Mistakes
Wrong turns are the most valuable content. They're what makes tutorials actually useful.
3. Include the Journey
"We tried X, it failed because Y, so we did Z" > "We did Z"
4. Capture Improvements
Concrete learnings, not vague observations. "Always validate at boundary" > "validation is important"
5. Quality Bar
Each entry should be publishable as a $500+ tutorial. Real error messages. Honest about what didn't work. Code that runs.
Contributing
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing) - Commit your changes (
git commit -m 'Add amazing feature') - Push to the branch (
git push origin feature/amazing) - Open a Pull Request
License
MIT License — see LICENSE for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file buildlog-0.2.0.tar.gz.
File metadata
- Download URL: buildlog-0.2.0.tar.gz
- Upload date:
- Size: 40.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
20687c8affb69dfd92544088b6e11264839c6fd22a77bc4a5ff5ed3982969393
|
|
| MD5 |
df1d57e60430d7cc1fc5e2dff3e9a19b
|
|
| BLAKE2b-256 |
c6b3d64cd494259f8d27c58054dbbaf1720f58b227d6004172186fc9c057674b
|
File details
Details for the file buildlog-0.2.0-py3-none-any.whl.
File metadata
- Download URL: buildlog-0.2.0-py3-none-any.whl
- Upload date:
- Size: 53.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f01a8babd5382415ac95406426ce097c65212cf82ecde2ef3c8cedb049b7da54
|
|
| MD5 |
b9349e76a2a3024d752e3f9dd5148e9d
|
|
| BLAKE2b-256 |
559165c2931282007708cbce4304b752a49372d5ece0bce341236802e16708e0
|