Engineering notebook for AI-assisted development

These details have not been verified by PyPI

Project links

Project description

buildlog

Engineering Notebook for AI-Assisted Development

Capture your work as publishable content. Include the fuckups.

Chaos to Order - buildlog transforms messy development sessions into structured knowledge

Quick Start · The Pipeline · Commands · Philosophy

The Problem

You're pairing with AI on real work. Hours of debugging, wrong turns, "oh shit" moments, and hard-won insights—all vanishing into chat history the moment you close the tab.

Meanwhile, your AI agent makes the same mistakes on similar problems because it has no memory of what you learned together.

The Solution

buildlog captures the signal from AI-assisted development sessions and transforms it into:

Publishable content - Each entry is a $500+ tutorial draft
Structured insights - Categorized learnings ready for analysis
Agent rules - Deduplicated, confidence-scored rules that improve AI behavior

flowchart LR
    A["Raw Sessions<br/>(ephemeral)"] --> B["Buildlog Entries<br/>(structured markdown)"]
    B --> C["Distilled Insights<br/>(categorized patterns)"]
    C --> D["Agent Rules<br/>(deduplicated + scored)"]
    D --> E["CLAUDE.md<br/>settings.json<br/>Agent Skills"]

    style A fill:#ff6b6b,color:#fff
    style B fill:#4ecdc4,color:#fff
    style C fill:#45b7d1,color:#fff
    style D fill:#96ceb4,color:#fff
    style E fill:#dda0dd,color:#fff

Key Concepts

Term	What it means
Entry	A structured markdown file documenting one work session
Insight	A single learning extracted from an entry's Improvements section
Pattern	Raw insights grouped by category (architectural, workflow, etc.)
Rule	Deduplicated insight with stable ID, confidence score, and source tracking
Agent Skill	Rules promoted to `.claude/skills/` for on-demand loading by Claude

Features

Structured Capture

Templates with six required sections ensure you never forget to document the mistakes that teach the most.

Pattern Distillation

Extract categorized insights from all your entries into structured JSON/YAML for analysis.

Semantic Deduplication

"Run tests before commit" and "Always execute the test suite prior to committing" are the same insight. buildlog merges them.

Confidence Scoring

Rules are scored based on frequency and recency. High-confidence rules have been reinforced multiple times recently.

Multiple Promotion Targets

Promote rules to CLAUDE.md, settings.json, or Anthropic Agent Skills (.claude/skills/) for on-demand loading.

Pluggable Embeddings

Token-based similarity by default. Upgrade to sentence-transformers or OpenAI for semantic understanding.

Quick Start

# Install
pip install buildlog

# Initialize in your project
buildlog init

# Create an entry for today's work
buildlog new auth-api

# After a few entries, extract patterns
buildlog distill

# Generate deduplicated rules
buildlog skills

The Pipeline

buildlog is a three-stage pipeline that transforms ephemeral work into durable knowledge:

flowchart TB
    subgraph Stage1["Stage 1: Capture"]
        A1["buildlog new slug"] --> A2["Edit markdown entry"]
        A2 --> A3["Document: Goal, Journey,<br/>Tests, Code, Improvements"]
    end

    subgraph Stage2["Stage 2: Distill"]
        B1["buildlog distill"] --> B2["Parse all entries"]
        B2 --> B3["Extract Improvements sections"]
        B3 --> B4["Group by category"]
    end

    subgraph Stage3["Stage 3: Promote"]
        C1["buildlog skills"] --> C2["Deduplicate similar insights"]
        C2 --> C3["Calculate confidence scores"]
        C3 --> C4["Generate stable IDs"]
        C4 --> C5["Promote to target"]
    end

    Stage1 --> Stage2 --> Stage3

    C5 --> D1["CLAUDE.md"]
    C5 --> D2["settings.json"]
    C5 --> D3[".claude/skills/"]

Stage 1: Capture (`buildlog new`)

Create structured entries as you work. Each entry has six sections:

Section	Purpose
The Goal	What you're building and why
What We Built	Architecture diagram, components
The Journey	Chronological narrative including mistakes
Test Results	Actual commands, actual outputs
Code Samples	Key snippets with context
Improvements	Categorized learnings for next time

The Improvements section is structured for machine extraction:

### Architectural
- Always validate inputs at the boundary, not conditionally
- Use frozen dataclasses for immutable data containers

### Workflow
- Run the test suite after EVERY code change, not just at the end
- Write the integration test first to clarify the API contract

### Tool Usage
- The `patch` context manager for date mocking is cleaner than fixtures
- Use `jwt.io` to decode tokens instead of console.log

### Domain Knowledge
- `datetime.utcnow()` is deprecated in Python 3.12+
- Supabase storage returns 400, not 404, for missing files

Stage 2: Distill (`buildlog distill`)

Extract all insights across entries into structured data:

buildlog distill                    # JSON to stdout
buildlog distill -o patterns.yaml   # Write to file
buildlog distill --since 2026-01-01 # Filter by date
buildlog distill --category workflow # Filter by category

Output:

{
  "patterns": {
    "architectural": [
      {"insight": "Always validate inputs at boundary...", "source": "2026-01-16-auth.md"}
    ],
    "workflow": [...],
    "tool_usage": [...],
    "domain_knowledge": [...]
  },
  "statistics": {
    "total_patterns": 47,
    "by_category": {"architectural": 12, "workflow": 15, ...}
  }
}

Stage 3: Generate Rules (`buildlog skills`)

Transform raw patterns into deduplicated, scored rules:

buildlog skills                           # YAML to stdout
buildlog skills -o rules.yml              # Write to file
buildlog skills --format markdown         # For CLAUDE.md injection
buildlog skills --min-frequency 2         # Only repeated patterns
buildlog skills --embeddings openai       # Semantic deduplication

Output:

generated_at: '2026-01-16T12:00:00Z'
source_entries: 23
total_skills: 31
skills:
  architectural:
    - id: arch-b0fcb62a1e
      rule: Always validate inputs at the boundary, not conditionally
      frequency: 4
      confidence: high
      sources: [auth.md, api.md, validation.md, forms.md]
      tags: [api, error]
    - id: arch-0cda924aeb
      rule: Frozen dataclasses should be the default for data containers
      frequency: 2
      confidence: medium
      sources: [models.md, dto.md]
      tags: [python]

Patterns vs Rules

Patterns are raw extractions—every insight from every entry, exactly as written.

Rules are processed patterns with:

Property	Description
Stable ID	Same rule always gets same ID (SHA-256 based)
Deduplication	Similar insights merged, frequency tracked
Confidence	high/medium/low based on frequency + recency
Sources	Which entries contributed to this rule
Tags	Auto-extracted technology/concept keywords

Deduplication in Action

Raw patterns from different entries:

- "Run tests before committing"
- "Always run the test suite before commit"
- "Execute tests prior to committing code"

After deduplication → 1 rule with frequency: 3:

- id: wf-96f12966f1
  rule: Run tests before committing
  frequency: 3
  confidence: high

Promotion Targets

Rules can be promoted to different targets for agent consumption:

flowchart LR
    R["Rules<br/>(buildlog_status)"] --> T1["CLAUDE.md<br/>(append)"]
    R --> T2["settings.json<br/>(merge)"]
    R --> T3[".claude/skills/<br/>(Agent Skill)"]

    T1 --> A1["Always loaded<br/>in context"]
    T2 --> A2["Project settings<br/>for Claude Code"]
    T3 --> A3["On-demand loading<br/>saves context"]

    style T3 fill:#96ceb4,color:#fff

Target	File	When to Use
`claude_md`	`CLAUDE.md`	Rules always in context (default)
`settings_json`	`.claude/settings.json`	Project-level Claude Code settings
`skill`	`.claude/skills/buildlog-learned/SKILL.md`	On-demand loading - rules load only when relevant

Anthropic Agent Skills (New!)

The skill target creates an Anthropic Agent Skill that Claude loads on-demand:

# Via MCP tool
buildlog_promote(skill_ids=["arch-123", "wf-456"], target="skill")

Creates .claude/skills/buildlog-learned/SKILL.md:

---
name: buildlog-learned
description: Project-specific patterns learned from development history.
  Use when writing code, making architectural decisions, reviewing PRs,
  or ensuring consistency. Contains 12 rules across Architectural, Workflow.
---

# Learned Patterns

*12 rules extracted from buildlog entries on 2026-01-16*

## Must Follow (High Confidence)

These patterns have been reinforced multiple times.

### Architectural
- Always validate inputs at the boundary
- Use dependency injection for testability

### Workflow
- Run tests after EVERY code change

## Should Consider (Medium Confidence)

These patterns appear frequently but may have exceptions.

### Tool Usage
- Prefer `patch` context manager for date mocking

Why Agent Skills?

On-demand loading - Rules only load when Claude determines they're relevant
Saves context - Not always in context like CLAUDE.md
Progressive disclosure - Claude asks before loading the full skill

Live Usage Scenario

Here's how your learned rules actually get used:

┌─────────────────────────────────────────────────────────────────┐
│ You: "Review this authentication endpoint I wrote"              │
└─────────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────────┐
│ Claude sees: "authentication" + "review" + "endpoint"           │
│                                                                 │
│ Checks skill description:                                       │
│   "Use when writing code, making architectural decisions,       │
│    reviewing PRs, or ensuring consistency..."                   │
│                                                                 │
│ Match! Loads .claude/skills/buildlog-learned/SKILL.md           │
└─────────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────────┐
│ Claude now has access to YOUR learned rules:                    │
│                                                                 │
│   Must Follow:                                                  │
│   - Password hashing belongs in User model, not route handler   │
│   - Always validate inputs at the boundary                      │
│                                                                 │
│   Worth Knowing:                                                │
│   - bcrypt.compare() arg order is (plaintext, hash)             │
│   - JWT expiry is in seconds, not milliseconds                  │
└─────────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────────┐
│ Claude: "I notice the password hashing is in your route         │
│ handler. Based on patterns from your buildlog, this should      │
│ be a pre-save hook in the User model instead.                   │
│                                                                 │
│ Also, I see bcrypt.compare(hash, password) - the argument       │
│ order should be (plaintext, hash). This has tripped you up      │
│ before."                                                        │
└─────────────────────────────────────────────────────────────────┘

Your past mistakes now prevent future ones—automatically.

Embedding Backends

Deduplication uses text similarity. Choose your backend:

Backend	Install	Use Case
`token` (default)	Built-in	Fast, free, good for obvious duplicates
`sentence-transformers`	`pip install buildlog[embeddings]`	Local semantic similarity, no API calls
`openai`	`pip install buildlog[openai]`	Best quality, requires API key

# Token-based (default) - catches "run tests" ≈ "run testing"
buildlog skills

# Semantic - catches "use Redis for caching" ≈ "cache data in Redis"
buildlog skills --embeddings sentence-transformers

# OpenAI - best quality semantic matching
export OPENAI_API_KEY=...
buildlog skills --embeddings openai

Comparison

Input	Token	OpenAI
"Run tests before commit" ≈ "Run testing before committing"	Merged	Merged
"Use Redis for caching" ≈ "Cache data in Redis"	Separate	Merged

Practical Usage

1. Inject Rules into CLAUDE.md

buildlog skills --format markdown >> CLAUDE.md

Your AI agent now has access to every lesson you've learned:

## Learned Rules

Based on 23 buildlog entries, 31 actionable rules have emerged:

### Architectural (8 rules)
- Always validate inputs at the boundary (seen 4x)
- Use frozen dataclasses for data containers (seen 2x)

### Workflow (12 rules)
- Run tests after EVERY code change (seen 5x)
...

2. Create an Agent Skill

For on-demand loading instead of always-in-context:

# Via CLI (coming soon)
buildlog promote --target skill

# Via MCP
buildlog_promote(skill_ids=["arch-123"], target="skill")

3. Track Rule Evolution

Rules have stable IDs. Track which are reinforced over time:

# This week's new rules
buildlog skills --since 2026-01-10 -o this-week.yml

# Compare to baseline
diff baseline.yml this-week.yml

4. Find Your Blind Spots

buildlog stats --detailed

Buildlog Statistics

Entries: 23 total
Coverage: 87% with improvements

Category Breakdown:
  architectural:    12 insights (26%)
  workflow:         15 insights (33%)
  tool_usage:        8 insights (17%)
  domain_knowledge: 11 insights (24%)

Warnings:
  - 3 entries have empty Improvements sections

Commands

Command	Description
`buildlog init`	Initialize in current directory
`buildlog new <slug>`	Create entry for today
`buildlog new <slug> --date 2026-01-15`	Create entry for specific date
`buildlog list`	List all entries
`buildlog distill`	Extract patterns from all entries
`buildlog stats`	Show statistics and analytics
`buildlog skills`	Generate deduplicated rules
`buildlog update`	Update templates to latest

Skills Options

--output, -o PATH       # Write to file instead of stdout
--format [yaml|json|markdown]  # Output format (default: yaml)
--min-frequency N       # Only rules seen N+ times
--since YYYY-MM-DD      # Only entries from this date
--embeddings [token|sentence-transformers|openai]  # Similarity backend

Architecture

flowchart TB
    subgraph CLI["CLI Layer"]
        cli["cli.py"]
    end

    subgraph Core["Core Logic"]
        distill["distill.py<br/>Pattern extraction"]
        skills["skills.py<br/>Deduplication + scoring"]
        stats["stats.py<br/>Analytics"]
        embeddings["embeddings.py<br/>Similarity backends"]
        ops["core/operations.py<br/>status, promote, reject"]
    end

    subgraph Render["Render Adapters"]
        claude_md["claude_md.py"]
        settings_json["settings_json.py"]
        skill_render["skill.py"]
    end

    subgraph MCP["MCP Server"]
        server["server.py"]
        tools["tools.py"]
    end

    cli --> distill
    cli --> skills
    cli --> stats

    skills --> embeddings
    skills --> distill

    ops --> skills
    ops --> Render

    tools --> ops
    server --> tools

Data Flow

flowchart LR
    MD["buildlog/*.md"] --> Parse["Parse markdown"]
    Parse --> Extract["Extract Improvements"]
    Extract --> Distill["distill_all()"]
    Distill --> Patterns["Patterns by category"]
    Patterns --> Dedup["Deduplicate"]
    Dedup --> Score["Calculate confidence"]
    Score --> Rules["Rules with IDs"]
    Rules --> Format["Format output"]
    Format --> Out["YAML / JSON / Markdown / Agent Skill"]

Installation Options

# Basic install
pip install buildlog

# With local semantic embeddings (offline)
pip install buildlog[embeddings]

# With OpenAI embeddings
pip install buildlog[openai]

# Everything
pip install buildlog[all]

# Development
pip install buildlog[dev]

# With MCP server for Claude Code integration
pip install buildlog[mcp]

MCP Server (Claude Code Integration)

The MCP server lets Claude Code interact with your buildlog rules directly. Your agent can review learned patterns, promote them to rules, or reject false positives—all through natural conversation.

Setup for Claude Code CLI

Install with MCP support:

pip install buildlog[mcp]
# or with uv
uv pip install buildlog[mcp]

Add to your Claude Code settings (~/.claude/settings.json):

{
  "mcpServers": {
    "buildlog": {
      "command": "buildlog-mcp",
      "args": []
    }
  }
}

Start a new Claude Code session. The buildlog tools will be available.

Available Tools

Tool	Description
`buildlog_status`	Get rules grouped by category with confidence scores
`buildlog_promote`	Write rules to CLAUDE.md, settings.json, or Agent Skills
`buildlog_reject`	Mark rules to exclude from future suggestions
`buildlog_diff`	Show rules pending review (not yet promoted/rejected)

Promotion Targets via MCP

# Append to CLAUDE.md (default)
buildlog_promote(skill_ids=["arch-123"], target="claude_md")

# Merge into settings.json
buildlog_promote(skill_ids=["arch-123"], target="settings_json")

# Create Anthropic Agent Skill (NEW!)
buildlog_promote(skill_ids=["arch-123"], target="skill")

Example Conversation

You: What patterns have I learned?

Claude: [calls buildlog_status]
        Based on 23 buildlog entries, you have 31 rules:

        High confidence (ready to promote):
        - arch-b0fcb62a1e: "Always validate inputs at the boundary"
        - wf-96f12966f1: "Run tests before committing"

        Would you like me to add these to your CLAUDE.md or create an Agent Skill?

You: Create an Agent Skill so they load on-demand.

Claude: [calls buildlog_promote with target="skill"]
        Created skill at .claude/skills/buildlog-learned/SKILL.md

        This skill will load on-demand when relevant to your work,
        saving context for when you need it most.

Philosophy

1. Write Fast, Not Pretty

Refrigerator to-do list energy. Get it down before you forget.

2. Never Delete Mistakes

Wrong turns are the most valuable content. They're what makes tutorials actually useful.

3. Include the Journey

"We tried X, it failed because Y, so we did Z" > "We did Z"

4. Capture Improvements

Concrete learnings, not vague observations. "Always validate at boundary" > "validation is important"

5. Quality Bar

Each entry should be publishable as a $500+ tutorial. Real error messages. Honest about what didn't work. Code that runs.

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing)
Open a Pull Request

License

MIT License — see LICENSE for details.

Your AI pair programmer should learn from your mistakes.

buildlog makes that possible.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.23.0

Mar 14, 2026

0.22.0

Mar 14, 2026

0.21.1

Mar 12, 2026

0.21.0

Mar 12, 2026

0.20.0

Mar 7, 2026

0.18.4

Feb 15, 2026

0.18.2

Feb 14, 2026

0.18.1

Feb 14, 2026

0.15.0

Feb 13, 2026

0.13.1

Feb 7, 2026

0.13.0

Feb 6, 2026

0.12.0

Feb 5, 2026

0.11.1

Feb 5, 2026

0.11.0

Feb 5, 2026

0.10.5

Feb 4, 2026

0.10.4

Feb 4, 2026

0.10.3

Feb 4, 2026

0.10.2

Feb 4, 2026

0.10.1

Feb 4, 2026

0.10.0

Feb 4, 2026

0.9.0

Feb 2, 2026

0.8.0

Jan 31, 2026

0.7.0

Jan 22, 2026

0.6.1

Jan 22, 2026

0.6.0

Jan 22, 2026

0.5.0

Jan 22, 2026

0.4.0

Jan 17, 2026

0.3.0

Jan 17, 2026

This version

0.2.0

Jan 16, 2026

0.1.0

Jan 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

buildlog-0.2.0.tar.gz (40.5 kB view details)

Uploaded Jan 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

buildlog-0.2.0-py3-none-any.whl (53.0 kB view details)

Uploaded Jan 16, 2026 Python 3

File details

Details for the file buildlog-0.2.0.tar.gz.

File metadata

Download URL: buildlog-0.2.0.tar.gz
Upload date: Jan 16, 2026
Size: 40.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for buildlog-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`20687c8affb69dfd92544088b6e11264839c6fd22a77bc4a5ff5ed3982969393`
MD5	`df1d57e60430d7cc1fc5e2dff3e9a19b`
BLAKE2b-256	`c6b3d64cd494259f8d27c58054dbbaf1720f58b227d6004172186fc9c057674b`

See more details on using hashes here.

File details

Details for the file buildlog-0.2.0-py3-none-any.whl.

File metadata

Download URL: buildlog-0.2.0-py3-none-any.whl
Upload date: Jan 16, 2026
Size: 53.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for buildlog-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f01a8babd5382415ac95406426ce097c65212cf82ecde2ef3c8cedb049b7da54`
MD5	`b9349e76a2a3024d752e3f9dd5148e9d`
BLAKE2b-256	`559165c2931282007708cbce4304b752a49372d5ece0bce341236802e16708e0`

See more details on using hashes here.

buildlog 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

buildlog

Engineering Notebook for AI-Assisted Development

The Problem

The Solution

Key Concepts

Features

Structured Capture

Pattern Distillation

Semantic Deduplication

Confidence Scoring

Multiple Promotion Targets

Pluggable Embeddings

Quick Start

The Pipeline

Stage 1: Capture (buildlog new)

Stage 2: Distill (buildlog distill)

Stage 3: Generate Rules (buildlog skills)

Patterns vs Rules

Deduplication in Action

Promotion Targets

Anthropic Agent Skills (New!)

Live Usage Scenario

Embedding Backends

Comparison

Practical Usage

1. Inject Rules into CLAUDE.md

2. Create an Agent Skill

3. Track Rule Evolution

4. Find Your Blind Spots

Commands

Skills Options

Architecture

Data Flow

Installation Options

MCP Server (Claude Code Integration)

Setup for Claude Code CLI

Available Tools

Promotion Targets via MCP

Example Conversation

Philosophy

1. Write Fast, Not Pretty

2. Never Delete Mistakes

3. Include the Journey

4. Capture Improvements

5. Quality Bar

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Stage 1: Capture (`buildlog new`)

Stage 2: Distill (`buildlog distill`)

Stage 3: Generate Rules (`buildlog skills`)