Claude Code extension that watches agent tool calls and suggests structured alternatives

These details have not been verified by PyPI

Project links

Project description

Kibitzer

The person watching your chess game who can't help offering opinions.

Kibitzer is a Claude Code extension that watches how agents use tools and suggests better alternatives. It applies mode-based path protection to write tools (canonicalized, fail-closed) and to the Bash write vectors it can see statically, intercepts bash commands that have structured alternatives, and coaches agents toward more effective tool usage — all without an LLM in the decision loop. It is a guidance layer that composes under the Claude Code permission system and an OS-level sandbox, not a replacement for either (see Scope and limits).

Install

pip install kibitzer
cd your-project/
kibitzer init --hooks --mcp

This registers PreToolUse/PostToolUse hooks in .claude/settings.json and starts an MCP server with two tools the agent can call: ChangeToolMode and GetFeedback.

For richer coaching with Fledgling conversation analytics:

pip install kibitzer[fledgling]

What it does

Path protection

Each mode defines which paths the agent can write to. The path guard checks every Edit, Write, and NotebookEdit call, and statically scans Bash commands for common write vectors (redirections, tee, cp, mv, rm, sed -i, ln, ...).

Before comparing, the guard canonicalizes the target against the project directory — symlinks are followed, ./.. are collapsed, absolute and relative spellings of the same location compare equal — and matches by path segment, so src/ never matches src_secret/. The guard fails closed: an unknown mode, a missing target path, an unparseable Bash command in a restricted mode, or an internal guard error denies the write.

Mode        Writable                  Use case
─────────── ───────────────────────── ───────────────────────────
free        everything                prototyping, floor only
implement   src/, lib/, tests/, test/ normal dev (tests included)
test        tests/, test/, spec/      writing tests — source protected
docs        docs/, README.md          documentation only
explore     nothing                   read-only investigation
review      nothing                   read-only code review

Writable entries may also be absolute or ~-prefixed glob patterns, expanded at check time — the implement/test/docs defaults include ~/.claude/projects/*/memory/ so Claude auto-memory writes don't require a mode flip.

Independent of the mode, a safety floor deny-list runs first on every write (including statically extracted Bash write targets): secret dirs (~/.ssh, ~/.aws, ...), system paths, raw .git internals, and writes outside any git working tree are blocked even in free mode. It's configurable via [floor] in .kibitzer/config.toml and fails open on its own internal errors.

When a write is denied, the agent sees why and how to fix it:

Path 'notebooks/explore.ipynb' is not writable in the current mode (writable: ['src/', 'lib/', 'tests/', 'test/', '~/.claude/projects/*/memory/']).
Use the ChangeToolMode tool to switch modes.

In testing, agents consistently read this message and call ChangeToolMode to switch — no documentation or pre-training needed.

Scope and limits

Path protection is one layer, not a sandbox:

Bash coverage is static. The guard analyzes the command string; it is not a shell interpreter. Writes hidden behind interpreter one-liners (python -c ...), scripts fed on stdin, make targets, or git commands that mutate the tree are not visible to it. Opaque write constructs it can spot — xargs rm, process substitution targets, unparseable quoting — are denied in restricted modes rather than waved through.
Check-then-act window. Symlinks are resolved when the call is checked; a race that swaps a path component between the check and the actual write is not preventable at this layer.
free mode skips the mode guard by design (writable = ["*"]) and is the default; only the mode-independent safety floor still applies. Set a restricted default_mode in .kibitzer/config.toml if you want the full guard active from the first call.

For a hard boundary, compose kibitzer with the Claude Code permission system and an OS-level sandbox (e.g. an umwelt-compiled policy under nsjail). Kibitzer keeps a cooperating-but-fallible agent on task; the sandbox bounds what a misbehaving process can do.

Interception

Interceptor plugins watch Bash calls for commands that have structured alternatives:

Bash command	Suggested alternative	Plugin
`git add -A && git commit -m '...'`	`jetsam save`	jetsam
`pytest tests/`	`blq run test`	blq
`grep -rn 'def handler' src/`	`FindDefinitions(...)`	fledgling

Three interception modes form a ratchet — start in observe (log silently), graduate to suggest (show alternative), then redirect (deny bash, require structured tool). Each graduation is a one-line config change.

Coaching

The coach fires every N tool calls and detects patterns from ~250 experimental runs. Suggestions only reference tools the agent actually has — discovered from .mcp.json at runtime.

State-based patterns (always available):

Repeated edit failures — "Edit failed 3 times on src/handler.py. Try Read() first to see exact content."
Edit streak without tests — "You've made 7 edits without running tests." (mentions blq run test if blq is available)
Sequential file reads — "You've read 5 files one at a time." (mentions FindDefinitions if fledgling is available)
Bash-heavy usage — "You've run 6 bash commands without using structured tools."
Analysis loop — "You've spent 18 turns reading without changes. Start with the most confident fix."
Semantic tool underuse — "FindDefinitions shows all functions in one call." (only fires if fledgling is available)
Mode oscillation — "Frequent mode switches. Consider using free mode."

TDD patterns:

Test overfit — "test_auth.py has been edited 4 times. Stabilize test expectations before adjusting further."
Implement before test — "You edited source before writing tests. Consider starting with a failing test."

Fledgling-powered patterns (when fledgling is installed):

Repeated search patterns — "You've searched for 'def handle_request' 4 times via Grep."
Replaceable bash commands — "You've run 'grep' 3 times. FindDefinitions provides structured output."

All patterns are mode-aware: the analysis loop doesn't fire in explore mode (not editing is correct there), edit-without-test doesn't fire in docs mode (docs don't need tests).

Auto-transitions

The mode controller watches for failure patterns:

3+ consecutive failures → auto-switch to explore
20+ turns in explore → auto-switch back to implement

An oscillation guard prevents rapid switching: if the agent just left a mode (< 5 turns), it won't auto-switch back. After 6+ total switches, auto-transitions stop.

Composes with Claude Code's permission model

The Claude Code harness has its own permission system (per-tool allow/deny, prompting before sensitive actions). Kibitzer's modes layer on top of these permissions — not beside them. The two regulate at different granularities:

Harness permissions: coarse, universal, set at install — "can this agent ever call Bash?"
Kibitzer modes: fine, contextual, changed per task — "is Bash appropriate given the current mode?"

They compose via set intersection: a tool call succeeds iff the harness permits it AND (if kibitzer is active) the current mode permits it. Kibitzer can narrow the harness's permissions; it can never widen them.

When a call is denied, the response identifies which layer denied it so the agent can respond correctly — switching modes (kibitzer) or asking the user to grant permission (harness) are different remedies.

MCP tools

The agent can call two tools explicitly:

ChangeToolMode(mode, reason?) — Switch modes. Returns the new mode's writable paths and strategy.

GetFeedback(status?, suggestions?, intercepts?) — Check current status, get coaching suggestions, and see which bash commands have been intercepted.

Configuration

Override defaults in .kibitzer/config.toml:

# Monorepo: widen writable paths
[modes.implement]
writable = ["packages/core/src/", "packages/api/src/"]

# Add custom modes
[modes.deploy]
writable = ["infra/", "deploy/"]
strategy = "Verify before applying."

# Graduate jetsam to suggest mode
[plugins.jetsam]
mode = "suggest"

# More aggressive coaching
[coach]
frequency = 3

Python API

Use kibitzer from Python without the hook protocol:

from kibitzer import KibitzerSession

with KibitzerSession(project_dir=".") as session:
    # Check if a tool call is allowed
    result = session.before_call("Edit", {"file_path": "src/auth.py"})

    # Record a completed tool call
    session.after_call("Edit", {"file_path": "src/auth.py"}, success=True)

    # Batch-validate a program's planned calls
    violations = session.validate_calls([
        {"tool": "Edit", "input": {"file_path": "tests/foo.py"}},
    ])

Full API docs at kibitzer.readthedocs.io/python-api.

Coordinates with

Kibitzer suggests but never wraps these tools — each is independent:

blq — structured build/test capture
jetsam — git workflow acceleration
Fledgling — AST-aware code intelligence

None are required. Kibitzer degrades gracefully — path guard and coach work with nothing else installed. When tools are available, suggestions reference them specifically. When they're not, suggestions give generic advice.

Documentation

Full docs at kibitzer.readthedocs.io:

Modes — path protection, switching, auto-transitions
Coach — all patterns, experimental evidence, model dependency
Interceptors — the observe/suggest/redirect ratchet
Configuration — full config.toml reference, resilience, optional deps
Architecture — how the pieces fit together
Integration — blq, jetsam, fledgling, superpowers

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.9.0

Jul 19, 2026

0.8.2

Jun 29, 2026

0.8.1

Jun 28, 2026

0.8.0

May 27, 2026

0.3.2

Apr 12, 2026

0.3.0

Apr 1, 2026

0.2.1

Mar 30, 2026

0.2.0

Mar 30, 2026

0.1.5

Mar 30, 2026

0.1.4

Mar 30, 2026

0.1.3

Mar 29, 2026

0.1.2

Mar 29, 2026

0.1.1

Mar 29, 2026

0.1.0

Mar 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kibitzer-0.9.0.tar.gz (200.4 kB view details)

Uploaded Jul 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kibitzer-0.9.0-py3-none-any.whl (70.4 kB view details)

Uploaded Jul 19, 2026 Python 3

File details

Details for the file kibitzer-0.9.0.tar.gz.

File metadata

Download URL: kibitzer-0.9.0.tar.gz
Upload date: Jul 19, 2026
Size: 200.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kibitzer-0.9.0.tar.gz
Algorithm	Hash digest
SHA256	`670968652a8161d38e3cc44951368690e1929674e2ed337d85f226ae7834668b`
MD5	`001bf6539484f8fc1d19234fa66c822a`
BLAKE2b-256	`8ad944ed96dad3ffa34589acdeca4e1582735d1b28e7a682c671c4fbcc3fbffd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kibitzer-0.9.0.tar.gz:

Publisher: publish.yml on teaguesterling/kibitzer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kibitzer-0.9.0.tar.gz
- Subject digest: 670968652a8161d38e3cc44951368690e1929674e2ed337d85f226ae7834668b
- Sigstore transparency entry: 2195901196
- Sigstore integration time: Jul 19, 2026
Source repository:
- Permalink: teaguesterling/kibitzer@b214eccea380beb941c1ef7f8278edaf5460e0a9
- Branch / Tag: refs/tags/v0.9.0
- Owner: https://github.com/teaguesterling
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b214eccea380beb941c1ef7f8278edaf5460e0a9
- Trigger Event: push

File details

Details for the file kibitzer-0.9.0-py3-none-any.whl.

File metadata

Download URL: kibitzer-0.9.0-py3-none-any.whl
Upload date: Jul 19, 2026
Size: 70.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kibitzer-0.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3ae7f9d4e3e68d32ef48fb05914d338f03cf82937e4035da1906ddfcb71e7b03`
MD5	`631c3b970343f30a56e8eb9597748a45`
BLAKE2b-256	`5f0c3568601243e15085dbec1e7bcf0f506ae3f4d962e586089fd5a166404503`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kibitzer-0.9.0-py3-none-any.whl:

Publisher: publish.yml on teaguesterling/kibitzer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kibitzer-0.9.0-py3-none-any.whl
- Subject digest: 3ae7f9d4e3e68d32ef48fb05914d338f03cf82937e4035da1906ddfcb71e7b03
- Sigstore transparency entry: 2195901248
- Sigstore integration time: Jul 19, 2026
Source repository:
- Permalink: teaguesterling/kibitzer@b214eccea380beb941c1ef7f8278edaf5460e0a9
- Branch / Tag: refs/tags/v0.9.0
- Owner: https://github.com/teaguesterling
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b214eccea380beb941c1ef7f8278edaf5460e0a9
- Trigger Event: push

kibitzer 0.9.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Kibitzer

Install

What it does

Path protection

Scope and limits

Interception

Coaching

Auto-transitions

Composes with Claude Code's permission model

MCP tools

Configuration

Python API

Coordinates with

Documentation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance