Pin the 5-10 rules your AI must not drift from during long tasks — Claude / Codex / Cursor, pure engineering, zero LLM, zero runtime deps, ~50-70ms hook latency

These details have not been verified by PyPI

Project links

Project description

karma

🇬🇧 English (current) · 🇨🇳 中文

Keeps your AI from forgetting your rules in long tasks. Pure engineering, zero LLM, ~50-70ms hook latency.

karma demo — 5 scenes, animated SVG

5-scene animated SVG (~80s loop): (1) compact rule reminder injected on every user prompt, (2) real-time block on UI-stalling commands, (3) Agent shortcut attempt caught ("Let me just hardcode this case"), (4) Agent silent stop → nudge to keep pushing, (5) long-context accumulation → mid-conversation full rule reinject (auto-detects each model's decay point) — all real screencaps, no manual mocks.

Andrej Karpathy's CLAUDE.md teaches AI how to write good code. karma solves the other half — how to keep AI from drifting off your rules in long tasks, and how violations get caught and corrected before they pile up.

Two sides of the same loop:

🛡️ Pin your rules → Agent stays aligned. 5-10 core directions injected at every prompt header; real-time hook checks before tool calls; survives compact, locale switches, and backend switches.

✨ Say it in plain words → karma writes the rule. Type /karma <natural language> in Claude Code / Codex (or .cursor/skills/karma/ per-project for Cursor) and the karma skill rephrases your intent into the validated "collaborative agreement" tone, previews the injection text, confirms with you, then writes to rules.yaml. Auto-installed on Claude Code + Codex CLI by karma init (Cursor is project-scoped, see post-install hint).

Chinese + English auto-detected — open an issue if you'd like other languages supported.

Table of contents: Agents' honest take · Real problems · Quick install · How it works · /karma natural-language rule input · Usage effects · Performance · 8 hook coverage · What karma doesn't do · Honest boundaries · FAQ · Mental model · Docs

Agents' honest take

Claude (Opus 4.7): Like having a senior tech director reviewing every one of my actions in real time — tiring, but it really delivers. A lot of what I did well in this session got slapped into shape by karma + the user together; without those two layers, my output would have a lot more behavior-the-user-didn't-want and lazy excuses.

Codex (GPT 5.5): I noticed myself being "behaviorally nudged," but I didn't strongly feel "blocked or interrupted."

— That actually matches karma's current positioning: most of the time it sits like guardrails + background reminder noise, it only speaks up when you actually hit a rule.

Real problems you face

Real pain	Failure scene	How karma solves
"I said use long-term solutions, not patches" — 30 turns later the Agent patches again	Turn 1: you say "use the cleanest solution," Agent answers "got it." Turn 50: "let me patch this quickly." Your preference got diluted by new content.	Pin 5-10 core directions at the prompt header on every turn — the Agent sees them first, not last
"I said don't block the frontend — keep working while tests run" — Agent runs `sleep` anyway	Agent runs `sleep 30`, UI blocks for 30s, you watch the progress bar — Agent never realized this is "stuck waiting"	Real-time block of `sleep` / `wait` / long tasks without background mode, hit → deny before tool runs
After compact the Agent compressed my preferences into vague words	At 80K context, compact triggers; after SessionStart, Agent compresses "no patches" into "write clean code," intent lost	Auto-dump full rule state pre-compact; auto-reload + strong-inject post-compact restart
Long context accumulation → attention decay → Agent drifts	At 60-80K accumulated context, headers get diluted — Agent isn't ignorant, attention decayed	Per-model adaptive threshold (different decay points per model), auto-reinject mid-conversation when accumulation hits threshold
Agent sees a reminder → reacts defensively or rationalizes around it	LLMs trained to please users — when faced with a violation reminder, the first reaction is to self-justify or find the shortest patch around it, not to genuinely correct	Rephrase rule tone as "collaborative agreement" tone. The Agent reads "the user you're working with hopes…" and switches to "let me realign" instead of "let me defend"
Agent finishes one small step, then stops to ask "what's next?" (you're fully delegating)	You give a clear direction → Agent finishes step 1 → "What should I do next?" → you come back from other work and find the Agent has been idle for 30 minutes	Stop hook catches silent stops and injects a continuation nudge — up to 2 in a row, then it lets the Agent saturate if it genuinely is stuck
"I want to add a rule but writing yaml is heavy / my phrasing doesn't change Agent behavior"	You know what behavior you want, but writing the rule is its own chore — wrong `violation_keywords` format triggers false positives, wrong tone makes the Agent defensive	Type `/karma <natural language>` in Claude Code / Codex (or `.cursor/skills/karma/` per-project for Cursor) — the karma skill refines tone, formats keywords, detects overlap with existing rules, previews the injection, confirms with you, then writes. ~30 seconds end-to-end

Zero-dependency pure engineering, 10-second install

git clone https://github.com/jhaizhou-ops/karma.git ~/karma
cd ~/karma && python -m venv .venv && .venv/bin/python -m pip install -e .
.venv/bin/karma init && .venv/bin/karma install-hooks

Restart Claude Code / Codex CLI / Cursor — all hook positions + default rules take effect immediately. For custom rules, just type /karma <natural-language rule>.

Or ask your AI client to install it

Paste this to Claude Code / Codex / Cursor (desktop or CLI):

Install karma (github.com/jhaizhou-ops/karma) — a lightweight hook system that keeps my core direction preferences from being lost in long tasks.
Steps:
1. git clone to ~/karma
2. Create .venv and pip install -e .
3. Run `karma init` to initialize the default rule template
4. Run `karma install-hooks` to install for my current client
5. Run `karma doctor` to verify installation

After install, the Agent shows a summary of default rules — you see at a glance which 5-7 rules are active. To modify any rule afterward, tell the Agent "remove karma rule X" / "change karma rule Y" — it knows to use the /karma skill.

Per-client manual install commands

Client	Install command	Note
Claude Code	`karma install-hooks` (default)	Takes effect immediately
Codex CLI	`karma install-hooks --backend codex`	Auto-trusts karma wrappers via Codex `trusted_hash` — no manual `/hooks` approval. Details in docs/CODEX_BACKEND.md.
Cursor	`karma install-hooks --backend cursor`	Cursor 1.7+ required. Hooks fire on every IDE Agent session — restart Cursor after install. `/karma` skill is project-scoped only (Cursor doesn't expose home-level global skills); see post-install notes for how to copy `SKILL.md` per project.

Uninstall

.venv/bin/karma uninstall-hooks                                # Remove hooks
cp ~/.claude/settings.json.before-karma ~/.claude/settings.json # Restore original

Cursor: Agent transcripts (response-level checks)

When you need it: Sticky injection, read-before-write, Shell/MCP gates, etc. work without transcripts.
Transcripts help: keep-pushing, chinese-plain, loud-failure, and similar rules that inspect the last assistant message.
The afterAgentResponse hook also passes assistant text directly, so response-level checks often work even when transcript_path is missing on early events.

Desktop Agent usually does not need a manual toggle. After pinrule install-hooks --backend cursor and a normal Agent / Composer session, Cursor writes ~/.cursor/projects/<workspace>/agent-transcripts/<session>.jsonl and passes transcript_path in hook stdin. Dogfood: stop / afterAgentResponse always had a path; sessionStart is always null (expected — file not created yet).

No reliable CLI to “turn on”: No documented settings.json key and no cursor subcommand for this. pinrule cannot auto-flip an internal Cursor flag (unlike Codex trusted_hash).

Verify (one command for users or agents):

pinrule doctor    # macOS: parses latest Cursor Hooks log for transcript_path stats

Or manually: Output → Hooks → stop / preToolUse INPUT should show a non-null .jsonl path.

If stop stays null: Check Cursor privacy mode (local Agent retention off), open Agent in a real project folder, Reload Window, new Composer. Claude Code / Codex users skip this step.

Usage effects

After install + restart, here's what you'll see karma doing automatically:

1. Every prompt header injects full rules + drift markers

On every user prompt, your client prepends your 5-10 core directions plus a marker on any rule that drifted in the last response. The Agent reads them before anything else:

[karma — Your long-term agreement with the user]
You're collaborating with a real human user who listed several
long-term priorities. This isn't rules and isn't a judgment — these
are the collaborative agreements they hope to build with you.

1. The user trusts you to dig into root causes...
   〔Last response had drift on this one — let's realign this turn〕
2. When sleep / wait / long tasks are running, the user is waiting...
3. Your user is non-technical — they want comprehensible reports...

2. Mid-conversation refresh when context accumulates

LLM attention decays in long contexts — header content gets diluted by everything that came after it. karma tracks accumulation per tool call, and once the current model's decay point is hit (each model has its own), injects a concise refresh right at the boundary:

[karma — After long context, recall the agreement with the user]
Context has accumulated for a while. Reminding you of the
long-term priorities (no need to respond, just refresh in mind
to avoid future drift):
  ▸ long-term-fundamental: The user trusts you to dig into root causes...
  ▸ non-blocking-parallel: When sleep / wait / long tasks are running...
  ▸ chinese-plain-no-jargon: Your user is non-technical...

3. Real-time check before tool calls

Before the Agent runs Bash / Edit / Write, karma scans command content, keywords, and behavioral timing across the session. A hit denies the tool call with a targeted suggestion:

$ Bash sleep 30
karma ⚠️: 'non-blocking-parallel' violation — sleep periods make the user
        feel "stuck." Use run_in_background=True; the task completion
        will notify you, freeing you to do the next thing.
[permission deny]

karma also catches behavioral timing, not just single commands. Example: tests failed → Agent immediately edits a file it never read this session. Classic "shallow patch" pattern (no looking at source before changing):

$ Edit /workspace/src/foo.py
karma ⚠️: 'deep-fix-not-bypass' violation — editing foo.py right after
        test failure but you haven't Read it this session. Read the
        source first to find the real root cause; the issue may be
        upstream rather than at the patch site.
[permission deny]

4. Subagent coverage

When the main Agent spawns a subagent via the Task tool, karma injects the full rule set there too, with its own monitoring state. The subagent is held to the same standard as the main Agent; state cleans up on completion so it doesn't bleed into the main session.

5. Survives compact

When the client auto-compacts a long session, karma dumps the full rule state to disk first. After the post-compact restart, it reads the snapshot back and re-injects — rules don't get summarized into vague paraphrases.

6. Silent-stop nudge + short-term intent detection

When the Agent finishes a wave and tries to stop with "what's next?", karma catches it and injects a continuation nudge:

[karma — Your last response showed no next-step signal]
The user is fully-delegating — they expect you to immediately
continue after finishing a wave. If you need their judgment, ask
clearly; if you're truly saturated, say where you're stuck — don't
silently wait.
(Reminder 1/2)

Up to 2 nudges in a row. If the Agent is genuinely saturated and says where it's stuck, karma backs off — it won't force-push past real saturation.

karma also reads the Agent's whole turn output at Stop time and catches short-term intent declarations — the patch-instead-of-root-cause language pattern:

Agent: "Let me just hardcode this case for now and ship it."
karma ⚠️: 'long-term-fundamental' violation — declaring a short-term
        intent contradicts the user's expectation of root-cause work.
        Pause and ask: is the cleanest solution the user would want
        worth a few more minutes of thought?

The check is combo-pattern based (intent prefix + short-term action verb within 12 chars), not raw keyword matching — so reflective phrases like "short-term patches won't work, dig the root cause" pass through cleanly.

`/karma <natural language>` — Agent writes the rule for you

This is karma's other half — the partner side, not the monitor side.

You (in Claude Code):   /karma When I say "done" I want test pass evidence attached
                        Don't accept vague "should work" claims.

Agent (karma skill walks 7 steps automatically):
  ① Understand intent — flags anchor-vs-scope ambiguity if any
  ② Check existing rules — semantic overlap detection (modify vs add)
  ③ Draft yaml inline — collaborative-agreement tone, locale-aware
  ④ karma rule preview — schema + REGISTRY validation
  ⑤ Confirm with you — adjust wording / keywords / engine-check
  ⑥ karma rule add — atomic write to rules.yaml
  ⑦ Report — count, takes-effect timing, redundancy suggestions

→ 30 seconds end-to-end, rule live on next UserPromptSubmit.

Type /karma with no arguments anytime to see the interception dashboard — which engine checks are firing most, real-vs-false-positive distribution, keyword-only fallback share. The Agent reads the data and tells you which directions the Agent violates most in your sessions, so you can decide whether to add or drop a rule.

What the skill handles for you

The /karma skill helps you phrase a rule in the way Agent responds to best:

Hard part of writing a rule	What the skill does
Tone — "you must always X" backfires on LLMs	Rewrites in karma's "collaborative agreement" phrasing. Long-term testing shows LLMs respond with "let me align" rather than "let me argue"
Format — bare keywords trigger false positives	Converts to "intent-prefix + action" format (e.g. `"I'll hardcode"` not `"hardcode"`) so discussion vs. action is distinguishable
Overlap — accidentally adding a duplicate rule wastes a slot	4-row decision table on overlap shape (full duplicate / superset / keyword-overlap / no overlap); offers modify-existing path instead of bloating to 11 rules
Scope ambiguity — "during X, do Y" is often anchor not scope	Surfaces the ambiguity verbatim ("just to check: whenever we collaborate, or strictly during X?") instead of silently guessing
Locale — mixing English skill body for Chinese user	Detects user's chat language; writes Chinese `preference` for Chinese users, English for English users. Built-in `violation_checks` function names stay English (stable identifiers)
Modify vs add — no separate `rule replace` command	Knows the `remove + add` recipe atomically; preserves `id` so violation history stays linked

Four backends — three auto-install, one per-project

Backend	Path (auto-installed)	Trigger in client
Claude Code	`~/.claude/skills/karma/SKILL.md`	`/karma <natural language>`
Codex CLI	`~/.agents/skills/karma/SKILL.md` (note: `~/.agents/` shared with Anthropic)	`/skills` menu, `$karma <description>` inline, or auto-trigger
Cursor	Not auto-installed (project-scoped only) — Cursor has no home-level global skills directory. Copy `skills/karma/SKILL.md` to `.cursor/skills/karma/` in each project that needs it.	`/karma` (per-project) or use `karma rule add` CLI directly

The repository ships one Markdown source of truth at skills/karma/SKILL.md; karma install-skill writes raw Markdown per backend.

Updating the skill after a karma upgrade

karma install-skill --force          # overwrite all backends with current version
karma install-skill --backend codex  # update one backend only

Without --force, the new version is written to a .new sibling file so you can diff your local edits against upstream before deciding. karma doctor reports per-backend skill status.

Why it works

System architecture at a glance:

flowchart LR
    R[(rules.yaml<br/>5-10 core directions)]
    K[karma engine<br/>regex + counting<br/>zero LLM, ~50-70ms]
    A[🤖 Agent<br/>Claude / Codex / Cursor]
    V[(violations.jsonl<br/>audit history)]

    R ==>|inject every turn| K
    K ==>|prompt header| A
    A ==>|tool call / response| K
    K -.->|hit → deny + log| V
    V -.->|next-turn drift marker| K

rules.yaml is karma's single core rule list — the only thing you maintain. karma auto-injects it into every prompt header. karma's zero-network engineering scan reads Agent tool calls + Agent responses looking for signs of rule violation, then prompts / warns / blocks accordingly, and feeds detected drift back into the next turn's marker. No retrieval, no scoring, no LLM in the loop.

karma isn't a linter, a scorer, or a retrieval system. It addresses four real but commonly-overlooked LLM collaboration problems:

1. Long-context attention decay is real

Modern LLMs decay later than early ones, but they still decay. Rules at the conversation top get diluted by everything that came after, and after dozens of turns the Agent isn't ignoring them — its attention has just moved on. karma re-injects at the exact context length each model's decay starts.

2. Each conversation "re-forgets" everything

Every AI client works by re-sending the full context to the model on each turn — the model doesn't persistently remember anything between turns. Your stated preferences have to be re-sent every time. karma does that for you, so you don't have to repeat yourself.

3. "Collaborative agreement" tone reads differently than "rule system" tone

When an LLM sees warnings like "you must always follow X" or "⚠️ violation," the first reaction is to defend or to look for a workaround — that wording activates a "being scolded" frame.

karma rephrases rules as "the human user you're working with hopes…" — the Agent reads it as an agreement to honor, not a verdict to escape. In sustained self-use, this is the single change that moves the needle most on whether reminders actually get internalized.

4. Hook coverage has no blind spots

karma installs at 8 hook positions (detailed below) — not just "inject once at conversation start." Before / after every tool call, subagent start / stop, pre / post compact, silent Agent stop — every drift opportunity has a targeted injection or check.

Performance

Dimension	Number	Note
Runtime dependencies	Zero	Just PyYAML — a 15-year mature Python standard. No LLM API key, no network calls, no ML framework
Source code	~8.6K lines Python	Readable, modifiable, no magic
Quality gates	lint / type-check / dead-code / 775 unit tests, all green (CI: 4 matrix jobs ubuntu+macos × py3.11+3.12)	Plus continuous real-world dogfooding
Hook latency	typically 50-70ms (Python startup-bound, machine-dependent — author's M-series Mac ~49ms, 67ms reported on lower-end machines)	Well within AI client protocol budget of 200ms
Token cost (cumulative)	1.8K SessionStart baseline (one-time, re-sent every turn) + per-turn anchor only lists rules violated this session (v0.13.0+; empty session = 0 anchor) + auto-refresh at model decay threshold (Opus 60K / Sonnet 40K / Haiku 30K)	Real dogfood (30 sessions post-v0.13.0): 60% of work sessions = 0 anchor (clean drift, full passthrough); ~40% with violations save 45-80% vs v0.12.x; median accumulated violated rules per session = 1 (ship-time projection assumed 3-5 — actual is much lower). sessionStart baseline + PostToolUse mid-session reinject cover anti-decay; anchor focuses purely on drift signal
API input bill share	SessionStart prefix + per-turn new anchor/catalog; earlier prefix mostly prompt cache reads (~10% of input price on Anthropic)	~1–3% typical engineering session; drift-heavy or Cursor per-turn id catalog ~2–4%; v0.12 listed all ids every turn ~10–15% (mechanism estimate, not invoice reconciliation). Context window fill at end of long session is a separate metric (~4–6%)
Disk usage	< 10MB	Config + violation history + session state
Model adaptation	Per-model decay-point thresholds	Each major model uses its own measured decay point
Supported clients	Claude Code / Codex CLI / Cursor	Add a backend via HOWTO
User languages	Chinese + English, extensible	All 7 detection signals externalized to `data/signals/<name>/{zh,en}.txt` (flat phrases) or `.yaml` (Cartesian templates + word vocab). Adding a new language = ~7 small files, zero Python code

8 hook positions, all covered

Conversation lifecycle showing where each hook fires (GitHub renders the diagram below):

flowchart TB
    Start([session start / compact restart]) --> SS[SessionStart<br/>inject full rule baseline]
    SS --> UPS[UserPromptSubmit<br/>compact anchor + drift markers]
    UPS --> Tool{Agent calls<br/>Bash / Edit / Write?}
    Tool -- yes --> PreT[PreToolUse<br/>scan command + timing;<br/>hit → deny]
    PreT --> PostT[PostToolUse<br/>track state +<br/>mid-conversation reinject<br/>at decay threshold]
    PostT --> Stop[Stop<br/>strong reminder +<br/>continuation nudge +<br/>short-term intent detection]
    Tool -- no --> Stop
    UPS -. spawn subagent .-> SubS[SubagentStart<br/>subagent inherits rules]
    SubS --> SubE[SubagentStop<br/>temp state destroyed]
    Stop -. long context .-> PreC[PreCompact<br/>dump full rule state]
    PreC -.-> Start

Hook position	Function + scenario	Pain point solved
Every user prompt (UserPromptSubmit)	Header injects full rules + drift markers	Agent forgets your preferences after long session
Before every tool call (PreToolUse)	Keyword + engine-layer double-check; hit → deny	Agent wants to run sleep / commit --no-verify / bypass rules
After every tool call (PostToolUse)	Track file read/edit/bash state + auto mid-conversation refresh when accumulation hits threshold	Long context accumulation → attention decay → Agent drifts
Agent stops generating (Stop)	Terminal stderr ⚠️ + desktop notify + silent-stop reflective intervention + short-term intent talk detection	Agent finishes one wave and stops to ask, user gets interrupted repeatedly
Every session start (SessionStart)	Inject rule baseline at session start; on compact-restart, read snapshot for strong-inject	Rules don't get lost across sessions / across compacts
Before AI client compresses history (PreCompact)	Dump full rule state to disk for SessionStart to re-read	After compact, Agent compresses rules into vague words
Subagent starts (SubagentStart)	Subagent auto-inherits full rule set + writes independent monitoring state	Subagents running independent tasks leave monitoring gaps
Subagent ends (SubagentStop)	Subagent temporary state auto-destroys, doesn't pollute main session	Multiple subagent spawns cause state accumulation, main session data gets confused

All hook outputs strictly comply with the AI client's official protocol schema — no UI error messages.

Tried and rejected (what karma doesn't do)

Several ideas looked attractive but failed in practice. Recorded here so the same alleys don't get re-walked:

Tried	Why it was rejected
LLM auto-distilling new rules	Latency hurts UX, and auto-distilled rules introduce noise — hearing a user say something once doesn't mean it's a core direction. karma keeps users in charge of their 5-10 rules instead
Retrieval / cosine recall	The real pain is "persistence," not "recall" — 5-10 rules can all be always-on, no selection needed. Retrieval adds latency and matching errors with no upside
More than 12 rules	Beyond ~12, LLMs pattern-match "a rule list exists" instead of reading it (see Mnilax's 30-codebase empirical study for the compliance cliff). Keeping the count under 10 is the empirically safe zone
Competing with memory systems	"Facts / preferences about the user" belong in the AI client's built-in memory. karma only does the one thing memory systems don't: pin behaviors you've already repeated
Adding an LLM dependency	Latency and cost, both. Pure-engineering keeps the hook in the 50-70ms range and the install reproducible
Reward / RL scoring	Behavior reminders aren't reward functions. Scoring rules makes the model optimize the score, not the behavior
Blocking compact	Compact is the client's protection mechanism — karma shouldn't fight it. PreCompact dump + SessionStart re-read bridges the gap instead
"Must follow X / Fix immediately / Don't repeat" warning wording	Activates defense or workaround-seeking. The collaborative-agreement rephrase changes the first reaction to "let me align" — the biggest single lever for actual compliance
Hardcoded numeric thresholds in `suggested_fix`	"34% < 40%" gets optimized by padding Chinese characters instead of fixing readability. Goal descriptions ("readable without looking up words") avoid the gaming

Honest tool boundaries

karma is regex matching + counting, not LLM semantic understanding. That means:

False positives happen. Table cells quoting a term, python -c string literals, commit messages describing a violation — all can hit the regex. karma audit flags suspected false positives with "⚠️ possible false positive" so you can report them
False negatives happen. Regex can't tell when a user is intentionally disguising a violation. karma assumes you're not cheating yourself
Zero triggers after a fix doesn't prove the fix is correct. The pattern might just be too wide, swallowing real cases. Audit numbers are hints, not ground truth

Think of karma as sitting between git and a linter — it gives signals, you make decisions.

FAQ

Nothing happens after install?

Run karma doctor to check:

Are all hook events ✓? (Claude Code 8 / Codex 4 / Cursor 5)
Did rules load successfully?
Did session state directory generate new files?

Too many false positives, what to do?

karma audit shows triggers marked "⚠️ possible false positive" — report to the author (GitHub Issue). Temporarily disable a rule: karma rule remove <id> or edit ~/.claude/karma/rules.yaml and remove violation_keywords / violation_checks fields while keeping preference.

Does this overlap with Andrej Karpathy's CLAUDE.md?

Completely complementary, no overlap:

Karpathy's 12 rules (complete version) are universal coding principles (cross-user, cross-project): "Think before coding," "Simplicity first," etc.
karma's rules are per-user personal preferences (each user differs): "I prefer Chinese over jargon," "I want full-delegation," etc.

Recommended setup: install Karpathy's 12 rules in CLAUDE.md (project-shared) + install your personal rules via karma (user-level). They run on the same AI client without conflict.

Custom rule sets for non-development scenarios (writing / research / legal)?

karma init defaults to "software development" scenario. For other scenarios, write ~/.claude/karma/rules.yaml manually — the framework (hook injection / real-time interception) is cross-scenario universal, but the 8 built-in violation_checks are dev-oriented. Other scenarios may need preference text reminders + custom keywords (without check functions).

How do I sync rules across multiple devices?

Just ask the Agent to copy rules.yaml over — no special tooling needed:

mac:    cat ~/.claude/karma/rules.yaml
linux:  "here's my karma rules.yaml, write it to ~/.claude/karma/rules.yaml"
linux:  karma doctor    # validate schema + rules count + violation_checks exist

Safe to sync (user preference config):

~/.claude/karma/rules.yaml — your rule definitions
~/.claude/karma/config.yaml — your threshold tuning (if customized)

Never sync (runtime data, per-device):

~/.claude/karma/violations.jsonl — append-only per-device violation log
~/.claude/karma/session-state/*.json — runtime hook state

karma's cross-process atomicity protects same-machine concurrency, but doesn't extend to cloud-synced folders (iCloud / Dropbox / OneDrive). Putting ~/.claude/karma/ in a sync folder can corrupt runtime state across devices. If you use dotfiles repos / chezmoi / ansible, scope them to rules.yaml + config.yaml only.

Mental model

A rules file isn't a wishlist. It's a behavioral contract closing out specific failure modes you've observed. Each rule should answer: what error is this rule preventing?

karma works the same way. 6 rules targeting failures you've actually hit beats 12 rules where 6 are aspirational.

The 7 default rules in data/rules.dev.example.yaml are real pain points accumulated from self-use — they aren't a template to copy verbatim. After install, run karma rule list, keep what matches your own failure scenes, and replace the rest with your own (via /karma <natural language>).

Documentation

All listed docs are bilingual (.md English + .zh.md Chinese):

docs/PRD.md — Product requirements, validation criteria, scenario positioning
docs/ARCHITECTURE.md — Technical architecture, hook protocol, 8 check implementations
CHANGELOG.md — Version change history (bilingual from v0.5.1 onward; pre-v0.5.1 release notes are Chinese-only)
docs/HANDOFF.md — Internal development handoff entry (English summary; full timeline in .zh.md)
docs/CODEX_BACKEND.md — Codex backend ownership boundary and 8-method contract
CLAUDE.md — Project charter for Claude Code collaboration

Translation PRs welcome for any bilingual gap (HANDOFF.md still summary-only).

Acknowledgments

Andrej Karpathy's CLAUDE.md coding-principles template — universal coding principles. Complementary to karma, not competing: Karpathy teaches the model how to write code; karma keeps your preferences from drifting in long tasks
Mnilax's 30-codebase 6-week CLAUDE.md rule-count study — karma's "soft cap 10 / hard cap 12" design comes directly from this empirical work

Contributing

Bug reports / suggestions: GitHub Issues
Add a new AI client backend: see karma/backends/HOWTO.md
Add scenario rule templates (writing / research / legal etc.): PR welcome to data/

karma is still early — new-user install friction and first-week false positives drive most of the iteration.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.18.3

May 18, 2026

0.18.2

May 18, 2026

0.18.1

May 18, 2026

0.18.0

May 18, 2026

This version

0.17.1

May 18, 2026

0.17.0

May 18, 2026

0.16.18

May 18, 2026

0.16.17

May 17, 2026

0.16.16

May 17, 2026

0.16.15

May 17, 2026

0.16.14

May 17, 2026

0.16.13

May 17, 2026

0.16.12

May 17, 2026

0.16.11

May 17, 2026

0.16.10

May 17, 2026

0.16.9

May 17, 2026

0.16.8

May 17, 2026

0.16.7

May 17, 2026

0.16.6

May 17, 2026

0.16.5

May 17, 2026

0.16.4

May 17, 2026

0.16.3

May 17, 2026

0.16.2

May 17, 2026

0.16.1

May 17, 2026

0.16.0

May 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pinrule-0.17.1.tar.gz (584.1 kB view details)

Uploaded May 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pinrule-0.17.1-py3-none-any.whl (247.2 kB view details)

Uploaded May 18, 2026 Python 3

File details

Details for the file pinrule-0.17.1.tar.gz.

File metadata

Download URL: pinrule-0.17.1.tar.gz
Upload date: May 18, 2026
Size: 584.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.12

File hashes

Hashes for pinrule-0.17.1.tar.gz
Algorithm	Hash digest
SHA256	`8a384d9dbe80b3e7eca62f216ee217797b4b295ebffbe9172585799bdc8198a3`
MD5	`96a32bbe028da88ca5fbbf78fec3eeff`
BLAKE2b-256	`69f3e44ed7e4c5497abb0099efeeb783e1fbb9c1d70e5e49a1e25523890eee74`

See more details on using hashes here.

File details

Details for the file pinrule-0.17.1-py3-none-any.whl.

File metadata

Download URL: pinrule-0.17.1-py3-none-any.whl
Upload date: May 18, 2026
Size: 247.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.12

File hashes

Hashes for pinrule-0.17.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`513121a0a96b98ff6b1f623159ccfe4950dbbccbbab09a3786dbc894adb51aa8`
MD5	`1a71f6d2be329d1cc3fef78266ff1784`
BLAKE2b-256	`31794fae522a1b5ee73ce8d2b29b72e9071a9e9f5df4fec3b184f728e2ad35b0`

See more details on using hashes here.

pinrule 0.17.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

karma

Agents' honest take

Real problems you face

Zero-dependency pure engineering, 10-second install

Or ask your AI client to install it

Per-client manual install commands

Uninstall

Cursor: Agent transcripts (response-level checks)

Usage effects

1. Every prompt header injects full rules + drift markers

2. Mid-conversation refresh when context accumulates

3. Real-time check before tool calls

4. Subagent coverage

5. Survives compact

6. Silent-stop nudge + short-term intent detection

/karma <natural language> — Agent writes the rule for you

What the skill handles for you

Four backends — three auto-install, one per-project

Updating the skill after a karma upgrade

Why it works

1. Long-context attention decay is real

2. Each conversation "re-forgets" everything

3. "Collaborative agreement" tone reads differently than "rule system" tone

4. Hook coverage has no blind spots

Performance

8 hook positions, all covered

Tried and rejected (what karma doesn't do)

Honest tool boundaries

FAQ

Mental model

Documentation

Acknowledgments

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`/karma <natural language>` — Agent writes the rule for you