How Am I Doing — local-only self-audit & coaching for Claude Code sessions

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

dv-hart

These details have not been verified by PyPI

Project description

How Am I Doing (HAID)

A self-audit and coaching layer for Claude Code sessions.

HAID reads your own Claude Code session transcripts, builds a graph of what happened, and produces annotated, coaching-oriented reports. The aim is less "here is your bill" and more "here is where you and the agent diverged, why, and what to change."

Nothing leaves your machine unless you explicitly choose to submit aggregate metrics.

Status: Phase 1 complete; the full scoring stack now runs on real sessions — the deterministic pipeline runs end to end on real transcripts (163 tests, stdlib-only, no model in the loop):

Session parsing (src/haid/session/) — forest-aware JSONL parsing: dedup, branch/rewind classification, subagent stitching, overflow resolution, SQLite cache.

Session graph (src/haid/graph/) — L0 spine + L1 action/IO graph (reads/produces/edits from structuredPatch), signatures, per-timeline scoping.

Waste metrics (src/haid/metrics/) — rereads, retries, retouched, unused_context: one rule each, run at session and window scope, as benchmarkable token-rates placed against a per-scope baseline.

Analysis window (src/haid/window.py) — the multi-session unit metrics run over (a project over a timeframe, default 30 days).

Scoring (src/haid/scoring/) — the relative achievement/cost value scorer (difficulty + cleanliness placement, volume, normalized-token cost, value combiner), calibration-validated. Built ahead of the earlier phases.

haid metrics (src/haid/metrics/{json_out,view}.py + CLI) — the measured substrate: four waste metrics at session and window scope, each placed against a per-scope baseline, as a Markdown inspection view + a JSON hand-off to the later "why" passes.

Bridge (src/haid/bridge/) — reconstructs an analysis window's net code diff from the transcript alone (replay, no git) plus its normalized-token cost, so haid bridge and haid value --project/--session now run the full scoring stack on real sessions (previously the scorer only ran on supplied/calibration diffs).

All validated on real transcripts (163 tests). The user-facing report and visualization are the final product, composing this substrate with the Phase-2/3 why-analysis and the value score. Next: the diagnosis router, episode segmentation (Phase 2), and the visualization (Phase 1.5). See plans/roadmap.md and plans/phase1-build.md.

Installation

HAID is on PyPI (stdlib-only, no dependencies, Python ≥ 3.10):

pip install haid

On Ubuntu/WSL without a venv set up, a user install works fine:

python3 -m pip install --user haid
# CLI lands at ~/.local/bin/haid — make sure that's on your PATH

Verify it works:

haid --help
haid metrics --project ~/path/to/some/project --days 30

haid metrics is fully deterministic (no model calls) and runs against the Claude Code transcripts already on your machine — it's the quickest smoke test.

Where to install: HAID reads transcripts from ~/.claude/projects/ on the machine where the sessions ran. If you use Claude Code inside WSL, install HAID inside WSL too. (A Windows-side install can still reach WSL transcripts via UNC --session paths like //wsl.localhost/Ubuntu/home/<user>/.claude/projects/<slug>/*.jsonl, but --project discovery won't cross the boundary.)

Activating in Claude Code

The haid CLI never calls a model itself. The full coaching pipeline (tag → episodes → score → why → report) is driven by Claude Code through the haid-report skill: the CLI writes job manifests at each model boundary, and the skill tells Claude how to fulfill them with subagents and resume.

Install the CLI (above) so haid is on the PATH of the machine/shell where Claude Code runs.

Copy the skill from this repo into Claude Code's skills directory:

# available in every project:
mkdir -p ~/.claude/skills/haid-report
cp .claude/skills/haid-report/SKILL.md ~/.claude/skills/haid-report/

# …or for a single project only:
mkdir -p <project>/.claude/skills/haid-report
cp .claude/skills/haid-report/SKILL.md <project>/.claude/skills/haid-report/

(If you're working inside this repo, the skill is already active — it's a project skill here.)

Start a new Claude Code session and ask "how am I doing?", or invoke /haid-report directly. Claude will run the chain and present the coaching report. For a zero-cost, fully deterministic answer, ask for the --digest-only report or just the waste metrics.

What this is not

Not another token counter. Raw usage accounting is already well covered (ccusage and similar). The entire value lives one layer up, in diagnosis and coaching — telling you not what you spent but how to get better. A tool that confidently misdiagnoses is worse than nothing, because people act on it, so trustworthiness of the diagnosis is the central design constraint throughout. See docs/trust-discipline.md.

The one big idea: the session graph

Underneath everything is one data structure: a graph of the session(s). Turns and tool-calls are nodes; edges capture responds-to, reads, and produces relationships. The two headline features are just two operations on this one graph:

"Why did you do X?" → a backwards traversal from X to its trigger.
"Where did the tokens go?" → a weighting over the same nodes.

Build the graph once; get both views from it. Design in docs/session-graph-design.md.

Two orthogonal analysis passes

User-anchored pass — catches misalignment. Works backwards from user messages; corrections are ground truth ("no, I meant…", "that's wrong").
Signature-scanning pass — catches silent inefficiency. Scans for objective, reasoning-free waste signatures (redundant re-reads, retry loops, re-touched lines, unused context).

The two are orthogonal: one finds where the agent did the wrong thing, the other where it did the right thing wastefully. See docs/architecture.md.

Documentation map

Doc	What's in it
docs/vision.md	The full concept, goals, and the canonical test case
docs/architecture.md	The two-pass method and how the pieces fit
docs/session-graph-design.md	Node/edge taxonomy, episodes, the two core operations
docs/detectors.md	Detector catalog + waste metrics as graph queries
docs/intent-taxonomy.md	Two-axis message classification + purpose timeline + drift
docs/scoring-rubric.md	Achievement vs. cost — the relative value verdict (revised; see ladder/playbook)
docs/difficulty-ladder.md	The validated difficulty scorer (reference ladder + placement)
docs/cleanliness-ladder.md	The cleanliness/parsimony scorer (reference ladder + placement)
docs/axis-calibration-playbook.md	Self-contained recipe to calibrate a new scoring axis (worked example: cleanliness; originality calibrated then dropped)
docs/calibration-pilot-1.md	Pilot report: why mined review-signals don't validate difficulty
docs/visualization.md	The time-layered bus diagram (left-in/right-out, bundled)
docs/claude-code-data-format.md	Verified Claude Code on-disk data reference
docs/data-inventory.md	Field catalog from 38 real sessions: what's auto-taggable vs. inferred
docs/data-structure-report.md	Real annotated records → the graph they produce (Tier 1 & Tier 2 walkthrough)
docs/trust-discipline.md	Cite-or-unknown, hedging, no-traceable-origin
docs/tooling-landscape.md	Existing tools and what to build on
docs/decisions/	Architecture Decision Records (ADRs)
plans/roadmap.md	Phased delivery plan
plans/mvp.md	The minimum thing that tests the core risk
plans/phase1-build.md	The concrete Phase-1 build sequence + progress
plans/open-questions.md	Decisions to make / behaviors to verify early

Repository layout

HAID/
├── README.md                 # you are here
├── docs/                     # design & reference documentation
│   └── decisions/            # ADRs
├── plans/                    # roadmap, MVP spec, open questions
├── research/                 # raw research reports (inputs to the docs)
├── calibration/              # the scoring-axis calibration harness (experiment code)
├── src/haid/                 # implementation
│   ├── session/              #   Phase-1 parse: forest model, subagents, overflow, cache
│   ├── graph/                #   L0 spine + L1 IO graph (incl. Bash read/write parsing)
│   ├── metrics/              #   the four waste metrics + baseline + `haid metrics` emitter
│   ├── window.py             #   the multi-session analysis window
│   ├── scoring/              #   relative value scorer (difficulty/cleanliness/volume/cost/value)
│   └── bridge/               #   transcript→(diff, usage) reconstruction (the bridge)
├── tests/                    # session/ graph/ metrics/ scoring/ bridge/ suites (+ fixtures/)
└── scripts/                  # build_metric_baselines.py + CLI helpers

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

dv-hart

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.0.16

Jun 30, 2026

0.0.15

Jun 29, 2026

0.0.14

Jun 29, 2026

0.0.13

Jun 28, 2026

0.0.12

Jun 28, 2026

0.0.11

Jun 28, 2026

0.0.10

Jun 27, 2026

0.0.9

Jun 27, 2026

0.0.8

Jun 25, 2026

0.0.7

Jun 25, 2026

0.0.6

Jun 25, 2026

0.0.5

Jun 25, 2026

0.0.3

Jun 24, 2026

This version

0.0.2

Jun 24, 2026

0.0.1

Jun 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

haid-0.0.2.tar.gz (203.7 kB view details)

Uploaded Jun 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

haid-0.0.2-py3-none-any.whl (235.2 kB view details)

Uploaded Jun 24, 2026 Python 3

File details

Details for the file haid-0.0.2.tar.gz.

File metadata

Download URL: haid-0.0.2.tar.gz
Upload date: Jun 24, 2026
Size: 203.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for haid-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`13a58c81df4148c41877c907fd24eda9b1751de2e6a4a554018fb9efc626127a`
MD5	`6a252bbe7ac59195a03bdc1e9cbd89ea`
BLAKE2b-256	`224fcd332af95796822ea2a8a303d16b3f0a08cbf039f533b801332e0ccf516a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for haid-0.0.2.tar.gz:

Publisher: publish.yml on dv-hart/haid

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: haid-0.0.2.tar.gz
- Subject digest: 13a58c81df4148c41877c907fd24eda9b1751de2e6a4a554018fb9efc626127a
- Sigstore transparency entry: 1935252191
- Sigstore integration time: Jun 24, 2026
Source repository:
- Permalink: dv-hart/haid@4b1b299887cb1ec461430bc477a3485a596565d3
- Branch / Tag: refs/tags/v0.0.2
- Owner: https://github.com/dv-hart
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@4b1b299887cb1ec461430bc477a3485a596565d3
- Trigger Event: push

File details

Details for the file haid-0.0.2-py3-none-any.whl.

File metadata

Download URL: haid-0.0.2-py3-none-any.whl
Upload date: Jun 24, 2026
Size: 235.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for haid-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dcbf4939238dbd6f76fa1de0e046cc7a9187c41aeb1670902f02458037abc86f`
MD5	`b31f5e2568fc0fc2915ab6de48bff71f`
BLAKE2b-256	`36b289c812ffd308bb7924d821a4f7ad28a2495ceea48a1eae41d1361c734439`

See more details on using hashes here.

Provenance

The following attestation bundles were made for haid-0.0.2-py3-none-any.whl:

Publisher: publish.yml on dv-hart/haid

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: haid-0.0.2-py3-none-any.whl
- Subject digest: dcbf4939238dbd6f76fa1de0e046cc7a9187c41aeb1670902f02458037abc86f
- Sigstore transparency entry: 1935252213
- Sigstore integration time: Jun 24, 2026
Source repository:
- Permalink: dv-hart/haid@4b1b299887cb1ec461430bc477a3485a596565d3
- Branch / Tag: refs/tags/v0.0.2
- Owner: https://github.com/dv-hart
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@4b1b299887cb1ec461430bc477a3485a596565d3
- Trigger Event: push

haid 0.0.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

How Am I Doing (HAID)

Installation

Activating in Claude Code

What this is not

The one big idea: the session graph

Two orthogonal analysis passes

Documentation map

Repository layout

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance