Session monitor and handoff tool for Claude Code

These details have not been verified by PyPI

Project links

Project description

ANVL

   █████████   ██████   █████ █████   █████ █████
  ███▒▒▒▒▒███ ▒▒██████ ▒▒███ ▒▒███   ▒▒███ ▒▒███
 ▒███    ▒███  ▒███▒███ ▒███  ▒███    ▒███  ▒███
 ▒███████████  ▒███▒▒███▒███  ▒███    ▒███  ▒███
 ▒███▒▒▒▒▒███  ▒███ ▒▒██████  ▒▒███   ███   ▒███
 ▒███    ▒███  ▒███  ▒▒█████   ▒▒▒█████▒    ▒███      █
 █████   █████ █████  ▒▒█████    ▒▒███      ███████████
▒▒▒▒▒   ▒▒▒▒▒ ▒▒▒▒▒    ▒▒▒▒▒      ▒▒▒      ▒▒▒▒▒▒▒▒▒▒▒
                  ⚒ forged by IronDevz

Session monitor for Claude Code — saves your quota by detecting inflated sessions.

The problem

Claude Code resends the entire conversation history on every turn. This is how the API works — there's no persistent memory, so each request includes everything: system prompt, your messages, Claude's responses, tool results, all of it.

On turn 1, this might be 150K tokens. By turn 20, it's 500K. By turn 50, you're sending 1M+ tokens just to get a 2K response back. Your quota burns exponentially.

ANVL detects when this is happening and rotates your session before it wastes quota.

Quick start

1. Install

pip install anvl-monitor

2. Verify it works

anvl --version

If you get anvl: command not found (or similar), Python's Scripts folder isn't in your PATH. Use this instead:

python -m anvl --version

Windows users: pip install puts anvl.exe in Python's Scripts/ folder (e.g. C:\Users\YOU\AppData\Local\Programs\Python\Python313\Scripts\). If anvl isn't recognized, either:
Use python -m anvl everywhere (always works), or
Add the Scripts folder to your PATH:
# PowerShell (run once, then restart terminal)
$scripts = python -c "import sysconfig; print(sysconfig.get_path('scripts'))"
[Environment]::SetEnvironmentVariable("Path", "$env:Path;$scripts", "User")

3. Initialize

anvl init

That's it. ANVL runs silently in the background via Claude Code hooks. You'll only see it when your session starts inflating:

[ANVL] Session health: 45% (5.4x waste). Consider starting a new conversation soon.

When it's critical, ANVL blocks the session entirely:

[ANVL] Session blocked -- too inflated to continue efficiently.
       Handoff saved: handoff.md
       Start a new conversation and say: "Read handoff.md and continue where I left off"

How health is calculated

ANVL measures session health as a percentage from 0% to 100%, based on a single metric: waste factor.

Waste factor

waste = peak_avg_tokens_per_turn / calibrated_baseline

Calibrated baseline = the global median of "typical turn cost" across all your sessions. Each session contributes its median tokens/turn from the first 5 turns. With 50+ sessions of data, this is a stable reference for what a normal Claude Code turn costs in your workflow.
Peak avg = the worst 5-turn window average across the entire session. Health never improves — once a session inflates, it stays marked as inflated.

A fresh session has waste = 1.0x. As the conversation grows and Claude resends more history, tokens/turn increases, waste goes up.

Health percentage

Health maps waste linearly from 100% (fresh) to 0% (critical), weighted by confidence:

raw_health = 100% × (10 - waste) / 9
confidence = min(1.0, turns / 20)
health = raw_health × confidence + 100% × (1 - confidence)

Young sessions are blended toward 100% because waste isn't reliable with few data points. As turns increase, confidence grows and health reflects real waste:

Waste	5 turns	10 turns	20+ turns
1.0x	100%	100%	100%
2.0x	97%	94%	88%
5.0x	88%	77%	55%
8.0x	80%	61%	22%
10.0x	75%	50%	0%

Why tokens/turn matters

Claude Code pricing uses different rates for different token types:

Token type	Relative cost	What it is
Input tokens	1.0x	New content in the prompt
Cache read	0.1x	Previously cached context (90% cheaper)
Cache creation	1.25x	Writing new content to cache
Output tokens	5.0x	Claude's response

In early turns, most input is cache reads (cheap). As the conversation grows, cache creation increases and more raw input is sent — the cost per turn grows even though the token count might seem similar.

ANVL measures total tokens per turn (all categories combined) because this reflects the actual volume of data being processed, regardless of caching.

How it works

You're working in Claude Code
         |
    ANVL monitors every turn via hooks
         |
    Tokens/turn growing? Waste going up?
         |
    ┌────┴────┐
    │  < 60%  │ ──→ Warning: "Consider starting a new conversation"
    │  < 30%  │ ──→ Auto-saves handoff.md
    │   = 0%  │ ──→ Blocks session (exit code 2)
    └─────────┘
         |
    You open a new conversation
         |
    "Read handoff.md and continue where I left off"
         |
    Fresh session — back to 100% health

Hooks

ANVL installs three Claude Code hooks:

Hook	When it runs	What it does
`UserPromptSubmit`	Before Claude processes your message	Checks health, warns or blocks
`PostToolUse`	After each tool call	Same check during autonomous work
`SessionStart`	When a new session opens	Injects handoff.md context if it exists

The SessionStart hook is what makes handoffs seamless — when you open a new session, Claude automatically knows there's a handoff.md to read.

Handoff

When ANVL detects a critically inflated session, it generates handoff.md containing:

Session summary and what was being worked on
Files that were created or modified
Commands that were run
The last few conversation turns
Pending/next steps

This gives the new session full context without carrying the token debt.

Installation

# From PyPI (recommended)
pip install anvl-monitor

# From source (for development)
git clone https://github.com/juanlumanmx29/anvl.git
cd anvl
pip install -e .

Requirements: Python 3.11+ | Only dependency: rich (installed automatically)

Note: On all platforms, python -m anvl works as an alternative to the anvl command.

Setup

anvl init

Run once. This:

Creates config at ~/.anvl/config.json
Installs hooks in Claude Code (UserPromptSubmit, PostToolUse, SessionStart)
Writes CLAUDE.md with instructions for Claude to handle handoffs

The hooks are global — once installed, ANVL monitors all sessions in all projects automatically. You don't need to run anvl init per project.

Commands

Command	Description
`anvl init`	First-time setup (config + hooks + CLAUDE.md)
`anvl status`	Current session health, waste, tokens breakdown
`anvl status --json`	Machine-readable output
`anvl sessions`	All sessions with health status
`anvl sessions --active`	Only active sessions
`anvl monitor`	Live terminal monitor (auto-refreshes)
`anvl calibrate`	View global calibration baseline
`anvl calibrate --reset`	Reset calibration data
`anvl handoff`	Generate handoff manually
`anvl report`	Multi-session report

Live monitor

anvl monitor

Shows a live dashboard with:

Health bar with percentage and waste factor
Global calibrated baseline (median turn cost across all sessions)
Table of all active sessions with input/output totals
Tokens wasted by inflation and tokens saved by rotation

Auto-refreshes every 2 seconds. Press Ctrl+C to exit.

Calibration

ANVL learns what a "normal" turn costs by collecting baselines from all your sessions globally. Each session that reaches 5+ turns contributes its median tokens/turn to the global baseline.

anvl calibrate

Shows the current global baseline, session count, and range. With 3+ sessions, calibration activates and health works from turn 1 — no warmup period needed.

Configuration

File: ~/.anvl/config.json

Field	Default	Description
`waste_threshold`	2	Waste factor to start showing warnings
`handoff_waste_threshold`	10	Waste factor to block session + auto-handoff
`min_turns_for_alert`	5	Minimum turns before any alerts fire
`window_hours`	5	Rolling window for quota tracking

Alert levels

Health	Waste	Action
60-100%	1-5x	No alerts — session is healthy
30-60%	5-8x	Warning: "Consider starting a new conversation"
1-30%	8-10x	Handoff auto-generated, strong warning
0%	≥10x	Session blocked (exit code 2), handoff saved

Blocking requires at least 20 turns — ANVL won't block a short session even if waste is high, because short sessions are cheap regardless.

Bypass: If ANVL blocks your session and you need to send one more message, type anvl bypass before your message. ANVL will skip all checks for that message.

FAQ

Q: Does ANVL slow down Claude Code? No. The hook runs a lightweight scan of the session file (~10ms). It doesn't parse the full JSONL — it uses a fast token counter.

Q: What if I don't want blocking? Set handoff_waste_threshold to a very high number (e.g., 9999) in your config. You'll still get warnings.

Q: How much quota does session rotation actually save? Depends on session length. A 50-turn session that gets rotated at turn 25 typically saves 40-60% of what it would have consumed. Run anvl sessions to see estimated savings.

License

MIT — see LICENSE

⚒ Forged by IronDevz

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.5

Apr 23, 2026

0.3.4

Apr 23, 2026

0.3.3

Apr 21, 2026

0.3.2

Apr 21, 2026

0.3.0

Apr 15, 2026

0.2.5

Apr 12, 2026

0.2.4

Apr 9, 2026

0.2.3

Apr 9, 2026

0.2.2

Apr 9, 2026

0.2.1

Apr 9, 2026

This version

0.2.0

Apr 9, 2026

0.1.0

Apr 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anvl_monitor-0.2.0.tar.gz (44.1 kB view details)

Uploaded Apr 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

anvl_monitor-0.2.0-py3-none-any.whl (41.0 kB view details)

Uploaded Apr 9, 2026 Python 3

File details

Details for the file anvl_monitor-0.2.0.tar.gz.

File metadata

Download URL: anvl_monitor-0.2.0.tar.gz
Upload date: Apr 9, 2026
Size: 44.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for anvl_monitor-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`07b035880007570830a7ea5f7f4a946a48959a9de39347692346e7829dcb3441`
MD5	`27d320d57c831a94d3647051d7ba69ad`
BLAKE2b-256	`7d9ab5fa81a9544c31f8b424e66cdd5734dcf2f1ba23af3fb77790e095d893f5`

See more details on using hashes here.

File details

Details for the file anvl_monitor-0.2.0-py3-none-any.whl.

File metadata

Download URL: anvl_monitor-0.2.0-py3-none-any.whl
Upload date: Apr 9, 2026
Size: 41.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for anvl_monitor-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`044128942e767872ef127faf2916ef79cd6787298246162bdcafc284d3df4e3a`
MD5	`caf467cdc7e7e96e11db7a4bd84f4520`
BLAKE2b-256	`ea45af947fb6f4fb3534d2d2ab09eeecc4c48fbfff42f3d41fa0b22e44b90939`

See more details on using hashes here.

anvl-monitor 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ANVL

The problem

Quick start

1. Install

2. Verify it works

3. Initialize

How health is calculated

Waste factor

Health percentage

Why tokens/turn matters

How it works

Hooks

Handoff

Installation

Setup

Commands

Live monitor

Calibration

Configuration

Alert levels

FAQ

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes