Session monitor and handoff tool for Claude Code

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

ANVL

   █████████   ██████   █████ █████   █████ █████
  ███▒▒▒▒▒███ ▒▒██████ ▒▒███ ▒▒███   ▒▒███ ▒▒███
 ▒███    ▒███  ▒███▒███ ▒███  ▒███    ▒███  ▒███
 ▒███████████  ▒███▒▒███▒███  ▒███    ▒███  ▒███
 ▒███▒▒▒▒▒███  ▒███ ▒▒██████  ▒▒███   ███   ▒███
 ▒███    ▒███  ▒███  ▒▒█████   ▒▒▒█████▒    ▒███      █
 █████   █████ █████  ▒▒█████    ▒▒███      ███████████
▒▒▒▒▒   ▒▒▒▒▒ ▒▒▒▒▒    ▒▒▒▒▒      ▒▒▒      ▒▒▒▒▒▒▒▒▒▒▒
                  ⚒ forged by IronDevz

Session monitor for Claude Code — saves your quota by detecting inflated sessions.

The problem

Claude Code resends the entire conversation history on every turn. This is how the API works — there's no persistent memory, so each request includes everything: system prompt, your messages, Claude's responses, tool results, all of it.

On turn 1, this might be 150K tokens. By turn 20, it's 500K. By turn 50, you're sending 1M+ tokens just to get a 2K response back. Your quota burns exponentially.

ANVL detects when this is happening and rotates your session before it wastes quota.

Quick start

1. Install

pip install anvl-monitor

2. Verify it works

anvl --version

If you get anvl: command not found (or similar), Python's Scripts folder isn't in your PATH. Use this instead:

python -m anvl --version

Windows users: pip install puts anvl.exe in Python's Scripts/ folder (e.g. C:\Users\YOU\AppData\Local\Programs\Python\Python313\Scripts\). If anvl isn't recognized, either:
Use python -m anvl everywhere (always works), or
Add the Scripts folder to your PATH:
# PowerShell (run once, then restart terminal)
$scripts = python -c "import sysconfig; print(sysconfig.get_path('scripts'))"
[Environment]::SetEnvironmentVariable("Path", "$env:Path;$scripts", "User")

3. Initialize

anvl init

That's it. ANVL runs silently in the background via Claude Code hooks. You'll only see it when your session starts inflating:

[ANVL] Session health: 45% (5.4x waste). Consider starting a new conversation soon.

When it's critical, ANVL blocks the session entirely:

[ANVL] Session blocked -- too inflated to continue efficiently.
       Handoff saved: handoff.md
       Start a new conversation and say: "Read handoff.md and continue where I left off"

How health is calculated

ANVL measures session health as a percentage from 0% to 100%, based on a single metric: waste factor.

Waste factor (dual-signal)

ANVL uses a growth-aware waste calculation that combines two signals:

Signal A (relative): actual_growth / expected_growth_p75
Signal B (absolute): current_cost / fresh_session_cost
waste = max(signal_a, signal_b)

Signal A compares your session's growth rate against the historical p75 growth curve. If your session is growing faster than 75% of past sessions at the same turn count, waste goes up.
Signal B compares the current cost per turn against what a fresh session costs. This catches sessions that started expensive and stayed expensive.
Calibrated baseline = the global median of "typical turn cost" across all your sessions. Each session contributes its median tokens/turn from the first 5 turns. With enough sessions, this is a stable reference for what a normal Claude Code turn costs in your workflow.

A fresh session has waste = 1.0x. As the conversation grows and Claude resends more history, tokens/turn increases, waste goes up.

Health percentage

Health maps waste linearly from 100% (fresh, 1x) to 0% (critical, 15x):

health = 100% × (15 - waste) / 14

Sessions with fewer than 5 turns always show 100% — waste isn't reliable with few data points.

Waste	Health
1.0x	100%
3.0x	85%
5.0x	71%
8.0x	50%
12.0x	21%
15.0x	0%

Weighted token cost

The monitor shows a weighted cost per session that approximates real quota impact using API pricing ratios:

Token type	Weight	What it is
Input tokens	1.0x	New content in the prompt
Cache read	0.1x	Previously cached context (90% cheaper)
Cache creation	1.25x	Writing new content to cache
Output tokens	5.0x	Claude's response

This gives you a single number that reflects actual cost, not just raw token volume. A session burning 500K tokens mostly on cache reads costs far less than one burning 500K on fresh input + output.

How it works

You're working in Claude Code
         |
    ANVL monitors every turn via hooks
         |
    Tokens/turn growing? Waste going up?
         |
    ┌────┴────┐
    │  < 60%  │ ──→ Warning: "Consider starting a new conversation"
    │  < 30%  │ ──→ Auto-saves handoff.md
    │   = 0%  │ ──→ Blocks session (exit code 2)
    └─────────┘
         |
    You open a new conversation
         |
    "Read handoff.md and continue where I left off"
         |
    Fresh session — back to 100% health

Hooks

ANVL installs three Claude Code hooks:

Hook	When it runs	What it does
`UserPromptSubmit`	Before Claude processes your message	Checks health, warns or blocks
`PostToolUse`	After each tool call	Same check during autonomous work
`SessionStart`	When a new session opens	Injects handoff.md context if it exists

The SessionStart hook is what makes handoffs seamless — when you open a new session, Claude automatically knows there's a handoff.md to read.

Handoff

When ANVL detects a critically inflated session, it generates handoff.md containing:

Session summary and what was being worked on
Files that were created or modified
Commands that were run
The last few conversation turns
Pending/next steps

This gives the new session full context without carrying the token debt.

Installation

# From PyPI (recommended)
pip install anvl-monitor

# From source (for development)
git clone https://github.com/juanlumanmx29/anvl.git
cd anvl
pip install -e .

Requirements: Python 3.11+ | Only dependency: rich (installed automatically)

Note: On all platforms, python -m anvl works as an alternative to the anvl command.

Setup

anvl init

Run once. This:

Creates config at ~/.anvl/config.json
Installs hooks in Claude Code (UserPromptSubmit, PostToolUse, SessionStart)
Writes CLAUDE.md with instructions for Claude to handle handoffs

The hooks are global — once installed, ANVL monitors all sessions in all projects automatically. You don't need to run anvl init per project.

Commands

Command	Description
`anvl init`	First-time setup (config + hooks + CLAUDE.md)
`anvl status`	Current session health, waste, tokens breakdown
`anvl status --json`	Machine-readable output
`anvl sessions`	All sessions with health status
`anvl sessions --active`	Only active sessions
`anvl monitor`	Live terminal monitor (auto-refreshes)
`anvl calibrate`	View global calibration baseline
`anvl calibrate --reset`	Reset calibration data
`anvl handoff`	Generate handoff manually
`anvl report`	Multi-session report

Live monitor

anvl monitor

Shows a live dashboard with:

Health bar with percentage and waste factor per session
Weighted token cost (approximates real quota impact using API pricing ratios)
Global calibrated baseline (median turn cost across all sessions)
Tokens wasted by inflation and tokens saved by rotation
Update notification when a new version is available on PyPI

Auto-refreshes every 2 seconds. Press Ctrl+C to exit.

Calibration

ANVL learns what a "normal" turn costs by collecting baselines from all your sessions globally. Each session that reaches 5+ turns contributes its median tokens/turn to the global baseline.

anvl calibrate

Shows the current global baseline, session count, and range. With 3+ sessions, calibration activates and health works from turn 1 — no warmup period needed.

Configuration

File: ~/.anvl/config.json

Field	Default	Description
`waste_threshold`	2	Waste factor to start showing warnings
`handoff_waste_threshold`	10	Waste factor to block session + auto-handoff
`min_turns_for_alert`	10	Minimum turns before any alerts fire
`window_hours`	5	Rolling window for quota tracking

Alert levels

Health	Action
50-100%	No alerts — session is healthy
20-49%	Warning: "Consider starting a new conversation soon"
10-19%	Inflated — handoff auto-generated, strong warning
< 10%	Critical — handoff saved, session flagged for rotation

Alerts require at least 10 turns (min_turns_for_alert in config) — ANVL won't warn on short sessions even if waste is high, because short sessions are cheap regardless.

Bypass: If ANVL blocks your session and you need to send one more message, type anvl bypass before your message. ANVL will skip all checks for that message.

FAQ

Q: Does ANVL slow down Claude Code? No. The hook runs a lightweight scan of the session file (~10ms). It doesn't parse the full JSONL — it uses a fast token counter.

Q: What if I don't want blocking? Set handoff_waste_threshold to a very high number (e.g., 9999) in your config. You'll still get warnings.

Q: How much quota does session rotation actually save? Depends on session length. A 50-turn session that gets rotated at turn 25 typically saves 40-60% of what it would have consumed. Run anvl sessions to see estimated savings.

License

MIT — see LICENSE

⚒ Forged by IronDevz

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

IronDevz

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.5

Apr 23, 2026

0.3.4

Apr 23, 2026

0.3.3

Apr 21, 2026

This version

0.3.2

Apr 21, 2026

0.3.0

Apr 15, 2026

0.2.5

Apr 12, 2026

0.2.4

Apr 9, 2026

0.2.3

Apr 9, 2026

0.2.2

Apr 9, 2026

0.2.1

Apr 9, 2026

0.2.0

Apr 9, 2026

0.1.0

Apr 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anvl_monitor-0.3.2.tar.gz (41.4 kB view details)

Uploaded Apr 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

anvl_monitor-0.3.2-py3-none-any.whl (41.1 kB view details)

Uploaded Apr 21, 2026 Python 3

File details

Details for the file anvl_monitor-0.3.2.tar.gz.

File metadata

Download URL: anvl_monitor-0.3.2.tar.gz
Upload date: Apr 21, 2026
Size: 41.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for anvl_monitor-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`dedb2e5e891e1afcaa26919556cdb46fdb1f1d94f7d25100b9adc48dc96cc6f3`
MD5	`cd5b883cf0cdc3affea7d055cdedb16d`
BLAKE2b-256	`a7b87ad082d6b0f205fe8e4aaf1771b48df0ecdbe437b48408898877a88ccf13`

See more details on using hashes here.

Provenance

The following attestation bundles were made for anvl_monitor-0.3.2.tar.gz:

Publisher: publish.yml on juanlumanmx29/anvl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: anvl_monitor-0.3.2.tar.gz
- Subject digest: dedb2e5e891e1afcaa26919556cdb46fdb1f1d94f7d25100b9adc48dc96cc6f3
- Sigstore transparency entry: 1351200685
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: juanlumanmx29/anvl@2fc482e137a4eae09414483cbd24836193b35e36
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/juanlumanmx29
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@2fc482e137a4eae09414483cbd24836193b35e36
- Trigger Event: push

File details

Details for the file anvl_monitor-0.3.2-py3-none-any.whl.

File metadata

Download URL: anvl_monitor-0.3.2-py3-none-any.whl
Upload date: Apr 21, 2026
Size: 41.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for anvl_monitor-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`15b7237fc85f1f66c05072b0c898675c93a761874fcd0ff46e6cd6931c071738`
MD5	`0efb3d7a80a0b6b1d264f3d71e9f8798`
BLAKE2b-256	`cca23accb6ae3fa1fde46afe17bb6aca8a9f63b592467811c573a850c4e3ee64`

See more details on using hashes here.

Provenance

The following attestation bundles were made for anvl_monitor-0.3.2-py3-none-any.whl:

Publisher: publish.yml on juanlumanmx29/anvl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: anvl_monitor-0.3.2-py3-none-any.whl
- Subject digest: 15b7237fc85f1f66c05072b0c898675c93a761874fcd0ff46e6cd6931c071738
- Sigstore transparency entry: 1351200749
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: juanlumanmx29/anvl@2fc482e137a4eae09414483cbd24836193b35e36
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/juanlumanmx29
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@2fc482e137a4eae09414483cbd24836193b35e36
- Trigger Event: push

anvl-monitor 0.3.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

ANVL

The problem

Quick start

1. Install

2. Verify it works

3. Initialize

How health is calculated

Waste factor (dual-signal)

Health percentage

Weighted token cost

How it works

Hooks

Handoff

Installation

Setup

Commands

Live monitor

Calibration

Configuration

Alert levels

FAQ

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance