See where your AI coding tokens actually go. Analyze Claude Code session costs, find token bombs, and track spending.

These details have not been verified by PyPI

Project links

Project description

TokenXRay

See where your AI coding tokens actually go.

We spent $104 in a single Claude Code session researching token waste. Then we audited ourselves and discovered something every developer using AI coding tools should know.

The Problem We Found

We analyzed 514 Claude Code sessions totaling $12,600+ in API costs. Here's what the data showed:

Session Segments:
  1-10 turns:   329 sessions (64%)  →  avg $0.19   →  <1% of total spend
  11-30 turns:   76 sessions (15%)  →  avg $4.01   →   2% of total spend
  31-100 turns:  61 sessions (12%)  →  avg $11.05  →   5% of total spend
  100+ turns:    48 sessions  (9%)  →  avg $241    →  92% of total spend
                                                       ^^^

9% of sessions burn 92% of the money. And within those sessions:

Your actual questions are 0.012% of total input tokens
99% of tokens are the model re-reading old context
Cache creation (25% premium) is 51% of total cost — the single biggest expense
Tool results (bash output, file reads) ride in context forever, never pruned

No existing tool shows you this. Not your API dashboard. Not your billing page. We had to write custom scripts to see it. So we turned those scripts into TokenXRay.

Install

pip (recommended)

pip install tokenxray

From source (no venv needed)

git clone https://github.com/niajulhasan/tokenxray.git
cd tokenxray

# Option A: install it
pip install -e .

# Option B: just run it directly (zero setup)
PYTHONPATH=src python3 -m tokenxray

Zero external dependencies. Pure Python stdlib. No venv required.

Quick Start

# See all your sessions at a glance
tokenxray

# Deep dive into your most expensive session
tokenxray --session <id>

# Which projects are burning money?
tokenxray --projects

# Get actionable recommendations
tokenxray --diagnose

# Save a baseline, compare later
tokenxray --baseline
tokenxray --compare

# Interactive HTML dashboard
tokenxray --dashboard

# Export for spreadsheets
tokenxray --export csv > sessions.csv

# Install live cost tracking + auto-checkpoint in Claude Code
tokenxray --install-hook --confirm

What You'll See

Session Overview

TokenXRay - Session Overview
----------------------------------------------------------------------
  514 sessions    43,000+ total turns    $12,600+ total cost

  Segment Breakdown:
    1-10 turns:  329 sessions  avg  $0.19   total    $62   ░░░░░░░░░░  0%
         11-30:   76 sessions  avg  $4.01   total   $305   ░░░░░░░░░░  2%
        31-100:   61 sessions  avg $11.05   total   $674   █░░░░░░░░░  5%
          100+:   48 sessions  avg   $241   total $11,600  ████████░░ 92%

Session Deep Dive

TokenXRay - Session Deep Dive
----------------------------------------------------------------------
  Session:  060870e6-7cdd-425a-8e3b-8b9e29e7d3e8
  Duration: 2.4hrs
  Turns:    399

  Cost Breakdown:
    Cache read:           $47.30  (90% discount)
    Cache create:         $46.95  (25% premium)  <-- biggest cost!
    Output:               $10.20
    Total:                  $104

  The Waste Ratio:
    Your questions:          2.9K  (0.012% of input)
    Context re-sent:       24.4M  (99% waste from re-reading)

  Context Growth:
    Turn    1:    18.4K  ███░░░░░░░░░░░░░░░░░░░░░░░░░
    Turn  101:    67.7K  ██████████████░░░░░░░░░░░░░░░
    Turn  201:   131.3K  ████████████████████████████░░
    Turn  301:    52.3K  ███████████░░░░░░░░░░░░░░░░░░  (compacted)

Diagnosis

TokenXRay - Diagnosis & Recommendations
----------------------------------------------------------------------
  Potential savings: $16,000+

  [!!!] Marathon sessions are burning your wallet
        48 sessions with 100+ turns cost $11,600 (92% of total)
        Action: Start fresh sessions more often.

  [!!!] Cache creation is 51% of your total cost
        Paying $6,448 in cache creation fees (25% premium)
        Action: Reduce context growth with shorter responses.

  [!! ] Subagents cost $3,239 (26% of total)
        Action: Use direct tools (Grep, Read) over agents for lookups.

  [!  ] Opus usage: $12,600 across 237 sessions
        Same work on Sonnet would cost ~$2,500.
        Action: Use Sonnet for routine tasks.

Interactive Dashboard

Generate a self-contained HTML dashboard with charts and heatmaps:

tokenxray --dashboard

Opens an interactive dashboard with:

Cost breakdown by session segment (Chart.js bar/pie charts)
Session cost heatmap over time
Token type distribution (input, output, cache read, cache create)
Actionable recommendations based on your usage patterns

The dashboard is a single HTML file — no server, no dependencies, works offline.

Live Cost Tracking + Auto-Checkpoint

Install Claude Code hooks that track spending and auto-save session state:

tokenxray --install-hook --confirm

This installs two hooks:

Cost Hook (PostToolUse) — runs after every tool use:

Tracks running cost for each session
Alerts when you cross cost thresholds ($10, $25, $50, $100, $200, $500)
Shows periodic cost updates every 10 turns
Auto-checkpoints when sessions get expensive (80+ turns or $30+)

[TokenXRay] Opus — turn 40, $12.50 total, ~$0.31/turn, ctx 85K
[TokenXRay] $25.42 spent (crossed $25) — Opus, 142 turns, ctx 98K
[TokenXRay] Consider splitting this session! (80 turns, $31.20, ctx 120K)
[TokenXRay] Auto-checkpoint saved to .claude/checkpoint.md

Resume Hook (UserPromptSubmit) — runs when you start a new session:

Detects .claude/checkpoint.md from the previous session
Tells Claude to read it — full context restored automatically
Only fires once, then renames the file so it doesn't repeat

Zero user action required. Expensive session auto-saves, next session auto-resumes.

Configurable Guardrails

Customize thresholds via ~/.tokenxray/config.json:

{
    "split_turns": 80,
    "split_cost": 30,
    "alert_thresholds": [10, 25, 50, 100, 200, 500],
    "status_interval": 10
}

Key	Default	Description
`split_turns`	80	Warn + checkpoint after this many turns
`split_cost`	30	Warn + checkpoint after this cost ($)
`alert_thresholds`	[10,25,50,100,200,500]	Cost milestones that trigger alerts
`status_interval`	10	Show status update every N turns

Before / After Comparison

Save a baseline before changing your habits, then compare:

# Before: save your current state
tokenxray --baseline

# ... change habits, use Sonnet more, split sessions ...

# After: see the difference
tokenxray --compare

                     Baseline      Current          Delta
  Sessions              514          520    +6
  Total cost          $6,414       $6,480    +$66
  Avg cost/session    $12.48       $12.46    -$0.02   <-- improving!

Supported Tools

Claude Code

Reads JSONL conversation logs from ~/.claude/projects/**/*.jsonl. Full token breakdown: input, output, cache read (90% discount), cache creation (25% premium).

Gemini CLI

Reads session JSON from ~/.gemini/tmp/*/chats/session-*.json. Token breakdown: input, output, cached, thinking tokens. No cache creation premium.

# View only Claude sessions
tokenxray --source claude

# View only Gemini sessions
tokenxray --source gemini

# View everything (default)
tokenxray --source all

# View only Copilot sessions
tokenxray --source copilot

GitHub Copilot (VS Code)

Reads transcript JSONL from VS Code workspace storage. Limitation: Copilot marks token usage events as ephemeral (in-memory only), so token counts are estimated from message character lengths (~4 chars/token). Turn counts and tool usage are accurate.

Your data stays local. TokenXRay reads files on your machine and writes to ~/.tokenxray/. Nothing is sent anywhere.

Supported Models

Model	Input	Output	Cache Read	Cache Create
Claude Opus 4.6	$15/MTok	$75/MTok	$1.50/MTok	$18.75/MTok
Claude Sonnet 4.6	$3/MTok	$15/MTok	$0.30/MTok	$3.75/MTok
Claude Sonnet 4.5	$3/MTok	$15/MTok	$0.30/MTok	$3.75/MTok
Claude Haiku 4.5	$0.80/MTok	$4/MTok	$0.08/MTok	$1.00/MTok
Gemini 2.5 Pro	$1.25/MTok	$10/MTok	$0.31/MTok	—
Gemini 2.5 Flash	$0.15/MTok	$0.60/MTok	$0.04/MTok	—
Copilot (estimated)	—	—	—	—

Key Insights From Our Research

1. Cache creation is the hidden cost killer

Everyone talks about prompt caching saving money (and it does — 78% in our case). But nobody talks about cache creation costing 25% more than regular input. When your session context grows, every new token enters the cache at a premium. In our data, cache creation was 51% of total cost.

2. Your questions are noise in the token stream

Across 514 sessions, actual user questions averaged 0.01% of total input tokens. The other 99.99% is system prompts, tool definitions, conversation history, and tool results — all re-sent every turn.

3. Marathon sessions are exponentially expensive

A 100-turn session doesn't cost 10x a 10-turn session. It costs 100x+ because context accumulates and every new turn processes all previous context. The model re-reads your entire conversation history on every single turn.

4. Output verbosity compounds

A verbose 3K-token response doesn't just cost output tokens. That response enters the context and gets re-read on every subsequent turn. Over 100 more turns, that single response generates ~300K additional input tokens.

What's Next

TokenXRay gives you visibility (where tokens go) and guardrails (auto-checkpoint expensive sessions). Next up:

Session optimizer — intelligent context pruning and tool result summarization
Cache-layout optimization for maximum cache hits
Team usage dashboards

Requirements

Python 3.9+
Claude Code, Gemini CLI, and/or GitHub Copilot (reads their local session logs)
No external dependencies

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.0

May 3, 2026

0.3.28

May 3, 2026

0.3.27

May 3, 2026

0.3.26

May 3, 2026

0.3.25

May 3, 2026

0.3.24

May 3, 2026

0.3.23

May 2, 2026

0.3.22

Apr 29, 2026

0.3.20

Apr 23, 2026

0.3.19

Apr 23, 2026

0.3.18

Apr 23, 2026

0.3.17

Apr 23, 2026

0.3.16

Apr 23, 2026

0.3.15

Apr 23, 2026

0.3.14

Apr 23, 2026

0.3.13

Apr 23, 2026

0.3.12

Apr 23, 2026

0.3.11

Apr 23, 2026

0.3.10

Apr 23, 2026

0.3.9

Apr 23, 2026

0.3.8

Apr 22, 2026

0.3.7

Apr 22, 2026

0.3.6

Apr 22, 2026

0.3.5

Apr 22, 2026

0.3.4

Apr 21, 2026

0.3.3

Apr 21, 2026

0.3.2

Apr 21, 2026

0.3.1

Apr 21, 2026

0.3.0

Apr 21, 2026

0.2.0

Apr 19, 2026

0.1.9

Apr 19, 2026

0.1.8

Apr 19, 2026

0.1.7

Apr 19, 2026

This version

0.1.6

Apr 19, 2026

0.1.5

Apr 19, 2026

0.1.4

Apr 19, 2026

0.1.3

Apr 19, 2026

0.1.2

Apr 18, 2026

0.1.1

Apr 18, 2026

0.1.0

Apr 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tokenxray-0.1.6.tar.gz (43.0 kB view details)

Uploaded Apr 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tokenxray-0.1.6-py3-none-any.whl (39.0 kB view details)

Uploaded Apr 19, 2026 Python 3

File details

Details for the file tokenxray-0.1.6.tar.gz.

File metadata

Download URL: tokenxray-0.1.6.tar.gz
Upload date: Apr 19, 2026
Size: 43.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for tokenxray-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`c06f85206b9bf04de8c7883da24ed6d0853e2f9e0e320c324365500fdb1cbc64`
MD5	`af6e342b70cae16eef6bfd064a14dac3`
BLAKE2b-256	`ca640c9c285aa79b71dadc351c31284a1f494088d86a375f3dff7a030146f1a5`

See more details on using hashes here.

File details

Details for the file tokenxray-0.1.6-py3-none-any.whl.

File metadata

Download URL: tokenxray-0.1.6-py3-none-any.whl
Upload date: Apr 19, 2026
Size: 39.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for tokenxray-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0bd6df29a52bdf2c54b446db5db8f4c47a98ccccb5c15978cc14b17c0d8f874e`
MD5	`837f171cab31d8698f6243d108d0a9b0`
BLAKE2b-256	`f3f6a434730f443aeb343d18be22fe5feb11b7cf57049f1cd35e9e929523ef9e`

See more details on using hashes here.

tokenxray 0.1.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TokenXRay

The Problem We Found

Install

pip (recommended)

From source (no venv needed)

Quick Start

What You'll See

Session Overview

Session Deep Dive

Diagnosis

Interactive Dashboard

Live Cost Tracking + Auto-Checkpoint

Configurable Guardrails

Before / After Comparison

Supported Tools

Claude Code

Gemini CLI

GitHub Copilot (VS Code)

Supported Models

Key Insights From Our Research

What's Next

Requirements

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes