Skip to main content

Your AI activity ledger — reads every coding agent's local sessions (Claude Code, Codex, Hermes, OpenClaw, OpenHuman, Cursor) and shows what your tokens did.

Project description

tokenpayback

Your AI activity ledger — every agent, every activity, 100% local.
One CLI. Reads every coding agent on your machine. Tells you what your tokens did.

MIT PyPI stars local private

🌐 Live demo · 📦 Install · 🤖 Supported agents · 🔒 Privacy


The problem

You're paying for Claude Code + Codex + maybe Cursor, Hermes, OpenClaw, OpenHuman. Each agent keeps its own session log on your disk. No tool tells you what those tokens did across all of them.

Cost dashboards show you how many tokens you burned. None of them tell you:

  • Was this a new feature or a brainstorm?
  • Did the agent finally fix the bug or just dance around it?
  • How much of the spend went into shipping code vs. answering questions vs. organizing your life?

tokenpayback reads every agent's local data, classifies every session via LLM, and shows you the answer. It runs on your machine. Data never leaves.

This week — what your $264 of AI tokens did
─────────────────────────────────────────────
🚢 Code shipped         14 PRs · 3,120 lines
🐛 Bug fixed             3 sessions
🧹 Code cleaned          2 sessions
⚙️  Infra changed         5 sessions
📚 Info gathered         6 sessions
💡 Ideas explored        4 sessions
🎯 Life shipped          2 sessions (resumes, video drafts)
❓ Question answered     11 sessions

Every category gets credit. Asking a question and getting an answer is value. Sketching out an idea is value. Code is just one shape of value.


Supported agents

tokenpayback auto-detects which of these you have installed and reads each one:

Agent Local path Status
Claude Code ~/.claude/projects/ ✅ Full support
Codex CLI ~/.codex/sessions/ ✅ Full support
Hermes (Nous Research) ~/.hermes/ 🟡 Beta — SQLite reader
OpenClaw 🦞 ~/.openclaw/ or ~/Library/Application Support/OpenClaw/ 🟡 Beta — auto-detect
OpenHuman (tinyhumans.ai) ~/.openhuman/ 🟡 Beta — SQLite reader
Cursor ~/Library/Application Support/Cursor/User/ 🟡 Beta — composer data
Local proxy (anything that hits an LLM API) ~/.tokenpayback/proxy_log.jsonl ✅ Universal capture

Each agent has its own parser file in tokenpayback/parsers/. Adding a new agent = one file. PRs welcome.

Universal capture: the local proxy

For tools that don't keep local logs (your own scripts, OpenRouter clients, HuggingFace API calls, anything OpenAI-compatible), run tokenpayback proxy and point your tool at it:

# in one shell
tokenpayback proxy start --upstream openrouter --port 4000
# Reads OPENROUTER_API_KEY from env, forwards traffic, logs locally

# in another shell — point any tool at the proxy
export OPENAI_BASE_URL=http://localhost:4000/v1
export OPENAI_API_KEY=anything   # replaced by the proxy with your real key
# now run your script / aider / langchain / curl — all calls are captured

# anthropic-style tools
tokenpayback proxy start --upstream anthropic --port 4001
export ANTHROPIC_BASE_URL=http://localhost:4001

Supported upstreams out of the box: anthropic, openai, openrouter, groq, mistral, deepseek, huggingface, paigod. Add your own in ~/.tokenpayback/proxy.yaml. Set TOKENPAYBACK_PROXY_REDACT=1 to hash prompts before logging if you want extra paranoia.

tokenpayback proxy start    # default: anthropic on :4000
tokenpayback proxy status   # is it running?
tokenpayback proxy stop
tokenpayback proxy log      # tail of the captured traffic

Install

Requires Python 3.9+.

pipx install tokenpayback     # recommended (isolated env)
# or
pip install --user tokenpayback

You'll need an LLM API key for session classification — set ONE:

export ANTHROPIC_API_KEY=sk-ant-...        # recommended — you probably already have one
export OPENAI_API_KEY=sk-...
export LITELLM_API_KEY=...
export LITELLM_BASE_URL=https://your-proxy/v1
export LITELLM_MODEL=gpt-4o-mini

Skip classification entirely with tokenpayback --no-classify (still shows cost & agent activity).

Run

tokenpayback                  # scan all agents + classify + open dashboard in browser
tokenpayback scan             # just scan & write data
tokenpayback serve            # serve existing data on local port

First run takes ~60 seconds (LLM classifies each session). Subsequent runs use cache.


Categories are personalized — not hardcoded

The first time you run tokenpayback, the LLM looks at a sample of your real sessions and induces categories that fit how YOU use AI. An engineer's taxonomy will look very different from a creator's, a founder's, or a data scientist's.

tokenpayback                    # first run auto-generates ~/.tokenpayback/taxonomy.yaml
tokenpayback taxonomy show      # see what it came up with
tokenpayback taxonomy edit      # rename categories, change baselines, add new ones
tokenpayback taxonomy regen     # re-discover from scratch

Example: an engineer might end up with something like:

categories:
  - id: ship-feature
    icon: 🚢
    label: Ship feature
    description: Completing a new product feature end-to-end with commits
    baseline_usd: 80
    per_pr_usd: 700
    per_line_usd: 0.30
  - id: cf-worker-debug
    icon: 🛠
    label: CF Worker debugging
    description: Diagnosing issues in Cloudflare Worker deploys
    baseline_usd: 60
  ...

A content creator might end up with tiktok-edit, research, voiceover-prep, etc. Everyone's dashboard speaks their own language.

No baseline is $0 by default. Asking a question and getting an answer IS value. The point is to make the assumptions visible, not to hide them behind a SaaS pricing model.


Privacy

  • ❌ No tracking, no analytics, no phone-home
  • ❌ No account, no email, no sign-up
  • ❌ Your session data NEVER leaves your machine
  • ✅ The only outbound calls: (1) your chosen LLM for classification, (2) Anthropic/OpenAI usage APIs only if you opt in, (3) GitHub API only if you opt in
  • ✅ Open source. Read every line.

The LLM classification step sends a one-paragraph summary of each session (first prompt, tool call counts, sample bash commands) — not full prompts or code. Skip it entirely with --no-classify.


How it works

┌─────────────────┐  ┌─────────────────┐  ┌─────────────────┐  ┌─────────────────┐
│ ~/.claude/      │  │ ~/.codex/       │  │ ~/.hermes/      │  │ ~/Library/.../  │
│ projects/       │  │ sessions/       │  │ *.db (SQLite)   │  │ Cursor/User/    │
│ *.jsonl         │  │ *.jsonl         │  │                 │  │ state.vscdb     │
└────────┬────────┘  └────────┬────────┘  └────────┬────────┘  └────────┬────────┘
         │                    │                    │                    │
         └────────────────────┴─────┬──────────────┴────────────────────┘
                                    │
                            tokenpayback/parsers
                                    │
                                    ▼
                       ┌─────────────────────────┐
                       │  normalized Session[]   │
                       │   agent, project,       │
                       │   tokens, tool_counts,  │
                       │   est_cost_usd...       │
                       └────────────┬────────────┘
                                    │
                                    ▼  LLM classifier
                       ┌─────────────────────────┐
                       │   + category, summary,  │
                       │     value_signal        │
                       └────────────┬────────────┘
                                    │
                       ┌────────────┴───────────┐
                       │                         │
                       ▼                         ▼
              activity ledger            engineering ROI
              (every category)           (GitHub PR/commits)
                       │                         │
                       └────────────┬────────────┘
                                    ▼
                       local dashboard (localhost)

What it's not

  • Not a SaaS. No cloud, no signup, nothing to sell you.
  • Not a tracker. It cares about your spend, not your activity in aggregate.
  • Not an attribution oracle. Value heuristics are estimates. We're transparent about it.
  • Not a replacement for evals. Use Braintrust / Langfuse / Inspect for output quality.

Roadmap

For v0.3:

  • Sankey diagram: from agent → category → outcome
  • Time-series of how your category mix shifts week-over-week
  • Per-tool cost breakdown (which Bash patterns cost you the most?)
  • Native Mac app via Tauri or pywebview (no more "open browser" feel)
  • LLM-graded value (replace flat baselines with case-by-case judgment)

PRs welcome. Open an issue first for anything non-trivial.


Contributing

Add support for a new agent:

git clone https://github.com/gongyibob-ctrl/tokenpayback.git
cd tokenpayback
python3 -m venv .venv && .venv/bin/pip install -e .
# Create tokenpayback/parsers/<your_agent>.py — subclass BaseParser
# Register in tokenpayback/parsers/__init__.py ALL_PARSERS
# Test: .venv/bin/tokenpayback scan

Each parser is ~50 lines. See parsers/claude_code.py as the reference.

Code style: small modules, no premature abstraction, transparent heuristics.


Why "tokenpayback"?

Because the question isn't "how many tokens did I burn?" — every tool answers that. The question is "did those tokens come back as something?" — and "something" doesn't have to be code. A clear answer, a written note, a fixed bug, a planned weekend — those count too.

Built by @gongyibob-ctrl. Made in a weekend, open sourced because it shouldn't have to be a startup.


License

MIT. Use it, fork it, sell improvements built on it.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tokenpayback-0.6.1.tar.gz (57.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tokenpayback-0.6.1-py3-none-any.whl (65.6 kB view details)

Uploaded Python 3

File details

Details for the file tokenpayback-0.6.1.tar.gz.

File metadata

  • Download URL: tokenpayback-0.6.1.tar.gz
  • Upload date:
  • Size: 57.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for tokenpayback-0.6.1.tar.gz
Algorithm Hash digest
SHA256 ff915b50c3a28fbf21b52c89aa7211e14657368c1af1e00be2378122e76db0c4
MD5 99b6d7067abab4247b0257737c5edbbb
BLAKE2b-256 74cb7164aca1c781573586af560914436dd12815a3019041d4330e6960608f78

See more details on using hashes here.

File details

Details for the file tokenpayback-0.6.1-py3-none-any.whl.

File metadata

  • Download URL: tokenpayback-0.6.1-py3-none-any.whl
  • Upload date:
  • Size: 65.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for tokenpayback-0.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 57ff20d4bf3f33415104885f671206ea2d949b032dd2a2181e540f4b19fe0bdd
MD5 06488f4a4bac072818e5572c008e7a3a
BLAKE2b-256 952faa42ae08ebb8eb9daee4bf5a63c83b2857d769278328be53430ede661780

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page