TokenJam — local-first OTel-native observability for Autonomous AI agents

These details have not been verified by PyPI

Project links

Project description

TokenJam

Token Efficiency For AI Agents

TokenJam reads your agent's telemetry and tells you when to downsize, when to trim prompts, what to cache, and what to script. The result is a lower AI bill. Runs entirely on your machine.

pipx install tokenjam

_{Don't have pipx? brew install pipx on macOS, apt install pipx on Debian/Ubuntu, or see docs/installation.md. pip install tokenjam also works in a clean venv.}

No cloud · No signup · No vendor lock-in

Four Analyzers. One Install.

TokenJam reads telemetry from every major agent runtime, framework, provider, and observability tool and surfaces savings across four areas.

🪶 Downsize

Flags sessions where a cheaper model in the same family is worth a look. Never claims quality equivalence — surfaces examples so you can spot-check.

tj optimize downsize

Details →

💾 Cache

Shows your current caching ratio per (provider, model) and suggests Anthropic prompt-cache breakpoints from stable prefixes in your real usage.

tj optimize cache

Details →

📜 Script

Finds clusters of deterministic (tool_name, arg_shape) sequences that match the shape of work a plain script could replace.

tj optimize script

Details →

✂️ Trim

Predicts which regions of your prompts the model gives little weight to. Surfaces what's safe to cut.

tj optimize trim

Details →

Run all four with tj optimize. Run several with tj optimize downsize cache trim.

30-second quickstart

For Claude Code users — zero code, auto-backfills your last 30 days:

pipx install 'tokenjam[mcp]'
tj onboard --claude-code
tj optimize          # cost-saving candidates from your actual usage

For any Python agent:

from tokenjam.sdk import watch
from tokenjam.sdk.integrations.anthropic import patch_anthropic

patch_anthropic()

@watch(agent_id="my-agent")
def run(task: str) -> str:
    ...

→ Python SDK · TypeScript SDK · Codex · OTel-compatible agents

Why local-first matters

Your spans contain prompts, completions, tool inputs, and customer data. Shipping that to a SaaS vendor for "observability" is a data-egress decision most teams aren't ready to make.

	TokenJam	LangSmith	Langfuse	Datadog LLM Obs
Signup required	❌	✅	✅	✅
Data leaves your machine	❌	✅	cloud only	✅
Cost-optimization analyzers (Downsize, Cache, Script, Trim)	✅	❌	❌	❌
Real-time sensitive-action alerts	✅	❌	❌	❌
Behavioral drift detection	✅	❌	❌	❌
OTel GenAI SemConv native	✅	partial	partial	partial
Works with any agent / framework	✅	LangChain-first	partial	❌
Free, MIT licensed	✅	freemium	freemium	paid

Web UI

tj serve runs a local dashboard at http://127.0.0.1:7391/ with status, traces, cost breakdown, alerts, budget, and drift.

Beyond optimization

TokenJam is also a full observability stack. The four analyzers ride on top.

Real-time cost tracking — every LLM call priced as it happens
Safety alerts — 13 alert types, 6 channels (ntfy, Discord, Telegram, webhook, file, stdout)
Behavioral drift detection — Z-score baselines, no LLM required
Schema validation — declare or infer JSON Schema for tool outputs
OTel-native — point any OTLP exporter at tj serve and you're done
MCP server — 14 tools letting Claude Code query its own telemetry mid-session

CLI

tj optimize            # all four cost-optimization analyzers
tj optimize downsize   # one analyzer
tj status              # current cost, tokens, active alerts
tj cost --since 7d     # spend by agent / model / day / tool
tj alerts              # everything that fired while you were away
tj drift               # behavioral drift Z-scores
tj backfill claude-code # ingest historical ~/.claude/projects/ sessions
tj serve               # start the web UI + REST API

Full CLI reference →

Documentation

Topic	Where
🪶 Downsize / Cache / Script / Trim deep-dives	docs/optimize/
Claude Code & Codex integration	docs/claude-code-integration.md
Python SDK reference	docs/python-sdk.md
TypeScript SDK reference	docs/typescript-sdk.md
Framework support (LangChain / CrewAI / etc.)	docs/framework-support.md
Alert channels & rule reference	docs/alerts.md
Backfill from Langfuse / Helicone / OTLP	docs/backfill/
Configuration	docs/configuration.md
Architecture deep-dive	docs/architecture.md
Installation extras (Trim, framework patches)	docs/installation.md
Export to Grafana / Datadog / NDJSON	docs/export.md
NemoClaw sandbox observer	docs/nemoclaw-integration.md

Roadmap

Shipped in 0.3.x: Downsize · Cache · Script · Trim · Claude Code + Codex onboarding · MCP server · Web UI · Backfill adapters (Langfuse, Helicone, OTLP) · Period comparison · Routing-config export · Read-only policy preview

Up next:

tj policy add | edit | apply — unified rule surface
tj replay — replay captured sessions against new model versions
TypeScript framework patches (LangChain JS, OpenAI Agents SDK)
Vercel AI SDK & Mastra integrations
Docker image
GitHub Actions for CI drift/cost checks

tokenjam.dev · PyPI · npm · Issues

MIT License · Built by Metabuilder Labs

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.2

Jun 22, 2026

0.4.1

Jun 20, 2026

0.4.0

Jun 19, 2026

0.3.5

Jun 16, 2026

This version

0.3.4

Jun 15, 2026

0.3.3

Jun 9, 2026

0.3.2

Jun 9, 2026

0.3.1

May 29, 2026

0.3.0

May 29, 2026

0.2.3

May 18, 2026

0.2.2

May 12, 2026

0.2.1

May 12, 2026

0.2.0

May 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tokenjam-0.3.4.tar.gz (1.1 MB view details)

Uploaded Jun 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tokenjam-0.3.4-py3-none-any.whl (254.5 kB view details)

Uploaded Jun 15, 2026 Python 3

File details

Details for the file tokenjam-0.3.4.tar.gz.

File metadata

Download URL: tokenjam-0.3.4.tar.gz
Upload date: Jun 15, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for tokenjam-0.3.4.tar.gz
Algorithm	Hash digest
SHA256	`5e7c2c81674609c97b9b01f37abacc0e23d651c607191c6ca605615d997bc4d4`
MD5	`c1143c66e5c71142dc89662874788ac6`
BLAKE2b-256	`2d429eff9081af77948e20f001810bcd11073e76f5b6f52397c5c04221c69369`

See more details on using hashes here.

Provenance

The following attestation bundles were made for tokenjam-0.3.4.tar.gz:

Publisher: publish-pypi.yml on Metabuilder-Labs/tokenjam

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: tokenjam-0.3.4.tar.gz
- Subject digest: 5e7c2c81674609c97b9b01f37abacc0e23d651c607191c6ca605615d997bc4d4
- Sigstore transparency entry: 1827103137
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: Metabuilder-Labs/tokenjam@78584e6f14969d9848092551e79bf8e0fd8cdba7
- Branch / Tag: refs/tags/v0.3.4
- Owner: https://github.com/Metabuilder-Labs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@78584e6f14969d9848092551e79bf8e0fd8cdba7
- Trigger Event: release

File details

Details for the file tokenjam-0.3.4-py3-none-any.whl.

File metadata

Download URL: tokenjam-0.3.4-py3-none-any.whl
Upload date: Jun 15, 2026
Size: 254.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for tokenjam-0.3.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0af48fa5882ad99711ebebb20ea63b6e87ca16afd434550030b0c228e90a5d4b`
MD5	`4985672adca4978a774d018640e2694a`
BLAKE2b-256	`9ce3e1598456952193a06c87fc158d2718937371c4f844afefe5f67d073bc38c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for tokenjam-0.3.4-py3-none-any.whl:

Publisher: publish-pypi.yml on Metabuilder-Labs/tokenjam

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: tokenjam-0.3.4-py3-none-any.whl
- Subject digest: 0af48fa5882ad99711ebebb20ea63b6e87ca16afd434550030b0c228e90a5d4b
- Sigstore transparency entry: 1827103397
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: Metabuilder-Labs/tokenjam@78584e6f14969d9848092551e79bf8e0fd8cdba7
- Branch / Tag: refs/tags/v0.3.4
- Owner: https://github.com/Metabuilder-Labs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@78584e6f14969d9848092551e79bf8e0fd8cdba7
- Trigger Event: release

tokenjam 0.3.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TokenJam

Token Efficiency For AI Agents

Four Analyzers. One Install.

🪶 Downsize

💾 Cache

📜 Script

✂️ Trim

30-second quickstart

Why local-first matters

Web UI

Beyond optimization

CLI

Documentation

Roadmap

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance