One line to track every AI API cost. Sentry for AI costs.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

deependra

These details have not been verified by PyPI

Project description

tokenly

One line to track every AI API cost. Sentry for AI costs. No proxy, no account, free forever.

import tokenly
tokenly.init()

That's it. Now every OpenAI / Anthropic / Google call you make is logged — tokens, cost, latency, cache hits — to a local SQLite file.

$ tokenly stats

  tokenly · Today
  ────────────────────────────────────────────────────
  Spend                    $4.21
  Calls                       89
  Input               1,240,500 tokens
  Output                210,400 tokens
  Cache read             87,200 tokens
  Avg latency            842 ms

Why

Your monthly AI bill came back at $847 and you have no idea which feature caused it.
Your bill swings 2-3× every quarter for no reason you can explain.
Every existing tool wants you to change your base URL, run a proxy, or create an account.
tokenly is a tracker, not a gateway. One line, zero config, local first.

Install

pip install tokenly

Python 3.10+. Zero runtime dependencies.

Use it

import tokenly
tokenly.init()

import openai
client = openai.OpenAI()
client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "hi"}],
)

Then any time:

tokenly stats              # today
tokenly stats --week       # last 7 days
tokenly stats --month      # this month
tokenly stats --by=model   # group by model
tokenly tail               # live stream
tokenly export > calls.csv
tokenly doctor             # diagnose setup

Tag calls by user / feature

tokenly.configure(tags={"user": "alice", "feature": "chat"})

Then:

tokenly stats --by=tag.user
tokenly stats --by=tag.feature

Budget alerts

export TOKENLY_DAILY_BUDGET=10   # raise BudgetExceeded when spend hits $10/day
export TOKENLY_DAILY_WARN=5      # warn at $5/day, keep going

Or in code:

tokenly.init(budget_usd_day=10, warn_usd_day=5)

Works with

Provider	Tracks
OpenAI	prompt / completion tokens, cached tokens, cost
Anthropic	input / output tokens, cache read, cache write, cost
Google Gemini	prompt / output tokens, cached content tokens, cost
DeepSeek, xAI, Mistral, Cohere	via pricing DB; patches coming

Because tokenly patches the underlying SDKs, LangChain, LlamaIndex, and any other framework built on these SDKs work automatically — no integration needed. See examples/langchain_example.py and examples/llamaindex_example.py.

OpenTelemetry GenAI export (optional)

Emit an OpenTelemetry span per tracked call, following the GenAI semantic conventions (gen_ai.provider.name, gen_ai.request.model, gen_ai.usage.input_tokens, gen_ai.usage.output_tokens). That means tokenly plugs straight into Grafana, Datadog, Honeycomb, Jaeger, or any OTel-compatible backend — no extra integration.

pip install tokenly[otel]

import tokenly
tokenly.init(otel=True)   # or: export TOKENLY_OTEL=1

Span start_time is reconstructed from the measured latency so backends see a span that actually covers the model call, not a zero-width marker. The GenAI semconv is still experimental upstream — we track the latest and will bump as it stabilizes.

Where is the data?

By default: ~/.tokenly/log.db — a single SQLite file. One table, ten columns. Move it, query it, back it up, delete it. It's yours.

Pick any backend

SQLite is the default and needs nothing. For a team setup, point tokenly at your own MySQL or Postgres:

# One of these:
export TOKENLY_DB_URL="sqlite:///~/.tokenly/log.db"                 # default
export TOKENLY_DB_URL="mysql://user:pass@host:3306/tokenly"         # pip install tokenly[mysql]
export TOKENLY_DB_URL="postgresql://user:pass@host:5432/tokenly"    # pip install tokenly[postgres]

Or in code:

tokenly.init(db_url="postgresql://user:pass@db.internal/tokenly")

The schema is created automatically on first connect. The legacy TOKENLY_DB=/path/to.db env var still works (treated as a SQLite path).

Local dashboard

tokenly dashboard

Boots a local, read-only web dashboard on http://127.0.0.1:8787 (auto-picks a free port if that's taken) and opens your browser. Spend cards, cost-by-model bars, cost-over-time line chart, and a live table of recent calls. Tabs for Today / Week / Month / All. Refreshes every 5 seconds.

Stdlib HTTP server, no JS framework, Chart.js via CDN. Stays zero-dep. Pass --no-open for headless, --host 0.0.0.0 to expose on your LAN (no auth — only do this on trusted networks).

vs other tools

	tokenly	LiteLLM	Helicone	Langfuse
One-line setup	✓	✗	✗	✗
Requires URL change	✗	✓	✓	✗
Needs account	✗	✗	✓	✓
Local-first	✓	~	✗	~
Gateway / routing	✗	✓	✓	✗
Pure cost tracking	✓	~	~	~
Zero runtime deps	✓	✗	✗	✗

tokenly is tracking-only by design. If you want routing, fallbacks, or an auth proxy, use LiteLLM or Portkey. If you just want to know what you're spending, use tokenly.

Roadmap

OpenAI, Anthropic, Google auto-patch
CLI: stats, tail, export, reset, doctor
Tags and budget alerts
Streaming-response support (OpenAI, Anthropic)
Multi-DB backend: SQLite (default), MySQL, Postgres
Local web dashboard (tokenly dashboard)
OpenTelemetry GenAI export (pip install tokenly[otel])
Weekly auto-updated pricing DB
Node / TypeScript SDK (same storage)

License

Pricing numbers are best-effort; verify with the provider before basing decisions on them. Unknown models log with $0 cost; please PR them in src/tokenly/pricing.json.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

deependra

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.2

Apr 22, 2026

This version

0.2.0

Apr 21, 2026

0.1.0

Apr 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tokenly-0.2.0.tar.gz (35.4 kB view details)

Uploaded Apr 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tokenly-0.2.0-py3-none-any.whl (30.6 kB view details)

Uploaded Apr 21, 2026 Python 3

File details

Details for the file tokenly-0.2.0.tar.gz.

File metadata

Download URL: tokenly-0.2.0.tar.gz
Upload date: Apr 21, 2026
Size: 35.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for tokenly-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`f6e410b62c4e30fa4c1d45a2613f5d15a5df94f83d35cd6df38304614fcd33e7`
MD5	`7acbe8f64f2d634d49b282c0da834eed`
BLAKE2b-256	`0e040f212447ae098ea6733d227d0941bfe831834826c28c8012a57f7f999b4c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for tokenly-0.2.0.tar.gz:

Publisher: release.yml on deependra04/tokenly

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: tokenly-0.2.0.tar.gz
- Subject digest: f6e410b62c4e30fa4c1d45a2613f5d15a5df94f83d35cd6df38304614fcd33e7
- Sigstore transparency entry: 1354716890
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: deependra04/tokenly@40f6746d3c589fa6014e214469e5001430a2beae
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/deependra04
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@40f6746d3c589fa6014e214469e5001430a2beae
- Trigger Event: push

File details

Details for the file tokenly-0.2.0-py3-none-any.whl.

File metadata

Download URL: tokenly-0.2.0-py3-none-any.whl
Upload date: Apr 21, 2026
Size: 30.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for tokenly-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`35217393ea973f14579335e824693f1c556591c8a1e287d3f2dce1905ef3c0c3`
MD5	`4400fd9d6a27abc5a581084d49eeb268`
BLAKE2b-256	`5b2dd2b7c7f119179b95e385e53c862f2080765bbffca7e95bd67e06726fac5a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for tokenly-0.2.0-py3-none-any.whl:

Publisher: release.yml on deependra04/tokenly

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: tokenly-0.2.0-py3-none-any.whl
- Subject digest: 35217393ea973f14579335e824693f1c556591c8a1e287d3f2dce1905ef3c0c3
- Sigstore transparency entry: 1354716953
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: deependra04/tokenly@40f6746d3c589fa6014e214469e5001430a2beae
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/deependra04
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@40f6746d3c589fa6014e214469e5001430a2beae
- Trigger Event: push

tokenly 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

tokenly

Why

Install

Use it

Tag calls by user / feature

Budget alerts

Works with

OpenTelemetry GenAI export (optional)

Where is the data?

Pick any backend

Local dashboard

vs other tools

Roadmap

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance