Token usage and cost tracking for LLMs

These details have not been verified by PyPI

Project description

Tokenator : Track, analyze, compare LLM token usage and costs

Have you ever wondered :

How many tokens does your AI agent consume?
How much does it cost to run a complex AI workflow with multiple LLM providers?
Which LLM is more cost effective for my use case?
How much money/tokens did you spend today on developing with LLMs?

Afraid not, tokenator is here! With tokenator's easy to use functions, you can start tracking LLM usage in a matter of minutes.

Get started with just 3 lines of code!

Tokenator supports the official SDKs from openai, anthropic and google-genai(the new one). LLM providers which use the openai SDK like perplexity, deepseek and xAI are also supported.

Installation

pip install tokenator

Usage

OpenAI

from openai import OpenAI
from tokenator import tokenator_openai

openai_client = OpenAI(api_key="your-api-key")

# Wrap it with Tokenator
client = tokenator_openai(openai_client)

# Use it exactly like the OpenAI client
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}]
)

Works with AsyncOpenAI and streaming=True as well! Note : When streaming, don't forget to add stream_options={"include_usage": True} to the create() call!

Cost Analysis

from tokenator import usage

# Get usage for different time periods
usage.last_hour()
usage.last_day()
usage.last_week()
usage.last_month()

# Custom date range
usage.between("2024-03-01", "2024-03-15")

# Get usage for different LLM providers
usage.last_day("openai")
usage.last_day("anthropic")
usage.last_day("google")

Example `usage` object

print(cost.last_hour().model_dump_json(indent=4))

{
    "total_cost": 0.0004,
    "total_tokens": 79,
    "prompt_tokens": 52,
    "completion_tokens": 27,
    "providers": [
        {
            "total_cost": 0.0004,
            "total_tokens": 79,
            "prompt_tokens": 52,
            "completion_tokens": 27,
            "provider": "openai",
            "models": [
                {
                    "total_cost": 0.0004,
                    "total_tokens": 79,
                    "prompt_tokens": 52,
                    "completion_tokens": 27,
                    "model": "gpt-4o-2024-08-06"
                }
            ]
        }
    ]
}

Cookbooks

Want more code, example use cases and ideas? Check out our amazing cookbooks!

Features

Drop-in replacement for OpenAI, Anthropic client
Automatic token usage tracking
Cost analysis for different time periods
SQLite storage with zero configuration
Thread-safe operations
Minimal memory footprint
Minimal latency footprint

Anthropic

from anthropic import Anthropic, AsyncAnthropic
from tokenator import tokenator_anthropic

anthropic_client = AsyncAnthropic(api_key="your-api-key")

# Wrap it with Tokenator
client = tokenator_anthropic(anthropic_client)

# Use it exactly like the Anthropic client
response = await client.messages.create(
    model="claude-3-5-haiku-20241022",
    messages=[{"role": "user", "content": "hello how are you"}],
    max_tokens=20,
)

print(response)

print(usage.last_execution().model_dump_json(indent=4))
"""
{
    "total_cost": 0.0001,
    "total_tokens": 23,
    "prompt_tokens": 10,
    "completion_tokens": 13,
    "providers": [
        {
            "total_cost": 0.0001,
            "total_tokens": 23,
            "prompt_tokens": 10,
            "completion_tokens": 13,
            "provider": "anthropic",
            "models": [
                {
                    "total_cost": 0.0004,
                    "total_tokens": 79,
                    "prompt_tokens": 52,
                    "completion_tokens": 27,
                    "model": "claude-3-5-haiku-20241022"
                }
            ]
        }
    ]
}
"""

Google (Gemini - through AI studio)

from google import genai
from tokenator import tokenator_gemini

gemini_client = genai.Client(api_key=os.getenv("GEMINI_API_KEY"))

# Wrap it with Tokenator
client = tokenator_gemini(gemini_client)

# Use it exactly like the google-genai client
response = models.generate_content(
    model="gemini-2.0-flash",
    contents="hello how are you",
)

print(response)

print(usage.last_execution().model_dump_json(indent=4))
"""
{
    "total_cost": 0.0001,
    "total_tokens": 23,
    "prompt_tokens": 10,
    "completion_tokens": 13,
    "providers": [
        {
            "total_cost": 0.0001,
            "total_tokens": 23,
            "prompt_tokens": 10,
            "completion_tokens": 13,
            "provider": "gemini",
            "models": [
                {
                    "total_cost": 0.0004,
                    "total_tokens": 79,
                    "prompt_tokens": 52,
                    "completion_tokens": 27,
                    "model": "gemini-2.0-flash"
                }
            ]
        }
    ]
}
"""

xAI

You can use xAI models through the openai SDK and track usage using provider parameter in tokenator.

from openai import OpenAI
from tokenator import tokenator_openai

xai_client = OpenAI(
            api_key=os.getenv("XAI_API_KEY"),
            base_url="https://api.x.ai/v1"
        )

# Wrap it with Tokenator
client = tokenator_openai(xai_client, db_path=temp_db, provider="xai")

# Use it exactly like the OpenAI client but with xAI models
response = client.chat.completions.create(
    model="grok-2-latest",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response)

print(usage.last_execution())

Other AI model providers through openai SDKs

Today, a variety of AI companies have made their APIs compatible to the openai SDK. You can track usage of any such AI models using tokenator's provider parameter.

For example, let's see how we can track usage of perplexity tokens.

from openai import OpenAI
from tokenator import tokenator_openai

perplexity_client = OpenAI(
            api_key=os.getenv("PERPLEXITY_API_KEY"),
            base_url="https://api.perplexity.ai"
        )

# Wrap it with Tokenator
client = tokenator_openai(perplexity_client, db_path=temp_db, provider="perplexity")

# Use it exactly like the OpenAI client but with perplexity models
response = client.chat.completions.create(
    model="sonar",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response)

print(usage.last_execution())

print(usage.provider("perplexity"))

Most importantly, none of your data is ever sent to any server.

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.1

Mar 18, 2025

0.2.0

Mar 3, 2025

0.1.16

Jan 26, 2025

0.1.15

Jan 20, 2025

0.1.14

Jan 16, 2025

0.1.13

Jan 13, 2025

0.1.12

Jan 13, 2025

0.1.11

Jan 6, 2025

0.1.10

Dec 28, 2024

0.1.9

Dec 22, 2024

0.1.8

Dec 22, 2024

0.1.7

Dec 22, 2024

0.1.6

Dec 22, 2024

0.1.5

Dec 21, 2024

0.1.4

Dec 21, 2024

0.1.3

Dec 21, 2024

0.1.2

Dec 21, 2024

0.1.1

Dec 21, 2024

0.1.0

Dec 21, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tokenator-0.2.1.tar.gz (18.8 kB view details)

Uploaded Mar 18, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tokenator-0.2.1-py3-none-any.whl (26.0 kB view details)

Uploaded Mar 18, 2025 Python 3

File details

Details for the file tokenator-0.2.1.tar.gz.

File metadata

Download URL: tokenator-0.2.1.tar.gz
Upload date: Mar 18, 2025
Size: 18.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.1 CPython/3.10.16 Linux/6.8.0-1021-azure

File hashes

Hashes for tokenator-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`2958586773aa54e69be21a48402ad32a9ae815e59d6e3ea2037bae1c431a96b2`
MD5	`737fa1c63a512c02377d67bcd6416aed`
BLAKE2b-256	`0b660dd7a3b352447ff55d947a262bc8826ecf56ac45d40ad2c2a67c7ef93dbb`

See more details on using hashes here.

File details

Details for the file tokenator-0.2.1-py3-none-any.whl.

File metadata

Download URL: tokenator-0.2.1-py3-none-any.whl
Upload date: Mar 18, 2025
Size: 26.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.1 CPython/3.10.16 Linux/6.8.0-1021-azure

File hashes

Hashes for tokenator-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a3b77f45f94393ff101f2a1afdbf40eacf073fad7f5dabdfe9925c6828305979`
MD5	`7abc474ccd7a7221c39924350b663ebd`
BLAKE2b-256	`da3a257f2e69c6f0feaffa1ae90cf0aa2ad08d64f679cfe9d730e02ad394a29d`

See more details on using hashes here.

tokenator 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Tokenator : Track, analyze, compare LLM token usage and costs

Installation

Usage

OpenAI

Cost Analysis

Example `usage` object

Cookbooks

Features

Anthropic

Google (Gemini - through AI studio)

xAI

Other AI model providers through openai SDKs

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

tokenator 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Tokenator : Track, analyze, compare LLM token usage and costs

Installation

Usage

OpenAI

Cost Analysis

Example usage object

Cookbooks

Features

Anthropic

Google (Gemini - through AI studio)

xAI

Other AI model providers through openai SDKs

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Example `usage` object