Multi-provider LLM SDK metering - OpenAI, Anthropic, Gemini - usage tracking, budget enforcement, and cost control

These details have not been verified by PyPI

Project links

Project description

Aden

LLM Observability & Cost Control SDK (Python)

Aden automatically tracks every LLM API call in your application—usage, latency, costs—and gives you real-time controls to prevent budget overruns. Works with OpenAI, Anthropic, and Google Gemini.

from aden import instrument, MeterOptions, create_console_emitter
from openai import OpenAI

# One line to start tracking everything
instrument(MeterOptions(emit_metric=create_console_emitter()))

# Use your SDK normally - metrics collected automatically
client = OpenAI()
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}],
)

Why Aden?
Installation
Quick Start
Sending Metrics to Your Backend
Cost Control
Multi-Provider Support
What Metrics Are Collected?
Metric Emitters
Advanced Configuration
API Reference
Examples
Troubleshooting

Why Aden?

Building with LLMs is expensive and unpredictable:

No visibility: You don't know which features or users consume the most tokens
Runaway costs: One bug or bad prompt can blow through your budget in minutes
No control: Once a request is sent, you can't stop it

Aden solves these problems:

Problem	Aden Solution
No visibility into LLM usage	Automatic metric collection for every API call
Unpredictable costs	Real-time budget tracking and enforcement
No per-user limits	Context-based controls (per user, per feature, per tenant)
Expensive models used unnecessarily	Automatic model degradation when approaching limits

Installation

pip install aden

Install with specific provider support:

# Individual providers
pip install aden[openai]      # OpenAI/GPT models
pip install aden[anthropic]   # Anthropic/Claude models
pip install aden[gemini]      # Google Gemini models

# All providers
pip install aden[all]

# Framework support
pip install aden[pydantic-ai]  # PydanticAI integration
pip install aden[livekit]      # LiveKit voice agents

Quick Start

Step 1: Add Instrumentation

Add this once at your application startup (before creating any LLM clients):

from aden import instrument, MeterOptions, create_console_emitter

instrument(MeterOptions(
    emit_metric=create_console_emitter(pretty=True),
))

Step 2: Use Your SDK Normally

That's it! Every API call is now tracked:

from openai import OpenAI

client = OpenAI()

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Explain quantum computing"}],
)

# Console output:
# + [a1b2c3d4] openai gpt-4o 1234ms
#   tokens: 12 in / 247 out

Step 3: Clean Up on Shutdown

from aden import uninstrument

# In your shutdown handler
uninstrument()

Sending Metrics to Your Backend

For production, send metrics to your backend instead of the console:

Option A: Custom Handler

import httpx

async def http_emitter(event):
    async with httpx.AsyncClient() as client:
        await client.post(
            "https://api.yourcompany.com/v1/metrics",
            json={
                "trace_id": event.trace_id,
                "model": event.model,
                "input_tokens": event.usage.input_tokens if event.usage else 0,
                "output_tokens": event.usage.output_tokens if event.usage else 0,
                "latency_ms": event.latency_ms,
                "error": event.error,
            },
            headers={"Authorization": f"Bearer {API_KEY}"},
        )

instrument(MeterOptions(emit_metric=http_emitter))

Option B: Aden Control Server

For real-time cost control (budgets, throttling, model degradation), connect to an Aden control server:

import os
from aden import instrument, MeterOptions

instrument(MeterOptions(
    api_key=os.environ["ADEN_API_KEY"],
    server_url=os.environ.get("ADEN_API_URL"),
))

This enables all the Cost Control features described below.

Cost Control

Aden's cost control system lets you set budgets, throttle requests, and automatically downgrade to cheaper models—all in real-time.

Control Actions

The control server can apply these actions to requests:

Action	What It Does	Use Case
allow	Request proceeds normally	Default when within limits
block	Request is rejected with an error	Budget exhausted
throttle	Request is delayed before proceeding	Rate limiting
degrade	Request uses a cheaper model	Approaching budget limit
alert	Request proceeds, notification sent	Warning threshold reached

Local Cost Control (No Server)

For local development or testing, see the cost_control_local.py example which demonstrates implementing a policy engine locally. This pattern is useful for:

Understanding how cost control decisions work
Testing policy configurations before deploying a server
Simple use cases that don't need a full control server

# See examples/cost_control_local.py for a complete example
# that implements budget limits, throttling, and model degradation
# without requiring a control server.

Control Server

For production cost control, connect to an Aden control server:

from aden import instrument, MeterOptions, create_control_agent, ControlAgentOptions

agent = create_control_agent(ControlAgentOptions(
    server_url="https://your-control-server.com",
    api_key="your-api-key",
    on_alert=lambda alert: print(f"[{alert.level}] {alert.message}"),
))

instrument(MeterOptions(
    control_agent=agent,
))

Multi-Provider Support

Aden works with all major LLM providers. Instrumentation automatically detects available SDKs:

from aden import instrument, MeterOptions, create_console_emitter

# Instrument all available providers at once
result = instrument(MeterOptions(
    emit_metric=create_console_emitter(pretty=True),
))

print(f"OpenAI: {result.openai}")
print(f"Anthropic: {result.anthropic}")
print(f"Gemini: {result.gemini}")

OpenAI

from openai import OpenAI

client = OpenAI()

# Chat completions
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello"}],
)

# Streaming
stream = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Tell me a story"}],
    stream=True,
)

for chunk in stream:
    print(chunk.choices[0].delta.content or "", end="")
# Metrics emitted when stream completes

Anthropic

from anthropic import Anthropic

client = Anthropic()

response = client.messages.create(
    model="claude-3-5-sonnet-latest",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello"}],
)

Google Gemini

import google.generativeai as genai

genai.configure(api_key=os.environ["GOOGLE_API_KEY"])

model = genai.GenerativeModel("gemini-2.0-flash")
response = model.generate_content("Explain quantum computing")

What Metrics Are Collected?

Every LLM API call generates a MetricEvent:

@dataclass
class MetricEvent:
    # Identity
    trace_id: str           # Unique ID for this request
    span_id: str            # Span ID (OTel compatible)
    request_id: str | None  # Provider's request ID

    # Request details
    provider: str           # "openai", "anthropic", "gemini"
    model: str              # e.g., "gpt-4o", "claude-3-5-sonnet"
    stream: bool
    timestamp: str          # ISO timestamp

    # Performance
    latency_ms: float
    error: str | None

    # Token usage
    usage: NormalizedUsage | None
    # - input_tokens: int
    # - output_tokens: int
    # - total_tokens: int
    # - reasoning_tokens: int   # For o1/o3 models
    # - cached_tokens: int      # Prompt cache hits

    # Tool usage
    tool_calls: list[ToolCallMetric] | None

    # Custom metadata
    metadata: dict | None

Metric Emitters

Emitters determine where metrics go. You can use built-in emitters or create custom ones.

Built-in Emitters

from aden import (
    create_console_emitter,     # Log to console (development)
    create_batch_emitter,       # Batch before sending
    create_multi_emitter,       # Send to multiple destinations
    create_filtered_emitter,    # Filter events
    create_transform_emitter,   # Transform events
    create_file_emitter,        # Write to JSON files
    create_memory_emitter,      # Store in memory (testing)
    create_noop_emitter,        # Discard all events
)

Console Emitter (Development)

instrument(MeterOptions(
    emit_metric=create_console_emitter(pretty=True),
))

# Output:
# + [a1b2c3d4] openai gpt-4o 1234ms
#   tokens: 12 in / 247 out

Multiple Destinations

instrument(MeterOptions(
    emit_metric=create_multi_emitter([
        create_console_emitter(pretty=True),  # Log locally
        my_backend_emitter,                    # Send to backend
    ]),
))

Filtering Events

instrument(MeterOptions(
    emit_metric=create_filtered_emitter(
        my_emitter,
        lambda event: event.usage and event.usage.total_tokens > 100  # Only large requests
    ),
))

File Logging

from aden import create_file_emitter

instrument(MeterOptions(
    emit_metric=create_file_emitter(log_dir="./logs"),
))
# Creates: ./logs/metrics-2024-01-15.jsonl

Custom Emitter

def my_emitter(event):
    # Store in your database
    db.llm_metrics.insert({
        "trace_id": event.trace_id,
        "model": event.model,
        "tokens": event.usage.total_tokens if event.usage else 0,
        "latency_ms": event.latency_ms,
    })

    # Check for anomalies
    if event.latency_ms > 30000:
        alert_ops(f"Slow LLM call: {event.latency_ms}ms")

instrument(MeterOptions(emit_metric=my_emitter))

Advanced Configuration

Full Options Reference

instrument(MeterOptions(
    # === Metrics Destination ===
    emit_metric=my_emitter,           # Required unless api_key is set

    # === Control Server (enables cost control) ===
    api_key="aden_xxx",               # Your Aden API key
    server_url="https://...",         # Control server URL (optional)

    # === Context Tracking ===
    get_context_id=lambda: get_user_id(),  # For per-user budgets
    request_metadata={"env": "prod"},      # Custom metadata

    # === Pre-request Hook ===
    before_request=my_budget_checker,

    # === Local Control Agent ===
    control_agent=my_control_agent,
))

beforeRequest Hook

Implement custom rate limiting or request modification:

from aden import BeforeRequestResult

def budget_check(params, context):
    # Check your own rate limits
    if not check_rate_limit(context.metadata.get("user_id")):
        return BeforeRequestResult.cancel("Rate limit exceeded")

    # Optionally delay the request
    if should_throttle():
        return BeforeRequestResult.throttle(delay_ms=1000)

    # Optionally switch to a cheaper model
    if should_degrade():
        return BeforeRequestResult.degrade(
            to_model="gpt-4o-mini",
            reason="High load"
        )

    return BeforeRequestResult.proceed()

instrument(MeterOptions(
    emit_metric=my_emitter,
    before_request=budget_check,
    request_metadata={"user_id": get_current_user_id()},
))

Legacy Per-Instance Wrapping

For backward compatibility, you can still wrap individual clients:

from aden import make_metered_openai, MeterOptions
from openai import OpenAI

client = OpenAI()
metered = make_metered_openai(client, MeterOptions(
    emit_metric=my_emitter,
))

API Reference

Core Functions

Function	Description
`instrument(options)`	Instrument all available LLM SDKs globally
`uninstrument()`	Remove instrumentation
`is_instrumented()`	Check if instrumented
`get_instrumented_sdks()`	Get which SDKs are instrumented

Provider-Specific Functions

Function	Description
`instrument_openai(options)`	Instrument OpenAI only
`instrument_anthropic(options)`	Instrument Anthropic only
`instrument_gemini(options)`	Instrument Gemini only
`uninstrument_openai()`	Remove OpenAI instrumentation
`uninstrument_anthropic()`	Remove Anthropic instrumentation
`uninstrument_gemini()`	Remove Gemini instrumentation

Emitter Factories

Function	Description
`create_console_emitter(pretty=False)`	Log to console
`create_batch_emitter(handler, batch_size, flush_interval)`	Batch events
`create_multi_emitter(emitters)`	Multiple destinations
`create_filtered_emitter(emitter, filter_fn)`	Filter events
`create_transform_emitter(emitter, transform_fn)`	Transform events
`create_file_emitter(log_dir)`	Write to JSON files
`create_memory_emitter()`	Store in memory
`create_noop_emitter()`	Discard events

Control Agent

Function	Description
`create_control_agent(options)`	Create local control agent
`create_control_agent_emitter(agent)`	Create emitter from agent

Types

from aden import (
    MetricEvent,
    MeterOptions,
    NormalizedUsage,
    ToolCallMetric,
    BeforeRequestResult,
    BeforeRequestContext,
    ControlPolicy,
    ControlDecision,
    AlertEvent,
    RequestCancelledError,
    BudgetExceededError,
)

Examples

Run examples with python examples/<name>.py:

Example	Description
`openai_basic.py`	Basic OpenAI instrumentation
`anthropic_basic.py`	Basic Anthropic instrumentation
`gemini_basic.py`	Basic Gemini instrumentation
`cost_control_local.py`	Cost control without a server
`pydantic_ai_example.py`	PydanticAI framework integration

Troubleshooting

Metrics not appearing

Check instrumentation order: Call instrument() before creating SDK clients

# Correct
instrument(MeterOptions(...))
client = OpenAI()

# Wrong - client created before instrumentation
client = OpenAI()
instrument(MeterOptions(...))

Check SDK is installed: Aden only instruments SDKs that are importable
```
pip install openai anthropic google-generativeai
```

Verify emitter is working: Test with console emitter first

instrument(MeterOptions(
    emit_metric=create_console_emitter(pretty=True),
))

Budget not enforcing

Check control agent is connected: Budget enforcement requires a control agent connected to your server

agent = create_control_agent(ControlAgentOptions(
    server_url="https://your-server.com",
    api_key="your-api-key",
))
instrument(MeterOptions(
    control_agent=agent,  # Required!
))

Verify server policy is configured: Check your control server has the budget configured for your context

Streaming not tracked

Consume the stream: Metrics are emitted when the stream completes

stream = client.chat.completions.create(..., stream=True)
for chunk in stream:  # Must iterate through stream
    print(chunk.choices[0].delta.content or "", end="")
# Metrics emitted here

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.5

Jan 14, 2026

0.4.4

Jan 14, 2026

0.4.2

Jan 9, 2026

0.4.1

Jan 9, 2026

0.4.0

Jan 9, 2026

0.3.3

Jan 7, 2026

0.3.2

Jan 3, 2026

0.3.1

Dec 30, 2025

0.3.0

Dec 30, 2025

0.2.9

Dec 27, 2025

0.2.8

Dec 27, 2025

0.2.7

Dec 27, 2025

0.2.6

Dec 25, 2025

0.2.5

Dec 24, 2025

0.2.4

Dec 24, 2025

0.2.3

Dec 24, 2025

0.2.2

Dec 24, 2025

This version

0.2.1

Dec 24, 2025

0.2.0

Dec 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aden_py-0.2.1.tar.gz (70.2 kB view details)

Uploaded Dec 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aden_py-0.2.1-py3-none-any.whl (69.5 kB view details)

Uploaded Dec 24, 2025 Python 3

File details

Details for the file aden_py-0.2.1.tar.gz.

File metadata

Download URL: aden_py-0.2.1.tar.gz
Upload date: Dec 24, 2025
Size: 70.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for aden_py-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`a176aac5bfb87147b847cfdd3e5052cdaf90d2ebd6402eb69c61efac2e3f55f5`
MD5	`62784d9fab1d3e7b09adaf998976e65f`
BLAKE2b-256	`305653e876442475f77655fb176864b2990c99720b99f86315be76c032787b6f`

See more details on using hashes here.

File details

Details for the file aden_py-0.2.1-py3-none-any.whl.

File metadata

Download URL: aden_py-0.2.1-py3-none-any.whl
Upload date: Dec 24, 2025
Size: 69.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for aden_py-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`91307abf342816d4ecf5cb3bb4a07cecb483273b4796b99d5ed61809248fd968`
MD5	`6a8477211eee30c4d0a0f98babe0cf74`
BLAKE2b-256	`bb3536c58709897c4b041c16c5b1edf4ddf1ad89be0114f7cbc178983090cfe0`

See more details on using hashes here.

aden-py 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Aden

Table of Contents

Why Aden?

Installation

Quick Start

Step 1: Add Instrumentation

Step 2: Use Your SDK Normally

Step 3: Clean Up on Shutdown

Sending Metrics to Your Backend

Option A: Custom Handler

Option B: Aden Control Server

Cost Control

Control Actions

Local Cost Control (No Server)

Control Server

Multi-Provider Support

OpenAI

Anthropic

Google Gemini

What Metrics Are Collected?

Metric Emitters

Built-in Emitters

Console Emitter (Development)

Multiple Destinations

Filtering Events

File Logging

Custom Emitter

Advanced Configuration

Full Options Reference

beforeRequest Hook

Legacy Per-Instance Wrapping

API Reference

Core Functions

Provider-Specific Functions

Emitter Factories

Control Agent

Types

Examples

Troubleshooting

Metrics not appearing

Budget not enforcing

Streaming not tracked

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes