Skip to main content

Per-agent LLM cost attribution for Python

Project description

spaturzu (Python)

Per-agent LLM cost attribution + budget enforcement + cross-provider fallback for Python. Wraps your provider client (OpenAI, Anthropic, Bedrock, Gemini, Mistral) and emits a metering row to the spaturzu gateway on every call, with frame-based agent attribution and free-form tags.

# Published on PyPI as `spaturzu` (Python 3.10+).
pip install spaturzu                  # core
pip install "spaturzu[openai]"        # + OpenAI integration
pip install "spaturzu[anthropic]"     # + Anthropic
pip install "spaturzu[bedrock]"       # + boto3 for Bedrock Converse
pip install "spaturzu[gemini]"        # + google-genai
pip install "spaturzu[mistral]"       # + mistralai
pip install "spaturzu[all]"           # everything

Quickstart

from spaturzu import spaturzu
from openai import OpenAI

spaturzu = spaturzu(
    base_url="https://spaturzu-api.example.com",
    api_key="...",
    tags={"env": "prod", "region": "us-east-1"},
)

openai = spaturzu.wrap_openai(OpenAI())

with spaturzu.run("researcher"):
    r = openai.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": "Hello"}],
    )

# Short-lived processes — flush before exit:
spaturzu.flush()

Both sync (OpenAI, Anthropic, Mistral) and async (AsyncOpenAI, AsyncAnthropic, client.chat.complete_async, client.aio.models.*) shapes are supported on a single wrap. Python Bedrock is sync-only in v1 (boto3); aioboto3 support is a future addition.

Drop-in (one-line) instrumentation

Change only the import — construction and call sites stay the same:

- from openai import OpenAI
+ from spaturzu.openai import OpenAI
  client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])
  client.chat.completions.create(model="gpt-4o-mini", messages=[...])

Spaturzu reads its own config from SPATURZU_API_KEY / SPATURZU_BASE_URL. Call spaturzu.configure(...) once at startup (before constructing any client) for process-wide tags or an on_error handler.

Swap from …to Exports
from openai import OpenAI, AsyncOpenAI from spaturzu.openai import … OpenAI, AsyncOpenAI
from anthropic import Anthropic, AsyncAnthropic from spaturzu.anthropic import … Anthropic, AsyncAnthropic
from google.genai import Client from spaturzu.google import Client Client
from mistralai import Mistral from spaturzu.mistral import Mistral Mistral
boto3.client("bedrock-runtime", …) from spaturzu.bedrock import BedrockRuntime BedrockRuntime(...)

Agent attribution without a with block

client.with_agent("writer").chat.completions.create(...)

.with_agent(name) tags that call (and nests under any enclosing with spaturzu.run("planner"): as a sub-agent). For multi-call workflows use the context manager; top-level run/flush are importable from spaturzu:

from spaturzu import run, flush
with run("workflow"):
    client.chat.completions.create(...)
flush()

Budget / fallback on the drop-in path

client = OpenAI(api_key=..., spaturzu={"budget": {"hard_cap": True}})

Supported providers

Wrap method Client Methods intercepted
wrap_openai(client) openai chat.completions.create (sync + async)
wrap_anthropic(client) anthropic messages.create (sync + async)
wrap_bedrock(client) boto3.client("bedrock-runtime") converse, converse_stream (sync only in v1)
wrap_gemini(client) google.genai.Client models.* (sync) + aio.models.* (async)
wrap_mistral(client) mistralai.Mistral chat.complete, chat.stream, chat.complete_async, chat.stream_async

Bedrock

import boto3
client = boto3.client("bedrock-runtime", region_name="us-east-1")
wrapped = spaturzu.wrap_bedrock(client)

with spaturzu.run("agent"):
    r = wrapped.converse(
        modelId="anthropic.claude-3-5-sonnet-20241022-v2:0",
        messages=[{"role": "user", "content": [{"text": "hello"}]}],
    )

Gemini

from google import genai
client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
wrapped = spaturzu.wrap_gemini(client)

with spaturzu.run("agent"):
    # sync
    r = wrapped.models.generate_content(
        model="gemini-2.5-pro",
        contents=[{"role": "user", "parts": [{"text": "hello"}]}],
    )

    # async — via client.aio.models
    r = await wrapped.aio.models.generate_content(
        model="gemini-2.5-pro",
        contents=[{"role": "user", "parts": [{"text": "hello"}]}],
    )

Mistral

from mistralai import Mistral
client = Mistral(api_key=os.environ["MISTRAL_API_KEY"])
wrapped = spaturzu.wrap_mistral(client)

with spaturzu.run("agent"):
    # sync
    r = wrapped.chat.complete(
        model="mistral-large-latest",
        messages=[{"role": "user", "content": "hello"}],
    )

    # async
    r = await wrapped.chat.complete_async(
        model="mistral-large-latest",
        messages=[{"role": "user", "content": "hello"}],
    )

Agent frames + tags

async with spaturzu.run("research"):
    await openai.chat.completions.create(...)         # agent_path=["research"]

    async with spaturzu.run("synthesize", tags={"phase": "draft"}):
        await anthropic.messages.create(...)          # path=["research","synthesize"]

Use with for sync code, async with for async. Frames propagate via contextvars.ContextVar — each asyncio.Task gets its own copy, so parallel tasks see independent frames.

Budget enforcement

openai = spaturzu.wrap_openai(OpenAI(), budget={"hard_cap": True})
# or budget={"hard_cap": True, "on_breach": "warn"}

Raises BudgetExceededError (importable from spaturzu) before the upstream provider is hit.

Cross-provider fallback

from anthropic import Anthropic
import boto3

openai = spaturzu.wrap_openai(
    OpenAI(),
    fallback=[
        {
            "provider": "anthropic",
            "client": Anthropic(),
            "model": "claude-3-5-haiku-20241022",
        },
        {
            "provider": "bedrock",
            "client": boto3.client("bedrock-runtime"),
            "model": "anthropic.claude-3-5-haiku-20241022-v1:0",
        },
    ],
)

All 20 directional pairs (5 providers × 4 other-providers) are supported. Limitations are identical to the Node SDK: non-streaming, text only, no tools, no response_format.

Note on response shape after fallback: On the happy path, the wrap returns the provider's native typed object (attribute access). When a fallback target serves the call, the response is a plain dict in the primary provider's shape — use subscript access (resp["choices"][0]["message"]["content"], etc.) in code paths that may run after a fallback.

API reference

spaturzu(...)

Parameter Type Default
base_url str $SPATURZU_BASE_URL ?? hosted gateway
api_key str $SPATURZU_API_KEY
timeout_s float 10.0
backoff_ms list[int] [1000, 2000, 4000, 8000, 16000]
max_concurrent int 50
on_error (exc, entry) → None silent
tags dict[str, str | int | float | bool]

spaturzu.run(name, *, tags=None) → context manager

Yields a RunFrame. Both sync (with) and async (async with) usage are supported on the same returned object.

spaturzu.flush(timeout_s=None) / spaturzu.shutdown()

Block until queued log POSTs settle. shutdown also stops BudgetGuard's SSE + polling threads.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spaturzu-0.1.6.tar.gz (74.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

spaturzu-0.1.6-py3-none-any.whl (79.5 kB view details)

Uploaded Python 3

File details

Details for the file spaturzu-0.1.6.tar.gz.

File metadata

  • Download URL: spaturzu-0.1.6.tar.gz
  • Upload date:
  • Size: 74.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for spaturzu-0.1.6.tar.gz
Algorithm Hash digest
SHA256 f5f30bdc251223685ad4ab8be88c02177d391eaa61a7f046b42f35e57563ff20
MD5 aa76993ea5550a7daead2c5974a42ad3
BLAKE2b-256 1fe1a216066289b37525bf4d9837ddc158844bb2f26609f6b27f265887a01529

See more details on using hashes here.

File details

Details for the file spaturzu-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: spaturzu-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 79.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for spaturzu-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 4c7f369075d5fb100d8eb9f124036eb6c371fd416bcc1b612e0535ca4157086d
MD5 532fd018b175c80cccff5078aef7046d
BLAKE2b-256 c9e3ab1c5313f3c078bd56eba27dd6670ee23512be624f851081a29ff76976fe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page