Skip to main content

Per-agent LLM cost attribution for Python

Project description

spaturzu (Python)

Per-agent LLM cost attribution + budget enforcement + cross-provider fallback for Python. Wraps your provider client (OpenAI, Anthropic, Bedrock, Gemini, Mistral) and emits a metering row to the spaturzu gateway on every call, with frame-based agent attribution and free-form tags.

# Distributed as a wheel from https://spaturzu-sdk.superchiu.org (not on PyPI).
# The newest version is listed there; this is the current one:
WHL=https://spaturzu-sdk.superchiu.org/sdks/python/spaturzu-0.1.4-py3-none-any.whl

pip install "spaturzu @ $WHL"                 # core
pip install "spaturzu[openai] @ $WHL"         # + OpenAI integration
pip install "spaturzu[anthropic] @ $WHL"      # + Anthropic
pip install "spaturzu[bedrock] @ $WHL"        # + boto3 for Bedrock Converse
pip install "spaturzu[gemini] @ $WHL"         # + google-genai
pip install "spaturzu[mistral] @ $WHL"        # + mistralai
pip install "spaturzu[all] @ $WHL"            # everything

Quickstart

from spaturzu import spaturzu
from openai import OpenAI

spaturzu = spaturzu(
    base_url="https://spaturzu-api.example.com",
    api_key="...",
    tags={"env": "prod", "region": "us-east-1"},
)

openai = spaturzu.wrap_openai(OpenAI())

with spaturzu.run("researcher"):
    r = openai.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": "Hello"}],
    )

# Short-lived processes — flush before exit:
spaturzu.flush()

Both sync (OpenAI, Anthropic, Mistral) and async (AsyncOpenAI, AsyncAnthropic, client.chat.complete_async, client.aio.models.*) shapes are supported on a single wrap. Python Bedrock is sync-only in v1 (boto3); aioboto3 support is a future addition.

Drop-in (one-line) instrumentation

Change only the import — construction and call sites stay the same:

- from openai import OpenAI
+ from spaturzu.openai import OpenAI
  client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])
  client.chat.completions.create(model="gpt-4o-mini", messages=[...])

Spaturzu reads its own config from SPATURZU_API_KEY / SPATURZU_BASE_URL. Call spaturzu.configure(...) once at startup (before constructing any client) for process-wide tags or an on_error handler.

Swap from …to Exports
from openai import OpenAI, AsyncOpenAI from spaturzu.openai import … OpenAI, AsyncOpenAI
from anthropic import Anthropic, AsyncAnthropic from spaturzu.anthropic import … Anthropic, AsyncAnthropic
from google.genai import Client from spaturzu.google import Client Client
from mistralai import Mistral from spaturzu.mistral import Mistral Mistral
boto3.client("bedrock-runtime", …) from spaturzu.bedrock import BedrockRuntime BedrockRuntime(...)

Agent attribution without a with block

client.with_agent("writer").chat.completions.create(...)

.with_agent(name) tags that call (and nests under any enclosing with spaturzu.run("planner"): as a sub-agent). For multi-call workflows use the context manager; top-level run/flush are importable from spaturzu:

from spaturzu import run, flush
with run("workflow"):
    client.chat.completions.create(...)
flush()

Budget / fallback on the drop-in path

client = OpenAI(api_key=..., spaturzu={"budget": {"hard_cap": True}})

Supported providers

Wrap method Client Methods intercepted
wrap_openai(client) openai chat.completions.create (sync + async)
wrap_anthropic(client) anthropic messages.create (sync + async)
wrap_bedrock(client) boto3.client("bedrock-runtime") converse, converse_stream (sync only in v1)
wrap_gemini(client) google.genai.Client models.* (sync) + aio.models.* (async)
wrap_mistral(client) mistralai.Mistral chat.complete, chat.stream, chat.complete_async, chat.stream_async

Bedrock

import boto3
client = boto3.client("bedrock-runtime", region_name="us-east-1")
wrapped = spaturzu.wrap_bedrock(client)

with spaturzu.run("agent"):
    r = wrapped.converse(
        modelId="anthropic.claude-3-5-sonnet-20241022-v2:0",
        messages=[{"role": "user", "content": [{"text": "hello"}]}],
    )

Gemini

from google import genai
client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
wrapped = spaturzu.wrap_gemini(client)

with spaturzu.run("agent"):
    # sync
    r = wrapped.models.generate_content(
        model="gemini-2.5-pro",
        contents=[{"role": "user", "parts": [{"text": "hello"}]}],
    )

    # async — via client.aio.models
    r = await wrapped.aio.models.generate_content(
        model="gemini-2.5-pro",
        contents=[{"role": "user", "parts": [{"text": "hello"}]}],
    )

Mistral

from mistralai import Mistral
client = Mistral(api_key=os.environ["MISTRAL_API_KEY"])
wrapped = spaturzu.wrap_mistral(client)

with spaturzu.run("agent"):
    # sync
    r = wrapped.chat.complete(
        model="mistral-large-latest",
        messages=[{"role": "user", "content": "hello"}],
    )

    # async
    r = await wrapped.chat.complete_async(
        model="mistral-large-latest",
        messages=[{"role": "user", "content": "hello"}],
    )

Agent frames + tags

async with spaturzu.run("research"):
    await openai.chat.completions.create(...)         # agent_path=["research"]

    async with spaturzu.run("synthesize", tags={"phase": "draft"}):
        await anthropic.messages.create(...)          # path=["research","synthesize"]

Use with for sync code, async with for async. Frames propagate via contextvars.ContextVar — each asyncio.Task gets its own copy, so parallel tasks see independent frames.

Budget enforcement

openai = spaturzu.wrap_openai(OpenAI(), budget={"hard_cap": True})
# or budget={"hard_cap": True, "on_breach": "warn"}

Raises BudgetExceededError (importable from spaturzu) before the upstream provider is hit.

Cross-provider fallback

from anthropic import Anthropic
import boto3

openai = spaturzu.wrap_openai(
    OpenAI(),
    fallback=[
        {
            "provider": "anthropic",
            "client": Anthropic(),
            "model": "claude-3-5-haiku-20241022",
        },
        {
            "provider": "bedrock",
            "client": boto3.client("bedrock-runtime"),
            "model": "anthropic.claude-3-5-haiku-20241022-v1:0",
        },
    ],
)

All 20 directional pairs (5 providers × 4 other-providers) are supported. Limitations are identical to the Node SDK: non-streaming, text only, no tools, no response_format.

Note on response shape after fallback: On the happy path, the wrap returns the provider's native typed object (attribute access). When a fallback target serves the call, the response is a plain dict in the primary provider's shape — use subscript access (resp["choices"][0]["message"]["content"], etc.) in code paths that may run after a fallback.

API reference

spaturzu(...)

Parameter Type Default
base_url str $SPATURZU_BASE_URL ?? hosted gateway
api_key str $SPATURZU_API_KEY
timeout_s float 10.0
backoff_ms list[int] [1000, 2000, 4000, 8000, 16000]
max_concurrent int 50
on_error (exc, entry) → None silent
tags dict[str, str | int | float | bool]

spaturzu.run(name, *, tags=None) → context manager

Yields a RunFrame. Both sync (with) and async (async with) usage are supported on the same returned object.

spaturzu.flush(timeout_s=None) / spaturzu.shutdown()

Block until queued log POSTs settle. shutdown also stops BudgetGuard's SSE + polling threads.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spaturzu-0.1.4.tar.gz (74.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

spaturzu-0.1.4-py3-none-any.whl (79.6 kB view details)

Uploaded Python 3

File details

Details for the file spaturzu-0.1.4.tar.gz.

File metadata

  • Download URL: spaturzu-0.1.4.tar.gz
  • Upload date:
  • Size: 74.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for spaturzu-0.1.4.tar.gz
Algorithm Hash digest
SHA256 08468ba3f559f3537161593f630457e9cfd38b2a751a0cc07c91f104a8a830f2
MD5 d4bd0cb63d7300e2c98677d4ab1cec9a
BLAKE2b-256 527f164ec36aaec6dfaa8cd10db8488fdba53e814866f21cff3a01d4b23157f3

See more details on using hashes here.

File details

Details for the file spaturzu-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: spaturzu-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 79.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for spaturzu-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 95c3a5fde201cd11f74e3a10e0ae159a87e69a2caa215647adc18bbe91b2824c
MD5 f441404018bd03b114294be83c2a4a5a
BLAKE2b-256 83fc413ea56a6a8011c03ea6a4929beaaed861c2c54ac7acd5d7bfd433776a41

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page