Per-agent LLM cost attribution for Python

These details have not been verified by PyPI

Project description

spaturzu (Python)

Per-agent LLM cost attribution + budget enforcement + cross-provider fallback for Python. Wraps your provider client (OpenAI, Anthropic, Bedrock, Gemini, Mistral) and emits a metering row to the spaturzu gateway on every call, with frame-based agent attribution and free-form tags.

# Distributed as a wheel from https://spaturzu-sdk.superchiu.org (not on PyPI).
# The newest version is listed there; this is the current one:
WHL=https://spaturzu-sdk.superchiu.org/sdks/python/spaturzu-0.1.4-py3-none-any.whl

pip install "spaturzu @ $WHL"                 # core
pip install "spaturzu[openai] @ $WHL"         # + OpenAI integration
pip install "spaturzu[anthropic] @ $WHL"      # + Anthropic
pip install "spaturzu[bedrock] @ $WHL"        # + boto3 for Bedrock Converse
pip install "spaturzu[gemini] @ $WHL"         # + google-genai
pip install "spaturzu[mistral] @ $WHL"        # + mistralai
pip install "spaturzu[all] @ $WHL"            # everything

Quickstart

from spaturzu import spaturzu
from openai import OpenAI

spaturzu = spaturzu(
    base_url="https://spaturzu-api.example.com",
    api_key="...",
    tags={"env": "prod", "region": "us-east-1"},
)

openai = spaturzu.wrap_openai(OpenAI())

with spaturzu.run("researcher"):
    r = openai.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": "Hello"}],
    )

# Short-lived processes — flush before exit:
spaturzu.flush()

Both sync (OpenAI, Anthropic, Mistral) and async (AsyncOpenAI, AsyncAnthropic, client.chat.complete_async, client.aio.models.*) shapes are supported on a single wrap. Python Bedrock is sync-only in v1 (boto3); aioboto3 support is a future addition.

Drop-in (one-line) instrumentation

Change only the import — construction and call sites stay the same:

- from openai import OpenAI
+ from spaturzu.openai import OpenAI
  client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])
  client.chat.completions.create(model="gpt-4o-mini", messages=[...])

Spaturzu reads its own config from SPATURZU_API_KEY / SPATURZU_BASE_URL. Call spaturzu.configure(...) once at startup (before constructing any client) for process-wide tags or an on_error handler.

Swap from	…to	Exports
`from openai import OpenAI, AsyncOpenAI`	`from spaturzu.openai import …`	`OpenAI`, `AsyncOpenAI`
`from anthropic import Anthropic, AsyncAnthropic`	`from spaturzu.anthropic import …`	`Anthropic`, `AsyncAnthropic`
`from google.genai import Client`	`from spaturzu.google import Client`	`Client`
`from mistralai import Mistral`	`from spaturzu.mistral import Mistral`	`Mistral`
`boto3.client("bedrock-runtime", …)`	`from spaturzu.bedrock import BedrockRuntime`	`BedrockRuntime(...)`

Agent attribution without a `with` block

client.with_agent("writer").chat.completions.create(...)

.with_agent(name) tags that call (and nests under any enclosing with spaturzu.run("planner"): as a sub-agent). For multi-call workflows use the context manager; top-level run/flush are importable from spaturzu:

from spaturzu import run, flush
with run("workflow"):
    client.chat.completions.create(...)
flush()

Budget / fallback on the drop-in path

client = OpenAI(api_key=..., spaturzu={"budget": {"hard_cap": True}})

Supported providers

Wrap method	Client	Methods intercepted
`wrap_openai(client)`	`openai`	`chat.completions.create` (sync + async)
`wrap_anthropic(client)`	`anthropic`	`messages.create` (sync + async)
`wrap_bedrock(client)`	`boto3.client("bedrock-runtime")`	`converse`, `converse_stream` (sync only in v1)
`wrap_gemini(client)`	`google.genai.Client`	`models.` (sync) + `aio.models.` (async)
`wrap_mistral(client)`	`mistralai.Mistral`	`chat.complete`, `chat.stream`, `chat.complete_async`, `chat.stream_async`

Bedrock

import boto3
client = boto3.client("bedrock-runtime", region_name="us-east-1")
wrapped = spaturzu.wrap_bedrock(client)

with spaturzu.run("agent"):
    r = wrapped.converse(
        modelId="anthropic.claude-3-5-sonnet-20241022-v2:0",
        messages=[{"role": "user", "content": [{"text": "hello"}]}],
    )

Gemini

from google import genai
client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
wrapped = spaturzu.wrap_gemini(client)

with spaturzu.run("agent"):
    # sync
    r = wrapped.models.generate_content(
        model="gemini-2.5-pro",
        contents=[{"role": "user", "parts": [{"text": "hello"}]}],
    )

    # async — via client.aio.models
    r = await wrapped.aio.models.generate_content(
        model="gemini-2.5-pro",
        contents=[{"role": "user", "parts": [{"text": "hello"}]}],
    )

Mistral

from mistralai import Mistral
client = Mistral(api_key=os.environ["MISTRAL_API_KEY"])
wrapped = spaturzu.wrap_mistral(client)

with spaturzu.run("agent"):
    # sync
    r = wrapped.chat.complete(
        model="mistral-large-latest",
        messages=[{"role": "user", "content": "hello"}],
    )

    # async
    r = await wrapped.chat.complete_async(
        model="mistral-large-latest",
        messages=[{"role": "user", "content": "hello"}],
    )

Agent frames + tags

async with spaturzu.run("research"):
    await openai.chat.completions.create(...)         # agent_path=["research"]

    async with spaturzu.run("synthesize", tags={"phase": "draft"}):
        await anthropic.messages.create(...)          # path=["research","synthesize"]

Use with for sync code, async with for async. Frames propagate via contextvars.ContextVar — each asyncio.Task gets its own copy, so parallel tasks see independent frames.

Budget enforcement

openai = spaturzu.wrap_openai(OpenAI(), budget={"hard_cap": True})
# or budget={"hard_cap": True, "on_breach": "warn"}

Raises BudgetExceededError (importable from spaturzu) before the upstream provider is hit.

Cross-provider fallback

from anthropic import Anthropic
import boto3

openai = spaturzu.wrap_openai(
    OpenAI(),
    fallback=[
        {
            "provider": "anthropic",
            "client": Anthropic(),
            "model": "claude-3-5-haiku-20241022",
        },
        {
            "provider": "bedrock",
            "client": boto3.client("bedrock-runtime"),
            "model": "anthropic.claude-3-5-haiku-20241022-v1:0",
        },
    ],
)

All 20 directional pairs (5 providers × 4 other-providers) are supported. Limitations are identical to the Node SDK: non-streaming, text only, no tools, no response_format.

Note on response shape after fallback: On the happy path, the wrap returns the provider's native typed object (attribute access). When a fallback target serves the call, the response is a plain dict in the primary provider's shape — use subscript access (resp["choices"][0]["message"]["content"], etc.) in code paths that may run after a fallback.

API reference

`spaturzu(...)`

Parameter	Type	Default
`base_url`	`str`	`$SPATURZU_BASE_URL` ?? hosted gateway
`api_key`	`str`	`$SPATURZU_API_KEY`
`timeout_s`	`float`	`10.0`
`backoff_ms`	`list[int]`	`[1000, 2000, 4000, 8000, 16000]`
`max_concurrent`	`int`	`50`
`on_error`	`(exc, entry) → None`	silent
`tags`	`dict[str, str \| int \| float \| bool]`	—

`spaturzu.run(name, *, tags=None)` → context manager

Yields a RunFrame. Both sync (with) and async (async with) usage are supported on the same returned object.

`spaturzu.flush(timeout_s=None)` / `spaturzu.shutdown()`

Block until queued log POSTs settle. shutdown also stops BudgetGuard's SSE + polling threads.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.7

Jun 11, 2026

0.1.6

Jun 11, 2026

0.1.5

Jun 8, 2026

This version

0.1.4

Jun 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spaturzu-0.1.4.tar.gz (74.5 kB view details)

Uploaded Jun 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

spaturzu-0.1.4-py3-none-any.whl (79.6 kB view details)

Uploaded Jun 8, 2026 Python 3

File details

Details for the file spaturzu-0.1.4.tar.gz.

File metadata

Download URL: spaturzu-0.1.4.tar.gz
Upload date: Jun 8, 2026
Size: 74.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for spaturzu-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`08468ba3f559f3537161593f630457e9cfd38b2a751a0cc07c91f104a8a830f2`
MD5	`d4bd0cb63d7300e2c98677d4ab1cec9a`
BLAKE2b-256	`527f164ec36aaec6dfaa8cd10db8488fdba53e814866f21cff3a01d4b23157f3`

See more details on using hashes here.

File details

Details for the file spaturzu-0.1.4-py3-none-any.whl.

File metadata

Download URL: spaturzu-0.1.4-py3-none-any.whl
Upload date: Jun 8, 2026
Size: 79.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for spaturzu-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`95c3a5fde201cd11f74e3a10e0ae159a87e69a2caa215647adc18bbe91b2824c`
MD5	`f441404018bd03b114294be83c2a4a5a`
BLAKE2b-256	`83fc413ea56a6a8011c03ea6a4929beaaed861c2c54ac7acd5d7bfd433776a41`

See more details on using hashes here.

spaturzu 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

spaturzu (Python)

Quickstart

Drop-in (one-line) instrumentation

Agent attribution without a `with` block

Budget / fallback on the drop-in path

Supported providers

Bedrock

Gemini

Mistral

Agent frames + tags

Budget enforcement

Cross-provider fallback

API reference

`spaturzu(...)`

`spaturzu.run(name, *, tags=None)` → context manager

`spaturzu.flush(timeout_s=None)` / `spaturzu.shutdown()`

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

spaturzu 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

spaturzu (Python)

Quickstart

Drop-in (one-line) instrumentation

Agent attribution without a with block

Budget / fallback on the drop-in path

Supported providers

Bedrock

Gemini

Mistral

Agent frames + tags

Budget enforcement

Cross-provider fallback

API reference

spaturzu(...)

spaturzu.run(name, *, tags=None) → context manager

spaturzu.flush(timeout_s=None) / spaturzu.shutdown()

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Agent attribution without a `with` block

`spaturzu(...)`

`spaturzu.run(name, *, tags=None)` → context manager

`spaturzu.flush(timeout_s=None)` / `spaturzu.shutdown()`