Per-agent LLM cost attribution for Python
Project description
spaturzu (Python)
Per-agent LLM cost attribution + budget enforcement + cross-provider fallback for Python. Wraps your provider client (OpenAI, Anthropic, Bedrock, Gemini, Mistral) and emits a metering row to the spaturzu gateway on every call, with frame-based agent attribution and free-form tags.
Part of spaturzu — full docs at https://spaturzu.superchiu.org/docs.
# Published on PyPI as `spaturzu` (Python 3.10+).
pip install spaturzu # core
pip install "spaturzu[openai]" # + OpenAI integration
pip install "spaturzu[anthropic]" # + Anthropic
pip install "spaturzu[bedrock]" # + boto3 for Bedrock Converse
pip install "spaturzu[gemini]" # + google-genai
pip install "spaturzu[mistral]" # + mistralai
pip install "spaturzu[all]" # everything
Quickstart
from spaturzu import spaturzu
from openai import OpenAI
spaturzu = spaturzu(
base_url="https://spaturzu-api.example.com",
api_key="...",
tags={"env": "prod", "region": "us-east-1"},
)
openai = spaturzu.wrap_openai(OpenAI())
with spaturzu.run("researcher"):
r = openai.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello"}],
)
# Short-lived processes — flush before exit:
spaturzu.flush()
Both sync (OpenAI, Anthropic, Mistral) and async (AsyncOpenAI, AsyncAnthropic, client.chat.complete_async, client.aio.models.*) shapes are supported on a single wrap. Python Bedrock is sync-only in v1 (boto3); aioboto3 support is a future addition.
Drop-in (one-line) instrumentation
Change only the import — construction and call sites stay the same:
- from openai import OpenAI
+ from spaturzu.openai import OpenAI
client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])
client.chat.completions.create(model="gpt-4o-mini", messages=[...])
Spaturzu reads its own config from SPATURZU_API_KEY / SPATURZU_BASE_URL.
Call spaturzu.configure(...) once at startup (before constructing any
client) for process-wide tags or an on_error handler.
| Swap from | …to | Exports |
|---|---|---|
from openai import OpenAI, AsyncOpenAI |
from spaturzu.openai import … |
OpenAI, AsyncOpenAI |
from anthropic import Anthropic, AsyncAnthropic |
from spaturzu.anthropic import … |
Anthropic, AsyncAnthropic |
from google.genai import Client |
from spaturzu.google import Client |
Client |
from mistralai import Mistral |
from spaturzu.mistral import Mistral |
Mistral |
boto3.client("bedrock-runtime", …) |
from spaturzu.bedrock import BedrockRuntime |
BedrockRuntime(...) |
Agent attribution without a with block
client.with_agent("writer").chat.completions.create(...)
.with_agent(name) tags that call (and nests under any enclosing
with spaturzu.run("planner"): as a sub-agent). For multi-call workflows use
the context manager; top-level run/flush are importable from spaturzu:
from spaturzu import run, flush
with run("workflow"):
client.chat.completions.create(...)
flush()
Budget / fallback on the drop-in path
client = OpenAI(api_key=..., spaturzu={"budget": {"hard_cap": True}})
Supported providers
| Wrap method | Client | Methods intercepted |
|---|---|---|
wrap_openai(client) |
openai |
chat.completions.create (sync + async) |
wrap_anthropic(client) |
anthropic |
messages.create (sync + async) |
wrap_bedrock(client) |
boto3.client("bedrock-runtime") |
converse, converse_stream (sync only in v1) |
wrap_gemini(client) |
google.genai.Client |
models.* (sync) + aio.models.* (async) |
wrap_mistral(client) |
mistralai.Mistral |
chat.complete, chat.stream, chat.complete_async, chat.stream_async |
Bedrock
import boto3
client = boto3.client("bedrock-runtime", region_name="us-east-1")
wrapped = spaturzu.wrap_bedrock(client)
with spaturzu.run("agent"):
r = wrapped.converse(
modelId="anthropic.claude-3-5-sonnet-20241022-v2:0",
messages=[{"role": "user", "content": [{"text": "hello"}]}],
)
Gemini
from google import genai
client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
wrapped = spaturzu.wrap_gemini(client)
with spaturzu.run("agent"):
# sync
r = wrapped.models.generate_content(
model="gemini-2.5-pro",
contents=[{"role": "user", "parts": [{"text": "hello"}]}],
)
# async — via client.aio.models
r = await wrapped.aio.models.generate_content(
model="gemini-2.5-pro",
contents=[{"role": "user", "parts": [{"text": "hello"}]}],
)
Mistral
from mistralai import Mistral
client = Mistral(api_key=os.environ["MISTRAL_API_KEY"])
wrapped = spaturzu.wrap_mistral(client)
with spaturzu.run("agent"):
# sync
r = wrapped.chat.complete(
model="mistral-large-latest",
messages=[{"role": "user", "content": "hello"}],
)
# async
r = await wrapped.chat.complete_async(
model="mistral-large-latest",
messages=[{"role": "user", "content": "hello"}],
)
Agent frames + tags
async with spaturzu.run("research"):
await openai.chat.completions.create(...) # agent_path=["research"]
async with spaturzu.run("synthesize", tags={"phase": "draft"}):
await anthropic.messages.create(...) # path=["research","synthesize"]
Use with for sync code, async with for async. Frames propagate via contextvars.ContextVar — each asyncio.Task gets its own copy, so parallel tasks see independent frames.
Budget enforcement
openai = spaturzu.wrap_openai(OpenAI(), budget={"hard_cap": True})
# or budget={"hard_cap": True, "on_breach": "warn"}
Raises BudgetExceededError (importable from spaturzu) before the upstream provider is hit.
Cross-provider fallback
from anthropic import Anthropic
import boto3
openai = spaturzu.wrap_openai(
OpenAI(),
fallback=[
{
"provider": "anthropic",
"client": Anthropic(),
"model": "claude-3-5-haiku-20241022",
},
{
"provider": "bedrock",
"client": boto3.client("bedrock-runtime"),
"model": "anthropic.claude-3-5-haiku-20241022-v1:0",
},
],
)
All 20 directional pairs (5 providers × 4 other-providers) are supported. Limitations are identical to the Node SDK: non-streaming, text only, no tools, no response_format.
Note on response shape after fallback: On the happy path, the wrap returns the provider's native typed object (attribute access). When a fallback target serves the call, the response is a plain dict in the primary provider's shape — use subscript access (resp["choices"][0]["message"]["content"], etc.) in code paths that may run after a fallback.
API reference
spaturzu(...)
| Parameter | Type | Default |
|---|---|---|
base_url |
str |
$SPATURZU_BASE_URL ?? hosted gateway |
api_key |
str |
$SPATURZU_API_KEY |
timeout_s |
float |
10.0 |
backoff_ms |
list[int] |
[1000, 2000, 4000, 8000, 16000] |
max_concurrent |
int |
50 |
on_error |
(exc, entry) → None |
silent |
tags |
dict[str, str | int | float | bool] |
— |
spaturzu.run(name, *, tags=None) → context manager
Yields a RunFrame. Both sync (with) and async (async with) usage are supported on the same returned object.
spaturzu.flush(timeout_s=None) / spaturzu.shutdown()
Block until queued log POSTs settle. shutdown also stops BudgetGuard's SSE + polling threads.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file spaturzu-0.1.7.tar.gz.
File metadata
- Download URL: spaturzu-0.1.7.tar.gz
- Upload date:
- Size: 74.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1dd6af77acdb155a508e778cc185cb3303ecfb23fd1fa48263d117302a29a3cc
|
|
| MD5 |
e70b04c5dd2a0c8023e1d5850fc60865
|
|
| BLAKE2b-256 |
f92790e98a13b71f7591aa5ce72f48495b5688a2270a1ffbb616d8a597f32b65
|
File details
Details for the file spaturzu-0.1.7-py3-none-any.whl.
File metadata
- Download URL: spaturzu-0.1.7-py3-none-any.whl
- Upload date:
- Size: 79.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b435f0fd04b32e0595343167a4c3e1f5aae456f27521ec655823153e144167d8
|
|
| MD5 |
5d07d0aa1442c3e132f21ab136c7baab
|
|
| BLAKE2b-256 |
66ae602aa4b0c930d0171ff8777deea63ce854498512501828f654711ed7394f
|