Anthropic provider for lmux
Project description
lmux-anthropic
Anthropic provider for lmux. Wraps the anthropic SDK.
Supports chat completions and streaming.
Part of the lmux ecosystem: standardized interface, cost tracking on every response, and registry-based routing across providers.
Auth
Set ANTHROPIC_API_KEY in your environment. The default AnthropicEnvAuthProvider reads it automatically.
from lmux_anthropic import AnthropicProvider
provider = AnthropicProvider()
Usage
Chat
from lmux import UserMessage
response = provider.chat("claude-sonnet-4-20250514", [UserMessage(content="Hello")])
print(response.content)
print(response.cost)
Streaming
for chunk in provider.chat_stream("claude-sonnet-4-20250514", [UserMessage(content="Hello")]):
if chunk.delta:
print(chunk.delta, end="")
Async
All methods have async variants: achat, achat_stream.
Registry
Use with the lmux registry to route across multiple providers:
from lmux import Registry
registry = Registry()
registry.register("anthropic", provider)
response = registry.chat("anthropic/claude-sonnet-4-20250514", messages)
Provider Params
from lmux_anthropic import AnthropicParams
response = provider.chat(
"claude-sonnet-4-20250514",
messages,
provider_params=AnthropicParams(inference_geo="us"),
)
| Parameter | Type | Description |
|---|---|---|
thinking |
dict |
Extended thinking configuration |
metadata |
dict[str, str] |
Request metadata |
top_k |
int |
Top-k sampling |
service_tier |
"auto" | "standard_only" |
Service tier selection |
inference_geo |
"us" |
Inference geography (affects cost) |
cache_control |
dict |
Top-level prompt-cache control — auto-places a breakpoint on the last cacheable block (e.g. {"type": "ephemeral"}) |
Prompt Caching
Two ways to opt in:
- Top-level (auto-placement): pass
cache_controlviaAnthropicParams(above) to cache the full rendered prefix. - Explicit breakpoints: place
CachePointContentparts inUserMessagecontent. A cache point marks the end of the stable prefix; it attachescache_controlto the preceding content block. A cache point with no preceding block in its message applies to whatever came before it: the prior message's last block, or the system text seen so far (system text after the marker stays outside the cached prefix). A marker with nothing cacheable before it is dropped, and when two markers resolve to the same block the first one wins.
from lmux import CachePointContent, TextContent, UserMessage
messages = [
UserMessage(content=[TextContent(text=big_stable_context), CachePointContent(ttl="1h")]),
UserMessage(content="What changed since yesterday?"),
]
Cache reads/writes are reported on response.usage (cache_read_tokens, cache_creation_tokens, and the per-TTL cache_creation_tokens_by_ttl breakdown) and priced into response.cost, including the 2x write rate for ttl="1h".
Claude on Vertex AI
Requires the vertex extra, which pulls in google-auth via anthropic[vertex]:
uv add "lmux-anthropic[vertex]"
AnthropicVertexProvider serves Claude through GCP Vertex AI with the same chat/streaming interface:
from lmux_anthropic import AnthropicVertexProvider
provider = AnthropicVertexProvider(project_id="my-project", region="global")
response = provider.chat("claude-sonnet-4-5@20250929", [UserMessage(content="Hello")])
print(response.provider) # "anthropic-vertex"
print(response.cost)
project_id falls back to the ANTHROPIC_VERTEX_PROJECT_ID environment variable, then to the project resolved by the auth provider (e.g. the gcloud default project under ADC, or the service account key file's project). region falls back to CLOUD_ML_REGION; a request without a region raises at first call. region accepts "global", a multi-region ("us", "eu"), or a specific region ("us-east5", ...). Model IDs use Vertex's @-versioned format (claude-sonnet-4-5@20250929) or plain names for newer models (claude-opus-4-6).
Vertex Auth
Application Default Credentials by default; a service account file is also supported:
from lmux_anthropic import AnthropicVertexServiceAccountAuthProvider
provider = AnthropicVertexProvider(
project_id="my-project",
region="global",
auth=AnthropicVertexServiceAccountAuthProvider(service_account_file="/path/to/key.json"),
)
Any AuthProvider that returns google.auth Credentials works — either bare, or as a (credentials, project_id) tuple so the provider can infer the project.
Vertex Params Caveat
AnthropicParams.service_tier and AnthropicParams.inference_geo are Anthropic-API-only: the Vertex provider drops them from outgoing requests, and the inference_geo US cost multiplier never applies.
Constructor Options
AnthropicProvider(
auth=..., # AuthProvider[str], default: AnthropicEnvAuthProvider()
base_url=..., # Optional base URL override
timeout=..., # Request timeout in seconds
max_retries=..., # Max retry attempts
default_max_tokens=..., # Default max tokens (default: 4096)
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file lmux_anthropic-0.6.0.tar.gz.
File metadata
- Download URL: lmux_anthropic-0.6.0.tar.gz
- Upload date:
- Size: 14.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
43097b0ab57955d4cc044f498d8bc45690e96e3e773ff5f6d8be2e0af3075c6a
|
|
| MD5 |
33ff0d226cbec478912de247cf28cb94
|
|
| BLAKE2b-256 |
151f747779cc353456e91339e51cfe2c3cd86b43bb9f4b3a3687ab410cdd949c
|
Provenance
The following attestation bundles were made for lmux_anthropic-0.6.0.tar.gz:
Publisher:
publish.yml on cluebbehusen/lmux
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
lmux_anthropic-0.6.0.tar.gz -
Subject digest:
43097b0ab57955d4cc044f498d8bc45690e96e3e773ff5f6d8be2e0af3075c6a - Sigstore transparency entry: 1777254531
- Sigstore integration time:
-
Permalink:
cluebbehusen/lmux@65753e7ac27dae8c9ab286e1aaa8115cb4098218 -
Branch / Tag:
refs/tags/lmux-anthropic-v0.6.0 - Owner: https://github.com/cluebbehusen
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@65753e7ac27dae8c9ab286e1aaa8115cb4098218 -
Trigger Event:
push
-
Statement type:
File details
Details for the file lmux_anthropic-0.6.0-py3-none-any.whl.
File metadata
- Download URL: lmux_anthropic-0.6.0-py3-none-any.whl
- Upload date:
- Size: 17.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
79eba6c752ba0146645383b273026d1d91ddf33cf8fdeaee00759a73c9c60842
|
|
| MD5 |
48c37b0775bc502e1a55f65c27f47ca3
|
|
| BLAKE2b-256 |
8720ccb5a4b77c3b494b4baee0a4f2348420085fb86907cc4171466ee147f00e
|
Provenance
The following attestation bundles were made for lmux_anthropic-0.6.0-py3-none-any.whl:
Publisher:
publish.yml on cluebbehusen/lmux
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
lmux_anthropic-0.6.0-py3-none-any.whl -
Subject digest:
79eba6c752ba0146645383b273026d1d91ddf33cf8fdeaee00759a73c9c60842 - Sigstore transparency entry: 1777255124
- Sigstore integration time:
-
Permalink:
cluebbehusen/lmux@65753e7ac27dae8c9ab286e1aaa8115cb4098218 -
Branch / Tag:
refs/tags/lmux-anthropic-v0.6.0 - Owner: https://github.com/cluebbehusen
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@65753e7ac27dae8c9ab286e1aaa8115cb4098218 -
Trigger Event:
push
-
Statement type: