Thin configuration helper for the Pennsieve LLM Governor — returns a pre-configured anthropic.Anthropic client.

These details have not been verified by PyPI

Project description

pennsieve-llm

Thin Python configuration helper for the Pennsieve LLM Governor.

This library returns a pre-configured anthropic.Anthropic client pointed at the Pennsieve LLM Governor with SigV4 auth wired up. Streaming, tool use, prompt caching, extended thinking — every Anthropic SDK feature works because you're using the real Anthropic SDK.

Installation

pip install pennsieve-llm

Requires Python 3.10+. anthropic, httpx, and boto3 are installed as transitive dependencies.

Quick Start

from pennsieve_llm import Governor, MODEL_SONNET_45

gov = Governor()  # auto-configures from $LLM_GOVERNOR_URL + AWS creds

resp = gov.client().messages.create(
    model=MODEL_SONNET_45,
    messages=[{"role": "user", "content": "Hello, world!"}],
    max_tokens=1024,
)
print(resp.content[0].text)

The object returned by gov.client() is anthropic.Anthropic. Everything in the Anthropic Python SDK docs applies.

Streaming

gov = Governor()

with gov.client().messages.stream(
    model=MODEL_SONNET_45,
    messages=[{"role": "user", "content": "Write a poem"}],
    max_tokens=1024,
) as stream:
    for text in stream.text_stream:
        print(text, end="", flush=True)

Tool use

resp = gov.client().messages.create(
    model=MODEL_SONNET_45,
    max_tokens=1024,
    tools=[{
        "name": "get_weather",
        "description": "Get current weather",
        "input_schema": {...},
    }],
    messages=[{"role": "user", "content": "What's the weather in SF?"}],
)

Files on EFS

The governor supports referencing files on EFS without base64-encoding them into the request — use the efs_document content block builder:

gov = Governor()
resp = gov.client().messages.create(
    model=MODEL_SONNET_45,
    messages=[{"role": "user", "content": [
        {"type": "text", "text": "Summarize this PDF:"},
        gov.efs_document("workdir/paper.pdf"),
    ]}],
    max_tokens=1024,
)

Files are read server-side by the governor with execution-scoped access controls.

Backend selection

Env var set	Behavior
`LLM_GOVERNOR_URL`	Returns `anthropic.Anthropic` configured for the Pennsieve governor (HTTPS + SigV4)
(none)	Returns `MockClient` for tests / offline use

For local development against api.anthropic.com, just use anthropic.Anthropic() directly with your API key. That's not the SDK's job to wrap.

Environment variables

Variable	Source	Purpose
`LLM_GOVERNOR_URL`	Platform-injected (compute node)	Governor Function URL
`EXECUTION_RUN_ID`	Platform-injected (workflow)	Cost attribution; attached as `x-execution-run-id` header on every request
`AWS_REGION`	Default or override	SigV4 signing region (default `us-east-1`)

Constructor arguments override env vars:

gov = Governor(
    url="https://abc.lambda-url.us-east-1.on.aws",
    execution_run_id="run-123",
    region="us-east-1",
)

Governor-specific operations

Governor.check_budget() and Governor.list_models() query Pennsieve-specific endpoints (GET /v1/budget, GET /v1/models) — these aren't part of the Anthropic API.

budget = gov.check_budget()
print(f"${budget['periodRemainingUsd']:.2f} remaining this {budget['budgetPeriod']}")

models = gov.list_models()
for m in models["models"]:
    print(m["modelId"], m["status"])

Testing

Without LLM_GOVERNOR_URL set, Governor() auto-selects MockClient. You can also inject one explicitly:

from pennsieve_llm import Governor, MockClient

mock = MockClient()
mock.set_response(text="42", input_tokens=5, output_tokens=1)

gov = Governor(client=mock)
resp = gov.client().messages.create(
    model="test", messages=[{"role": "user", "content": "what is 6*7?"}], max_tokens=10
)
assert resp.content[0].text == "42"

# Inspect what was sent
assert mock.calls[0]["model"] == "test"

The MockClient mimics the parts of anthropic.Anthropic's surface that typical caller code uses — enough for unit tests without making real network calls.

Available model constants

Constant	Bedrock inference profile ID
`MODEL_HAIKU_45`	`us.anthropic.claude-haiku-4-5-20251001-v1:0`
`MODEL_SONNET_4`	`us.anthropic.claude-sonnet-4-20250514-v1:0`
`MODEL_SONNET_45`	`us.anthropic.claude-sonnet-4-5-20250929-v1:0`
`MODEL_SONNET_46`	`us.anthropic.claude-sonnet-4-6`
`MODEL_OPUS_47`	`us.anthropic.claude-opus-4-7`

us.* keeps inference in US AWS regions — HIPAA-friendly default for Pennsieve customers.

Error handling

Errors from the chat path (messages.create) come back as Anthropic SDK exceptions — use the Anthropic SDK's exception hierarchy:

import anthropic
from pennsieve_llm import Governor

gov = Governor()
try:
    resp = gov.client().messages.create(...)
except anthropic.BadRequestError as e:
    # 400 — bad request shape, model not allowed, etc.
    print(e.body)
except anthropic.RateLimitError as e:
    # 429
    pass

For governor-specific endpoints (check_budget, list_models), the SDK raises GovernorError:

from pennsieve_llm import Governor, GovernorError

try:
    budget = gov.check_budget()
except GovernorError as e:
    print(f"{e.code}: {e.msg}")

Migration from v0.3.x

v0.4.0 dropped the parallel type system (InvokeRequest, InvokeResponse, Backend, LambdaBackend, AnthropicBackend) in favor of returning the real anthropic.Anthropic client. The convenience methods (gov.ask, gov.ask_with_system, gov.ask_about_file) are gone — call gov.client().messages.create(...) directly. Net effect: less SDK code, more Anthropic features available.

Before (v0.3.x):

text = gov.ask(MODEL_SONNET_46, "Hello")

After (v0.4.0+):

resp = gov.client().messages.create(
    model=MODEL_SONNET_46,
    messages=[{"role": "user", "content": "Hello"}],
    max_tokens=1024,
)
text = resp.content[0].text

Development

python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"
pytest

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.0

May 16, 2026

0.4.4

May 15, 2026

0.4.3

May 15, 2026

0.4.2

May 15, 2026

0.4.1

May 15, 2026

This version

0.4.0

May 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pennsieve_llm-0.4.0.tar.gz (9.7 kB view details)

Uploaded May 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pennsieve_llm-0.4.0-py3-none-any.whl (10.5 kB view details)

Uploaded May 15, 2026 Python 3

File details

Details for the file pennsieve_llm-0.4.0.tar.gz.

File metadata

Download URL: pennsieve_llm-0.4.0.tar.gz
Upload date: May 15, 2026
Size: 9.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pennsieve_llm-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`be7a464ce7a673707e681f790661b20011d546a3bee6087eb9cdc65e8226b52c`
MD5	`fcd1ca92c5fa5330b3ad13043f140e7a`
BLAKE2b-256	`12fb3d8776c7f0d298e37d0eed84f8af2b98a6d3acc7a730d53af7a0a1e5ac20`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pennsieve_llm-0.4.0.tar.gz:

Publisher: publish-pypi.yml on Pennsieve/pennsieve-python-llm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pennsieve_llm-0.4.0.tar.gz
- Subject digest: be7a464ce7a673707e681f790661b20011d546a3bee6087eb9cdc65e8226b52c
- Sigstore transparency entry: 1548628813
- Sigstore integration time: May 15, 2026
Source repository:
- Permalink: Pennsieve/pennsieve-python-llm@fa16298723a24f0b5858f1ec3be3f878720dd158
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Pennsieve
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@fa16298723a24f0b5858f1ec3be3f878720dd158
- Trigger Event: push

File details

Details for the file pennsieve_llm-0.4.0-py3-none-any.whl.

File metadata

Download URL: pennsieve_llm-0.4.0-py3-none-any.whl
Upload date: May 15, 2026
Size: 10.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pennsieve_llm-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`173acc1b8c4dbaa8f46c1f530e87ecd1a3025d23478c1e6cfd7494ba223fc01c`
MD5	`523b3654cc6102e4727cc679fc0ea244`
BLAKE2b-256	`0186ae2ba2ab43679ace5ad5eb7b85f8cbed8d0e2ad38bd341378595dc7ee40f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pennsieve_llm-0.4.0-py3-none-any.whl:

Publisher: publish-pypi.yml on Pennsieve/pennsieve-python-llm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pennsieve_llm-0.4.0-py3-none-any.whl
- Subject digest: 173acc1b8c4dbaa8f46c1f530e87ecd1a3025d23478c1e6cfd7494ba223fc01c
- Sigstore transparency entry: 1548628850
- Sigstore integration time: May 15, 2026
Source repository:
- Permalink: Pennsieve/pennsieve-python-llm@fa16298723a24f0b5858f1ec3be3f878720dd158
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Pennsieve
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@fa16298723a24f0b5858f1ec3be3f878720dd158
- Trigger Event: push

pennsieve-llm 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

pennsieve-llm

Installation

Quick Start

Streaming

Tool use

Files on EFS

Backend selection

Environment variables

Governor-specific operations

Testing

Available model constants

Error handling

Migration from v0.3.x

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance