Lightweight drop-in wrapper for the OpenAI Python client that logs telemetry and detects PII risks.

Project description

AgentWatch: Proactive LLM Governance Platform

AgentWatch is an ultra-low latency API proxy and SDK designed to solve the "runaway agent" problem for enterprise engineering teams. It intercepts, manages, and enforces budget constraints on LLM API requests at the edge, acting as a proactive governance layer between your application and upstream providers like OpenAI and Anthropic.

The Problem It Solves

As engineering teams adopt autonomous LLM agents (e.g., coding assistants, research bots, recursive planners), they face a critical financial vulnerability: the runaway loop.

If an agent gets stuck in a recursive error-correction loop, it can execute hundreds of API calls per minute. Because each iteration typically appends the previous output to the context window, the token size grows quadratically. This can result in a single stuck agent burning thousands of dollars in minutes—a scenario that passive monitoring tools will only report after the budget is already gone.

AgentWatch was built from the ground up to prevent this.

Core Features

1. Session-Aware Identity Tracking

Instead of treating API requests as isolated events, AgentWatch tracks iterative agent loops as Sessions.

Every request is tagged with a session_id and an iteration_index.
The cumulative token count for a session is securely computed and maintained server-side in the AgentWatch Global Distributed State. This ensures that even if a local agent process crashes, restarts, or runs in parallel, the session's financial state cannot be bypassed or reset.

2. Synchronous Pre-Call Budget Enforcement

AgentWatch acts as a strict financial gatekeeper for agent sessions.

Developers define a budget ceiling (e.g., $2.00) per session via the AgentWatch Python SDK.
Before any upstream LLM call is made, the SDK performs a sub-millisecond synchronous pre-flight check to the AgentWatch Edge Engine.
If the session's cumulative token cost exceeds the limit, the SDK instantly blocks the execution and raises an AgentBudgetExceeded exception.
Fail-Open Resilience: By default, if the AgentWatch infrastructure experiences downtime, the budget check silently fails open. This ensures our infrastructure never causes a hard outage for your production traffic.

3. Inline Anomaly Detection

AgentWatch heuristically detects runaway behavior before the budget is even exhausted.

The AgentWatch Edge Engine maintains a rolling window of the last 5 iterations for every active session.
It calculates the token growth ratio synchronously on every ingest request.
If three consecutive iterations show a >1.4x prompt growth—a hallmark signature of a context-appending loop—it asynchronously fires a webhook alert to your security team, adding zero latency to the critical API path.

4. Zero-Latency Proxying & Resilient Telemetry

Ultra-Low Latency: The hot path of the proxy only handles authentication, routing, and credential management.
Asynchronous Telemetry: Payload logging and risk scanning are offloaded to our Async Telemetry Pipeline. The client receives the provider's response immediately.
Data Security & Compliance: Strict data retention policies automatically purge payloads according to your enterprise compliance requirements.

Python SDK Integration

AgentWatch integrates seamlessly via composition with standard OpenAI client wrappers:

from agentwatch import WatchedOpenAI

client = WatchedOpenAI(
    agentwatch_api_key="your_aw_key",
    agentwatch_project="checkout-service",
    agentwatch_team="payments-eng",
    agentwatch_session_id="ci-run-123",
    agentwatch_session_budget_usd=2.00,  # Strict $2 limit
    agentwatch_enforcement_mode=True
)

# Standard OpenAI API usage
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Refactor this module..."}]
)

Supported Providers

AgentWatch acts as a unified proxy. The client token is never forwarded upstream; AgentWatch seamlessly authenticates the request to the target provider.

OpenAI: Supported fully.
Anthropic: Supported fully.
Groq: Supported fully.

Project details

Release history Release notifications | RSS feed

1.0.0

Jun 11, 2026

This version

0.1.2

Jun 11, 2026

0.1.1

Jun 11, 2026

0.1.0

Jun 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aw_sdk-0.1.2.tar.gz (8.0 kB view details)

Uploaded Jun 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aw_sdk-0.1.2-py3-none-any.whl (8.1 kB view details)

Uploaded Jun 11, 2026 Python 3

File details

Details for the file aw_sdk-0.1.2.tar.gz.

File metadata

Download URL: aw_sdk-0.1.2.tar.gz
Upload date: Jun 11, 2026
Size: 8.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for aw_sdk-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`ac91a760b06937b0bc456d544ee1dccdc9a635486c3edae64cd0a68de3fbc700`
MD5	`f461dc2411e045f6be37022a7fd62483`
BLAKE2b-256	`6a48374e5bb95b195ab999da6dfcb689f67637a59dfaeeb0a1fc1038764fd4ad`

See more details on using hashes here.

File details

Details for the file aw_sdk-0.1.2-py3-none-any.whl.

File metadata

Download URL: aw_sdk-0.1.2-py3-none-any.whl
Upload date: Jun 11, 2026
Size: 8.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for aw_sdk-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ba5b34ba48f1f0c63926f0c4c0b2c7e9581656cb261251738618dd7249cb6259`
MD5	`ecf114ea0309a9fb7693e95257e1eb75`
BLAKE2b-256	`1a8f17bc7c8456282322821f776a5a022037ad261b1774e16bfdfe0b796fccc3`

See more details on using hashes here.

aw-sdk 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

AgentWatch: Proactive LLM Governance Platform

The Problem It Solves

Core Features

1. Session-Aware Identity Tracking

2. Synchronous Pre-Call Budget Enforcement

3. Inline Anomaly Detection

4. Zero-Latency Proxying & Resilient Telemetry

Python SDK Integration

Supported Providers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes