Runtime-enforced governance kernel for AI agents
Project description
AgenWatch
Runtime-enforced execution governance for AI agents.
AgenWatch is a Python SDK that guarantees AI agents stop when they must.
Budgets, iteration limits, and execution boundaries are enforced during runtime, not observed after failure.
This is not an observability tool.
This is not a prompt framework.
AgenWatch is an execution kernel.
For architectural details and guarantees, see ARCHITECTURE.md
Why AgenWatch Exists
Most agent frameworks answer:
“How do I make my agent smarter?”
AgenWatch answers a different question:
“Can I mathematically guarantee this agent will stop?”
AgenWatch enforces hard limits before tools or LLM calls execute:
- No runaway costs
- No infinite reasoning loops
- No silent retry storms
- No post-mortem surprises
What AgenWatch Is (and Is Not)
AgenWatch is
- A bounded execution controller
- A deterministic agent runtime
- A runtime safety and governance layer for agents
AgenWatch is not
- A prompt-engineering framework
- A workflow orchestrator
- A UI or observability dashboard
- A LangChain replacement
AgenWatch does not try to make agents smarter.
It makes them governable.
Installation
pip install agenwatch
AgenWatch-Only Example
This example shows pure AgenWatch execution with runtime enforcement.
import os
from agenwatch import Agent, tool
from agenwatch.providers import OpenAIProvider
@tool("Echo input text")
def echo(**kwargs) -> dict:
"""Echo back the provided text"""
text = kwargs.get("text", "")
return {"echo": text}
agent = Agent(
tools=[echo],
llm=OpenAIProvider(
api_key=os.getenv("OPENAI_API_KEY"),
model="gpt-4o-mini"
),
budget=1.0, # Hard execution budget
max_iterations=5
)
result = agent.run("Echo hello")
print(f"Success: {result.success}")
print(f"Cost: {result.cost}")
print(f"Output: {result.output}")
Guarantees
- If budget is exhausted, no further LLM or tool calls can occur
- Retries are idempotent (no double charging)
- Termination is enforced inside the kernel, not user code
Budget Kill-Switch (Runtime Enforcement)
AgenWatch enforces budgets as a runtime kill switch, not a warning.
Behavior:
- First allowed call executes and is charged
- Retries do not double-charge
- Next call is blocked before execution
- Agent terminates with
budget_exceeded
This enforcement happens synchronously inside the kernel.
LangChain + AgenWatch Integration
AgenWatch can be used alongside LangChain.
Important distinction:
- LangChain produces prompts / logic
- AgenWatch governs execution
- The LLM decides whether to call tools
AgenWatch cannot force an LLM to call a tool. It only governs tool calls if and when they occur.
Working Integration Example
# Pattern: LangChain Logic + AgenWatch Enforcement
import os
from langchain_core.prompts import ChatPromptTemplate
from agenwatch import Agent, tool
from agenwatch.providers import OpenAIProvider
# Step 1: Define tools (AgenWatch-governed)
@tool("Echo text safely")
def echo(**kwargs) -> dict:
return {"echo": kwargs.get("text", "")}
# Step 2: Create AgenWatch Agent (the kernel)
agent = Agent(
tools=[echo],
llm=OpenAIProvider(
api_key=os.getenv("OPENAI_API_KEY"),
model="gpt-4o-mini"
),
budget=1.0,
max_iterations=3
)
# Step 3: Let LangChain produce the task
prompt = ChatPromptTemplate.from_messages([
("human", "Say hello using the echo tool")
])
task = prompt.format_messages()[0].content
# Step 4: Execute through AgenWatch
result = agent.run(task)
print(f"Success: {result.success}")
print(f"Cost: {result.cost}")
print(f"Output: {result.output}")
Example Output (Observed)
Success: True
Cost: 0.0
Output: It seems that I don't have access to an echo tool to assist with that.
However, I can say hello directly:
Hello!
Important Clarification
This behavior is expected.
- The LLM chose not to call the tool
- AgenWatch correctly enforced execution boundaries
- No unauthorized calls occurred
- Budget and iteration limits were still active
AgenWatch does not influence LLM reasoning. It governs execution, not decision-making.
Streaming & Inspection
AgenWatch exposes execution events for inspection:
for event in agent.stream("Analyze input"):
print(event.type)
- Event order is guaranteed
- Streaming does not affect execution behavior
- This is inspection, not control
Deterministic Execution (v0.1)
AgenWatch records execution decisions so failures can be inspected without re-running the agent.
In v0.1:
- Execution ledger is in-memory
- Replay is read-only
- No crash recovery or persistence
This is intentional and documented.
Who Should Use AgenWatch
AgenWatch is designed for:
- Platform engineers
- Infrastructure teams
- Safety & governance layers
- Production systems with strict cost or control requirements
When to Use AgenWatch
Use AgenWatch when you need:
- Hard runtime stop guarantees
- Budget or iteration enforcement
- Deterministic termination behavior
- Protection against runaway agents
AgenWatch prioritizes predictability over flexibility.
When NOT to Use AgenWatch
AgenWatch may not be a good fit if:
- You only need post-execution observability
- You want rapid prototyping without hard limits
- You expect the SDK to force tool usage
Relationship to Other Frameworks
AgenWatch is complementary to frameworks like:
- LangChain
- CrewAI
- LangGraph
Those frameworks focus on agent capability. AgenWatch focuses on agent control.
Status
Version: 0.1.0
- Kernel and budget enforcement are stable
- Public API is minimal and frozen
- Governance primitives will evolve incrementally
License
MIT License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agenwatch-0.1.2.tar.gz.
File metadata
- Download URL: agenwatch-0.1.2.tar.gz
- Upload date:
- Size: 109.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0a1cdd4ae5604ee0defb86ce0469345690879422ebba6b7a9dcc4cc18c846e51
|
|
| MD5 |
5d46f11098a1399ab18766c084d53db3
|
|
| BLAKE2b-256 |
7ec2869d9a37ad9c119fda35588b39e001da74d47f184fb5be0961dbec263f24
|
File details
Details for the file agenwatch-0.1.2-py3-none-any.whl.
File metadata
- Download URL: agenwatch-0.1.2-py3-none-any.whl
- Upload date:
- Size: 126.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4c0a362991936e626b2cb97a58d5fb8d44e4fec5ad8f598a99a1a040cce5f9a8
|
|
| MD5 |
eb6e9cc528ddf243f6ed43dce4b7452e
|
|
| BLAKE2b-256 |
831dafe9180b4754240a21d26cd1e211a77c596614c6df0e517fb9369e5bfd15
|