Governance layer for autonomous AI agents — pre-flight checks, runtime monitoring, and post-session reporting.

These details have not been verified by PyPI

Project links

Project description

AgentGuard

Governance layer for autonomous AI agents — pre-flight checks, runtime monitoring, and post-session reporting.

"You wouldn't launch a rocket without a pre-launch checklist. Why run an autonomous agent without one?"

Maximum instruction, minimum interpretation.

AgentGuard doesn't eliminate the probability of failure. It reduces the impact.

The Problem: Observability vs Governance — the Gap

The AI agent tooling landscape is rich with observability tools — LangSmith, Langfuse, Helicone, Arize. They answer: "What did the agent do?"

But they don't answer: "Should the agent have started at all?"

AgentGuard fills the gap before execution: it checks that governance prerequisites are in place, monitors for loops and stalls at runtime, and produces a post-session governance report.

Read the full context: The Blind Spot of Agentic AI Systems

AgentGuard vs Observability Tools

Feature	LangSmith	Langfuse	Helicone	Arize	AgentGuard
Pre-flight governance check	—	—	—	—	Yes
Owner / scope / escalation enforcement	—	—	—	—	Yes
Killswitch verification	—	—	—	—	Yes
Instruction file validation	—	—	—	—	Yes
Runtime loop detection	Partial	—	—	Partial	Yes
Post-session governance report	—	Partial	—	—	Yes
Token / cost monitoring	Yes	Yes	Yes	Yes	Threshold
Trace visualization	Yes	Yes	—	Yes	—
Prompt replay / debugging	Yes	Yes	—	—	—
Works with any agent framework	—	—	—	—	Yes

AgentGuard is not a replacement for observability tools — it is the layer that runs before they do.

Prerequisite Level 0: The "Fuel in the Car" Metaphor

Before you drive, you check: fuel, seatbelt, mirrors. These are non-negotiable prerequisites — you don't skip them because you're in a hurry.

AgentGuard's Level 0 checks are the equivalent for autonomous agents:

Check	What it verifies
OWNER	Someone is responsible for this agent
SCOPE	The agent knows what it's allowed to do
ESCALATION	There is a human to contact when things go wrong
KILLSWITCH	There is a documented way to stop the agent

These are CRITICAL by default. An agent without an owner is an unaccountable system. An agent without a killswitch is a runaway process waiting to happen.

Quick Start

pip install agentguard-governance
cd my-agent-project
agentguard check

If your project lacks governance prerequisites, you'll see:

╭─────────── AGENTGUARD — PRE-FLIGHT CHECK ────────────╮
│   Project:  ./my-agent-project                       │
│   Checked:  2026-06-06 15:00:00                      │
│                                                      │
│   🔴 CRITICAL   No agent owner defined               │
│   🔴 CRITICAL   No authorized scope defined          │
│   🔴 CRITICAL   No prohibited actions defined        │
│   🔴 CRITICAL   No escalation path configured        │
│   🔴 CRITICAL   No killswitch defined                │
│   🔴 CRITICAL   No CLAUDE.md or AGENTS.md found      │
│                 (fix: create CLAUDE.md first)        │
│                                                      │
│   RESULT: BLOCKED — 6 critical gaps                  │
│                                                      │
│   This agent cannot start until governance           │
│   gaps are resolved or explicitly overridden.        │
│                                                      │
│   agentguard init --interactive                      │
│   agentguard override --reason "..."                 │
╰──────────────────────────────────────────────────────╯

Fix it interactively:

agentguard init --interactive

CLI Commands

`agentguard check`

Run a pre-flight governance check.

agentguard check                          # check current directory
agentguard check --path ./my-project      # check a specific path
agentguard check --config ./gov.yaml      # use a specific governance.yaml
agentguard check --format json            # machine-readable output
agentguard check --ai-review              # include AI-powered scope quality review

Exit codes:

0 — OK or warnings only (with or without AI review)
1 — CRITICAL findings found
2 — Config error

--ai-review requires AGENTGUARD_AI_PROVIDER and AGENTGUARD_AI_API_KEY in .env or environment. Without them, AI review is silently skipped.

`agentguard init`

Initialize governance for a project.

agentguard init --guided        # AI-powered 5-step concretization (requires API key)
agentguard init --interactive   # guided Q&A with inline examples,
                                # generates governance.yaml + CLAUDE.md block
agentguard init --template-only # copies governance.yaml.example to ./governance.yaml

Interactive mode guides you through:

Agent owner, scope (authorized / prohibited / confirmation-required)
Escalation contact with format validation
Escalation method (log / terminal / file)
Killswitch definition

All free-text inputs are sanitized — quote characters are stripped automatically to prevent YAML parse errors.

`agentguard init --guided`

AI-powered 5-step guided concretization. Answer 5 questions in plain language — AgentGuard uses an AI provider to transform your intent into enforceable rules.

agentguard init --guided

Requires AGENTGUARD_AI_PROVIDER and AGENTGUARD_AI_API_KEY in .env. Without them, use agentguard init --interactive instead.

What it does:

Owner — who is responsible for this session
Mission — free-text description → AI splits into authorized / prohibited / confirmation scope
Hard Limits — things the agent must never do → AI adds to prohibited scope
Escalation — how to reach you when something goes wrong
Killswitch — how to stop the agent

After all 5 steps, a review panel shows the full governance before saving. You can adjust individual fields or start over.

Adjustment loop: Each AI-concretized field offers up to 3 rounds of refinement. If the AI cannot improve further, your raw input is saved with a warning.

What gets written:

governance.yaml — with metadata comment block (Generated by: agentguard init --guided)
.claude/settings.json — PreToolUse hook (merge-safe)
CLAUDE.md — AgentGuard governance block appended

Graceful degradation: API failure → raw input saved, flow continues. Ctrl+C → prompted to save progress before exiting.

`agentguard watch`

Start runtime observer. Reads a JSON tool-call log from the agent harness.

agentguard watch                          # watch agent.log (default)
agentguard watch --log ./my-agent.log     # watch a specific log file
agentguard watch --interval 5             # poll every 5 seconds

Emits LOOP_WARNING, STALL_WARNING, and BURN_WARNING events to console and appends to agentguard.log.

Expected log format (one JSON object per line):

{"tool": "bash", "tokens": 150}
{"tool": "bash", "tokens": 150}
{"tool": "read_file", "tokens": 80}

`agentguard report`

Generate a post-session Markdown governance report.

agentguard report                                      # reads agentguard.log, writes report.md
agentguard report --session ./run1.log --output r.md   # custom paths

`agentguard review`

Review and update existing governance.yaml interactively.

agentguard review                          # interactive field-by-field review
agentguard review --guided                 # AI-assisted rule concretization
agentguard review --field authorized       # review a specific field only
agentguard review --path ./my-project      # review a project in another directory

Shows a summary of current governance, then offers:

Review all fields — walk through each scope field, keep/add/remove/replace rules
Review specific field — focus on one field
Add new rules — append to an existing field
Mark ambiguities as resolved — close open ambiguities with an audit timestamp
View full governance.yaml — Rich syntax-highlighted display

All changes are logged in governance_history with the date, tool, and changed fields.

`agentguard override`

Override CRITICAL findings and proceed. The --reason flag is mandatory and the override is logged.

agentguard override --reason "Emergency hotfix — owner notified verbally"
agentguard override --reason "Demo environment — no real escalation needed" --path ./demo

Override log is written to agentguard-overrides.log.

`agentguard verify`

Verify governance.yaml was generated consistently. Detects drift if prompts or outputs changed since last pin.

agentguard verify                              # verify current directory
agentguard verify --config path/to/governance.yaml

After agentguard init --guided, a concretization_pins block is written to governance.yaml. Each pin records the SHA-256 hashes of the prompt, output, model, provider, and temperature used during AI concretization.

agentguard verify checks:

All required pin fields are present
temperature is 0 (deterministic output guaranteed)

Exit code	Meaning
`0`	All pins verified — governance is reproducible
`1`	Pin issues found (missing, incomplete, or temperature drift)
`2`	governance.yaml not found

Consistency & Reproducibility

When agentguard init --guided generates governance rules, it records prompt-pins alongside each concretized field:

concretization_pins:
  - field: "mission"
    input_hash: "abc123def456abcd"
    prompt_hash: "def456abc123ef01"
    output_hash: "1234567890abcdef"
    model: "claude-sonnet-4-20250514"
    provider: "anthropic"
    temperature: 0
    date: "2026-06-09"

This answers: "How were these governance rules generated — and can we reproduce them?"

The hashes are SHA-256 truncated to 16 chars for readability. They don't re-verify the AI output automatically (the AI is non-deterministic even at temperature=0 across versions), but they document the exact conditions under which governance was created. Use agentguard verify to check structural integrity.

How AgentGuard Works — Four Layers

Layer 1 — Before the agent starts (Pre-Flight)

agentguard check validates governance prerequisites. agentguard check --ai-review adds AI-powered scope quality scoring.

Layer 2 — While the agent runs (Enforcement)

agentguard enforce runs as a Claude Code PreToolUse hook. Deterministic — no LLM. Checks every tool call against governance.yaml. Exit 2 = blocked. Exit 0 = allowed.

Layer 3 — Monitoring (Runtime Watch)

agentguard watch reads native Claude Code JSONL transcripts. Detects loops, stalls, and token burn in real time.

Layer 4 — After the session (Reporting & Audit)

agentguard report generates a Markdown governance report. agentguard verify checks governance consistency via prompt pins. agentguard review updates governance for changed projects.

Complete Command Reference

Command	Purpose	Requires API Key
`agentguard check`	Pre-flight governance validation	No
`agentguard check --ai-review`	+ AI scope quality scoring	Yes
`agentguard init --interactive`	Basic guided setup	No
`agentguard init --guided`	AI-concretized governance setup	Yes
`agentguard enforce`	PreToolUse hook handler	No
`agentguard watch`	Runtime JSONL monitoring	No
`agentguard report`	Post-session governance report	No
`agentguard review`	Update existing governance	No
`agentguard review --guided`	AI-assisted governance update	Yes
`agentguard verify`	Check governance consistency/drift	No
`agentguard override`	Proceed despite critical gaps	No
`agentguard web`	Browser UI — check, governance, terminal	No (API key optional)
`agentguard web --path p1 --path p2`	Multi-project browser UI	No

AI-Powered Scope Review (Optional)

AgentGuard can use an AI provider to assess the quality of your governance scope — catching vague, incomplete, or ungovernable definitions that string-based checks miss.

Model selection: AgentGuard uses different models for different tasks:

Scope review (--ai-review): provider default (e.g. claude-haiku, gpt-4o-mini)
Governance concretization (--guided): higher-capability model (claude-sonnet for Anthropic, gpt-4o for OpenAI) for schema reliability
All concretization calls use temperature=0 for consistency

Setup

Create a .env file in your project root (see .env.example):

AGENTGUARD_AI_PROVIDER=anthropic
AGENTGUARD_AI_API_KEY=your-api-key-here

Supported providers:

Provider	Value	Default Model
Anthropic	`anthropic`	claude-haiku-4-5-20251001
OpenAI	`openai`	gpt-4o-mini
Anysphere (Cursor)	`anysphere`	cursor-small
OpenAI-compatible	`openai-compatible`	set `AGENTGUARD_AI_MODEL`

Usage

agentguard check --ai-review

AI review is always opt-in. Without --ai-review, AgentGuard runs fully offline with no API calls and no external dependencies.

How AgentGuard Enforces — Layer 2

After agentguard init, your project contains .claude/settings.json:

{
  "hooks": {
    "PreToolUse": [{
      "matcher": "Bash|Write|Edit|MultiEdit|NotebookEdit",
      "hooks": [{"type": "command", "command": "agentguard enforce"}]
    }]
  }
}

Every tool call Claude Code attempts fires agentguard enforce first. AgentGuard reads your governance.yaml and checks:

Does this action violate the prohibited scope?
Does this action require human confirmation?

If yes → exit 2. Claude Code receives the denial reason and cannot proceed with that action. This is deterministic — it fires every time, regardless of model behavior or context length.

All enforcement decisions are logged to agentguard-enforcement.log.

What AgentGuard cannot do

AgentGuard enforces at the tool execution layer. It cannot prevent Claude from reasoning toward a blocked action — only from executing it. For production systems, combine with OS-level sandboxing.

See What AgentGuard Cannot Do for the full list.

governance.yaml Reference

# Required (CRITICAL if missing)
owner: "Jane Smith"

scope:
  authorized:
    - action: "Read and write Python files in ./src"
      reason: "Core task — agent must modify source files"
      added: "2026-06-07"

  prohibited:
    - action: "No database schema changes or production writes"
      reason: "Production data changes require human review — no exceptions"
      severity: "HARD_LIMIT"
      added: "2026-06-07"
    - action: "No git push to main branch"
      reason: "All changes must go through pull request review"
      severity: "HARD_LIMIT"
      added: "2026-06-07"

  requires_confirmation:
    - action: "Any file deletion outside ./tmp"
      reason: "File deletion is irreversible — requires explicit sign-off"
      added: "2026-06-07"

escalation:
  contact: "jane@example.com"
  method: "log"              # log | terminal | file
  trigger: "2+ critical failures or loop detected"

killswitch: "Ctrl+C"

governance_history:
  - date: "2026-06-09"
    action: "Initial governance created"
    tool: "agentguard init --guided"
    version: "0.5.1"

# Concretization consistency (added by agentguard init --guided)
concretization_pins:
  - field: "mission"
    input_hash: "a1b2c3d4e5f6g7h8"
    prompt_hash: "b2c3d4e5f6g7h8i9"
    output_hash: "c3d4e5f6g7h8i9j0"
    model: "claude-sonnet-4-20250514"
    provider: "anthropic"
    temperature: 0
    date: "2026-06-09"

# Severity overrides (critical | warning | info)
severity:
  no_owner: critical
  no_scope: critical
  no_escalation: critical
  no_killswitch: critical
  no_instruction_file: critical
  no_loop_detection: warning
  no_root_cause_rule: warning
  no_api_research_rule: info
  no_attempt_counter: warning
  no_action_log: warning
  no_skill_md: warning

# Runtime thresholds
runtime:
  loop_threshold: 2
  progress_check_interval: 10
  token_burn_threshold: 5000
  progress_scoring: false        # requires ANTHROPIC_API_KEY

# Override policy
override:
  allowed: true
  require_reason: true
  log_overrides: true

Why structured governance matters

Each governance rule includes:

action — what is allowed, prohibited, or requires confirmation
reason — why this decision was made (critical for future reference)
severity — HARD_LIMIT, CRITICAL, or WARNING (prohibited rules only)
added — when the rule was created

Six months from now — after staff changes, project handovers, or simply forgetting — the reason field answers: "What did we mean by this?"

Governance without context is a checklist. Governance with context is institutional memory.

Legacy flat-string format is still supported for backward compatibility.

What AgentGuard Checks

Level 0 — Governance Prerequisites (CRITICAL)

Check	Rule
Owner	`governance.yaml` has non-empty `owner` field
Scope	`governance.yaml` has non-empty `scope` field
Escalation	`governance.yaml` has `escalation.contact` field
Killswitch	`governance.yaml` has `killswitch` field
Instruction file	`CLAUDE.md` or `AGENTS.md` present in project root
security.md absent	INFO — consider documenting security policies

Prompt Quality (WARNING)

Check	Keywords scanned in CLAUDE.md / AGENTS.md
Loop detection	loop, iteration, attempt, stuck, retry
Root-cause analysis	root cause, root_cause, diagnose before, confirm before
External API research	fetch, documentation, never rely on memory, aktuelle

Harness Quality (WARNING)

Check	Patterns scanned in `*.py` files
Attempt counter	`attempt_count`, `retry_count`, `max_attempts`
Action log	`action_log`, `log_action`, `append.*log`
Error pattern detection	`same_error`, `error_pattern`, `consecutive_errors`

Governance Review Cycle

Governance defined today may not fit your project in three months. agentguard review ensures governance stays current.

# Review all governance fields interactively
agentguard review

# Review with AI-assisted concretization
agentguard review --guided

# Review a specific field only
agentguard review --field authorized

# Review a project in another directory
agentguard review --path ./my-project

Use agentguard review when:

The project scope has changed significantly
Team members have changed (handover situation)
Unresolved ambiguities need to be addressed
A governance audit is due
The agent produced unexpected results

All changes are logged in governance_history — full audit trail of when governance changed, what changed, and which tool was used.

Pre-Inquiry — Quality In, Quality Out

The quality of your governance is directly proportional to the quality of your preparation.

AgentGuard cannot fill knowledge gaps — it exposes them. The owner bears responsibility for what they define.

Before running agentguard init --guided, know:

Which directories and files the agent may touch
Which external APIs or services are involved
What success looks like — in measurable terms
Who is accountable when something goes wrong
What the agent must never do — without exceptions

Vague input produces vague governance. Vague governance produces unenforceable rules. Unenforceable rules produce incidents.

What AgentGuard Cannot Do

Guarantee model behavior — AgentGuard enforces at the tool execution layer. It cannot prevent Claude from reasoning toward a blocked action, only from executing it.
Fill knowledge gaps — Ambiguities in your governance definition reflect real gaps in your understanding of the agent's scope. AgentGuard documents them; you must resolve them.
Replace security practices — For production systems, combine AgentGuard with OS-level sandboxing (Docker, seccomp, file ACLs).
Enforce on non-hook frameworks — Enforcement requires Claude Code hooks. For other frameworks, use agentguard enforce manually in your own harness.

Regulatory Alignment

AgentGuard is designed to be compatible with:

Singapore IMDA Model Governance Framework — human oversight, accountability, and documentation requirements
Anthropic — Building Effective Agents — loop detection, progress monitoring, and controlled escalation
EU AI Act GPAI provisions (effective August 2, 2026) — transparency, human oversight, and risk management for general-purpose AI systems

AgentGuard does not provide legal compliance. It provides the technical prerequisites that compliance frameworks require.

Web Interface

pip install "agentguard-governance[web]"
agentguard web

Opens http://localhost:8767 with:

Tab	Purpose
Pre-Flight Check	Run governance validation, see results visually
Governance	View all governance rules with color-coded sections
Verify Pins	Check concretization consistency
Terminal	Run any agentguard command interactively
Setup Governance	Guided, interactive, or template setup
Review & Update	Update governance as project evolves

All commands including interactive ones (init --guided, review --guided) run directly in the browser terminal. Click "▶ Run in Terminal" in Setup or Review to launch any command without leaving the browser.

agentguard web                                    # single project (current dir)
agentguard web --path ./my-project                # specific project
agentguard web --path ./proj1 --path ./proj2      # multiple projects
agentguard web --port 8888                        # custom port
agentguard web --no-browser                       # don't auto-open browser

Multiple projects: pass --path multiple times. The sidebar shows a project switcher — all panels update when you switch projects. Projects with governance.yaml show ✓, projects without show ⚠.

Requires macOS or Linux (Python pty module).

Building the frontend (for development)

cd web
npm install
npm run build   # builds to web/dist/ — served by FastAPI
npm run dev     # hot-reload dev server (proxies API to :8767)

Development

git clone https://github.com/MyPatric69/agentguard
cd agentguard
pip install -e ".[dev]"
pytest --tb=short
ruff check agentguard tests

License

MIT — see LICENSE for details.

Built for developers who believe that governance should be a first-class concern, not an afterthought.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.7.0

Jun 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentguard_governance-0.7.0.tar.gz (90.5 kB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentguard_governance-0.7.0-py3-none-any.whl (50.6 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file agentguard_governance-0.7.0.tar.gz.

File metadata

Download URL: agentguard_governance-0.7.0.tar.gz
Upload date: Jun 10, 2026
Size: 90.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentguard_governance-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`6de1c90b26ceece565fb9c8705fa4abff6bcd682aa80893112391e914b2f25db`
MD5	`563f0154d71f02b232da4be915a7dd2b`
BLAKE2b-256	`69308654485718b334f1c1bb3503f9e0699b1c024b954d81abc5403dbab362cb`

See more details on using hashes here.

File details

Details for the file agentguard_governance-0.7.0-py3-none-any.whl.

File metadata

Download URL: agentguard_governance-0.7.0-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 50.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentguard_governance-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a9f0de0f625d5f3169845a42ad2f9217c6f701c27fe81902c2762303b0830998`
MD5	`9f135464d478d3caee54145e80eb2d2b`
BLAKE2b-256	`cdfd6997f91e8615843e7444dc3cea5c36d6d94d6ca99f6a270bf4092b181bb4`

See more details on using hashes here.

agentguard-governance 0.7.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AgentGuard

The Problem: Observability vs Governance — the Gap

AgentGuard vs Observability Tools

Prerequisite Level 0: The "Fuel in the Car" Metaphor

Quick Start

CLI Commands

agentguard check

agentguard init

agentguard init --guided

agentguard watch

agentguard report

agentguard review

agentguard override

agentguard verify

Consistency & Reproducibility

How AgentGuard Works — Four Layers

Layer 1 — Before the agent starts (Pre-Flight)

Layer 2 — While the agent runs (Enforcement)

Layer 3 — Monitoring (Runtime Watch)

Layer 4 — After the session (Reporting & Audit)

Complete Command Reference

AI-Powered Scope Review (Optional)

Setup

Usage

How AgentGuard Enforces — Layer 2

What AgentGuard cannot do

governance.yaml Reference

Why structured governance matters

What AgentGuard Checks

Level 0 — Governance Prerequisites (CRITICAL)

Prompt Quality (WARNING)

Harness Quality (WARNING)

Governance Review Cycle

Pre-Inquiry — Quality In, Quality Out

What AgentGuard Cannot Do

Regulatory Alignment

Web Interface

Building the frontend (for development)

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`agentguard check`

`agentguard init`

`agentguard init --guided`

`agentguard watch`

`agentguard report`

`agentguard review`

`agentguard override`

`agentguard verify`