Constitutional Governance Kernel for AI Agents — trust scoring, approvals, audit trail

These details have not been verified by PyPI

Project links

Project description

AgentGuard

Constitutional Governance Kernel for AI Agents

"Your AI agents are only as safe as the system governing them." "If your agent can break AgentGuard, it better be good at breaking Bitcoin."

Install

pip install agentguard-kernel

The Problem

You're building with AI agents. Your agent has access to your database, your payments, your users.

What happens when it does something it shouldn't?

Most teams have no answer. No pause button. No audit trail. No trust scoring. No approval gate. Just an agent with full access and a prayer.

AgentGuard changes that.

Quick Start

from agentguard import TrustEngine

engine = TrustEngine(db_path="trust.db")

# New agent starts TRUSTED
level = engine.get_trust_level("my-agent-001")
print(level)  # → TRUSTED

# Quick permission check
if engine.is_allowed("my-agent-001"):
    # proceed with action
    pass

# Agent does something bad
result = engine.apply_penalty("my-agent-001", "spam")
print(result)  # → {"new_points": 20, "new_level": "LIMITED"}

# Agent does something worse
result = engine.apply_penalty("my-agent-001", "abuse")
print(result)  # → {"new_points": 50, "new_level": "OBSERVED"}

# Agent recovers with good behaviour
engine.upgrade_trust("my-agent-001", reason="passed_review")

# Emergency: hard revoke
engine.revoke("my-agent-001", reason="critical_violation")

# Full audit trail
history = engine.get_history("my-agent-001")

# See all agents
agents = engine.list_agents()

Trust Levels

TRUSTED → LIMITED → OBSERVED → SUSPICIOUS → QUARANTINED → REVOKED

Level	Points	What it means
TRUSTED	0–19	Full access, normal operation
LIMITED	20–39	Restricted actions, monitored
OBSERVED	40–59	All actions logged, some blocked
SUSPICIOUS	60–79	Most actions require approval
QUARANTINED	80–99	Almost fully restricted
REVOKED	100	Completely blocked

Violation Penalties

Violation	Points
fake_verification	50
hate_speech	60
drug_listing	40
unauthorized_access	40
high_risk_activity	35
abuse	30
multiple_flags	25
spam	20

API Reference

`TrustEngine(db_path="trust.db")`

Initialize with a SQLite database path. Uses trust.db by default.

`get_trust_level(user_id) → str`

Returns the current trust level string.

`get_trust_score(user_id) → dict`

Returns {"level": ..., "points": ..., "appeal_status": ...}.

`is_allowed(user_id) → bool`

Returns False for QUARANTINED or REVOKED agents. Use this as your fast gate check.

`apply_penalty(user_id, violation_type, details="") → dict`

Applies a penalty. Returns {"new_points": ..., "new_level": ...}.

`upgrade_trust(user_id, reason="") → dict`

Rewards good behaviour, reduces points by 15.

`revoke(user_id, reason="") → dict`

Hard stop — immediately sets agent to REVOKED.

`submit_appeal(user_id, reason, evidence) → dict`

Agent submits an appeal for review.

`get_history(user_id, limit=50) → list`

Full audit trail for an agent.

`list_agents() → list`

All tracked agents and their trust state.

Architecture

┌─────────────────────────────────────────────────┐
│                   Your App                       │
├─────────────────────────────────────────────────┤
│         AgentGuard Kernel                        │
│    Trust Engine │ Risk Scoring │ Audit Log       │
├─────────────────────────────────────────────────┤
│              SQLite (Community)                  │
│         PostgreSQL (Enterprise)                  │
└─────────────────────────────────────────────────┘

Zero dependencies. Drop into any project in 5 minutes.

Enterprise Edition

The community kernel is free and open source.

Enterprise adds: REST API, JWT auth, Ed25519 capability tokens, human approval gate, real-time event stream, dashboard UI, global emergency stop, multi-tenant orgs, PostgreSQL + Redis, SSO/SAML, webhooks, SLA, and priority support.

📧 bkdk62309@gmail.com — response within 24 hours.

License

MIT — free to use, modify, and build on.

Author

Dheeraj Kumar Biswakarma 🐙 github.com/Mangomindai/agentguard

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Jun 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentguard_kernel-0.1.0-py3-none-any.whl (7.3 kB view details)

Uploaded Jun 2, 2026 Python 3

File details

Details for the file agentguard_kernel-0.1.0-py3-none-any.whl.

File metadata

Download URL: agentguard_kernel-0.1.0-py3-none-any.whl
Upload date: Jun 2, 2026
Size: 7.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for agentguard_kernel-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9814d64ae7657c512b276bdd8a1649323c9811ec85031d46a6bbb8b45f7415cc`
MD5	`8b88b86bdc07b26ba10864bfcc9a49a0`
BLAKE2b-256	`002c17b9c395817d2553bdac3ad6f293d1a1a2f748cefb6a28ef759bec9c4968`

See more details on using hashes here.

agentguard-kernel 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AgentGuard

Install

The Problem

Quick Start

Trust Levels

Violation Penalties

API Reference

TrustEngine(db_path="trust.db")

get_trust_level(user_id) → str

get_trust_score(user_id) → dict

is_allowed(user_id) → bool

apply_penalty(user_id, violation_type, details="") → dict

upgrade_trust(user_id, reason="") → dict

revoke(user_id, reason="") → dict

submit_appeal(user_id, reason, evidence) → dict

get_history(user_id, limit=50) → list

list_agents() → list

Architecture

Enterprise Edition

License

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

`TrustEngine(db_path="trust.db")`

`get_trust_level(user_id) → str`

`get_trust_score(user_id) → dict`

`is_allowed(user_id) → bool`

`apply_penalty(user_id, violation_type, details="") → dict`

`upgrade_trust(user_id, reason="") → dict`

`revoke(user_id, reason="") → dict`

`submit_appeal(user_id, reason, evidence) → dict`

`get_history(user_id, limit=50) → list`

`list_agents() → list`