Deterministic validation layer for AI agents and autonomous systems

These details have not been verified by PyPI

Project links

Project description

agentguard-trustlayer

AgentGuard-TrustLayer is a runtime safety layer that prevents AI agents from taking invalid or unsafe actions—even when they try.

Prevents AI agents from executing invalid or unsafe actions before they happen.

Why this exists

AI agents can generate actions.

But they don't understand consequences.

Without a validation layer:

they can break invariants
corrupt system state
execute invalid operations

agentguard-trustlayer sits between AI and execution.

It ensures:

every action is checked
every rule is enforced
every failure is contained

Core Idea

agentguard-trustlayer separates:

decision-making (AI) from execution (validated system)

How it works

AI Agent  -->  Proposal  -->  TrustLayer  -->  Execution
                                   ^
                              Constraints

Every update passes through four gates:

Auth — is the token valid and unexpired?
Locks — is the target key frozen?
Constraints — does the new state pass all rules?
Rollback — if anything fails, state is fully restored

Features

Constraint-based validation with composable logic (&, |, ~)
Delta-aware constraints — rules can compare proposed vs original state
Authenticated authority (HMAC-signed tokens with TTL)
Safe state updates with automatic rollback
set, increment, and update action types
Async agent loop with retry, backoff, and error feedback to model
Tamper-evident audit chain — every ValidationEvent carries a SHA-256 hash linked to the previous event
GuardedAgent high-level API — one object, one call
Zero dependencies (standard library only)

Practical Use Cases

Prevent AI agents from breaking business rules
Enforce invariants in automated systems
Add a safety layer to LLM workflows
Control multi-agent environments with authority levels

Quick Start

Install:

pip install trustlayer-py

Or clone and run a demo:

git clone https://github.com/AILIFE1/agentguard-trustlayer
cd agentguard-trustlayer
python examples/demo.py

🔥 Try to break the agent

python examples/demo_break_the_agent.py

An agent tries to set balance = 1,000,000. TrustLayer blocks it. The error is fed back into the prompt. The agent self-corrects and increments safely instead.

[MODEL OUTPUT] Attempting INVALID action...

[MODEL INPUT]
Increase balance as much as possible
Last error: balance <= max_limit

[MODEL OUTPUT] Attempting SAFE action...

FINAL STATE
{'balance': 110, 'max_limit': 200}

RESULT
[OK] Increase balance as much as possible

GuardedAgent — one-liner setup

import asyncio, json
from trustlayer import GuardedAgent, LambdaConstraint

async def my_model(prompt: str) -> str:
    return json.dumps({"type": "set", "target": "score", "value": 75})

agent = GuardedAgent(
    model=my_model,
    rules=[LambdaConstraint("score 0-100", lambda v: 0 <= v.get("score", 0) <= 100)],
    initial_state={"score": 50},
)

result = asyncio.run(agent.run("raise the score"))
print(result)
# {'status': 'success', 'state': {'score': 75}, 'audit': '<sha256>'}

Full API example

import asyncio, json
from trustlayer import (
    Agent, AuthorityLevel, AuthToken, Cathedral,
    LambdaConstraint, RetryConfig, State, Validator,
)

SECRET = b"my-secret"

score_ok = LambdaConstraint("score_ok", lambda v: 0 <= v.get("score", 0) <= 100)

state     = State(values={"score": 50})
validator = Validator(state, [score_ok], SECRET)
token     = AuthToken.issue(AuthorityLevel.SYSTEM, "agent", ttl_seconds=60, secret=SECRET)

async def model(prompt: str) -> str:
    return json.dumps({"type": "set", "target": "score", "value": 75})

async def main():
    cathedral = Cathedral(validator, Agent(model), retry=RetryConfig(max_attempts=3))
    event = await cathedral.step("raise the score", token)
    print(event)           # [OK] raise the score
    print(event.audit_hash)  # sha256 chain link
    print(state.values)    # {'score': 75}

asyncio.run(main())

Project Structure

agentguard-trustlayer/
├── trustlayer/
│   ├── __init__.py       # Public API + logging setup
│   ├── auth.py           # AuthToken, AuthorityLevel
│   ├── constraints.py    # Constraint, LambdaConstraint, And/Or/Not
│   ├── types.py          # State, Action, Update
│   ├── validator.py      # Validator, ValidationEvent, audit chain
│   └── engine.py         # Agent, Cathedral, GuardedAgent, RetryConfig
└── examples/
    ├── demo.py                    # Basic walkthrough
    └── demo_break_the_agent.py    # Constraint enforcement + self-correction

Philosophy

agentguard-trustlayer doesn't make decisions — it decides whether decisions are allowed.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.3.0

May 15, 2026

2.1.0

May 6, 2026

This version

2.0.1

May 6, 2026

2.0.0

Apr 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trustlayer_py-2.0.1.tar.gz (10.4 kB view details)

Uploaded May 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

trustlayer_py-2.0.1-py3-none-any.whl (9.6 kB view details)

Uploaded May 6, 2026 Python 3

File details

Details for the file trustlayer_py-2.0.1.tar.gz.

File metadata

Download URL: trustlayer_py-2.0.1.tar.gz
Upload date: May 6, 2026
Size: 10.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for trustlayer_py-2.0.1.tar.gz
Algorithm	Hash digest
SHA256	`5e6fdc63a8667e4b2c66bbef120135847f02aada3393a0908b0900dcce34eb0c`
MD5	`31430f11d6ec70efc1e825737f526f6a`
BLAKE2b-256	`1924f7c684c698283385ca091c15ac0ac4169804398979a5c5922701943f6100`

See more details on using hashes here.

File details

Details for the file trustlayer_py-2.0.1-py3-none-any.whl.

File metadata

Download URL: trustlayer_py-2.0.1-py3-none-any.whl
Upload date: May 6, 2026
Size: 9.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for trustlayer_py-2.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`72cd8a2c0eec7fd149270a824cd23a24d627ecccf9647105752319ff7dcbb40e`
MD5	`25d7b0d302255bc2eab6bd98829c5423`
BLAKE2b-256	`fa1946d1bbadc4334502e0c51bb0cfec49f24cd492c7f6a7a25ad5933b9fa667`

See more details on using hashes here.

trustlayer-py 2.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

agentguard-trustlayer

Why this exists

Core Idea

How it works

Features

Practical Use Cases

Quick Start

🔥 Try to break the agent

GuardedAgent — one-liner setup

Full API example

Project Structure

Philosophy

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes