Skip to main content

A quality-assurance engine for LLM-generated code

Project description

AgentGuard

PyPI version Python License: MIT Downloads CI

Agent-native quality engine for LLM code generation. AgentGuard provides structured guidance to your AI agent โ€” it never calls an LLM itself.

878+ monthly installs ยท Explore Marketplace ยท Documentation


๐ŸŽฌ Video Tutorials

New to AgentGuard? Watch the demo series:

# Video Duration
00 What is AgentGuard? โ€” Explainer 3:54
01 Installation & Setup from Zero 3:44
02 Create a Project from Scratch with AI ~6 min
03 Automatic Documentation โ€” ADR, PRD & Dev Guide ~5 min
04 Migration: Spaghetti โ†’ CQRS Architecture ~6 min

๐Ÿ‡ช๐Ÿ‡ธ Spanish versions also available โ€” see full playlist


Why AgentGuard

AI agents generate code fast. But without structure, they hallucinate imports, skip error handling, and produce inconsistent architecture.

AgentGuard gives your agent a disciplined framework โ€” without replacing it. It's the quality layer between your AI and your codebase.

What It Does

AgentGuard is an MCP server that gives your AI agent (Claude, GPT, Gemini, etc.) a disciplined process for generating production-ready code:

  1. Skeleton โ†’ file tree with responsibilities
  2. Contracts & Wiring โ†’ typed stubs with import connections
  3. Logic โ†’ function-by-function implementation
  4. Challenge โ†’ self-review against quality criteria
  5. Validate โ†’ static analysis (syntax, lint, types, imports)

Your agent does the thinking. AgentGuard provides the framework.

Installation

pip install rlabs-agentguard

That's it. One command, no extras, no API keys needed.

Configure Your IDE

Claude Desktop / Claude Code

Add to your MCP config:

{
  "mcpServers": {
    "agentguard": {
      "command": "agentguard-mcp"
    }
  }
}

Cursor / Windsurf

Add to .cursor/mcp.json or equivalent:

{
  "mcpServers": {
    "agentguard": {
      "command": "agentguard-mcp"
    }
  }
}

Python (direct)

python -m agentguard  # starts MCP server on stdio

Quick Example โ€” What You Get

Ask your agent: "Build a REST API for task management using AgentGuard"

AgentGuard's skeleton tool returns a structured file tree:

src/
โ”œโ”€โ”€ app.py              # FastAPI application entry point
โ”œโ”€โ”€ config.py           # Environment and app configuration
โ”œโ”€โ”€ models/
โ”‚   โ””โ”€โ”€ task.py         # Task SQLAlchemy model
โ”œโ”€โ”€ routers/
โ”‚   โ””โ”€โ”€ tasks.py        # CRUD endpoints for /tasks
โ”œโ”€โ”€ services/
โ”‚   โ””โ”€โ”€ task_service.py # Business logic layer
โ””โ”€โ”€ tests/
    โ””โ”€โ”€ test_tasks.py   # Endpoint + service tests

Then contracts_and_wiring generates typed stubs with all imports wired:

# src/routers/tasks.py
from fastapi import APIRouter, Depends, HTTPException
from ..services.task_service import TaskService
from ..models.task import Task, TaskCreate, TaskUpdate

router = APIRouter(prefix="/tasks", tags=["tasks"])

@router.get("/", response_model=list[Task])
async def list_tasks(service: TaskService = Depends()) -> list[Task]: ...

@router.post("/", response_model=Task, status_code=201)
async def create_task(payload: TaskCreate, service: TaskService = Depends()) -> Task: ...

Your agent fills in the logic. AgentGuard ensures the structure is right from the start.

Tools

Agent-Native (structured guidance โ€” no API key)

Tool Purpose
skeleton L1: file tree with responsibilities
contracts_and_wiring L2+L3: typed stubs with imports (saves ~15K tokens vs separate calls)
contracts L2 only: typed function/class stubs
wiring L3 only: import and call-chain connections
logic L4: implement one function body
get_challenge_criteria Self-review criteria for an archetype
digest Compact project summary for efficient review
debug Structured debugging protocol
migrate Migration plan with compatibility checks

Utility

Tool Purpose
validate Mechanical code checks (syntax, lint, types, structure)
list_archetypes List all available project archetypes
get_archetype Get detailed archetype configuration
reload_archetypes Pick up newly installed archetypes
trace_summary Get cost & token tracking summary
docs Get AgentGuard documentation on any topic
update_agentguard Update to the latest version from PyPI

Built-In Archetypes

Archetype Tech Stack
api_backend Python + FastAPI (production)
library Python reusable package (production)
cli_tool Python CLI with subcommands
react_spa TypeScript + React SPA (production)
web_app Python + TypeScript full-stack (production)
script Python one-off automation
debug_backend Python/FastAPI debugging protocol
debug_frontend React/TypeScript debugging protocol

+ 61 community archetypes on the AgentGuard Marketplace โ€” browse, install, and publish your own.

Marketplace

Install community archetypes:

# From the AgentGuard marketplace (agentguard.rlabs.cl)
# Use the reload_archetypes tool after installing

How It Works

AgentGuard is agent-native: every tool returns structured prompts and criteria that your AI agent processes. The tool never calls an external LLM.

Your Agent (Claude, GPT, etc.)
    โ”‚
    โ”œโ”€โ”€ calls skeleton(spec, archetype) โ”€โ”€โ”€โ”€โ”€โ†’ returns L1 file tree prompt
    โ”œโ”€โ”€ calls contracts_and_wiring(spec, skeleton) โ†’ returns L2+L3 stubs prompt
    โ”œโ”€โ”€ calls logic(file, function) โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ†’ returns L4 implementation prompt
    โ””โ”€โ”€ calls get_challenge_criteria() โ”€โ”€โ”€โ”€โ”€โ”€โ†’ returns review criteria
    โ””โ”€โ”€ calls validate(files) โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ†’ returns static analysis results

The agent reads the prompt, generates the code, validates it, and loops back if criteria aren't met. AgentGuard provides the structure โ€” your agent provides the intelligence.

Development

pip install -e ".[dev]"
pytest tests/
ruff check agentguard/

Latest Release

See CHANGELOG.md for the full version history.

Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines on:

  • Setting up your dev environment
  • Creating custom archetypes
  • Submitting pull requests

Looking for a place to start? Check out issues labeled good first issue.

License

MIT โ€” see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rlabs_agentguard-0.14.0.tar.gz (144.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rlabs_agentguard-0.14.0-py3-none-any.whl (157.0 kB view details)

Uploaded Python 3

File details

Details for the file rlabs_agentguard-0.14.0.tar.gz.

File metadata

  • Download URL: rlabs_agentguard-0.14.0.tar.gz
  • Upload date:
  • Size: 144.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rlabs_agentguard-0.14.0.tar.gz
Algorithm Hash digest
SHA256 8faeb51f1b548954d602e7599f30fcfcad407f57a92caea2bd0b89cca7a49fa1
MD5 e2794d0f1bf6a774a9d50ad39e08d3b2
BLAKE2b-256 f86e0832d3bbb9b3782f114a0491d52561e9708f2918e1c05ac56530a63c78a9

See more details on using hashes here.

Provenance

The following attestation bundles were made for rlabs_agentguard-0.14.0.tar.gz:

Publisher: publish-pypi.yml on rlabs-cl/agentguard-lib

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file rlabs_agentguard-0.14.0-py3-none-any.whl.

File metadata

File hashes

Hashes for rlabs_agentguard-0.14.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0b00be82b77c8f6633f0dc29ebe8f9353f137ae19e58e182df717f74e22b5c3b
MD5 f78f21a5a487523822e739fb4d995854
BLAKE2b-256 21a6dd1b7544bc0f8cae128bfe60bba2420d069c8dde8dd6d62e2a44003e55e7

See more details on using hashes here.

Provenance

The following attestation bundles were made for rlabs_agentguard-0.14.0-py3-none-any.whl:

Publisher: publish-pypi.yml on rlabs-cl/agentguard-lib

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page