An AI coding agent you can actually trust - with built-in impact preview

These details have not been verified by PyPI

Project links

Project description

🛡️ Safe Agent

Guardrails for AI code agents.

Safe Agent previews every file edit with impact-preview so AI helpers can’t quietly ship risky changes. Drop it into CI or run locally and require approvals before writes.

pip install safe-agent-cli
safe-agent "add error handling to api.py" --dry-run

Project Map

impact-preview (Agent Polis): the guardrail layer that previews and scores risky actions.
safe-agent-cli (this repo): a reference coding agent that uses impact-preview for approvals.
Roadmap: staged execution plan in ROADMAP.md.
Compatibility Matrix: version contract in docs/compatibility-matrix.md.
Monday Packet: current assignment bundle in docs/monday-assignment-packet.md.

The Problem

AI coding agents are powerful but dangerous:

Replit Agent deleted a production database
Cursor YOLO mode deleted an entire system
You can't see what's about to happen until it's too late

The Solution

Safe Agent previews every change before execution:

$ safe-agent "update database config to use production"

📋 Task: update database config to use production

📝 Planned Changes
┌────────┬─────────────────┬─────────────────────────┐
│ Action │ File            │ Description             │
├────────┼─────────────────┼─────────────────────────┤
│ MODIFY │ config/db.yaml  │ Update database URL     │
└────────┴─────────────────┴─────────────────────────┘

Step 1/1

╭─────────────── Impact Preview ───────────────╮
│ Update database URL                          │
│                                              │
│ **File:** `config/db.yaml`                   │
│ **Action:** MODIFY                           │
│ **Risk:** 🔴 CRITICAL                        │
╰──────────────────────────────────────────────╯

Risk Factors:
  ⚠️  Production pattern detected: production
  ⚠️  Database configuration change

Diff:
- url: postgresql://localhost:5432/dev
+ url: postgresql://prod-server:5432/production

⚠️  CRITICAL RISK - Please review carefully!
Apply this change? [y/N]:

Installation

pip install safe-agent-cli

Set your Anthropic API key:

export ANTHROPIC_API_KEY=your-key-here

Usage

Basic Usage

# Run a coding task
safe-agent "add input validation to user registration"

# Preview only (no execution)
safe-agent "refactor auth module" --dry-run

# Auto-approve low-risk changes
safe-agent "add docstrings" --auto-approve-low

CI / Non-interactive mode

Use --non-interactive to avoid prompts (auto-approves LOW/MEDIUM, rejects HIGH/CRITICAL). Combine with --fail-on-risk to fail the process if risky changes are proposed:

safe-agent "scan repository for risky config changes" --dry-run --non-interactive --fail-on-risk high

Interactive Mode

safe-agent --interactive

From File

safe-agent --file task.md

How It Works

Plan - Claude analyzes your task and plans file changes
Preview - Each change runs through impact-preview for risk analysis
Approve - You see the diff and risk level before anything executes
Execute - Only approved changes are applied

Options

Flag	Description
`--dry-run`	Preview changes without executing
`--auto-approve-low`	Auto-approve low-risk changes
`--non-interactive`	Run without prompts (CI-friendly)
`--fail-on-risk`	Exit non-zero if any change meets/exceeds risk level
`--interactive`, `-i`	Interactive mode
`--file`, `-f`	Read task from file
`--model`	Claude model to use (default: claude-sonnet-4-20250514)

MCP Server (For Other AI Agents)

Safe Agent can be used as an MCP server, letting other AI agents delegate coding tasks safely.

# Start the MCP server
safe-agent-mcp

Claude Desktop Integration

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "safe-agent": {
      "command": "safe-agent-mcp"
    }
  }
}

Available MCP Tools

Tool	Description	Safety
`run_coding_task`	Execute a coding task with preview	🔴 Destructive
`preview_coding_task`	Preview changes without executing	🟢 Read-only
`get_agent_status`	Check agent status and capabilities	🟢 Read-only

Moltbook Integration

Safe Agent is available as a Moltbook skill for AI agent networks.

See moltbook-skill.json for the skill definition.

Demo Producer

Set up a canned risky-edit scenario and print recording commands:

safe-agent-demo prepare  # creates a demo repo with config/db.yaml
cd /tmp/safe-agent-demo-*  # or your chosen path
safe-agent-demo record     # shows asciinema + GIF commands

By default the demo runs safe-agent --dry-run "switch database config to production" against the prepared repo.

Safe Agent demo

For AI Agents

If you're an AI agent wanting to use Safe Agent programmatically:

from safe_agent import SafeAgent

agent = SafeAgent(
    auto_approve_low_risk=True,  # Skip approval for low-risk changes
    dry_run=False,               # Set True to preview only
)

result = await agent.run("add error handling to api.py")

Powered By

impact-preview - Impact analysis and diff generation
Claude - AI planning and code generation
Rich - Beautiful terminal output
MCP - Model Context Protocol for agent interoperability

Marketing Helpers

A lightweight CLI to generate headline variants, channel-specific copy (HN, Twitter/X, LinkedIn), and README hero blocks:

safe-agent-marketing generate --audience "Teams running AI code agents in CI" \
  --hypothesis "Guardrail that blocks risky edits" --update-readme

This writes JSON/Markdown bundles to marketing/ and (optionally) refreshes the README hero block. Queue posts with:

safe-agent-marketing queue --slot 2026-02-05T15:00:00Z --slot 2026-02-05T20:00:00Z

Log traction daily:

safe-agent-marketing analytics --repo agent-polis/safe-agent --log experiments/experiments.csv

License

MIT License - see LICENSE for details.

Built by developers who want AI agents they can actually trust.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.4

Feb 20, 2026

0.4.3

Feb 18, 2026

0.4.2

Feb 18, 2026

0.4.1

Feb 16, 2026

0.4.0

Feb 14, 2026

This version

0.3.0

Feb 11, 2026

0.2.0

Feb 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

safe_agent_cli-0.3.0.tar.gz (95.2 kB view details)

Uploaded Feb 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

safe_agent_cli-0.3.0-py3-none-any.whl (25.3 kB view details)

Uploaded Feb 11, 2026 Python 3

File details

Details for the file safe_agent_cli-0.3.0.tar.gz.

File metadata

Download URL: safe_agent_cli-0.3.0.tar.gz
Upload date: Feb 11, 2026
Size: 95.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for safe_agent_cli-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`03a187b35c1b6ce633ea27d75a189f68a8aa4e650f298a756fe33a92b3061749`
MD5	`c4940c849816c07fb00bad8dada8e759`
BLAKE2b-256	`92b3ebe839defc5b3ff63d70a8bd419018acf507bd6b9cce3e7edbd790cf6e8e`

See more details on using hashes here.

File details

Details for the file safe_agent_cli-0.3.0-py3-none-any.whl.

File metadata

Download URL: safe_agent_cli-0.3.0-py3-none-any.whl
Upload date: Feb 11, 2026
Size: 25.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for safe_agent_cli-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4d8e52d1b8b0f0051baea672b7255b428a2d06a28ebeeb7c7abeae618c69b251`
MD5	`454bbfb7d646d4cc95d380bd5ef8953c`
BLAKE2b-256	`73bae704536780a11ff9d1c7b14945ad3a743de0c88e6791e3b1afb1e47f6dc0`

See more details on using hashes here.

safe-agent-cli 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🛡️ Safe Agent

Project Map

The Problem

The Solution

Installation

Usage

Basic Usage

CI / Non-interactive mode

Interactive Mode

From File

How It Works

Options

MCP Server (For Other AI Agents)

Claude Desktop Integration

Available MCP Tools

Moltbook Integration

Demo Producer

For AI Agents

Powered By

Marketing Helpers

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes