Skip to main content

Safe, reliable local coding agent backend. Forge + coding-specific guardrails.

Project description

coding-guardrails

Safe, reliable local coding agent backend. Open-source, pip-installable.

coding-guardrails is a proxy that sits between your coding agent and a local LLM, adding two layers of protection:

  1. Forge (Layer 1) — Rescue parsing, retries, validation. Makes local models actually work for tool calling.
  2. Coding Guardrails (Layer 2) — Read-before-edit, path safety, command blocking, secret masking, test-after-change suggestions.

One command to go from "I have a GPU" to "I have a safe local coding agent backend."

Quick Start

# Install
pip install coding-guardrails

# Start llama-server (your local LLM backend)
llama-server -m Qwen3.5-9B-UD-Q4_K_XL.gguf --jinja --flash-attn auto \
  --port 8080 -c 200000 --spec-type draft-mtp -np 1

# Start the proxy
coding-guardrails serve \
  --backend-url http://localhost:8080 \
  --model Qwen3.5-9B-UD-Q4_K_XL \
  --port 8081

# Point your agent at http://localhost:8081/v1

That's it. Your agent sees a standard OpenAI-compatible API.

What It Blocks

Rule Blocks Example
Path safety Reads/writes outside workspace read_file("/etc/passwd")
Command safety Destructive shell commands bash("rm -rf /")
Secret detection API keys, tokens, private keys bash("export AWS_SECRET_ACCESS_KEY=...")
Prerequisites Edit before read (soft nudge) edit_file() without read_file() ⚠️
Sequencing Missing test runs (soft nudge) Edit without pytest ⚠️
Tool resolution Empty/error results (soft nudge) Tool returns "" ⚠️

All rules are configurable. See docs/rules.md.

Supported Models

Optimized for consumer GPUs (24 GB VRAM) with llama-server:

Model VRAM Context Speed Notes
Qwen3.5-9B 18 GB 200K ~53 tok/s Dense, MTP, best quality
Gemma 4 26B-A4B 21 GB 200K ~50 tok/s MoE, vision, Google
Qwen3.6-35B-A3B 22.5 GB 32K ~22 tok/s Legacy

Works with any OpenAI-compatible backend. See docs/models.md.

Agent Setup

Point any OpenAI-compatible agent at http://localhost:8081/v1:

  • Piapi_base: "http://localhost:8081/v1"
  • Claude CodeOPENAI_BASE_URL=http://localhost:8081/v1
  • OpenCode — add provider with baseURL: http://localhost:8081/v1
  • AiderOPENAI_API_BASE=http://localhost:8081/v1
  • Continue"apiBase": "http://localhost:8081/v1"
  • Cline / Roo — set API base in settings

See docs/agents.md for detailed setup guides.

Configuration

Create a guardrail-config.yaml (or use defaults):

path_safety:
  enabled: true
  blocked_prefixes: ["/etc/", "/sys/", "/proc/"]

command_safety:
  enabled: true
  strength: hard  # hard = block, soft = warn

secrets:
  enabled: true
  strength: hard
  mask_value: "[REDACTED]"

Pass with --config guardrail-config.yaml.

Architecture

Agent → coding-guardrails (:8081) → llama-server (:8080) → GPU
            │
            ├─ Layer 1 (Forge): rescue, validate, retry
            └─ Layer 2 (Guardrails): 6 safety rules

See docs/architecture.md for details.

Docker

docker compose up

Or standalone:

docker run -p 8081:8081 ghcr.io/stawils/coding-guardrails:latest \
  serve --backend-url http://host.docker.internal:8080 --model your-model

Eval

coding-guardrails eval --backend-url http://localhost:8081

Runs scenarios from eval/scenarios/ and reports pass/fail by category.

Development

git clone https://github.com/stawils/coding-guardrails.git
cd coding-guardrails
uv venv && source .venv/bin/activate
uv pip install -e ".[dev]"

# Run tests
pytest tests/unit/ -v

# Run against live backend
pytest tests/integration/ -v -m integration

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coding_guardrails-0.3.1.tar.gz (45.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coding_guardrails-0.3.1-py3-none-any.whl (44.8 kB view details)

Uploaded Python 3

File details

Details for the file coding_guardrails-0.3.1.tar.gz.

File metadata

  • Download URL: coding_guardrails-0.3.1.tar.gz
  • Upload date:
  • Size: 45.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for coding_guardrails-0.3.1.tar.gz
Algorithm Hash digest
SHA256 99fb6f0536c1c81ebc77c2eaa22df25026ff5ae290f9d56d61bd4c48c9589682
MD5 91c2f7e073f2188aba951ad532bfcc00
BLAKE2b-256 45e5c762f9c39da538a3239e3081528dbeb267b80f961b65b27cd43937286994

See more details on using hashes here.

Provenance

The following attestation bundles were made for coding_guardrails-0.3.1.tar.gz:

Publisher: ci.yaml on stawils/coding-guardrails

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file coding_guardrails-0.3.1-py3-none-any.whl.

File metadata

File hashes

Hashes for coding_guardrails-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6625d28850991a910b568f41b3828ea2009b3aa532fb66841f1942b6bf5c4a2c
MD5 40baca032148f3e8373db558affc5ec1
BLAKE2b-256 94f5c7abdea56eb12e2b9860720b91c2294161889419618b22f9f5f724d0aeb6

See more details on using hashes here.

Provenance

The following attestation bundles were made for coding_guardrails-0.3.1-py3-none-any.whl:

Publisher: ci.yaml on stawils/coding-guardrails

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page