Guardrails service for AI agents — safety evaluation, audit, and approval workflows

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Genunix

These details have not been verified by PyPI

Project description

Intaris

intaris

Guardrails service for AI agents. Intaris sits between your AI agent and its tools, evaluating every tool call for safety and alignment before allowing execution. Works with OpenCode, Claude Code, OpenClaw, and any MCP-compatible client.

Default-deny. Every tool call is classified and evaluated. Read-only operations are fast-pathed; everything else goes through LLM safety evaluation. Unknown tools are never auto-approved.

Real-time. Sub-second evaluation with a priority-ordered decision matrix. Read-only calls resolve in under 1ms. LLM evaluations complete within the 5-second circuit breaker. WebSocket streaming for live monitoring.

Self-hosted. Single Python process, SQLite or PostgreSQL storage, no external dependencies beyond an LLM API key. Your code and audit trail stay under your control.

Part of the Cognara platform (Cognis controller, Intaris guardrails, Mnemory memory).

Features

Default-deny classifier -- Explicit read-only allowlist with critical pattern detection. Everything not allowlisted goes through LLM evaluation.
LLM safety evaluation -- OpenAI-compatible structured output for alignment checking, risk assessment, and decision reasoning.
Priority-ordered decision matrix -- Critical risk auto-denies, aligned low/medium approves, high risk and misalignment escalate for human review.
Session management -- Hierarchical parent/child sessions with intention tracking, lifecycle states, and idle sweep.
Intention tracking -- User-driven intention model with IntentionBarrier for real-time updates and AlignmentBarrier for parent/child enforcement.
MCP proxy -- Sits between clients and upstream MCP servers, evaluating every tool call with per-tool preference overrides.
Audit trail -- Every evaluation is logged with decision, reasoning, risk level, classification, latency, and redacted arguments.
Secret redaction -- API keys, passwords, tokens, and connection strings are automatically redacted before audit storage.
Filesystem path protection -- Working directory enforcement with approved path prefix learning from LLM approvals.
Session recording -- Full-fidelity event logs with live tailing, playback, and chunked ndjson storage (filesystem or S3).
Behavioral analysis -- Three-layer system: per-call data collection, session summaries, and cross-session behavioral profiling.
Management UI -- Built-in web dashboard with session tree view, audit log, approval queue, MCP server management, and real-time charts.
Judge auto-resolution -- Escalated tool calls can be automatically reviewed by a more capable LLM (gpt-5.4), reducing human intervention while maintaining safety. Three modes: disabled, auto, advisory.
Webhook callbacks -- HMAC-signed escalation notifications for external approval systems.
Notification channels -- Per-user push notifications (Pushover, Slack, webhook) with one-click approve/deny action links.
Rate limiting -- Per-session sliding window rate limiter to prevent runaway agents.

Quick Start

Intaris needs an OpenAI-compatible API key for safety evaluation. It picks up LLM_API_KEY from your environment automatically.

LLM_API_KEY=sk-your-key uvx intaris

That's it. Intaris starts on http://localhost:8060, management UI at http://localhost:8060/ui.

Now integrate with your agent. We already ship extensions for some clients. For example for OpenCode, install the plugin:

export INTARIS_URL=http://localhost:8060
cp integrations/opencode/intaris.ts ~/.config/opencode/plugins/

Intaris can also serve as MCP proxy with audit trail and guardrails for tool calls. To use that, configure any MCP client to use intaris as a single MCP server:

{
  "mcpServers": {
    "intaris": {
      "type": "streamable-http",
      "url": "http://localhost:8060/mcp"
    }
  }
}

And add MCP servers via Intaris UI or config.

Intaris is also available via Docker, pip, or production setup. See the full quick start guide for more clients and options.

Screenshots

Dashboard -- evaluation metrics, decision distribution, performance stats, and activity timeline

Sessions -- hierarchical tree view with expandable session details and recent evaluations

Approvals -- pending escalations with reasoning, arguments, and one-click approve/deny

Behavioral Profile
Analysis -- behavioral risk profile with per-agent risk indicators and trends

Analysis Trend
Analysis -- cross-session behavioral trend tracking over time

Sessions -- suspicious session detail with evaluation reasoning and risk assessment

Critical Denied
Audit -- critical tool execution denied with detailed reasoning

See the Management UI docs for all tabs and features.

Supported Clients

Client	Integration	Setup Guide
OpenCode	Plugin (`intaris.ts`)	OpenCode Guide
Claude Code	Hooks (bash scripts)	Claude Code Guide
OpenClaw	Plugin (`@fpytloun/openclaw-intaris`)	OpenClaw Guide
Any MCP client	MCP proxy (`/mcp` endpoint)	MCP Proxy Guide

Plugin/Hooks give fine-grained control: custom error messages, fail-open/fail-closed behavior, session lifecycle management, and behavioral analysis. MCP proxy is zero-code configuration but with less UX control.

How It Works

Intercept. The client integration (plugin, hooks, or MCP proxy) captures every tool call before execution and sends it to Intaris for evaluation.

Classify. The classifier checks the tool against a priority chain: session policy denies, tool preference overrides, critical patterns, the read-only allowlist, and filesystem path policy. Read-only tools are auto-approved. Critical patterns are auto-denied.

Evaluate. Tool calls classified as WRITE go through LLM safety evaluation. The LLM assesses alignment with the session intention, risk level (low/medium/high/critical), and recommends a decision -- all within a 4-second timeout.

Decide. The decision matrix applies priority-ordered rules: critical risk always denies, aligned low/medium risk approves, high risk and misalignment escalate for human review. The decision, reasoning, and full context are recorded in the audit trail.

See the Architecture and Evaluation Pipeline docs for the full technical details.

Benchmark Results

Intaris catches 100% of critical threats (destructive commands, data exfiltration, RCE) with zero false positives. Across 41 benchmark scenarios including adversarial attacks, social engineering, and cross-session patterns, Intaris achieves 94% F1 with 100% precision -- it never blocks legitimate developer work.

Metric	Value
Precision	100%
F1 Score	93.7%
False Positive Rate	0.0%
Critical Misses	0
Avg Latency	1.1s

See the Benchmarking docs for methodology, scenario details, and how to run your own benchmarks.

Documentation

Document	Description
Quick Start	Get running in 5 minutes
Architecture	System design, layers, and key decisions
Evaluation Pipeline	Classification, LLM evaluation, and decision matrix
Configuration	Environment variable reference
REST API	Full API endpoint reference
MCP Proxy	MCP proxy setup, tool namespacing, and preferences
Management UI	Built-in web dashboard
Deployment	Production deployment guide
Development	Contributing, tests, and code conventions
OpenCode Integration	OpenCode plugin setup
Claude Code Integration	Claude Code hooks setup
OpenClaw Integration	OpenClaw extension setup
Benchmarking	Guardrails benchmark system

License

Business Source License 1.1 — see LICENSE for the full text.

The Licensed Work is (c) 2026 Filip Pytloun. You may use the Software for your own internal business operations free of charge. Commercial use (SaaS, managed services, or as a component of a commercial product) requires a separate license. On the Change Date (2030-03-15), the license converts to Apache License 2.0.

For alternative licensing arrangements, contact: filip@pytloun.cz

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Genunix

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.3

Apr 15, 2026

0.4.2

Apr 15, 2026

0.4.1

Apr 11, 2026

0.4.0

Apr 2, 2026

0.3.2

Mar 28, 2026

This version

0.3.1

Mar 26, 2026

0.3.0

Mar 26, 2026

0.2.0

Mar 20, 2026

0.1.0

Mar 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

intaris-0.3.1.tar.gz (6.4 MB view details)

Uploaded Mar 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

intaris-0.3.1-py3-none-any.whl (475.0 kB view details)

Uploaded Mar 26, 2026 Python 3

File details

Details for the file intaris-0.3.1.tar.gz.

File metadata

Download URL: intaris-0.3.1.tar.gz
Upload date: Mar 26, 2026
Size: 6.4 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for intaris-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`cf8fce956b6d404576af2a90de3836a536b013dc587108e70e2391126d526861`
MD5	`eb9bafcd5f029f202cc37a245275709b`
BLAKE2b-256	`be38a6c0b7f5f26b17ccd70354197e57afdeb2658db1f83756167c581316632b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for intaris-0.3.1.tar.gz:

Publisher: python-publish.yml on fpytloun/intaris

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: intaris-0.3.1.tar.gz
- Subject digest: cf8fce956b6d404576af2a90de3836a536b013dc587108e70e2391126d526861
- Sigstore transparency entry: 1186131273
- Sigstore integration time: Mar 26, 2026
Source repository:
- Permalink: fpytloun/intaris@a1920b976b6c4c14d9aaf26efd9af620b986eb3d
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/fpytloun
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@a1920b976b6c4c14d9aaf26efd9af620b986eb3d
- Trigger Event: release

File details

Details for the file intaris-0.3.1-py3-none-any.whl.

File metadata

Download URL: intaris-0.3.1-py3-none-any.whl
Upload date: Mar 26, 2026
Size: 475.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for intaris-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bf70e9936a4505b5ed46e716a1f9f4df0802ce924dbc086650a889d9977c5970`
MD5	`d910d845a1ecd0856e134d55a35619f3`
BLAKE2b-256	`26cf69897023729bae5ade94b0a721363845bba98395e3ff20d27d83c868780f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for intaris-0.3.1-py3-none-any.whl:

Publisher: python-publish.yml on fpytloun/intaris

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: intaris-0.3.1-py3-none-any.whl
- Subject digest: bf70e9936a4505b5ed46e716a1f9f4df0802ce924dbc086650a889d9977c5970
- Sigstore transparency entry: 1186131302
- Sigstore integration time: Mar 26, 2026
Source repository:
- Permalink: fpytloun/intaris@a1920b976b6c4c14d9aaf26efd9af620b986eb3d
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/fpytloun
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@a1920b976b6c4c14d9aaf26efd9af620b986eb3d
- Trigger Event: release

intaris 0.3.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

intaris

Features

Quick Start

Screenshots

Supported Clients

How It Works

Benchmark Results

Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance