Framework-agnostic runtime that makes any AI agent safe and reliable enough to put in production: policy-gated execution, durable checkpointing, hash-chained audit.

These details have not been verified by PyPI

Project description

Lynx

Make any AI agent safe and reliable enough to put in production. Open-source Python runtime that wraps any agent (LangGraph, CrewAI, OpenAI Agents SDK, Anthropic Agent SDK, or a plain Python loop) and gives you three things every team currently rebuilds from scratch:

Policy-gated execution — every tool call passes through a declarative YAML policy engine. Dry-run, deny, transform, or require human approval.
Durable execution — every step is checkpointed before its side effect. Crash mid-run, resume exactly where you left off, no double-execution.
Hash-chained audit log — content-addressed, tamper-evident, regulator-grade trail of every decision and action.

Think Envoy + Temporal + OPA, but for AI agents.

Why

Agent reliability is the #1 unmet need in 2026 (Gartner: 40% of agentic AI projects will fail). Capabilities are up, reliability is lagging. Real incidents from the last 12 months:

An AI agent deleted a developer's entire D: drive when asked to clear a cache folder.
An AI agent wiped a production AWS environment, causing a 13-hour outage.
Meta's AI safety director was unable to stop her own agent from deleting her inbox.
An n8n v2.4.7→v2.6.3 upgrade silently broke function-calling schemas across the user base.

Every team building agents reinvents the same scaffolding: retry logic, dry-runs, approval flows, audit trails. Lynx is the missing layer.

Quickstart (under 2 minutes)

pip install lynx-agent
lynx init

# my_agent.py
import asyncio
from lynx import tool, runtime, ToolCall, FinalAnswer, Message

@tool(cost="low", reversible=False, scope=["filesystem:write"])
async def shell(cmd: str) -> str:
    proc = await asyncio.create_subprocess_shell(
        cmd, stdout=asyncio.subprocess.PIPE, stderr=asyncio.subprocess.PIPE
    )
    out, err = await proc.communicate()
    return (out + err).decode()

@shell.shadow
async def _shell_shadow(cmd: str) -> dict:
    return {"would_run": cmd}

class MyAgent:
    """Replace with any LLM-backed agent."""
    async def step(self, conversation):
        # Pretend the LLM proposed a dangerous command
        return ToolCall(tool="shell", args={"cmd": "rm -rf /"}, call_id="c1")

async def main():
    result = await runtime.run(
        MyAgent(),
        task="clean up the workspace",
        policy="./policy.yaml",
    )
    print(result.status, result.final_answer)

asyncio.run(main())

$ python my_agent.py
$ lynx ps                    # list runs
$ lynx trace <run_id>        # see every step + policy decision
$ lynx audit verify <run_id> # verify the hash chain

The default policy will deny the rm -rf / and feed the denial back to the agent as a tool result, so the agent can retry with something safer.

How it works

                ┌────────────────────────────────────────────────┐
                │  Agent (LangGraph / CrewAI / SDK / any)        │
                └──────────────────┬─────────────────────────────┘
                                   │ proposed tool call
                                   ▼
              ╔════════════════════════════════════════════════╗
              ║                  AGENT RUNTIME                 ║
              ║  ┌────────────┐  ┌────────────┐  ┌──────────┐  ║
              ║  │  Scheduler │→ │ Policy PDP │→ │ Mediator │  ║
              ║  │ (durable)  │  │  (pure)    │  │  (PEP)   │  ║
              ║  └────────────┘  └────────────┘  └──────────┘  ║
              ║          ↓             ↓              ↓        ║
              ║  ┌──────────────────────────────────────────┐  ║
              ║  │      SQLite journal + audit chain        │  ║
              ║  └──────────────────────────────────────────┘  ║
              ╚════════════════════════════════════════════════╝
                                   │ approved + recorded
                                   ▼
              ┌────────────────────────────────────────────────┐
              │ Real world (shell, browser, DB, AWS, etc.)     │
              └────────────────────────────────────────────────┘

Every action passes through the Mediator. The PDP returns one of five verdicts (allow, deny, dry_run, approve_required, transform). The Mediator dispatches accordingly. Before any side effect, a checkpoint is written. Every step emits a hash-chained audit event.

Policy example

# policy.yaml
version: 1
defaults:
  on_missing_shadow: approve_required
  on_no_match: deny

rules:
  - id: read-only-allow
    match: { declared.scope.contains_any: ["filesystem:read", "net:read"] }
    decision: allow

  - id: shell-rm-rf-root
    match:
      tool: shell
      args.cmd.matches: '^\s*rm\s+(-[rRf]+\s+)+/(\s|$)'
    decision: deny
    reason: "rm -rf / is never allowed"

  - id: prod-mutations-need-approval
    match:
      context.environment: prod
      declared.scope.contains_any: ["filesystem:write", "db:write", "cloud:write"]
    decision: approve_required
    approvers: ["@oncall"]

  - id: irreversible-dry-run-first
    match: { declared.reversible: false }
    decision: dry_run

Three layers, increasing expressiveness:

YAML rules — 80% of cases
Predicates — reusable named patterns
Python escape hatch — @policy.rule for edge cases

See docs/02-policy-language.md for the full grammar.

CLI

lynx init                    # set up a project
lynx run <script>            # run an agent script
lynx ps                      # list recent runs
lynx trace <run-id>          # step-by-step trace
lynx approvals               # list pending approvals
lynx approve <approval-id>   # approve a pending request
lynx audit verify <run-id>   # verify the hash chain
lynx audit export <run-id>   # emit jsonl for compliance
lynx policy lint             # validate policy.yaml
lynx policy bundle-id        # content-addressed bundle ID

Repo layout

lynx/
├── docs/
│   ├── 00-execution-plan.md      ← read first
│   ├── 01-data-model.md
│   ├── 02-policy-language.md
│   └── 03-sdk-and-cli.md
├── src/lynx/
│   ├── core/                     ← pure kernel, no I/O
│   │   ├── types.py
│   │   ├── policy.py             ← PDP
│   │   ├── mediator.py           ← PEP
│   │   └── scheduler.py          ← step loop
│   ├── stores/                   ← pluggable I/O
│   │   └── sqlite.py
│   ├── cli/main.py
│   ├── decorators.py             ← @tool, @shadow
│   ├── policy.py                 ← top-level re-exports
│   ├── runtime.py                ← public Runtime facade
│   └── sdk.py                    ← Agent protocol + Message types
├── tests/
├── examples/                    ← 12 numbered examples + framework integrations
│   ├── 01_hello_allow.py ... 10_devops_assistant.py
│   ├── 11_flask_service.py
│   ├── 12_django_service.py
│   └── policies/                ← multi-rule YAMLs used by examples 07/08/10
├── benchmarks/
└── pyproject.toml

Architectural rule: core/ has zero I/O. All I/O lives in stores/, adapters/, and cli/. This is why the PDP runs in microseconds, why tests are flake-free, and why upgrading from SQLite to Postgres to a gRPC sidecar is a deployment change, not a rewrite.

Roadmap

Shipped in v1.0 (current release):

Core kernel: policy PDP, action mediator, scheduler with pre-execution checkpointing
All five verdicts: allow / deny / dry_run / approve_required / transform
Hash-chained, tamper-evident audit log with lynx audit verify
Stores: SQLite (default), Postgres (production)
Adapters: Anthropic Claude, OpenAI, LangGraph, CrewAI, MCP
Shadow library: shell, filesystem, SQL, HTTP
Subprocess sandbox with POSIX rlimits
Crash-resume + approval-resume
Prometheus + OpenTelemetry hooks
Full CLI + 12 examples + STRIDE threat model

On the table for v1.x (no firm dates):

lynx replay <run-id> --from-step N --edit for run inspection
Container sandbox mode (the v1.0 sandbox is POSIX-subprocess only)
Webhook + Slack approval transports
gRPC sidecar mode for non-Python apps
HSM-signed audit events (current chain is hash-only)
Control-plane / multi-tenant dashboards (probably commercial)

Performance

What	Number
Policy evaluation (typical, ≤100 rules)	~100 µs / call
Policy evaluation (worst case, 1000 rules)	~1 ms / call
End-to-end overhead per step	~3 ms (SQLite-bound)
Test suite	57 tests in 1.1 s

For real agents where each step is a 500 ms – 5 s LLM call, Lynx's overhead is under 1%. Reproducible numbers in benchmarks/.

Documentation

Start here if you're new:

Doc	What it answers
Why Lynx	When should I use this? When shouldn't I?
Getting started	5-minute walkthrough from install to first denial
Concepts	Vocabulary: Tool, Policy, Verdict, Run, AuditEvent
Policy cookbook	Copy-pasteable rules for common patterns
FAQ	Common first-time questions

Reference docs:

Doc	What it covers
Data model	The six core types + SQLite schema
Policy language	Full YAML grammar + predicates + Python escape hatch
SDK + CLI	The public Python API + every CLI command
Threat model	STRIDE analysis + guarantees + non-goals
How v1.0 was built	The execution plan that got us to v1.0 (historical)

Examples — a learning path of 12. Each lead with a plain-language SCENARIO so the use case is clear:

#	Demo	What it shows
01	`01_hello_allow.py`	Smallest possible loop. ALLOW verdict.
02	`02_block_dangerous.py`	Block `rm -rf /` before it can run. DENY verdict.
03	`03_preview_writes.py`	See a file's contents BEFORE saving. DRY_RUN verdict.
04	`04_human_approval.py`	Pause for human sign-off on irreversible actions.
05	`05_real_llm_blocked.py`	Real Claude / GPT agent gated by Lynx.
06	`06_compliance_audit.py`	Hash-chain verification + tamper detection.
07	`07_refund_workflow.py`	Multi-tier refund rules (allow / approve / deny).
08	`08_sql_transform.py`	TRANSFORM verdict auto-injects `tenant_id` into SQL.
09	`09_fastapi_service.py`	Drop-in FastAPI integration.
10	`10_devops_assistant.py`	All five verdicts in one realistic DevOps scenario.
11	`11_flask_service.py`	Same as 09 but Flask (sync via `runtime.run_sync`).
12	`12_django_service.py`	Same as 09 but Django (async views, 4.1+).

See examples/README.md for the full index + how to run them.

Status

v1.0 — public API committed. SemVer from here: minor versions add features, patch versions fix bugs, major versions are reserved for breaking changes with documented deprecation cycles. Internal modules (lynx.core.*) are not part of the public API and may change in any minor release.

Production-ready for the documented scope (SQLite store, all five adapters, subprocess sandbox, hash-chained audit). See CHANGELOG.md for the full v1.0 surface area covered by the SemVer commitment.

License

Apache 2.0.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.0.0

Jun 10, 2026

This version

1.0.1

Jun 10, 2026

1.0.0

Jun 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lynx_agent-1.0.1.tar.gz (106.5 kB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lynx_agent-1.0.1-py3-none-any.whl (53.0 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file lynx_agent-1.0.1.tar.gz.

File metadata

Download URL: lynx_agent-1.0.1.tar.gz
Upload date: Jun 10, 2026
Size: 106.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lynx_agent-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`f6b8b76fbe8f49d8eefe9920c546afa3b1f35a71f40a8b7389b1b80ba415e5b1`
MD5	`2aa6339de326d2c26fb6e4dd940328c8`
BLAKE2b-256	`082ebecbe0a1399bfb9d73f05f8f6c2fedc69f9b1df6d6a63fcd789558dead95`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lynx_agent-1.0.1.tar.gz:

Publisher: release.yml on hadihonarvar/lynx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lynx_agent-1.0.1.tar.gz
- Subject digest: f6b8b76fbe8f49d8eefe9920c546afa3b1f35a71f40a8b7389b1b80ba415e5b1
- Sigstore transparency entry: 1772747329
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: hadihonarvar/lynx@24d025a09ae1440f587f8e91d4f8d5581f31178a
- Branch / Tag: refs/tags/v1.0.1
- Owner: https://github.com/hadihonarvar
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@24d025a09ae1440f587f8e91d4f8d5581f31178a
- Trigger Event: push

File details

Details for the file lynx_agent-1.0.1-py3-none-any.whl.

File metadata

Download URL: lynx_agent-1.0.1-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 53.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lynx_agent-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`51a41b16fe75fb15f6ea3929ba5a605af14120e3231e98e56e6d2e94f541bfdb`
MD5	`8f9eab5a1d3354ddb91a142e2b11d7d2`
BLAKE2b-256	`48cad19e221e6188468654b14b176138a876a18a99f574adaf380d9162885420`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lynx_agent-1.0.1-py3-none-any.whl:

Publisher: release.yml on hadihonarvar/lynx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lynx_agent-1.0.1-py3-none-any.whl
- Subject digest: 51a41b16fe75fb15f6ea3929ba5a605af14120e3231e98e56e6d2e94f541bfdb
- Sigstore transparency entry: 1772747526
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: hadihonarvar/lynx@24d025a09ae1440f587f8e91d4f8d5581f31178a
- Branch / Tag: refs/tags/v1.0.1
- Owner: https://github.com/hadihonarvar
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@24d025a09ae1440f587f8e91d4f8d5581f31178a
- Trigger Event: push

lynx-agent 1.0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Lynx

Why

Quickstart (under 2 minutes)

How it works

Policy example

CLI

Repo layout

Roadmap

Performance

Documentation

Status

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance