Skip to main content

Multi-agent verification orchestration -- LLMs propose, formal tools prove, every change is machine-checked

Project description

kairos

Multi-agent verification orchestration for hardware and software.

LLMs can propose RTL optimizations, but they cannot prove those optimizations are correct. A single model call that claims "area reduced by 12%" has no way to guarantee functional equivalence with the original design. Kairos closes that gap: it orchestrates specialized agents (proposer, reviewer, counterexample analyst) through formal verification tools (EBMC, yosys, Lean) in a closed loop where every proposed change is machine-checked before acceptance.

kairos optimize block.sv runs a multi-agent loop: the proposer generates candidates, EBMC k-induction proves functional equivalence, yosys measures the real area delta, and the reviewer challenges the result. What ships is not what the LLM said works -- it is what the formal tools proved correct.

Kairos is a CLI and MCP server. Connect it to Claude Code, Kiro, or Cursor, and your agent gets 40+ verification tools that return machine-checked results, not LLM opinions.

Why agents need guardrails

Every agent in the kairos loop has a different role and a different incentive. The proposer optimizes, the reviewer challenges, the counterexample analyst actively tries to break the proposal. Only changes that survive all three reach formal verification. And formal verification is not simulation or testing -- it is mathematical proof over all possible inputs.

The guardrail stack:

  • Pre-verification gates reject bad proposals before they reach formal tools: structural gate (cell count must decrease), toggle power gate (no >20% power regression)
  • Bounded model checking (EBMC) proves properties hold for all states up to depth k
  • Equivalence checking (yosys) proves the optimized netlist matches the original
  • Counterexample generation refutes invalid proposals with concrete failing traces
  • Trust boundary separates LLM-touching code from certificate generation (AST-lint enforced)
  • Block memory tracks refuted approaches so agents cannot re-propose known failures
  • Certificate chain with sha256 provenance from source through proof to output

Focus areas

Domain Verification engine What kairos checks
General software mypy, hypothesis, CrossHair, mutmut Types, properties, symbolic contracts, mutation testing
Chip design (RTL) EBMC, yosys Equivalence, safety properties, area/power optimization
Accelerator kernels NKI runtime, Lean 4 Custom kernel correctness, formal proof gate
Networking protocols EBMC, z3 Congestion control properties (BBR, CUBIC, Reno convergence)
Theorem proving Lean 4 Proof verification, sorry closure, axiom auditing
Authorization policy Cedar permit/forbid policy correctness

Kairos is built by kairos. We use our own verification and anti-hallucination tools to develop, test, and ship kairos itself. Every bug found by kairos repair --review on our own codebase makes the tool stronger.

Quick start

pip install

pip install athanor-sdk          # imports as: kairos
pip install athanor-sdk[types]   # adds mypy for Python type checking
kairos setup                     # zero-config wizard
kairos doctor                    # verify your environment

# Verify any codebase
kairos verify block.sv           # chip design (RTL)
kairos verify kernel.py          # accelerator kernel
kairos verify src/               # Python (types + properties + mutation)
kairos repair --review .         # find bugs across the whole project

Docker (includes all verification engines)

docker pull ghcr.io/athanor-ai/kairos:latest
docker run --rm -v $PWD:/work ghcr.io/athanor-ai/kairos:latest verify /work/block.sv
docker run --rm -v $PWD:/work ghcr.io/athanor-ai/kairos:latest verify /work/train.py
docker run --rm -e ANTHROPIC_API_KEY -v $PWD:/work ghcr.io/athanor-ai/kairos:latest optimize /work/block.sv

The Docker image includes yosys, EBMC, verilator, and all other engines. No additional installs needed.

Commands

design -- create RTL from natural language

kairos design "8-entry FIFO with sync reset"     # spec + architecture + RTL
kairos design --spec fifo_spec.sv                 # skip spec-gen, start from SVA
kairos design "pipelined ALU" --output ./alu/     # custom output directory

optimize -- area/power reduction with formal proof

kairos optimize block.sv                          # propose + verify + measure
kairos optimize block.sv --estimate-only          # preview strategies + cost, no LLM
kairos optimize block.sv --compound 5             # 5 iterative rounds
kairos optimize block.sv --liberty cells.lib      # technology-mapped cell counts
kairos optimize block.sv --local-only             # zero network except LLM API
kairos optimize kernel.py                         # auto-detects NKI kernels

verify -- auto-detect domain, multi-layer checking

kairos verify block.sv                            # RTL: yosys + EBMC bounded model check
kairos verify src/                                # directory: walks all files
kairos verify my_cca.py                           # Python CCA: telos + types + mutation (auto-detected)
kairos verify proof.lean                          # Lean: lake build + sorry check

Auto-detects the verification domain from file content, not just extension. A Python file with networking patterns (cwnd, ssthresh, pacing_rate) gets telos protocol verification THEN Python type checking and mutation testing -- stacked, not either/or.

repair -- find + fix + verify

kairos repair block.sv                            # detect issues, propose fixes, verify
kairos repair --review block.sv                   # diagnose only, no changes
kairos repair --focus security src/               # focus on security findings

explore -- fork and compare design variants

kairos explore block.sv                           # architecture exploration
kairos explore block.sv --fork area timing power  # parallel variant comparison

doctor -- environment health check

kairos doctor                                     # check deps, license, engines, models
kairos doctor --json                              # machine-readable output

setup -- first-run configuration

kairos setup                                      # interactive wizard

How it works

  1. Propose: LLM generates optimization candidates (local or cloud)
  2. Verify: EBMC k-induction proves functional equivalence
  3. Measure: yosys synthesis confirms real area reduction
  4. Report: honest verdict with claim verification

Every measurement from a tool, not an LLM estimate. Area from yosys synthesis, equivalence from EBMC, timing from OpenSTA (when .lib provided).

Configuration (kairos.yaml)

Drop a kairos.yaml in your project root:

model: claude-sonnet-4-6

methodology:
  verification:
    require_equivalence_proof: true
    minimum_bound_k: 10
  certificate:
    require_all_properties_proved: true

agents:
  reviewer:
    enabled: true
    severity_threshold: medium

Precedence: CLI flags > kairos.yaml > environment variables > defaults.

Agent integration (MCP)

{
  "mcpServers": {
    "kairos": {
      "command": "kairos",
      "args": ["mcp", "serve"]
    }
  }
}

Works with Claude Code, Kiro, Cursor, and any MCP-compatible IDE. 41 tools available including kairos_optimize, kairos_verify, kairos_repair, kairos_explore, and kairos_doctor.

Local-only mode

kairos optimize block.sv --local-only
kairos optimize block.sv --proposer-backend local  # RTL never leaves machine

No trace upload, no telemetry. Only the LLM API call leaves your machine. Use --proposer-backend local with ollama/vLLM for full air-gapped operation.

Requirements

  • Python 3.10+
  • kairos setup installs everything else

Optional extras:

  • pip install athanor-sdk[types]: mypy for Python type checking layer
  • pip install athanor-sdk[verify]: CrossHair for symbolic contract checking
  • pip install athanor-sdk[mcp]: MCP server for IDE integration

Optional engines (for full verification):

  • yosys + EBMC (hardware equivalence)
  • Lean 4 (theorem proving)
  • OpenSTA (timing analysis, requires Liberty .lib)

Run kairos doctor to check your setup.

Documentation

In Docker: see /opt/kairos-docs/ for bundled guides.

  • Getting Started (getting-started.md)
  • Installation Guide (installation.md)
  • Enterprise RTL Quickstart (enterprise-rtl-quickstart.md)

License

Commercial license required for production use. Free evaluation available. Visit athanor-ai.com for access.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

athanor_sdk-0.8.10.tar.gz (1.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

athanor_sdk-0.8.10-py3-none-any.whl (2.1 MB view details)

Uploaded Python 3

File details

Details for the file athanor_sdk-0.8.10.tar.gz.

File metadata

  • Download URL: athanor_sdk-0.8.10.tar.gz
  • Upload date:
  • Size: 1.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for athanor_sdk-0.8.10.tar.gz
Algorithm Hash digest
SHA256 57fa29e8359d4246966f2cc8cb2778e104684c559c75055004f230d8a3ccf5f3
MD5 742624bdfcf5d68d77fac2ba3f27c426
BLAKE2b-256 7bb47083b8a52aea70aec2c7d8c3b520d3fd03504a6ddb6cc346c49aff14be20

See more details on using hashes here.

File details

Details for the file athanor_sdk-0.8.10-py3-none-any.whl.

File metadata

  • Download URL: athanor_sdk-0.8.10-py3-none-any.whl
  • Upload date:
  • Size: 2.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for athanor_sdk-0.8.10-py3-none-any.whl
Algorithm Hash digest
SHA256 bbb516421353712a434e80fa1dd07f80493ddc49f0fa808f7e6b90d0ff8c72c2
MD5 43688d44282e66d9aaff33024d7510e1
BLAKE2b-256 19613199a1433dcae4064d260616505dcf7345095fea575f30f029c62c36ca2c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page