Skip to main content

Decision Intelligence Runtime - deterministic Kernel Space for agentic AI

Project description

Decision Intelligence Runtime (DIR)

An architectural framework for building reliable, accountable, and stateful AI decision systems.

Responsibility-Oriented Agents

Project Goal

Current "agent frameworks" treat Large Language Models (LLMs) as autonomous executors, resulting in non-deterministic behaviors, hallucinations in critical control paths, and insufficient accountability mechanisms.

Decision Intelligence Runtime addresses this by enforcing a strict separation between Reasoning (LLM-based, semantic, probabilistic) and Execution (code-based, deterministic, verifiable). This separation is implemented through a Kernel Space / User Space architectural boundary, inspired by operating system design principles.

Unlike purely theoretical frameworks, DIR provides a concrete, executable implementation of safe delegation patterns.

Origins

DIR emerged from production constraints in the AIvestor automated trading system, where the cost of reasoning failures is measured in capital loss. The patterns documented here are not theoretical-they represent battle-tested solutions to "Day Two" failure modes: state drift, non-idempotent operations, TOCTOU vulnerabilities, and the collapse of accountability in multi-agent systems.

This repository contains the architectural concepts, formal specifications, and reference implementations of the DIR framework.


Architectural Convergence & Validation

The publication of Intelligent AI Delegation by Google DeepMind (February 2026, arXiv:2602.11865) confirms the architectural direction we have been developing since early 2025. Their formalization of "Responsibility Transfer," "Auditability," and "Permission Handling" as fundamental requirements for agentic systems aligns with the patterns we have been validating in production environments.

The independent convergence is notable:

  • Google's "Responsibility Transfer"ROA Responsibility Contracts
  • Google's "Auditability"DecisionFlow ID (DFID)
  • Google's "Permission Handling"Decision Integrity Module (DIM)

This alignment reinforces that these patterns are not vendor-specific abstractions-they are architectural necessities for production-grade agentic systems.

Industry Validation: Google DeepMind vs. DIR Implementation

While Google's Intelligent AI Delegation (2026) outlines the theoretical requirements for safe agentic systems, DIR provides the open-source architectural implementation available today.

Requirement (Google DeepMind) DIR Implementation (This Repo) Status
Verifiable Task Completion Topology C (DL+PCI)
Cryptographic Proof-Carrying Intents
✅ Implemented
Transfer of Authority Responsibility Contracts
Machine-readable scope definitions
✅ Implemented
Permission Handling Kernel Space (DIM)
Deterministic policy enforcement
✅ Implemented
Structural Transparency DecisionFlow ID (DFID)
End-to-end distributed tracing
✅ Implemented

Read the full framework comparison: Google DeepMind vs. DIR/ROA


Quick Start

The fastest way to see the DIR architecture in action:

pip install -e .
python samples/00_quick_start/run.py

This sample demonstrates protection against catastrophic actions (e.g. parsing error turning 15.5 ETH into 15,500 ETH) and prompt injection in external data. High-level overview of the full architecture. See samples/00_quick_start.


Start Here

If you are new to DIR/ROA, begin with the introduction article. It explains the core motivation ("Day Two" failures in production systems), the Kernel Space vs. User Space architectural boundary, and why traditional agentic loops are insufficient for high-stakes environments.

Read: Beyond Prompt Engineering - Building a Deterministic Runtime for Responsible AI Agents

Why use DIR today?

Big Tech providers are beginning to theorize about "Agent Runtimes" as proprietary cloud services. DIR is the first open-source implementation of the Zero Trust Agents philosophy - agents propose, the runtime verifies; no implicit trust in LLM output.

  • No Vendor Lock-in: Run your agents on AWS, Azure, GCP, or on-premise. The runtime logic is yours.
  • Ready for Production: Solves "Day Two" problems (loops, drifts, audit) that simple orchestration libraries ignore.
  • Works with Existing Frameworks: DIR is not a competitor to LangChain, AutoGen, or LangGraph - it's the execution shell. Wrap task-oriented agents in ROA contracts, inject mission boundaries, and enforce deterministic validation. (See: 34_langchain_roa_wrapper, 35_crewai_roa_wrapper)
  • Language Agnostic Architecture: While the reference implementation is Python, the patterns (Kernel/User separation, DFID) are universal.
  • Context as Code: Documentation is the new compiler - Markdown files act as system prompts for AI agents. (Context as Code)

Core Concepts

1. Responsibility-Oriented Agents (ROA)

Current Status: Published

ROA is the architectural pattern for agents themselves. Instead of open-ended control loops, ROA constrains agents through explicit contracts:

  • Responsibility Contracts: Formal scope boundaries and authority limits.
  • Missions: Explicit optimization objectives defining agent purpose.
  • Stateful Existence: Long-lived memory and persistent identity.
  • Decision Lifecycle: Explain → Policy → Proposal (strict separation of reasoning from execution).

Read the ROA Manifesto

2. The Runtime Architecture

Current Status: Published

The execution environment that enforces deterministic guarantees:

  • Decision Integrity Module (DIM): Kernel-space validation layer (schema enforcement, RBAC, state consistency checks).
  • Context Compilation: Immutable state snapshots preventing TOCTOU vulnerabilities.
  • DecisionFlow ID (DFID): Distributed tracing for reasoning chain reconstruction.
  • Safety Invariants: Idempotency guarantees, TTL enforcement, and escalation protocols.

Read the DIR Architectural Pattern

3. Decision Intelligence Topologies

Current Status: Work in Progress

A pluralistic approach to agent orchestration. No single execution model satisfies all operational requirements. DIR defines three distinct topologies optimized for different constraint profiles:

  • Topology A (EOAM): Event-Oriented Agent Mesh for decentralized strategy coordination.
  • Topology B (SDS): Structural Decision Streams for high-velocity, grammar-constrained execution.
  • Topology C (DL+PCI): Decision Ledger with Proof-Carrying Intents for cryptographically verifiable audit trails.

Read Decision Intelligence Topologies

4. Post-Execution Governance & Drift

Current Status: Work in Progress

An architectural defense against "Day Three" problems, where an agent's individual decisions are technically valid (Kernel Compliance), but its aggregate behavior over time erodes business intent (Business Health).

  • Agent Drift Taxonomy: Formal classification of Optimization (Reward Hacking), Semantic, and Environmental drift.
  • Rolling Window Monitors: Asynchronous analysis of execution logs and context snapshots linked via DFID.
  • Circuit Breaking: Automated mechanism to transition an agent's Registry status to SUSPENDED upon detecting statistically significant drift.

Read Governance & Agent Drift


Getting Started

Prerequisites

  • Python 3.12+
  • SQLite3 (optional - only for samples that use persistent storage)
  • Ollama (optional - for local LLM inference in samples such as 31_finance_trading; use USE_MOCK_LLM=1 to run without it)

Installation

  1. Clone the repository.
  2. Install the package in editable mode:
pip install -e .

This installs the dir_core package (source in src/dir_core), making it available to all sample implementations.

Repository Structure

decision-intelligence-runtime/
├── README.md
├── FAQ.md                    # Frequently asked questions (architecture, adoption, compliance)
├── pyproject.toml            # pip install -e . installs dir_core (incl. dir_core.utils for samples)
├── requirements.txt          # Shared dependencies
├── src/
│   └── dir_core/             # Core DIR/ROA components (per docs spec)
│       # DFID, EventBus, DIM, Context Store, models, arbitration, PCI, etc.
│       └── utils/            # Supporting utilities for samples
│           # logging_utils, config_loader, llm_client (synthetic market: samples/31_finance_trading/mocks/)
├── samples/                  # Reference implementations (01–11 mechanics, 31+ use cases)
│   ├── README.md             # Sample catalog and run instructions
│   ├── 00_quick_start/ … 08_custom_repo_psql/ … 11_topology_c_dl_pci/
│   ├── 31_finance_trading/ … 35_crewai_roa_wrapper/ … 36_drift_optimization_discount/ … 37_drift_semantic_refund/ … 38_drift_environmental_bidding/
│   └── 88_meta_context_engineering/   # Meta-sample: System Prompt Toolkit
├── docs/                     # Architectural documentation
│   ├── 00-introduction/      # DIR intro, framework mapping
│   ├── 01-roa-manifesto/
│   ├── 02-decision-runtime/
│   ├── 03-topologies/
│   ├── 07-dir-minified/      # Machine-optimized single-file spec (DIR-minified.md)
│   └── 08-conclusion/        # Context as Code concept
└── assets/                   # Images, diagrams

Samples & Reference Implementations

Execute any sample from the repository root: python samples/<folder>/run.py

Full list and details: samples/README.md

Mechanics & Topologies (00–11)

# Sample Focus Description
00 00_quick_start Quick Start High-level overview: comma catastrophe, prompt injection
01 01_roa_agent ROA Manifesto Contract, Explain → Policy → Proposal
02 02_dfid_propagation DIR Pattern DecisionFlow ID: generation, propagation, logging
03 03_idempotency_guard DIR Pattern Idempotency: preventing duplicate side effects
04 04_context_store DIR Pattern 4 Layers of Context: Session, State, Memory, Artifacts
05 05_dim_validation DIR Pattern Decision Integrity Module: deterministic validation gate
06 06_agent_registry DIR Pattern Agent Registry: manifests and capability handshake
07 07_event_bus_swappable Infrastructure In-memory Event Bus; note on swapping for Kafka/PubSub
08 08_custom_repo_psql Infrastructure PostgreSQL StorageBundle: setup_environment with database.provider: postgres and env DB_*; minimal classic ROA + DIM run so audit and context land in the shared PostgreSQL adapter. See sample README.
09 09_topology_a_eoam Topology A Event-Oriented Agent Mesh
10 10_topology_b_sds Topology B Sovereign Decision Stream
11 11_topology_c_dl_pci Topology C Decision Ledger & Proof-Carrying Intents
88 88_meta_context_engineering Meta-Sample Travel/insurance: System Prompt Toolkit. Domain: flight delay refunds. Generates Flight Delay Refund System (Topology C).

Business Use Cases (31+)

# Sample Domain What it checks
31 31_finance_trading Finance/trading Market quotes, news, parallel agents; position spawning and execution. (Topology: EOAM)
32 32_fraud_gate Fraud detection YAML-driven fraud gate: ROA, DIM, JIT drift via verify_drift, StorageBundle audit, mock settlement idempotency. (Topology: classic + scenarios)
33 33_insurance_underwriting Insurance underwriting Risk evaluation with cryptographic Proof-Carrying Intents (PCI). (Topology: DL+PCI)
34 34_langchain_roa_wrapper FinOps LangChain ReAct → ROA. Cloud cost management. Verifies mission injection blocks PROD termination.
35 35_crewai_roa_wrapper Customer claims/refunds CrewAI Crew → ROA. E-commerce refunds (EUR). Verifies ACCEPT/ESCALATE/REJECT by category, return window, amount; NL intake.
36 36_drift_optimization_discount Retention discounts Optimization drift (reward hacking): agent offers discounts within DIM hard cap but profitability decays in aggregate. PerformanceMonitor detects rolling-average breach and suspends the agent. Shows that kernel compliance ≠ business health. (See Governance & Drift)
37 37_drift_semantic_refund Support refunds Semantic drift (emotional manipulation): refunds stay under a EUR cap so DIM accepts, but the delay policy is violated. ComplianceMonitor uses rolling REFUND_EXECUTED telemetry (delay_hours in audit details) and suspends when the semantic violation rate exceeds the threshold. (See Governance & Drift)
38 38_drift_environmental_bidding AdTech / bidding Environmental drift: market CPC escalates; bids stay under a contract cap so DIM accepts, but rolling average CPC exceeds LTV. BusinessROIMonitor joins execution_log to market_snapshots and suspends after consecutive negative ROI cycles. (See Governance & Drift)

Documentation

  • DIR Introduction - Architectural motivation and Kernel Space / User Space boundary.
  • ROA Manifesto - Responsibility-oriented agent design.
  • DIR Architecture - Runtime components and invariants.
  • Topologies - Operational modes (EOAM, SDS, DL+PCI).
  • Governance & Drift - Managing aggregate safety and business health over time.
  • Context as Code - Treating documentation as a system prompt.
  • FAQ - Answers to common engineering questions about DIR/ROA: "Day Two" failure modes, Kernel vs User Space, comparison with orchestration frameworks, JIT state verification, idempotency, compliance (e.g. EU AI Act and Proof-Carrying Intents), and incremental adoption.

Machine-optimized specification: DIR-minified.md

DIR-minified is a single-file, machine-optimized version of the framework specification. It is intended for use as context by LLMs and code-generation agents (e.g. Cursor, Claude, Devin), not as primary reading for humans.

The document is intentionally radical (dense, exhaustive) and redundant (repeats key constraints and examples in multiple places) so that a model loading it as context has a complete, self-contained picture of ROA, DIR, and the three topologies without chasing links. If you are feeding this repo to an AI to implement or extend DIR, attach DIR-minified.md as the main spec; the human-oriented docs in docs/ remain the narrative and tutorial layer.


License

This project is licensed under the Apache License, Version 2.0 – see the LICENSE file for details.


Author

Artur Huk - LinkedIn


This repository represents an evolving architectural framework derived from production constraints in high-stakes AI decision systems.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dir_core-0.1.0.tar.gz (68.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dir_core-0.1.0-py3-none-any.whl (62.6 kB view details)

Uploaded Python 3

File details

Details for the file dir_core-0.1.0.tar.gz.

File metadata

  • Download URL: dir_core-0.1.0.tar.gz
  • Upload date:
  • Size: 68.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for dir_core-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2132c2547e29d3aeb288e9cd4bbf8172e0fd84b14ab86ab620405b4478d028c2
MD5 fb0944a103d11919d66bf21186a33cfc
BLAKE2b-256 e860cf381a76e8ca28876a6c90219bdf95a36ca994b808702344948448b78548

See more details on using hashes here.

File details

Details for the file dir_core-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: dir_core-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 62.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for dir_core-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b586bd879726e102be5d5b7be3267b064012547cc5a77267c601921dabeeee0c
MD5 0f72cb941ef94de25e461c3542bf1f0d
BLAKE2b-256 f863a74c1bfb2fa0462f2fe7f5758fd056b3a61ea02f929b3470110ab6026e0d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page