Agent Utilities for Pydantic AI Agents
Project description
Agent Utilities - AGI Harness
Version: 0.46.0
Table of Contents
- The Technical Novel: Narrative Journey
- Overview
- Key Features
- Intelligence Graph
- First Principles Architecture
- Concept Map
- Architecture & Orchestration
- Multi-Model Config & Secret Storage
- Installation
- Quick Start
- Creating an Agent
- Building MCP Servers
- API Documentation
- Documentation
- Contributing
- License
🌌 The Journey of Agent Utilities: The Technical Novel
[!NOTE] New to the project? Rather than reading dry configuration tables, experience
agent-utilitieslive! Read our comprehensive technical biography tracing the lifecycle of a high-stakes quantitative rebalancing mandate.
🌌 Mission & Future State: Distributed Evolution
The core vision for agent-utilities transcends being just an execution harness—it is the bedrock for Distributed Agentic Evolution and the substrate for the AI-First Autonomous Organization.
As autonomous agents leverage this ecosystem to solve complex problems, they continuously learn, adapt, and refine their own capabilities. Our future state envisions a community of independent, self-improving agents that not only run on this harness but dynamically contribute their localized evolutionary breakthroughs—new skills, optimized TeamConfigs, refined prompts, and advanced reasoning traces—back to the open-source collective.
By tying our unified Knowledge Graph, capability auto-activation, and cross-agent communication protocols together, agent-utilities becomes an interconnected hive mind where the evolution of one agent elevates the intelligence of all. The harness is not just a way to run an agent; it is the heartbeat of a distributed, self-evolving intelligence network.
Key Features
- Multi-Domain Expert System: Scale across finance, medical, and scientific domains using Temporally-Aware Epistemic Memory (TKG) and specialized MCP tools.
- Unified Intelligence Graph: A tiered pipeline combining native Rust in-memory processing (
EpistemicGraphBackend, the default L1 working store) with a durable PostgreSQL/pggraph persistence tier and OWL (Apache Jena Fuseki) semantics. (LadybugDB / Neo4j / FalkorDB Cypher backends remain available underbackends/contrib/.) - Centralized Sessions & Goals (API-First Gateway): A highly-resilient, centralized REST API gateway running on Port
8100that handles background goal loops, durable turns, and user session reply orchestration. - High-Performance Rust Compute Engine 🔬: A compiled Rust Graph Compute Engine via
epistemic-graphrunning over high-speed Unix Sockets, providing fast AST parsing, VF2 subgraph matching, and a Redpanda-backed Reactive State Ledger designed to scale seamlessly up to 100,000,000 concurrent agents. - Spec-Driven Development (SDD): High-fidelity orchestration that decomposes goals into structured specs, implementation plans, and parallel tasks.
- Emergent Architecture: Dynamic AgentCapability auto-activation, TeamConfig coalition promotion, and evolutionary skill refinement via self-models.
- Agent OS & Safety: Built-in Universal Tool Guards, structural vulnerability scanning, and transparent process lifecycle management.
- Dynamic Company Brain Ingestion: Dynamic data ingestion from external platforms like Jira, GitLab/GitHub, enterprise architecture repositories (e.g., Essential Project, Archi), and databases with automatic ontology alignment and GraphQL/REST extraction.
- Company Brain Runtime (Trust, Permissions, Feedback): The 6-layer "Single Company Brain" wired end-to-end behind
KG_BRAIN_ENFORCE— source-authority conflict resolution with trust decay and field-level survivorship (durable per-attribute provenance / MDM golden record), data-level ACLs + tenant scoping + read audit on the retrieval path, a human-correction → durable rule → eval feedback loop, and token-budgeted, task-scoped retrieval. - Vendor-Neutral Enterprise Ontology: One ArchiMate-aligned upper ontology + crosswalk so ServiceNow↔ERPNext, Camunda↔Archi, etc. are interchangeable — a single query resolves all sources regardless of which vendor tool produced the data.
- Enterprise Agent Governance: Production-grade mutation governance with risk-scored change proposals, human-in-the-loop approval gates, AGENTS.md self-improvement, lint enforcement hooks, plugin bundle distribution, permission policies, staleness auditing, and unified governance workflow pipeline.
- Global Workspace Attention & Social-System Swarm: After every multi-agent wave the parallel engine scores, selects, and broadcasts winning specialist proposals (read back as runtime standing, with an engine-mismatch telemetry guard) and snapshots Multi-Agent Social System health (archetype heterogeneity, co-evolution, Wasserstein drift) into
ExecutionResult.telemetry. - In-House Training Substrate: Fine-tune the framework's own open-weight models end-to-end — a deterministic reward/data engine, torch/PEFT SFT/DPO/GRPO trainers (
data-science-mcp[training]), a pure-Rust loss/optimizer performance path (epistemic-graph), checkpoint→reliability-suite eval hooks, and a model-registry role deploy seam that goes live with no hot-path edit. Build-now / run-later on the GB10 (first run: OpenSeeker SFT).
📖 View the Comprehensive Feature List & Architecture Deep Dives
🗺 Concept Map
→ Full Concept Map: docs/concept_map.md — canonical concept registry. → Single Source of Truth: docs/concepts.yaml — machine-generated registry of every concept marker in code. → Concept Index: docs/overview.md — all pillars with descriptions and code paths.
Synthesized from concept markers in the codebase into 106 canonical concepts across 12 pillars.
This count and the table below are generated from
docs/concepts.yamlbyscripts/gen_docs.py. Do not edit by hand.
| Pillar | ID Range | Count | Focus |
|---|---|---|---|
| AHE-3 Agentic Harness Engineering | AHE-3.x – AHE-3.14 | 11 | Telemetry-Driven Optimization, Agentic Harness Engineering / Evolution, Adversarial verification passed — no issues found, Optional convergence monitor for multi-loop tasks, Check for matching TeamConfig before LLM planning, Detected mathematical/quantitative topology. Escalate to reasoning model, Distills updated tool description back into Python function docstring, GitOps Git Commit Automation |
| CTX-1 Context Management | CTX-1.0 | 1 | Nested Subfolder Instructions |
| ECO-4 Ecosystem & Peripherals | ECO-4.0 – ECO-4.24 | 14 | Register PlannerGraphSkill when graph_bundle is available, Live MCP server connection for tool metadata caching, Company Infrastructure Orchestration, Infrastructure Blueprint Library, Pluggable Event Queue Backend, Team-Specific Startup Context, Deterministic Lint Enforcement Hook, Plugin Bundle Distribution System |
| KG-1 Knowledge Graph Core | KG-1.0 | 1 | Centralized KG Coordination Protocol |
| KG-2 Epistemic Knowledge Graph | KG-2.0 – KG-2.32 | 33 | Provides git-like transactional mutation for KG evolution, Self-Model proficiency + R5 ACO pheromone affinities, Entity-Claim Extraction for MAGMA Epistemic View, Lazy embedding model — defer HTTP connection to first use, Compute positional interaction encoding for structural generalization, /2.15/2.34/2.35 — Topological Analysis Engine, Generates actionable LLM artifacts from KG-ingested research, / KG-2.10 — research assimilation + orchestration synthesis |
| LGC-1 Logic & Governance Core | LGC-1.0 | 1 | Logic & Governance Core |
| ORCH-1 Graph Orchestration | ORCH-1.0 – ORCH-1.36 | 28 | Inject signal board observations from prior adaptive_agent_router, Current nesting depth for recursive graph orchestration, Invalidate hot cache so routing reflects new self-knowledge, Visibility allow-list of upstream step ids whose results, Session ID of the parent graph if this state was forked, Dependency cycle detected — falling back, Autonomous Department Orchestration, Graph-Native Reactive Event Sourcing and OS Guardrails |
| ORCH-2 Orchestration Extensions | ORCH-2.0 | 1 | Orchestration Engine |
| ORCH-5 Orchestration Runtime | ORCH-5.0 | 1 | / TUI-20 |
| OS-5 Agent OS Infrastructure | OS-5.0 – OS-5.13 | 13 | FileWatcher — watchdog-triggered graph execution, refactoring. This module re-exports it to avoid breaking, MaintenanceCron — scheduled autonomous maintenance, Reactive Multi-Axis Budget Guardrails, WASM Micro-Agent Sandbox & Runner, Distributed Coordinator with Semantic Sharding, Deterministic Replay Engine, Epistemic dynamic priority & quota scaling based on KG Centrality |
| SAFE-1 Safety & Guardrails | SAFE-1.0 | 1 | Tool-Agnostic File Safety Hooks |
| UTIL-1 Shared Utilities | UTIL-1.0 | 1 | Data Type Conversion |
🏗️ Architecture & Pillar Reference
The detailed architectural diagrams and deep-dive documentation for agent-utilities have been moved to their respective Pillar documentation pages in /docs.
- 1. Graph Orchestration & Planning
- Contains: First Principles Architecture, SDD Lifecycle, Execution Flow (Dynamic Multi-Layer Parallelism).
- 2. Epistemic Knowledge Graph
- Contains: Graph-OS Native Ingestion Pipeline, MAGMA Reasoning Views, Persistent Task Tracking.
- 3. Agentic Harness Engineering
- Contains: Self-Models, Evolution, Evaluation.
- 4. Ecosystem Peripherals
- Contains: graph-os MCP Tools, Server Endpoints, MCP Loading & Registry Architecture.
- 5. Agent OS Infrastructure
- Contains: Human-in-the-Loop Tool Approval, Process Lifecycle, Auth/Security.
- 6. GeniusBot Desktop Cockpit
- Contains: Premium Systems Cockpit, swappable plugins tab matrix, sandboxed terminal widget, visual finance trading dashboard.
- C4 Architecture Diagrams
- Contains: Ecosystem Dependency Graph, C4 Container Diagram, Cross-Pillar Data Flows.
- Memory Architecture
- Contains: Multi-Timescale Memory, Memento Context Management, Observational Memory Bridge.
- Company Brain Runtime
- Contains: the 6-layer model wired end-to-end — trust/conflict resolution & field-level survivorship, data permissions/tenancy/audit, feedback→rule→eval, retrieval budget, streams,
KG_BRAIN_ENFORCE.
- Contains: the 6-layer model wired end-to-end — trust/conflict resolution & field-level survivorship, data permissions/tenancy/audit, feedback→rule→eval, retrieval budget, streams,
- Vendor-Neutral Enterprise Ontology
- Contains: the canonical ArchiMate crosswalk, vendor adapters, code→capability realization, and virtual REST federation.
External Agent Discovery (mcp_config.json)
Register the Knowledge Graph in your IDE's mcp_config.json using the standard CLI pattern:
{
"mcpServers": {
"graph-os": {
"command": "uv",
"args": ["run", "graph-os"],
"env": {
"AGENT_ID": "local-developer",
"WORKSPACE_PATH": "${workspaceFolder}"
}
}
}
}
Note: Model selection, routing logic, and system configurations are centralized in your XDG
~/.config/agent-utilities/config.json. Only local workspace paths, local agent IDs, or environment overrides remain in the environment.
Multi-Model Config & Secret Storage
All LLM providers, model registries, safety guardrails, and scheduler policies are managed centrally via the XDG-compliant configuration file at ~/.config/agent-utilities/config.json.
Every field in the config.json has a 1-to-1 environment variable override. The environment variables (detailed in .env.example) act as secondary overrides for all settings.
Centralized config.json Template
Here is a fully-populated and production-ready config.json file representing the absolute source of truth for the agent-utilities Pydantic AgentConfig schema:
{
"default_agent_name": "Agent",
"agent_description": "AI Agent",
"agent_system_prompt": null,
"host": "0.0.0.0",
"port": 9000,
"debug": false,
"enable_web_ui": false,
"enable_terminal_ui": false,
"enable_web_logs": true,
"enable_acp": false,
"acp_port": 8001,
"acp_session_root": ".acp-sessions",
"mcp_config": null,
"max_upload_size": 10485760,
"agent_api_key": null,
"enable_api_auth": false,
"auth_jwt_jwks_uri": null,
"auth_jwt_issuer": null,
"auth_jwt_audience": null,
"allowed_origins": null,
"allowed_hosts": null,
"tool_guard_mode": "strict",
"sensitive_tool_patterns": [
".*delete.*",
".*remove.*",
".*rm_.*",
".*prune.*",
".*kill.*",
".*exec.*",
".*run_command.*"
],
"secrets_backend": "inmemory",
"secrets_sqlite_path": null,
"secrets_vault_url": null,
"secrets_vault_mount": "secret",
"routing_strategy": "hybrid",
"graph_persistence_type": "file",
"graph_persistence_path": "~/.local/share/agent-utilities/graph_state",
"enable_llm_validation": false,
"graph_router_timeout": 300.0,
"graph_verifier_timeout": 300.0,
"graph_direct_execution": true,
"min_confidence": 0.4,
"validation_mode": false,
"approval_timeout": 0.0,
"enable_kg_embeddings": true,
"kg_backups": 3,
"knowledge_graph_sync_background": true,
"enable_otel": true,
"otel_exporter_otlp_endpoint": "http://langfuse.arpa/api/public/otel",
"otel_exporter_otlp_headers": null,
"otel_exporter_otlp_public_key": "lf_pk_...",
"otel_exporter_otlp_secret_key": "lf_sk_...",
"otel_exporter_otlp_protocol": "http/protobuf",
"langfuse_host": "http://langfuse.arpa",
"langfuse_public_key": "lf_pk_...",
"langfuse_secret_key": "lf_sk_...",
"langfuse_dataset_capture_threshold": 0.0,
"a2a_broker": "in-memory",
"a2a_broker_url": null,
"a2a_storage": "in-memory",
"a2a_storage_url": null,
"a2a_config": null,
"a2a_refresh_interval": 300,
"max_tokens": 16384,
"temperature": 0.7,
"top_p": 1.0,
"timeout": 32400.0,
"tool_timeout": 32400.0,
"parallel_tool_calls": true,
"seed": null,
"presence_penalty": 0.0,
"frequency_penalty": 0.0,
"logit_bias": null,
"stop_sequences": null,
"extra_headers": null,
"extra_body": null,
"cognitive_scheduler_enabled": true,
"max_concurrent_agents": 5,
"agent_token_quota": 100000,
"preemption_threshold_pct": 0.85,
"agent_policies_path": null,
"permissions_signing_key": null,
"specialist_registry_path": null,
"homeostatic_downgrade_enabled": true,
"adversarial_verification": false,
"maintenance_token_budget": 0,
"maintenance_priority": "LOW",
"watchdog_patterns": [
"pyproject.toml",
"mcp_config.json",
"requirements*.txt"
],
"custom_skills_directory": null,
"skill_types": null,
"chat_models": [
{
"id": "qwen/qwen3.5-9b",
"provider": "openai",
"base_url": "http://vllm.arpa/v1",
"supports_json": false,
"vision": true,
"reasoning": true,
"tools_enabled": true,
"parallel_instances": 3,
"context_window": 256000,
"intelligence_level": "normal",
"can_route": true,
"can_kg": true
}
],
"embedding_models": [
{
"id": "text-embedding-nomic-embed-text-v2-moe",
"provider": "openai",
"base_url": "http://vllm-embed.arpa/v1",
"parallel_instances": 4,
"chunk_size": 768
}
],
"workspace_path": "/home/apps/workspace",
"agent_utilities_config_dir": "~/.config/agent-utilities"
}
Note: JSON does not support comments. The JSON key names correspond exactly to their uppercase environment variable overrides (e.g.
default_agent_name→DEFAULT_AGENT_NAME).
For comprehensive definitions and capabilities of specific variables, see the Configuration Guide and Local Secret Storage Guide.
Installation
Install via pip:
pip install agent-utilities
To install with all optional dependencies (including MCP servers, UI, and external graph backends):
pip install "agent-utilities[all]"
For more details, see the Installation Guide.
Zero-infrastructure by default
Out of the box, agent-utilities runs as a single self-contained binary with no
external system dependencies. The default knowledge-graph backend is tiered —
the always-included Rust-native epistemic_graph (L1) in front of an embedded
LadybugDB (L2). No Postgres/Neo4j server is required to get started.
To use a durable PostgreSQL tier in production, just set a DSN — the tiered L2 auto-switches to Postgres (your existing configuration keeps working unchanged):
export GRAPH_BACKEND=tiered
export GRAPH_DB_URI=postgresql://agent:agent@localhost:5432/agent_kg
Deployment
Full deployment instructions — running graph-os and mcp-multiplexer as
standard stdio or streamable-http servers, the centralized REST API
gateway, Docker composes, and production hardening — are in the
Deployment Guide.
Quick Start
You can quickly launch the graph-os MCP server (a thin FastMCP wrapper):
uv run graph-os # stdio (default)
uv run graph-os --transport streamable-http --host 0.0.0.0 --port 8004
Or start the standalone agent from your code:
from agent_utilities.core.config import config
from agent_utilities.agent.factory import create_agent
# Configuration is automatically loaded from config.json
agent = create_agent(name="MyAgent")
response = agent.run_sync("Analyze the knowledge graph for recent updates.")
print(response.data)
For a comprehensive walkthrough, see the Quick Start Guide.
📚 Guides & Tutorials
For detailed tutorials, installation options, and configuration guides, refer to the docs/guides/ directory:
- Quick Start
- Installation Guide
- Bare-metal, pip packages, Docker
- Deployment Guide
- Zero-infra default, graph-os & multiplexer (stdio/streamable-http), API gateway, production hardening
- Configuration & Environment Variables
- Multi-tiered LLM setup, Models Config
- Local Secret Storage (Vault & SQLite)
- Creating an Agent
- Building MCP Servers & API Wrappers
- API Documentation & Swagger
Documentation
Comprehensive system documentation is available in the docs/ directory:
New to the project? Start with the Concept Overview Map to get oriented.
Core References
| Guide | Description |
|---|---|
| Overview Map | The Concept Galaxy — canonical concepts (see the Concept Map above for the authoritative count), query lifecycle, concept index |
| Concept Map | Canonical concept registry (single source of truth) |
| C4 Architecture | System context, container, and component diagrams |
| Company Brain Runtime | The 6-layer brain wired end-to-end: trust/survivorship, permissions, feedback→rule→eval, retrieval budget (KG_BRAIN_ENFORCE) |
| Vendor-Neutral Enterprise Ontology | ArchiMate crosswalk + vendor adapters making ServiceNow↔ERPNext↔Camunda interchangeable |
| Global Workspace Attention | GWT loop: score→select→broadcast specialist proposals + get_attention_score read-back + engine-mismatch telemetry |
| Multi-Agent Social System | Swarm as S=(f,g,G): archetypes, local observability, co-evolution, P1–P4 swarm health |
| In-House Training Substrate | Cross-repo: reward/data engine → torch/PEFT trainers → Rust kernels → deploy seam (GB10 fine-tunes) |
| Graph-Native Assimilation Engine | Self-evolution loop: ingest papers/OSS/repos/docs → dedup → gap → synergy → rank → grounded plans; idempotent, runs via graph_orchestrate(action="assimilate") + golden-loop daemon |
| Evolution Pipeline | Assimilation governance, wire-or-discard heuristic, 4-phase pipeline |
Pillar Deep-Dives
| Pillar | Guide |
|---|---|
| Graph Orchestration | docs/pillars/1_graph_orchestration.md |
| Epistemic Knowledge Graph | docs/pillars/2_epistemic_knowledge_graph.md |
| Agentic Harness Engineering | docs/pillars/3_agentic_harness_engineering.md |
| Ecosystem & Peripherals | docs/pillars/4_ecosystem_peripherals.md |
| Agent OS Infrastructure | docs/pillars/5_agent_os_infrastructure.md |
Contributing
Contributions are welcome. Please follow these guidelines:
- Fork the repository and create a feature branch.
- Write tests for new functionality — all tests must include assertions.
- Follow existing patterns — use the established Pydantic models, structured prompts, and concept markers.
- Run the test suite before submitting:
uv run pytest tests/ -q.Note: All tests are strictly bounded by a 60-second timeout via
pytest-timeout. Any test that sleeps or hangs indefinitely will fail automatically. Ensure you don't usetime.sleepwithout bounds. - Update documentation in
docs/if your changes affect public APIs.
See AGENTS.md for project-specific conventions and architecture rules.
License
This project is licensed under the terms specified in the LICENSE file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agent_utilities-0.46.0.tar.gz.
File metadata
- Download URL: agent_utilities-0.46.0.tar.gz
- Upload date:
- Size: 2.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
412b392933cc46323ad48769ab26034149b8afd5020f862c40ef3c539f6dcfb1
|
|
| MD5 |
721bd5bf09cd059f59751a3251d7c24c
|
|
| BLAKE2b-256 |
805476df1db1e5946b5f10b3d8343b2de5ffa5998c04ad7044400ae9dc0aca56
|
File details
Details for the file agent_utilities-0.46.0-py3-none-any.whl.
File metadata
- Download URL: agent_utilities-0.46.0-py3-none-any.whl
- Upload date:
- Size: 3.4 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d4f5a91b1e2f8e70c5d94b3f8f11d1e12b5db3dc530013989b40b249d04aab3d
|
|
| MD5 |
6bc4fd95ddc0117fe083cfb02d691a4e
|
|
| BLAKE2b-256 |
3a8b8c81222532341f4cce14d78ccdc3ba8ab65477b24a6c865ca66798d016ca
|