Agent Utilities for Pydantic AI Agents
Project description
Agent Utilities - AGI Harness
Version: 0.41.0
Table of Contents
- The Technical Novel: Narrative Journey
- Overview
- Key Features
- Intelligence Graph
- First Principles Architecture
- Concept Map
- Architecture & Orchestration
- Multi-Model Config & Secret Storage
- Installation
- Quick Start
- Creating an Agent
- Building MCP Servers
- API Documentation
- Documentation
- Contributing
- License
🌌 The Journey of Agent Utilities: The Technical Novel
[!NOTE] New to the project? Rather than reading dry configuration tables, experience
agent-utilitieslive! Read our comprehensive technical biography tracing the lifecycle of a high-stakes quantitative rebalancing mandate.
🌌 Mission & Future State: Distributed Evolution
The core vision for agent-utilities transcends being just an execution harness—it is the bedrock for Distributed Agentic Evolution and the substrate for the AI-First Autonomous Organization.
As autonomous agents leverage this ecosystem to solve complex problems, they continuously learn, adapt, and refine their own capabilities. Our future state envisions a community of independent, self-improving agents that not only run on this harness but dynamically contribute their localized evolutionary breakthroughs—new skills, optimized TeamConfigs, refined prompts, and advanced reasoning traces—back to the open-source collective.
By tying our unified Knowledge Graph, capability auto-activation, and cross-agent communication protocols together, agent-utilities becomes an interconnected hive mind where the evolution of one agent elevates the intelligence of all. The harness is not just a way to run an agent; it is the heartbeat of a distributed, self-evolving intelligence network.
Key Features
- Multi-Domain Expert System: Scale across finance, medical, and scientific domains using Temporally-Aware Epistemic Memory (TKG) and specialized MCP tools.
- Unified Intelligence Graph: A tiered pipeline combining native Rust in-memory processing (
EpistemicGraphBackend, the default L1 working store) with a durable PostgreSQL/pggraph persistence tier and OWL (Apache Jena Fuseki) semantics. (LadybugDB / Neo4j / FalkorDB Cypher backends remain available underbackends/contrib/.) - Centralized Sessions & Goals (API-First Gateway): A highly-resilient, centralized REST API gateway running on Port
8100that handles background goal loops, durable turns, and user session reply orchestration. - High-Performance Rust Compute Engine 🔬: A compiled Rust Graph Compute Engine via
epistemic-graphrunning over high-speed Unix Sockets, providing fast AST parsing, VF2 subgraph matching, and a Redpanda-backed Reactive State Ledger designed to scale seamlessly up to 100,000,000 concurrent agents. - Spec-Driven Development (SDD): High-fidelity orchestration that decomposes goals into structured specs, implementation plans, and parallel tasks.
- Emergent Architecture: Dynamic AgentCapability auto-activation, TeamConfig coalition promotion, and evolutionary skill refinement via self-models.
- Agent OS & Safety: Built-in Universal Tool Guards, structural vulnerability scanning, and transparent process lifecycle management.
- Dynamic Company Brain Ingestion: Dynamic data ingestion from external platforms like Jira, GitLab/GitHub, enterprise architecture repositories (e.g., Essential Project, Archi), and databases with automatic ontology alignment and GraphQL/REST extraction.
- Enterprise Agent Governance: Production-grade mutation governance with risk-scored change proposals, human-in-the-loop approval gates, AGENTS.md self-improvement, lint enforcement hooks, plugin bundle distribution, permission policies, staleness auditing, and unified governance workflow pipeline.
📖 View the Comprehensive Feature List & Architecture Deep Dives
🗺 Concept Map
→ Full Concept Map: docs/concept_map.md — canonical concept registry. → Single Source of Truth: docs/concepts.yaml — machine-generated registry of every concept marker in code. → Concept Index: docs/overview.md — all pillars with descriptions and code paths.
Synthesized from concept markers in the codebase into 70 canonical concepts across 12 pillars.
This count and the table below are generated from
docs/concepts.yamlbyscripts/gen_docs.py. Do not edit by hand.
| Pillar | ID Range | Count | Focus |
|---|---|---|---|
| AHE-3 Agentic Harness Engineering | AHE-3.x – AHE-3.11 | 8 | Telemetry-Driven Optimization, Agentic Harness Engineering / Evolution, Adversarial verification passed — no issues found, Optional convergence monitor for multi-loop tasks, Check for matching TeamConfig before LLM planning, Detected mathematical/quantitative topology. Escalate to reasoning model, Distills updated tool description back into Python function docstring, GitOps Git Commit Automation |
| CTX-1 Context Management | CTX-1.0 | 1 | Nested Subfolder Instructions |
| ECO-4 Ecosystem & Peripherals | ECO-4.0 – ECO-4.23 | 13 | Register PlannerGraphSkill when graph_bundle is available, Live MCP server connection for tool metadata caching, Company Infrastructure Orchestration, Infrastructure Blueprint Library, Pluggable Event Queue Backend, Team-Specific Startup Context, Deterministic Lint Enforcement Hook, Plugin Bundle Distribution System |
| KG-1 Knowledge Graph Core | KG-1.0 | 1 | Centralized KG Coordination Protocol |
| KG-2 Epistemic Knowledge Graph | KG-2.0 – KG-2.23 | 14 | Provides git-like transactional mutation for KG evolution, Self-Model proficiency + R5 ACO pheromone affinities, Entity-Claim Extraction for MAGMA Epistemic View, Lazy embedding model — defer HTTP connection to first use, Compute positional interaction encoding for structural generalization, /2.15/2.34/2.35 — Topological Analysis Engine, Generates actionable LLM artifacts from KG-ingested research, Bidirectional sync to ensure both the persistent |
| LGC-1 Logic & Governance Core | LGC-1.0 | 1 | Logic & Governance Core |
| ORCH-1 Graph Orchestration | ORCH-1.0 – ORCH-1.26 | 18 | Inject signal board observations from prior adaptive_agent_router, Current nesting depth for recursive graph orchestration, Invalidate hot cache so routing reflects new self-knowledge, Execution Budget. Defines caps for this session, Session ID of the parent graph if this state was forked, Dependency cycle detected — falling back, Autonomous Department Orchestration, Graph-Native Reactive Event Sourcing and OS Guardrails |
| ORCH-2 Orchestration Extensions | ORCH-2.0 | 1 | Orchestration Engine |
| ORCH-5 Orchestration Runtime | ORCH-5.0 | 1 | / TUI-20 |
| OS-5 Agent OS Infrastructure | OS-5.0 – OS-5.10 | 10 | FileWatcher — watchdog-triggered graph execution, refactoring. This module re-exports it to avoid breaking, MaintenanceCron — scheduled autonomous maintenance, Reactive Multi-Axis Budget Guardrails, WASM Micro-Agent Sandbox & Runner, Distributed Coordinator with Semantic Sharding, Deterministic Replay Engine, Epistemic dynamic priority & quota scaling based on KG Centrality |
| SAFE-1 Safety & Guardrails | SAFE-1.0 | 1 | Tool-Agnostic File Safety Hooks |
| UTIL-1 Shared Utilities | UTIL-1.0 | 1 | Data Type Conversion |
🏗️ Architecture & Pillar Reference
The detailed architectural diagrams and deep-dive documentation for agent-utilities have been moved to their respective Pillar documentation pages in /docs.
- 1. Graph Orchestration & Planning
- Contains: First Principles Architecture, SDD Lifecycle, Execution Flow (Dynamic Multi-Layer Parallelism).
- 2. Epistemic Knowledge Graph
- Contains: Graph-OS Native Ingestion Pipeline, MAGMA Reasoning Views, Persistent Task Tracking.
- 3. Agentic Harness Engineering
- Contains: Self-Models, Evolution, Evaluation.
- 4. Ecosystem Peripherals
- Contains: graph-os MCP Tools, Server Endpoints, MCP Loading & Registry Architecture.
- 5. Agent OS Infrastructure
- Contains: Human-in-the-Loop Tool Approval, Process Lifecycle, Auth/Security.
- 6. GeniusBot Desktop Cockpit
- Contains: Premium Systems Cockpit, swappable plugins tab matrix, sandboxed terminal widget, visual finance trading dashboard.
- C4 Architecture Diagrams
- Contains: Ecosystem Dependency Graph, C4 Container Diagram, Cross-Pillar Data Flows.
- Memory Architecture
- Contains: Multi-Timescale Memory, Memento Context Management, Observational Memory Bridge.
External Agent Discovery (mcp_config.json)
Register the Knowledge Graph in your IDE's mcp_config.json using the standard CLI pattern:
{
"mcpServers": {
"graph-os": {
"command": "uv",
"args": ["run", "graph-os"],
"env": {
"AGENT_ID": "local-developer",
"WORKSPACE_PATH": "${workspaceFolder}"
}
}
}
}
Note: Model selection, routing logic, and system configurations are centralized in your XDG
~/.config/agent-utilities/config.json. Only local workspace paths, local agent IDs, or environment overrides remain in the environment.
Multi-Model Config & Secret Storage
All LLM providers, model registries, safety guardrails, and scheduler policies are managed centrally via the XDG-compliant configuration file at ~/.config/agent-utilities/config.json.
Every field in the config.json has a 1-to-1 environment variable override. The environment variables (detailed in .env.example) act as secondary overrides for all settings.
Centralized config.json Template
Here is a fully-populated and production-ready config.json file representing the absolute source of truth for the agent-utilities Pydantic AgentConfig schema:
{
"default_agent_name": "Agent",
"agent_description": "AI Agent",
"agent_system_prompt": null,
"host": "0.0.0.0",
"port": 9000,
"debug": false,
"enable_web_ui": false,
"enable_terminal_ui": false,
"enable_web_logs": true,
"enable_acp": false,
"acp_port": 8001,
"acp_session_root": ".acp-sessions",
"mcp_config": null,
"max_upload_size": 10485760,
"agent_api_key": null,
"enable_api_auth": false,
"auth_jwt_jwks_uri": null,
"auth_jwt_issuer": null,
"auth_jwt_audience": null,
"allowed_origins": null,
"allowed_hosts": null,
"tool_guard_mode": "strict",
"sensitive_tool_patterns": [
".*delete.*",
".*remove.*",
".*rm_.*",
".*prune.*",
".*kill.*",
".*exec.*",
".*run_command.*"
],
"secrets_backend": "inmemory",
"secrets_sqlite_path": null,
"secrets_vault_url": null,
"secrets_vault_mount": "secret",
"routing_strategy": "hybrid",
"graph_persistence_type": "file",
"graph_persistence_path": "~/.local/share/agent-utilities/graph_state",
"enable_llm_validation": false,
"graph_router_timeout": 300.0,
"graph_verifier_timeout": 300.0,
"graph_direct_execution": true,
"min_confidence": 0.4,
"validation_mode": false,
"approval_timeout": 0.0,
"enable_kg_embeddings": true,
"kg_backups": 3,
"knowledge_graph_sync_background": true,
"enable_otel": true,
"otel_exporter_otlp_endpoint": "http://langfuse.arpa/api/public/otel",
"otel_exporter_otlp_headers": null,
"otel_exporter_otlp_public_key": "lf_pk_...",
"otel_exporter_otlp_secret_key": "lf_sk_...",
"otel_exporter_otlp_protocol": "http/protobuf",
"langfuse_host": "http://langfuse.arpa",
"langfuse_public_key": "lf_pk_...",
"langfuse_secret_key": "lf_sk_...",
"langfuse_dataset_capture_threshold": 0.0,
"a2a_broker": "in-memory",
"a2a_broker_url": null,
"a2a_storage": "in-memory",
"a2a_storage_url": null,
"a2a_config": null,
"a2a_refresh_interval": 300,
"max_tokens": 16384,
"temperature": 0.7,
"top_p": 1.0,
"timeout": 32400.0,
"tool_timeout": 32400.0,
"parallel_tool_calls": true,
"seed": null,
"presence_penalty": 0.0,
"frequency_penalty": 0.0,
"logit_bias": null,
"stop_sequences": null,
"extra_headers": null,
"extra_body": null,
"cognitive_scheduler_enabled": true,
"max_concurrent_agents": 5,
"agent_token_quota": 100000,
"preemption_threshold_pct": 0.85,
"agent_policies_path": null,
"permissions_signing_key": null,
"specialist_registry_path": null,
"homeostatic_downgrade_enabled": true,
"adversarial_verification": false,
"maintenance_token_budget": 0,
"maintenance_priority": "LOW",
"watchdog_patterns": [
"pyproject.toml",
"mcp_config.json",
"requirements*.txt"
],
"custom_skills_directory": null,
"skill_types": null,
"chat_models": [
{
"id": "qwen/qwen3.5-9b",
"provider": "openai",
"base_url": "http://vllm.arpa/v1",
"supports_json": false,
"vision": true,
"reasoning": true,
"tools_enabled": true,
"parallel_instances": 3,
"context_window": 256000,
"intelligence_level": "normal",
"can_route": true,
"can_kg": true
}
],
"embedding_models": [
{
"id": "text-embedding-nomic-embed-text-v2-moe",
"provider": "openai",
"base_url": "http://vllm-embed.arpa/v1",
"parallel_instances": 4,
"chunk_size": 768
}
],
"workspace_path": "/home/apps/workspace",
"agent_utilities_config_dir": "~/.config/agent-utilities"
}
Note: JSON does not support comments. The JSON key names correspond exactly to their uppercase environment variable overrides (e.g.
default_agent_name→DEFAULT_AGENT_NAME).
For comprehensive definitions and capabilities of specific variables, see the Configuration Guide and Local Secret Storage Guide.
Installation
Install via pip:
pip install agent-utilities
To install with all optional dependencies (including MCP servers, UI, and external graph backends):
pip install "agent-utilities[all]"
For more details, see the Installation Guide.
Quick Start
You can quickly launch the graph-os MCP server:
uv run graph-os
Or start the standalone agent from your code:
from agent_utilities.core.config import config
from agent_utilities.agent.factory import create_agent
# Configuration is automatically loaded from config.json
agent = create_agent(name="MyAgent")
response = agent.run_sync("Analyze the knowledge graph for recent updates.")
print(response.data)
For a comprehensive walkthrough, see the Quick Start Guide.
📚 Guides & Tutorials
For detailed tutorials, installation options, and configuration guides, refer to the docs/guides/ directory:
- Quick Start
- Installation Guide
- Bare-metal, pip packages, Docker
- Configuration & Environment Variables
- Multi-tiered LLM setup, Models Config
- Local Secret Storage (Vault & SQLite)
- Creating an Agent
- Building MCP Servers & API Wrappers
- API Documentation & Swagger
Documentation
Comprehensive system documentation is available in the docs/ directory:
New to the project? Start with the Concept Overview Map to get oriented.
Core References
| Guide | Description |
|---|---|
| Overview Map | The Concept Galaxy — canonical concepts (see the Concept Map above for the authoritative count), query lifecycle, concept index |
| Concept Map | Canonical concept registry (single source of truth) |
| C4 Architecture | System context, container, and component diagrams |
| Evolution Pipeline | Assimilation governance, wire-or-discard heuristic, 4-phase pipeline |
Pillar Deep-Dives
| Pillar | Guide |
|---|---|
| Graph Orchestration | docs/pillars/1_graph_orchestration.md |
| Epistemic Knowledge Graph | docs/pillars/2_epistemic_knowledge_graph.md |
| Agentic Harness Engineering | docs/pillars/3_agentic_harness_engineering.md |
| Ecosystem & Peripherals | docs/pillars/4_ecosystem_peripherals.md |
| Agent OS Infrastructure | docs/pillars/5_agent_os_infrastructure.md |
Contributing
Contributions are welcome. Please follow these guidelines:
- Fork the repository and create a feature branch.
- Write tests for new functionality — all tests must include assertions.
- Follow existing patterns — use the established Pydantic models, structured prompts, and concept markers.
- Run the test suite before submitting:
uv run pytest tests/ -q.Note: All tests are strictly bounded by a 60-second timeout via
pytest-timeout. Any test that sleeps or hangs indefinitely will fail automatically. Ensure you don't usetime.sleepwithout bounds. - Update documentation in
docs/if your changes affect public APIs.
See AGENTS.md for project-specific conventions and architecture rules.
License
This project is licensed under the terms specified in the LICENSE file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agent_utilities-0.41.0.tar.gz.
File metadata
- Download URL: agent_utilities-0.41.0.tar.gz
- Upload date:
- Size: 2.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
34d2c1c5a127c452357dca7edf0adffa8e2eccdac2b09d225d95d3e41d699733
|
|
| MD5 |
fbec82e894ca8d91367050e8a050335a
|
|
| BLAKE2b-256 |
90f895a26172d940714dacfbe046e17a14843388b7940402e1551a3f0b84a65e
|
File details
Details for the file agent_utilities-0.41.0-py3-none-any.whl.
File metadata
- Download URL: agent_utilities-0.41.0-py3-none-any.whl
- Upload date:
- Size: 2.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a95a9734bfcca2d8949b7ec0222a84a94626b802aa670c5a2a61580aa5931b8c
|
|
| MD5 |
7a3369603718c2d2b5325195f979f216
|
|
| BLAKE2b-256 |
856d9439192cec08b80c8e13da0f6351cdba751a4ccd233e17ac7a5fb2d3939a
|