Skip to main content

Universal configurable AI agent framework — production-grade, YAML-driven, open-source ready.

Project description

koboi-agent

Configurable AI agent framework. YAML-driven config, async Python 3.10+, multi-provider LLM (OpenAI, Anthropic, Cloudflare).

Features

  • Multi-provider LLM: OpenAI, Anthropic, Cloudflare Workers AI
  • YAML-driven config with ${ENV_VAR} interpolation
  • Built-in tools: calculator, filesystem, shell, web search, memory, git, subagent, task
  • Hook lifecycle: 15 event types for logging, guardrails, telemetry
  • RAG pipeline: chunking (fixed/sentence/paragraph/semantic), retrieval (keyword/semantic/hybrid), augmentation
  • Guardrails: input/output validation, rate limiting, approval workflows, policy engine
  • Multi-agent orchestration: keyword/LLM/hybrid routing, sequential/parallel execution
  • Context management: truncation, smart truncation, key facts, sliding window
  • MCP client (stdio + HTTP) and server support
  • Evaluation: BFCL, GAIA, SWE-bench, RAGAS, DeepEval scorers
  • Terminal UI (Textual): chat, command palette, diff view, session management

Quickstart

Install

pip install -e ".[dev,tui]"

Set your API key

cp .env.example .env
# Edit .env and set OPENAI_API_KEY

Run the CLI

koboi chat configs/simple_chat.yaml

Run programmatically

import asyncio
from koboi import KoboiAgent

async def main():
    async with KoboiAgent.from_config("configs/simple_chat.yaml") as agent:
        result = await agent.run("What is 2 + 2?")
        print(result.content)

asyncio.run(main())

Configuration

Agents are configured via YAML. Key sections:

agent:
  name: "my-agent"
  system_prompt: "You are helpful."
  max_iterations: 10
  mode: "chat"  # chat | plan | act | auto

llm:
  provider: "openai"        # openai | anthropic | cloudflare
  model: "gpt-4o-mini"
  api_key: "${OPENAI_API_KEY}"
  base_url: "${OPENAI_BASE_URL:}"

tools:
  builtin: [calculator, web_search, memory_store, memory_recall]
  custom:
    - module: "my_tools"

context:
  strategy: "sliding_window"  # noop | truncation | smart_truncation | key_facts | sliding_window
  max_context_tokens: 8000

rag:
  enabled: true
  chunker: "paragraph"       # fixed | sentence | paragraph
  retriever: "keyword"       # keyword | semantic
  top_k: 3
  documents:
    - path: "./data/sample/product_catalog.md"

guardrails:
  input:
    max_length: 10000
  rate_limit:
    max_calls_per_minute: 20

harness:
  doom_loop:
    consecutive_identical_threshold: 3
  telemetry: true
  carryover: true

See configs/ for full examples and .claude/skills/yaml-config.md for the complete schema.

Testing

pytest                        # all tests
pytest tests/test_config.py   # single file
pytest -k "hook"              # by keyword
pytest --cov=koboi            # with coverage

Examples

examples/ contains 28 numbered scripts covering every feature:

Range Features
01-04 Basic chat and tool use
05-08 Context management and RAG
09-10 MCP client/server
11-14 Policy, hooks, skills, custom tools
15-16 Multi-agent orchestration
17 Anthropic provider
18-20 Harness (telemetry, doom loop, carryover)
21-24 Evaluation, production setup, SWE-bench, config-driven orchestration
25-28 Subagent delegation, task management, benchmarks, custom RAG

Run any example:

python examples/01_simple_chat.py              # automatic mode
python examples/01_simple_chat.py -m interactive  # interactive mode

Architecture

For a detailed architecture overview (agent loop lifecycle, hook system, tool pipeline, extension points), see docs/architecture.md.

KoboiAgent (facade.py) is the single entry point. It assembles:

  • AgentCore (loop.py) -- async agent loop
  • RetryClient (client.py) -- LLM HTTP transport with retry
  • ToolRegistry (tools/) -- tool registration and execution
  • HookChain (hooks/) -- lifecycle event dispatch (15 events)
  • ContextManager (context/) -- context window strategies
  • AugmentationStrategy (rag/) -- RAG pipeline
  • Guardrails (guardrails/) -- input/output validation
  • PolicyEngine (harness/) -- rule-based tool filtering
  • SkillRegistry (skills/) -- skill discovery
  • ModeManager (modes.py) -- chat/plan/act/auto modes
  • TrustDatabase (trust.py) -- graduated permissions
  • Orchestrator (orchestration/) -- multi-agent coordination
  • SubAgentManager (subagent.py) -- parallel sub-agent delegation
  • MCP clients (mcp/) -- external tool servers

All subsystems are configured from a single YAML file via Config (config.py).

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

koboi_agent-0.1.1.tar.gz (350.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

koboi_agent-0.1.1-py3-none-any.whl (270.3 kB view details)

Uploaded Python 3

File details

Details for the file koboi_agent-0.1.1.tar.gz.

File metadata

  • Download URL: koboi_agent-0.1.1.tar.gz
  • Upload date:
  • Size: 350.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for koboi_agent-0.1.1.tar.gz
Algorithm Hash digest
SHA256 b66fe906063801e7819cff29b07cff53d96a93da5dad790b3b1017e19ec06357
MD5 0ce05251d2bf0e24717334ef76047598
BLAKE2b-256 d7d36a6af634bdf51d94fd73e50f7a43a8109c07bcdcaa3dd9fbc78cb95f3ae7

See more details on using hashes here.

File details

Details for the file koboi_agent-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: koboi_agent-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 270.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for koboi_agent-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ac8ec9f6a9fc708e9f2a346d1f88ae594721617c3434cf705eeca760ee5b4107
MD5 84575d56e1640cd14f4fe77b5f773015
BLAKE2b-256 196fa72a8e08b72a32a46816ecc9c431ea895f7cbc6656ca8298299048a45abf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page