Local-first repair loop for debugging and improving AI agents.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kayba

These details have not been verified by PyPI

Project links

Homepage

Project description

Kyoko

Kyoko is a fully local system for measuring, debugging, and improving AI agents.

Add telemetry, run your agent, and Kyoko shows where performance breaks across runs. It groups recurring failures into evidence-backed issues, lets Codex or Claude Code draft fixes, and only applies changes after checks and evals pass.

It is built around the manual workflow developers already use: inspect traces, understand the failure, patch the prompt, context, or harness, rerun evals, and decide what ships. Kyoko makes that workflow repeatable while keeping you in control.

Everything stays local by default: traces, issues, proposals, evals, database, and dashboard. For analysis and fix drafting, Kyoko can use the coding-agent CLI you already have, like Codex or Claude Code, so there is no separate Kyoko model API key or hosted service.

Why Kyoko

Finds the failures that repeat across runs. Kyoko looks across runs, groups recurring problems into evidence-backed issues, and shows where each one happened.
Turns issues into fixes. Accepted issues become proposed changes to your agent context, skills, or harness.
Measures whether fixes worked. Kyoko reruns failing traces, runs deterministic checks, and compares eval results before applying a fix.
Keeps the developer in control. Review every issue, proposal, and apply decision manually, or automate only the parts that pass the gate.
Uses the tools you already have. Codex, Claude Code, OpenClaw, Hermes, or a generic command can analyze evidence and draft fixes through existing CLI auth.
Runs locally by default. SQLite, loopback dashboard, local traces, local proposals, and explicit external calls.
Connects to real agent stacks. OTLP/GenAI, Python and TypeScript SDKs, importers, JSON CLI, dashboard, and MCP.

The loop

        ┌─────────────────┐           ┌─────────────────┐
        │  1. Analyse     │ ───────▶ │  2. Issues      │
        │  traces in      │           │  recurring      │
        │                 │           │  failures       │
        └─────────────────┘           └─────────────────┘
                 ▲                            │
                 │ measure                    │ accept
                 │                            ▼
        ┌─────────────────┐  ┌──────┐ ┌─────────────────┐
        │  4. Evals       │◀─┤ gate ├─│  3. Proposals   │
        │  failure rate   │  └──────┘ │  fixes          │
        │                 │   apply   │                 │
        └─────────────────┘           └─────────────────┘

Kyoko keeps the repair loop explicit. Every step creates something you can inspect in the dashboard or CLI.

Analyse: Kyoko reads real traces from your agent and looks across runs for repeated behavior: tool mistakes, missing context, policy drift, brittle routing, bad handoffs, or eval failures.
Issues: recurring failures become evidence-backed issues with category, severity, occurrence count, and links to the spans where they happened.
Proposals: accepted issues become concrete fixes to your agent context, skills, evals, or harness. The fix stays reviewable before it can apply.
Evals: Kyoko reruns failing traces, runs deterministic checks, and compares eval results so the gate can decide whether the fix worked.

The gate is the control point. It applies a fix only when checks, replay evidence, autonomy policy, and human locks allow it.

Run it your way. The same loop, the same gate. You pick the autonomy level:

Human-in-the-loop: Kyoko surfaces issues and drafts fixes, and you review and approve each change before it applies.
Fully autonomous: the policy auto-applies any change that clears replay, evals, and human locks, and parks anything that doesn't for you to look at.

Quick demo

Try Kyoko without wiring up an agent. The demo creates a local database, loads bundled fixture runs, and serves the dashboard.

pipx install kyoko
kyoko demo --db /tmp/kyoko-demo.db --json
kyoko serve --db /tmp/kyoko-demo.db

Open http://127.0.0.1:8765.

Requires Python 3.12 or newer. No live model, framework adapter, or replay server is needed for the demo.

Get started

From the root of your agent project (needs Python 3.12+):

pipx install kyoko
kyoko project-bootstrap
kyoko serve

Open http://127.0.0.1:8765. pip install kyoko and uv tool install kyoko work too; see docs/INSTALL.md.

Bootstrap writes a local .kyoko/ workspace: database, scaffolds, MCP config, and operator presets. Every later kyoko command finds that database automatically, so no --db flags are needed inside your project.

Then wire up telemetry. This is the step that makes everything else work: Kyoko can only find and fix what it can see. The easiest way is to let your coding agent do the wiring:

kyoko install-skill   # then run /kyoko-instrument in your coding agent

This installs the bundled /kyoko-instrument skill into .claude/skills/, where Claude Code picks it up automatically; for Codex, Cursor, or other agents, kyoko install-skill --print prints the same playbook to paste in. The skill finds your agent's entry point, records one real run, and verifies it shows up in Kyoko.

To connect your agent over MCP instead, or to wire telemetry by hand (Python or TypeScript SDK, OTLP, importers), see Getting Started.

What you get

Run capture: Python SDK, TypeScript SDK, generated source adapters, OTLP/GenAI JSON, Hermes import, and OpenClaw import.
Issue queue: recurring failures grouped into evidence-backed issues with category, severity, occurrence count, and span links.
Fix proposals: accepted issues become validated LearningProposal records for context, skills, evals, or harness changes.
Verification: bounded replay, deterministic checks, and eval comparison before a fix can apply.
Operator path: Codex, Claude, OpenClaw, Hermes, or a generic command can analyze evidence and draft fixes through existing CLI auth.
Control surfaces: local dashboard, JSON-everywhere CLI, and stdio MCP server, all sharing the same gated apply path.

Area	Supported paths
Source telemetry	Python SDK, TypeScript SDK, generated source adapters, OTLP/GenAI JSON, Hermes import, OpenClaw import
Replay	External replay commands, managed HTTP replay servers, generated replay scaffolds
Operator agents	Codex, Claude Code, OpenClaw, Hermes, generic command adapters, local presets
Agent clients	Dashboard, JSON CLI, stdio MCP server
Framework scaffolds	Generic Python/TypeScript, LangGraph, Pydantic AI, OpenAI Agents, CrewAI, Hermes, OpenClaw, AI SDK

See docs/INTEGRATIONS.md and examples/README.md.

The gate and local boundary

Every behavior-changing path (operator output, imports, MCP tools, and kyoko improve) flows through one gate:

Validate the structured proposal.
Resolve the evidence it references.
Generate or select checks.
Run bounded replay and deterministic checks.
Evaluate the autonomy policy.
Enforce human locks on protected targets.
Apply context or harness changes only if the gate allows it.

Operator agents can analyze evidence and draft fixes; they do not directly mutate Kyoko state. Context writes update Kyoko-managed skills and delivery rules. Harness writes create reviewable patch transactions against an explicit workspace root.

Replay server URLs are loopback-only unless you pass --allow-remote-server. Evidence exported to prompts, MCP, API, or bundles is redacted by default. See docs/SECURITY.md and docs/ARCHITECTURE.md.

Documentation

Getting Started: demo, project bootstrap, telemetry, inspection, and the repair loop.
Install: install paths, verification, data location, and common setup fixes.
Integrations: source adapters, replay adapters, operator agents, MCP, and SDKs.
CLI Reference: grouped command reference.
Architecture: runtime model, data model, and the gate.
Security: local data, loopback serving, tokens, redaction, and write boundaries.
Scope: what v0 is and is not.
Development: tests, dashboard bundle, release smoke, and contract artifacts.

Specs, schemas, fixtures, and design decisions live under docs/ as reference contracts.

Contributing

Issues and pull requests are welcome. See CONTRIBUTING.md for local setup, the test and validation gates, and how to submit a change. To report a security vulnerability, follow SECURITY.md rather than opening a public issue.

Repository layout

kyoko/              Python import package, CLI runtime, dashboard/API, bundled assets
frontend/           React/Vite dashboard source
sdk/typescript/     Dependency-free TypeScript telemetry SDK
examples/           Source and replay hook examples
scripts/            Installer, release smoke, fixture and artifact helpers
tests/              Python unittest suite and CLI contract tests
docs/               User docs plus specs, schemas, fixtures, and decisions

License

Apache-2.0. See LICENSE.

Built by Kayba and the open-source community.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kayba

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1.2

Jun 10, 2026

0.1.1

Jun 9, 2026

0.1.0

Jun 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kyoko-0.1.2.tar.gz (2.9 MB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kyoko-0.1.2-py3-none-any.whl (2.3 MB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file kyoko-0.1.2.tar.gz.

File metadata

Download URL: kyoko-0.1.2.tar.gz
Upload date: Jun 10, 2026
Size: 2.9 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kyoko-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`03281643fa56bc204be6aa43bf2c0707ffaccf6cd4a6e9f5f722264e84b7df0d`
MD5	`d6ccf72597f4c9a562060795bdc6aa10`
BLAKE2b-256	`96a6653d8386aac60a3d5c3c14236f77c4a51252e9ac54cd116bd582f1bc0f99`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kyoko-0.1.2.tar.gz:

Publisher: release.yml on kayba-ai/Kyoko

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kyoko-0.1.2.tar.gz
- Subject digest: 03281643fa56bc204be6aa43bf2c0707ffaccf6cd4a6e9f5f722264e84b7df0d
- Sigstore transparency entry: 1783114204
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: kayba-ai/Kyoko@9b29b0c8e727d16be9e6a501503555c8a7eb0523
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/kayba-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@9b29b0c8e727d16be9e6a501503555c8a7eb0523
- Trigger Event: push

File details

Details for the file kyoko-0.1.2-py3-none-any.whl.

File metadata

Download URL: kyoko-0.1.2-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 2.3 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kyoko-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a998d29566953fe5be25833bde44e19eeb5bea219b530f13f75d51f2e6c3192c`
MD5	`42fa13a4c49ace37b3c586d3b589384b`
BLAKE2b-256	`2cdfa0a979ffef45d262dcb543e0f53b6bd3ec27d1508fd19cc8c190d6ffd91b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kyoko-0.1.2-py3-none-any.whl:

Publisher: release.yml on kayba-ai/Kyoko

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kyoko-0.1.2-py3-none-any.whl
- Subject digest: a998d29566953fe5be25833bde44e19eeb5bea219b530f13f75d51f2e6c3192c
- Sigstore transparency entry: 1783114986
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: kayba-ai/Kyoko@9b29b0c8e727d16be9e6a501503555c8a7eb0523
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/kayba-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@9b29b0c8e727d16be9e6a501503555c8a7eb0523
- Trigger Event: push

kyoko 0.1.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Kyoko

Why Kyoko

The loop

Quick demo

Get started

What you get

The gate and local boundary

Documentation

Contributing

Repository layout

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance