Build self-improving AI agents that learn from experience

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kayba

These details have not been verified by PyPI

Project links

Homepage

Project description

Agentic Context Engine (ACE)

GitHub stars

[!TIP]

Try our hosted solution for free at kayba.ai: automated agent self-improvement from your terminal. CLI + dashboard that analyzes traces, surfaces failures, and ships improvements directly from Claude Code, Codex, and more.

AI agents don't learn from experience. They repeat the same mistakes every session, forget what worked, and ignore what failed. ACE adds a persistent learning loop that makes them better over time.

The agent claims a seahorse emoji exists. ACE reflects on the error, and on the next attempt, the agent responds correctly — without human intervention.

Proven Results

Metric	Result	Context
2x consistency	Doubles pass^4 on Tau2 airline benchmark	15 learned strategies, no reward signals
49% token reduction	Browser automation costs cut nearly in half	10-run learning curve
$1.50 learning cost	Claude Code translated 14k lines to TypeScript	Zero build errors, all tests passing

Quick Start

uv add ace-framework

Option A — Interactive setup (recommended):

ace setup            # Walks you through model selection, API keys, and connection validation

Option B — Manual configuration:

export OPENAI_API_KEY="your-key"    # or ANTHROPIC_API_KEY, or any of 100+ supported providers

Then use it:

from ace import ACELiteLLM

agent = ACELiteLLM(model="gpt-4o-mini")

# First attempt — the agent may hallucinate
answer = agent.ask("Is there a seahorse emoji?")

# Feed a correction — ACE extracts a strategy and updates the Skillbook
agent.learn_from_feedback("There is no seahorse emoji in Unicode.")

# Subsequent calls benefit from the learned strategy
answer = agent.ask("Is there a seahorse emoji?")

# Inspect what the agent has learned
print(agent.get_strategies())

No fine-tuning, no training data, no vector database.

-> Quick Start Guide | -> Setup Guide | -> Hosted API: Where Do Traces Come From?

How It Works

ACE maintains a Skillbook — a persistent collection of strategies that evolves with every task. Three specialized roles manage the learning loop:

Role	Responsibility
Agent	Executes tasks, enhanced with Skillbook strategies
Reflector	Analyzes execution traces to extract what worked and what failed
SkillManager	Curates the Skillbook — adds, refines, and removes strategies

The Recursive Reflector is the key innovation: instead of summarizing traces in a single pass, it writes and executes Python code in a sandboxed environment to programmatically search for patterns, isolate errors, and iterate until it finds actionable insights.

flowchart LR
    Skillbook[(Skillbook)]
    Start([Task]) --> Agent[Agent]
    Agent <--> Environment[Environment]
    Environment -- Trace --> Reflector[Reflector]
    Reflector --> SkillManager[SkillManager]
    SkillManager -- Updates --> Skillbook
    Skillbook -. Strategies .-> Agent

All roles are backed by PydanticAI agents with structured output validation. PydanticAI routes to 100+ LLM providers through its LiteLLM integration, with native support for OpenAI, Anthropic, Google, Bedrock, Groq, and more.

Based on the ACE paper (Stanford & SambaNova) and Dynamic Cheatsheet.

Runners

Runner	Class	Description
LiteLLM	`ACELiteLLM`	Batteries-included agent with `.ask()`, `.learn()`, `.save()` — accepts any LiteLLM model string
Core	`ACE`	Full learning loop with batch epochs and evaluation
Trace Analyser	`TraceAnalyser`	Learn from pre-recorded traces without re-running tasks
browser-use	`BrowserUse`	Browser automation that improves with each run
LangChain	`LangChain`	Wrap any LangChain chain or agent with learning
Claude Code	`ClaudeCode`	Claude Code CLI tasks with learning

uv add 'ace-framework[browser-use]'    # Browser automation
uv add 'ace-framework[langchain]'      # LangChain
uv add 'ace-framework[logfire]'        # Observability (auto-instruments PydanticAI)
uv add 'ace-framework[mcp]'            # MCP server for IDE integration
uv add 'ace-framework[deduplication]'  # Embedding-based skill deduplication

Have existing agent logs? Extract strategies from them directly:

from ace import ACELiteLLM

agent = ACELiteLLM(model="gpt-4o-mini")
agent.learn_from_traces(your_existing_traces)
print(agent.get_strategies())

-> Examples

Benchmarks

Tau2 — Multi-Step Agentic Tasks

tau2-bench by Sierra Research: airline domain tasks requiring tool use and policy adherence. Claude Haiku 4.5 agent, strategies learned on the train split with no reward signals, evaluated on the held-out test split.

Tau2 Benchmark — ACE doubles consistency at pass^4

pass^k = probability all k independent attempts succeed. ACE doubles consistency at pass^4 with 15 learned strategies.

Claude Code — Autonomous Translation

ACE + Claude Code translated this library from Python to TypeScript with zero supervision:

Metric	Result
Duration	~4 hours
Commits	119
Lines written	~14,000
Build errors	0
Tests	All passing
Learning cost	~$1.50

Pipeline Architecture

ACE is built on a composable pipeline engine. Each step declares what it requires and what it produces:

AgentStep -> EvaluateStep -> ReflectStep -> UpdateStep -> DeduplicateStep

Use learning_tail() for the standard learning sequence, or compose custom pipelines:

from ace import Pipeline, AgentStep, EvaluateStep, learning_tail

steps = [AgentStep(agent, skillbook), EvaluateStep(env)] + learning_tail(reflector, skill_manager, skillbook)
pipeline = Pipeline(steps)

The pipeline engine (pipeline/) is framework-agnostic with requires/provides contracts, immutable context, and error isolation. See Pipeline Design and Architecture.

CLI

Command	Description
`ace setup`	Interactive setup — model selection, API keys, connection validation
`ace models <query>`	Search available models with pricing
`ace validate <model>`	Test a model connection
`ace config`	Show current configuration
`kayba`	Cloud CLI — upload traces, fetch insights, manage prompts
`ace-mcp`	MCP server for IDE integration

Documentation

Full Documentation — Guides, API reference, examples
Quick Start — 5-minute setup
Setup Guide — Configuration and providers
Hosted API Guide — Hosted CLI, trace upload, prompt install
Architecture — Core concepts and system design
Code Reference — Implementations, API, usage examples
Design Decisions — Rejected alternatives and rationale
Pipeline Engine — Step composition and context flow
Examples — Runnable demos
Changelog — Version history

Contributing

Contributions are welcome. See Contributing Guidelines.

Built by Kayba and the open-source community.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kayba

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.12.0

May 7, 2026

0.10.0

Apr 13, 2026

0.9.7

Apr 11, 2026

0.9.6

Apr 11, 2026

0.9.5

Apr 11, 2026

0.9.4

Apr 11, 2026

0.9.3

Apr 1, 2026

0.9.2

Mar 31, 2026

0.9.1

Mar 26, 2026

0.9.0

Mar 26, 2026

0.8.9

Mar 18, 2026

0.8.8

Mar 17, 2026

0.8.7

Mar 17, 2026

0.8.6

Mar 12, 2026

0.8.5

Mar 4, 2026

0.8.4

Feb 27, 2026

0.8.3

Feb 21, 2026

0.8.2

Feb 18, 2026

0.8.1

Feb 18, 2026

0.8.0

Feb 17, 2026

0.7.3

Feb 4, 2026

0.7.2

Jan 26, 2026

0.7.1

Dec 8, 2025

0.7.0

Dec 4, 2025

0.6.0

Nov 29, 2025

0.5.1

Nov 25, 2025

0.5.0

Nov 20, 2025

0.4.0

Nov 8, 2025

0.2.0

Oct 16, 2025

0.1.0

Oct 15, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ace_framework-0.12.0.tar.gz (211.2 kB view details)

Uploaded May 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ace_framework-0.12.0-py3-none-any.whl (216.1 kB view details)

Uploaded May 7, 2026 Python 3

File details

Details for the file ace_framework-0.12.0.tar.gz.

File metadata

Download URL: ace_framework-0.12.0.tar.gz
Upload date: May 7, 2026
Size: 211.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ace_framework-0.12.0.tar.gz
Algorithm	Hash digest
SHA256	`8313d779c5b05995301a276b31eb27f3f5d4d017db160ae0e9b5a0fc2218be66`
MD5	`a3bda211c8c1fac5939ab41866828ea5`
BLAKE2b-256	`bf33689d81127a2c0003f8e83f9dffeb92a7d7de443cd8ded725c81f3002f9b9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ace_framework-0.12.0.tar.gz:

Publisher: publish.yml on kayba-ai/agentic-context-engine

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ace_framework-0.12.0.tar.gz
- Subject digest: 8313d779c5b05995301a276b31eb27f3f5d4d017db160ae0e9b5a0fc2218be66
- Sigstore transparency entry: 1463054323
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: kayba-ai/agentic-context-engine@300efc010cc7fc66863a69a75f1dc43a999ca6ca
- Branch / Tag: refs/tags/v0.12.0
- Owner: https://github.com/kayba-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@300efc010cc7fc66863a69a75f1dc43a999ca6ca
- Trigger Event: release

File details

Details for the file ace_framework-0.12.0-py3-none-any.whl.

File metadata

Download URL: ace_framework-0.12.0-py3-none-any.whl
Upload date: May 7, 2026
Size: 216.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ace_framework-0.12.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e81a3443f67a5345cf5094258bcf9ea7125a327d68a49eb1c47d8aa526141d19`
MD5	`2a62e939e59b76e6f93d139388d7fbe8`
BLAKE2b-256	`029321fbf095fd9f2a54584e57ef75fe06af14d64a2bc94842ccd9fe2e8e5ea8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ace_framework-0.12.0-py3-none-any.whl:

Publisher: publish.yml on kayba-ai/agentic-context-engine

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ace_framework-0.12.0-py3-none-any.whl
- Subject digest: e81a3443f67a5345cf5094258bcf9ea7125a327d68a49eb1c47d8aa526141d19
- Sigstore transparency entry: 1463054336
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: kayba-ai/agentic-context-engine@300efc010cc7fc66863a69a75f1dc43a999ca6ca
- Branch / Tag: refs/tags/v0.12.0
- Owner: https://github.com/kayba-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@300efc010cc7fc66863a69a75f1dc43a999ca6ca
- Trigger Event: release

ace-framework 0.12.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Agentic Context Engine (ACE)

Try our hosted solution for free at kayba.ai: automated agent self-improvement from your terminal. CLI + dashboard that analyzes traces, surfaces failures, and ships improvements directly from Claude Code, Codex, and more.

Proven Results

Quick Start

How It Works

Runners

Benchmarks

Tau2 — Multi-Step Agentic Tasks

Claude Code — Autonomous Translation

Pipeline Architecture

CLI

Documentation

Contributing

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance