Cognitive continuity infrastructure for long-lived AI agents — cross-model state reconstruction, semantic recall, cognitive compression

These details have not been verified by PyPI

Project links

Project description

AgentKeeper

Cognitive continuity infrastructure for long-lived AI agents.

AgentKeeper reconstructs persistent cognitive state across model switches, crashes, restarts, and constrained contexts. Built for agents that must survive longer than a single context window.

Why AgentKeeper exists

Agents don't fail because they forget facts. They fail because they lose cognitive continuity — their state, priorities, identity, and decision context drift the moment the underlying model changes, the context window fills, or the process restarts.

AgentKeeper treats this as a systems problem, not a memory problem.

It provides:

A Cognitive Reconstruction Engine that rebuilds an agent's state for the target model, every call.
A memory hierarchy (working / episodic / semantic / archival) with importance-aware retention.
Semantic recall based on embeddings — meaning, not keywords.
Cognitive compression — decay, consolidation, contradiction arbitration.
Identity persistence — principles and hard constraints that survive every form of compression.
Cross-model translation — XML for Claude, sections for GPT-4, narrative for Gemini, minimal for local models.

Continuity, not just memory.

Install

pip install agentkeeper

Optional extras (only install what you need):

pip install 'agentkeeper[anthropic]'    # Claude
pip install 'agentkeeper[openai]'       # GPT models + OpenAI embeddings
pip install 'agentkeeper[gemini]'       # Google Gemini
pip install 'agentkeeper[semantic]'     # Local embeddings (sentence-transformers)
pip install 'agentkeeper[all]'          # everything

No external infrastructure required. Storage defaults to local SQLite. Vendor-agnostic by design.

60-second tour

import agentkeeper

# 1. Create an agent and define its persistent identity
agent = agentkeeper.create(agent_id="aria", provider="anthropic")
agent.set_identity(
    name="Aria",
    role="EU insurance broker copilot",
    principles=["never share PII without explicit consent"],
    constraints=["EU data residency only"],
)

# 2. Teach it about the world
agent.principle("always confirm budget changes in writing")
agent.fact("client: Acme Corporation", importance=0.95)
agent.event("contract signed", when="2026-05-15")
agent.remember("favourite colour: blue")  # tier inferred automatically

# 3. Ask — context is reconstructed for the target model
response = agent.ask("What do we know about the Acme deal?")

# 4. Switch providers — memory and identity survive
agent.switch_provider("openai").save()
response = agent.ask("Same question, different model.")

Architecture

       ┌──────────────────────────────────────────────────────────────┐
       │                  AgentKeeper Public API                       │
       │  agent.remember() · agent.recall() · agent.ask()              │
       │  agent.compress() · agent.set_identity() · agent.save()       │
       └────────────────────────────┬──────────────────────────────────┘
                                    │
       ┌────────────────────────────▼──────────────────────────────────┐
       │           Cognitive Reconstruction Engine (CRE)               │
       │  Identity injection · importance ranking · semantic boost     │
       │  Token budget · profile-driven rendering                      │
       └─┬────────────┬────────────┬────────────┬─────────────────────┘
         │            │            │            │
   ┌─────▼─────┐ ┌───▼────────┐ ┌─▼─────────┐ ┌▼──────────────┐
   │ Memory    │ │ Semantic   │ │ Cognitive │ │ Cross-Model    │
   │ Hierarchy │ │ Recall     │ │ Compress  │ │ Translation    │
   │           │ │            │ │           │ │                │
   │ working   │ │ embeddings │ │ decay     │ │ XML / sections │
   │ episodic  │ │ vector     │ │ consol.   │ │ narrative      │
   │ semantic  │ │ index      │ │ contradic │ │ minimal        │
   │ archival  │ │            │ │           │ │                │
   └───────────┘ └────────────┘ └───────────┘ └────────────────┘
                                    │
                      ┌─────────────▼─────────────┐
                      │   Storage (SQLite-first)  │
                      │   Vendor-agnostic         │
                      └───────────────────────────┘

Every layer is interchangeable and opt-in. The base CRE works with no embeddings, no compression, no profile customisation. Layers stack as you need them.

Cognitive continuity in five primitives

1. Identity that never erodes

agent.set_identity(
    name="Aria",
    role="EU broker copilot",
    principles=["never share PII"],
    constraints=["EU data residency only"],
)

Identity is injected into every reconstructed context, regardless of token budget. It survives compression, model switches, and restarts. Principles are protected — never decayed, never consolidated, never flagged.

2. Memory organised by tier

agent.fact("budget: 50k EUR")                      # semantic (stable)
agent.event("contract signed", when="2026-05-15")  # episodic (time-anchored)
agent.principle("always confirm changes")          # protected, identity-level
agent.remember("favourite colour: blue")           # tier inferred automatically

Tiers shape narration in reconstructed prompts and drive retention policy under compression.

3. Semantic recall

results = agent.recall("money allocated to the project", top_k=5)
for fact, score in results:
    print(f"{score:.2f}  {fact.content}")

Pluggable embedding providers: local sentence-transformers (default, free, no lock-in), OpenAI, or your own. Recall biases context reconstruction toward facts that actually matter for the current question.

4. Cognitive compression

report = agent.compress()
# CompressionReport(
#   decayed_facts=12,
#   consolidation={'clusters_found': 3, 'facts_removed': 7, ...},
#   contradictions={'pairs_found': 2, 'resolutions': 2},
#   facts_before=120, facts_after=102,
# )

Three independent passes:

Decay — exponential half-life on unused non-critical facts. Critical and protected facts are immortal.
Consolidation — embedding-based clustering merges near-duplicates. Optional LLM-backed synthesiser.
Contradiction arbitration — key-value divergence and polarity-opposition detection. Deterministic winner (critical > importance > recency). Loser is flagged, not deleted.

5. Cross-model translation

The same cognitive state, four formats:

Provider	Format	Why
Anthropic	XML	Claude excels with structured `<agent_identity>` and `<memory>` blocks
OpenAI	Sections	GPT family responds well to labelled sections (`AGENT IDENTITY`, `MEMORY`, `CURRENT TASK`)
Gemini	Narrative	Long-context model — prefers prose framing
Ollama	Minimal	Small/local models — aggressive compression, terse tokens

Custom providers? Register your own:

from agentkeeper import CognitiveProfile, PromptFormat, register_profile

register_profile(CognitiveProfile(
    provider="my-llm",
    format=PromptFormat.SECTIONS,
    effective_context_tokens=10_000,
))

Async API

import asyncio
import agentkeeper

async def main() -> None:
    agent = agentkeeper.create_async(agent_id="aria", provider="anthropic")
    agent.set_identity(name="Aria", role="copilot")
    agent.fact("budget: 50k", importance=0.95)

    # Parallel asks across providers
    answers = await asyncio.gather(
        agent.ask("status?", provider="anthropic"),
        agent.ask("status?", provider="openai"),
    )

asyncio.run(main())

Sync and async agents share storage — you can save with one and load with the other.

Production-grade

Type-safe: py.typed marker shipped, mypy-strict friendly.
Typed exceptions: AgentKeeperError root, with subclasses for every failure mode (provider down, agent missing, retriable network errors).
Structured logging: namespaced under agentkeeper.*, opt-in.
Retries: built-in exponential backoff for transient provider errors via with_retry / with_async_retry.
Tested: 267 tests, CI on Python 3.10 / 3.11 / 3.12.

Configuration

Environment variables (all optional):

Variable	Default	Purpose
`OPENAI_API_KEY`	—	Required for OpenAI provider
`ANTHROPIC_API_KEY`	—	Required for Anthropic provider
`GEMINI_API_KEY`	—	Required for Gemini provider
`OPENAI_MODEL`	`gpt-4-turbo`	OpenAI model name
`ANTHROPIC_MODEL`	`claude-sonnet-4-5-20250929`	Anthropic model name
`OLLAMA_HOST`	`http://localhost:11434`	Ollama server URL
`AGENTKEEPER_DB`	`agentkeeper.db`	SQLite path
`AGENTKEEPER_EMBEDDING_PROVIDER`	auto (sentence-transformers > openai > mock)	Override embedding backend

Roadmap (v1.x)

v1.1 — Persistent vector index (sqlite-vec) so the recaller survives restarts without rebuild.
v1.2 — Async LLM-backed synthesiser in compression.
v1.3 — Pluggable storage backends (Postgres, encrypted).
v1.4 — AgentKeeper Cloud (managed sync). Stays optional; OSS remains feature-complete.

Featured on

AgentKeeper was vouched for in March 2026 by:

Shruti Codes ✓
Chidanand Tripathi ✓ (80K-view thread)
Kayvon Jafarzadeh ✓
Leonard Rodman ✓
Martin Szerment ✓
Bespoke AI Solutions Inc ✓
@grok (xAI)

If you've shipped agents that needed to survive across model changes, you understand why this matters. Thank you to everyone who saw the project early.

Contributing

Issues, ideas, and PRs welcome. See CONTRIBUTING.md.

License

MIT. See LICENSE.

Built by

ThinkLanceAI — cognitive infrastructure for AI systems. Need this in production with custom integrations? hello@thinklanceai.com.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.1.2

May 20, 2026

1.1.1

May 20, 2026

This version

1.1.0

May 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentkeeper_ai-1.1.0.tar.gz (120.3 kB view details)

Uploaded May 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentkeeper_ai-1.1.0-py3-none-any.whl (109.1 kB view details)

Uploaded May 19, 2026 Python 3

File details

Details for the file agentkeeper_ai-1.1.0.tar.gz.

File metadata

Download URL: agentkeeper_ai-1.1.0.tar.gz
Upload date: May 19, 2026
Size: 120.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for agentkeeper_ai-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`468801becf950c3319cfd6aa24e385a4c58edc67b84b94a413f9f678eb949103`
MD5	`f7647ee1c408533654e0c590cd734be0`
BLAKE2b-256	`89f4002be478226cb18c729674110b4e5e04064bddf3115db587d32b7d0a7a24`

See more details on using hashes here.

File details

Details for the file agentkeeper_ai-1.1.0-py3-none-any.whl.

File metadata

Download URL: agentkeeper_ai-1.1.0-py3-none-any.whl
Upload date: May 19, 2026
Size: 109.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for agentkeeper_ai-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`89fec40782035e781c0d53ab0fe1fa72f6c316a4570f505ae6af2d7855f3e446`
MD5	`e3b53f873f55c2d5f23a40cb2d98dcf4`
BLAKE2b-256	`6e577cf393c29aa5bdbd906f2d4b9a31f540a4ca29718f80b7b40d009bcce75b`

See more details on using hashes here.

agentkeeper-ai 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AgentKeeper

Why AgentKeeper exists

Install

60-second tour

Architecture

Cognitive continuity in five primitives

1. Identity that never erodes

2. Memory organised by tier

3. Semantic recall

4. Cognitive compression

5. Cross-model translation

Async API

Production-grade

Configuration

Roadmap (v1.x)

Featured on

Contributing

License

Built by

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes