Skip to main content

Persistent two-tier memory for AI agents. Short-term markdown + long-term SQLite with semantic search.

Project description

AgentRecall

PyPI Python License: MIT

Persistent two-tier memory for AI agents. Battle-tested across 134 sessions with 10 agent roles.

Short-term (markdown files, always loaded) + Long-term (SQLite + OpenAI embeddings, searched on-demand).

Install

pip install agentrecall-memory

Zero required dependencies. SQLite is Python stdlib.

Optional semantic search:

pip install agentrecall-memory[embeddings]
export OPENAI_API_KEY="sk-..."

Quick Start

CLI

# Initialize
agentrecall init

# Store a memory (auto-dedup via cosine similarity >0.92)
agentrecall store coder gotchas "kamal app exec spawns new container, use docker exec"
agentrecall store social exhausted_stories "blue-green deploy order loss" --tags deploy,sqlite

# Search (semantic + keyword fallback)
agentrecall search coder gotchas "kamal file not found"

# List categories
agentrecall list coder

# Check health
agentrecall check --all

Python API

from agentrecall import MemoryStore, MemorySearch

# Store
store = MemoryStore()
store.store("coder", "gotchas", "kamal spawns new container", tags=["kamal", "docker"])

# Search
search = MemorySearch()
results = search.search("coder", "gotchas", "kamal file not found")
for text in results:
    print(text)

# List categories
categories = store.list_categories("coder")

How It Works

Two-Tier Design

Short-term (memory/<role>.md) Long-term (SQLite + embeddings)
Active learnings, mistakes, feedback Growing lists (exhausted topics, defect patterns)
Max 80 lines, pruned regularly Unlimited entries, never pruned
Read in full at session start Searched on-demand per query

Semantic Dedup

Every store call embeds the text via OpenAI text-embedding-3-small and checks cosine similarity against all existing entries in the same role/category. Similarity > 0.92 blocks the store (raises DuplicateError).

Without an API key, falls back to exact text matching.

Search

  1. Embed the query
  2. Compute cosine similarity against all entries with embeddings
  3. Return entries above threshold (0.75), sorted by similarity
  4. If no embedding matches: keyword fallback (>=50% keyword match)
  5. No API key: keyword-only search

Graceful Degradation

Works fully offline without an OpenAI API key:

  • Store: exact text dedup (case-insensitive)
  • Search: keyword matching (>=50% of query words must appear)

Agent Skills

Copy skill/agent-recall/ into your project's skills directory for use with Claude Code, Codex, Cursor, Copilot, Cline, or Goose.

cp -r skill/agent-recall/ .claude/skills/agent-recall/

Configuration

Environment variables:

Variable Default Description
AGENT_RECALL_HOME ~/.agentrecall Memory storage directory
OPENAI_API_KEY (none) OpenAI API key for embeddings
UT_OPENAI_API_KEY (none) Preferred over OPENAI_API_KEY

CLI Reference

agentrecall store <role> <category> "text" [--tags t1,t2] [--db path]
agentrecall search <role> <category> "query" [--db path]
agentrecall list <role> [--db path]
agentrecall check [--fix] [--long-term] [--all] [--dir path] [--db path]
agentrecall init [--dir path]
agentrecall migrate [--dry-run] [--rebuild] [--dir path] [--db path]

Exit codes: 0 = success/found, 1 = not-found/validation-fail, 2 = input error.

Migration from JSONL

If you have existing JSONL memory files:

agentrecall migrate --dir /path/to/memory/
agentrecall migrate --rebuild  # Re-embed entries missing embeddings

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_cerebro-0.1.0.tar.gz (23.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agent_cerebro-0.1.0-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file agent_cerebro-0.1.0.tar.gz.

File metadata

  • Download URL: agent_cerebro-0.1.0.tar.gz
  • Upload date:
  • Size: 23.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for agent_cerebro-0.1.0.tar.gz
Algorithm Hash digest
SHA256 847e9a39bcffd1bd5746e8fbf6af86f0346ed458a674655d7791d5aa0ac64f84
MD5 68ce7dc679da814d4ed3a179e0f990c8
BLAKE2b-256 decb2001331aea8e52b1deb1a866344750ac489c3f76589e1e0ed8dca32455cf

See more details on using hashes here.

File details

Details for the file agent_cerebro-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: agent_cerebro-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for agent_cerebro-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f3fd18b43deef41459d8a8037b978971c80dbd1b7ae808c1e33f7f8ffee1dfd4
MD5 9ebe7cb9d89558386792f39eef320c15
BLAKE2b-256 ef182ed7e9975d3d16b3334b94cdb4a29e3ed7930a9fff154adba53078f4aae3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page