Cross-agent semantic memory with knowledge graphs, fact extraction, and MCP integration

These details have not been verified by PyPI

Project links

Project description

Lore — Memory That Works Without Code Changes

Install a hook. Your agent remembers.

Lore auto-injects relevant memories into your agent's context before every response. No SDK calls. No agent cooperation. No code changes. Just a hook and a server.

User: "What API rate limits should I use?"

── Lore hook fires (20ms) ──────────────────────────────
🧠 Relevant memories:
- [0.82] Stripe API returns 429 after 100 req/min — use exponential backoff
- [0.71] Our internal API rate limit is 500 req/min per API key
────────────────────────────────────────────────────────

Agent sees memories + prompt → responds with full context

Your agent didn't call anything. Lore queried itself, found relevant memories, and injected them — all before the agent saw the message.

Why Not MCP Tools?

MCP memory tools (including Lore's own 20 MCP tools) rely on the agent choosing to call recall(). In practice, agents rarely do. Memory becomes write-only — a fancy notebook nobody reads.

Auto-retrieval fixes this structurally. Every message triggers a semantic search. Relevant memories appear in context automatically. The agent doesn't need to know Lore exists.

Quick Start — Claude Code (2 minutes)

1. Start the Lore server

pip install lore-sdk
lore serve  # starts on port 8765, SQLite by default

2. Add the hook

Create ~/.claude/hooks/lore-retrieve.sh:

#!/bin/bash
# Auto-inject Lore memories into every Claude Code prompt
INPUT=$(cat)
PROMPT=$(echo "$INPUT" | jq -r '.prompt // empty')

# Skip short/empty prompts
[ -z "$PROMPT" ] || [ ${#PROMPT} -lt 10 ] && exit 0

ENCODED=$(printf '%s' "$PROMPT" | jq -sRr @uri)
RESPONSE=$(curl -s --max-time 2 \
  "http://localhost:8765/v1/retrieve?query=${ENCODED}&limit=5&min_score=0.3&format=markdown" \
  -H "Authorization: Bearer ${LORE_API_KEY}" 2>/dev/null)

COUNT=$(echo "$RESPONSE" | jq -r '.count // 0' 2>/dev/null)
if [ "$COUNT" -gt 0 ]; then
  FORMATTED=$(echo "$RESPONSE" | jq -r '.formatted // empty' 2>/dev/null)
  jq -n --arg ctx "🧠 Relevant memories from Lore:
$FORMATTED" '{
    hookSpecificOutput: {
      hookEventName: "UserPromptSubmit",
      additionalContext: $ctx
    }
  }'
fi

chmod +x ~/.claude/hooks/lore-retrieve.sh

Add to ~/.claude/settings.json:

{
  "hooks": {
    "UserPromptSubmit": [
      {
        "hooks": [
          {
            "type": "command",
            "command": "~/.claude/hooks/lore-retrieve.sh"
          }
        ]
      }
    ]
  }
}

3. Done

Every prompt you type in Claude Code now gets relevant memories injected automatically. Store memories via MCP tools, the SDK, or the REST API — they'll surface when relevant.

How It Works

┌─────────────┐     ┌──────────────┐     ┌──────────────┐
│ User types   │────▶│ Hook fires   │────▶│ Lore server  │
│ a prompt     │     │ (pre-agent)  │     │ /v1/retrieve │
└─────────────┘     └──────┬───────┘     └──────┬───────┘
                           │                     │
                    memories injected      semantic search
                    into agent context     (pgvector/ONNX)
                           │                     │
                    ┌──────▼───────┐     ┌──────▼───────┐
                    │ Agent sees   │     │ Top-K results │
                    │ prompt +     │◀────│ scored &      │
                    │ memories     │     │ formatted     │
                    └──────────────┘     └──────────────┘

Key properties:

20ms latency (warm) — faster than a network round-trip
Fail-open — if Lore is slow or down, the agent responds normally
Per-message — every prompt gets fresh context, not just session start
No agent cooperation — the agent doesn't know Lore exists

Supported Runtimes

Runtime	Hook Type	Status
Claude Code	`UserPromptSubmit`	✅ Ready
OpenClaw	`message:preprocessed`	✅ Ready
Any HTTP client	`GET /v1/retrieve`	✅ Ready

The /v1/retrieve endpoint works with any system that can make an HTTP call before sending a prompt to an LLM. If your runtime supports pre-prompt hooks, Lore plugs right in.

Claude Code Setup · OpenClaw Setup · API Reference

Storing Memories

Memories flow in automatically or manually:

Auto-ingest (conversations → memories)

Hook into your agent's message stream. Lore extracts facts, preferences, and decisions from conversations automatically.

MCP Tools (20 tools)

"Remember that our API uses REST with rate limits at 100 req/min"

Your agent calls remember() via MCP. Works with Claude Desktop, Cursor, VS Code, Windsurf, ChatGPT, Cline, and Claude Code.

REST API

curl -X POST http://localhost:8765/v1/memories \
  -H "Authorization: Bearer $LORE_API_KEY" \
  -d '{"content": "API rate limit is 100 req/min", "tags": ["api"]}'

SDK

from lore import Lore

lore = Lore()  # zero config — local SQLite, built-in embeddings
lore.remember("API rate limit is 100 req/min", tags=["api"])

Comparison

Feature	Lore	Mem0	Zep
Auto-retrieval hooks	Yes	No	No
No code changes needed	Yes	No	No
Local-first (no server)	Yes	No	No
MCP native (20 tools)	Yes	No	No
Knowledge graph	Yes	Yes*	Yes
Fact extraction	Yes	No	No
Auto-consolidation	Yes	No	Yes
Memory tiers + decay	Yes	No	Yes
Self-hosted	Yes	Partial	Partial
No external DB required	Yes	No**	No

* Mem0 requires Neo4j for graph features. ** Mem0 requires Qdrant or Redis.

The difference: Mem0 and Zep are SDKs — you write the search() → inject → add() code yourself. Lore is a runtime plugin — install a hook, memories flow automatically.

Installation

Local (SQLite, zero config)

pip install lore-sdk
lore serve

Everything runs locally. ONNX embeddings ship with the package. No API keys needed.

Docker (Postgres + pgvector)

docker compose up -d

Starts Postgres with pgvector and the Lore HTTP server on port 8765.

# Production
cp .env.example .env  # edit POSTGRES_PASSWORD and LORE_ROOT_KEY
docker compose -f docker-compose.prod.yml up -d

Self-Hosted Guide

npm (TypeScript SDK)

npm install lore-sdk

Features

Auto-Retrieval (v0.8.3)

GET /v1/retrieve · runtime hooks · fail-open design

The headline feature. Semantic search + formatted output designed for prompt injection. Supports XML, Markdown, and raw JSON formats. Configurable score threshold, result limit, and timeout.

Memory Management

remember · recall · forget · list_memories · stats · upvote · downvote

Core operations with semantic search, tier-based TTL (working/short/long), importance scoring with temporal decay, and automatic PII redaction.

Knowledge Graph

graph_query · entity_map · related

Entities and relationships extracted from memories, with hop-by-hop traversal. Graph-enhanced recall surfaces connected memories that pure vector search misses.

Fact Extraction & Conflicts

extract_facts · list_facts · conflicts

Atomic (subject, predicate, object) triples extracted from text. Automatic conflict detection when new facts contradict old ones.

Intelligence Pipeline

classify · enrich · consolidate

LLM-powered classification, metadata enrichment, and memory consolidation. All opt-in — requires an LLM API key.

Temporal Queries (v0.7.0)

on_this_day · verbatim recall · temporal filters

Surface memories from the same day across years. Return original words. Filter by date ranges.

Import/Export

ingest · as_prompt · check_freshness · github_sync

Webhook-style ingestion, LLM-formatted export, staleness detection, and GitHub sync.

MCP Setup Guides

Client	Guide
Claude Desktop	docs/setup-claude-desktop.md
Claude Code	docs/setup-claude-code.md
Cursor	docs/setup-cursor.md
VS Code (Copilot)	docs/setup-vscode.md
Windsurf	docs/setup-windsurf.md
ChatGPT	docs/setup-chatgpt.md
Cline	docs/setup-cline.md

All Setup Guides

API Reference

Full API Reference

Performance

Operation	Latency
`/v1/retrieve` (warm)	~20ms
`remember()` no LLM	< 100ms
`recall()` 100 memories	< 50ms
`recall()` 10K memories	< 200ms
`recall()` graph-enhanced	< 500ms
Embedding (500 words)	< 200ms

Architecture

graph LR
    A[Agent Runtime] -->|hook| B["Retrieve API"]
    B --> C["Embedder (ONNX)"]
    C --> D[pgvector search]
    D --> E[Score + Format]
    E -->|context| A

    F[Conversations] -->|auto-ingest| G["Memories API"]
    G --> H[Lore SDK]
    H --> I["Store (SQLite / Postgres)"]
    H --> J["LLM Pipeline (optional)"]

Full Architecture Documentation

Examples

See examples/ for runnable scripts:

full_pipeline.py — remember, recall, tiers, prompt export
mcp_tool_tour.py — tour of all 20 MCP tools
webhook_ingestion.py — ingest with source tracking
consolidation_demo.py — memory consolidation

Contributing

git clone https://github.com/agentkitai/lore.git
cd lore
pip install -e ".[dev,mcp,enrichment]"
pytest

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.2.0

Mar 24, 2026

1.1.1

Mar 22, 2026

1.1.0

Mar 21, 2026

1.0.0

Mar 15, 2026

0.9.5

Mar 9, 2026

0.9.4

Mar 9, 2026

0.9.3

Mar 9, 2026

0.9.2

Mar 9, 2026

This version

0.9.1

Mar 8, 2026

0.9.0

Mar 8, 2026

0.8.3

Mar 8, 2026

0.8.2

Mar 8, 2026

0.5.1

Mar 5, 2026

0.5.0

Mar 4, 2026

0.4.1

Mar 4, 2026

0.4.0

Mar 4, 2026

0.2.1

Feb 14, 2026

0.2.0

Feb 12, 2026

0.1.1

Feb 12, 2026

0.1.0

Feb 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lore_sdk-0.9.1.tar.gz (799.1 kB view details)

Uploaded Mar 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lore_sdk-0.9.1-py3-none-any.whl (184.9 kB view details)

Uploaded Mar 8, 2026 Python 3

File details

Details for the file lore_sdk-0.9.1.tar.gz.

File metadata

Download URL: lore_sdk-0.9.1.tar.gz
Upload date: Mar 8, 2026
Size: 799.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for lore_sdk-0.9.1.tar.gz
Algorithm	Hash digest
SHA256	`dcedac10bb2fea3b65373a2b515aec45dc10cee1d9c44b2fc2e3cdf98604b6be`
MD5	`e9cf57e2caec9c2f40c521e04dfb2260`
BLAKE2b-256	`73b4a0a6b400f6c7304d41d6861cef54d0fa0403210740cc593b5b4cad4af49d`

See more details on using hashes here.

File details

Details for the file lore_sdk-0.9.1-py3-none-any.whl.

File metadata

Download URL: lore_sdk-0.9.1-py3-none-any.whl
Upload date: Mar 8, 2026
Size: 184.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for lore_sdk-0.9.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0ea6ae21822043dd74d72c25136849930b62ea42c84f75204eaabd44d179fceb`
MD5	`869fce5159b4a3a4298c3fc2f37cbd23`
BLAKE2b-256	`6370ac3b67b592cb10eb2a47b353238ff6dda3f96513c36d8d1516e460ba4b89`

See more details on using hashes here.

lore-sdk 0.9.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Lore — Memory That Works Without Code Changes

Why Not MCP Tools?

Quick Start — Claude Code (2 minutes)

1. Start the Lore server

2. Add the hook

3. Done

How It Works

Supported Runtimes

Storing Memories

Auto-ingest (conversations → memories)

MCP Tools (20 tools)

REST API

SDK

Comparison

Installation

Local (SQLite, zero config)

Docker (Postgres + pgvector)

npm (TypeScript SDK)

Features

Auto-Retrieval (v0.8.3)

Memory Management

Knowledge Graph

Fact Extraction & Conflicts

Intelligence Pipeline

Temporal Queries (v0.7.0)

Import/Export

MCP Setup Guides

API Reference

Performance

Architecture

Examples

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes