Skip to main content

Local-first AI memory built to forget. Depth-axis architecture with verifiable purge receipts.

Project description

Lethe

Lethe is more than agent memory. It's the first AI memory built to forget.

PyPI tests python MCP license



Every memory framework right now is racing the same direction.

Mem0 promises perfect recall. MemPalace promises verbatim retention. Letta hands an LLM the whole context and asks it to manage itself. The benchmarks they all compete on — recall@K, hit-rate, MRR — measure one thing: how rarely does your agent lose a fact?

But agents in production don't die of losing facts. They die of keeping them. The password rotated three months ago, still suggested. The customer who exercised right-to-deletion, still in the recommender. The job title wrong since 2023 because the supersede never landed. The OTP from last Tuesday, permanently embedded next to a real preference. Memory systems fail by overgrowth, not by attrition. And no one is benchmarking that side.

The Greeks had a name for the missing operation.

Lethe (Λήθη) — one of the five rivers of Hades. Souls drank from it before reincarnation, leaving the former life behind.

Its opposite is Mnemosyne, memory. The Greek word for truthἀλήθεια / aletheia — is a- (un-) + lethe. Truth is un-forgetting.

Memory is what survives Lethe. Truth is what survives memory.


The model

In the myth, Lethe is a river — surface, current, bed. Everything in the water has a depth. A leaf floats; a stone sinks; some things are weighed down enough to disappear.

Graph stores answer what is connected to what. Vector stores answer what is semantically similar. Neither answers the question agent memory actually faces: how deep is this fact, right now?

We built the simplest mental model that fits: every fact has one number — depth. Every operation is a force on it.

depth     state                          how it got there
─────────────────────────────────────────────────────────────
+∞        pinned, immune to gravity      .pin()
= 1.0     just inscribed, on surface     .inscribe()
∈ (0, 1)  sinking under gravity          .consolidate()
= 0       submerged, present but mute    .surrender(mode="release")
< 0       erased from disk               .surrender(mode="purge")
─────────────────────────────────────────────────────────────

No weight. No alive flag. No superseded_at column. One number, one axis, one mental model.


Benchmarks

Two axes. The conventional one: can a memory system find a fact when you need it? The one we propose: can it let go of a fact when you ask? Most frameworks score on the first; Lethe scores on both.

LongMemEval-S — retrieval (the conventional axis)

500 questions on MemPalace's own evaluation methodology, same all-MiniLM-L6-v2 embedder, zero API calls.

System R@1 R@5 R@10 Wall
MemPalace (raw) 80.6% 96.6% 98.2% 12 min
Lethe v1 85.4% 97.4% 99.0% 14 min

Lethe leads at every K; the gap is 6× wider at R@1 than at R@5 (+4.8 pp vs +0.8 pp). A single depth axis beats a palace of wings, rooms, and drawers — most clearly where it matters: at #1.

ForgetEval — forgetting (the axis we propose)

1000 generated cases across five families — supersession, decay, amnesia, purge, drift — each probing one structural property a memory system must exhibit to be safe in production. Pass / fail is exact substring matching on top-k recall, no LLM judge, deterministic. Full methodology: docs/forgeteval.md.

System super decay amnesia purge drift Overall
Lethe v1 100% 100% 98% 100% 99% 99.3% (993 / 1000)
Mem0 (2.0.2) 100% 100% 70% 75% 100% 88.8%
MemPalace 0% 0% 0% 0% 0% 0% (no forgetting primitives)

Mem0 ties on supersession / decay / drift but breaks at the precision operations: forgetting one entity without bleeding into near-neighbors (amnesia 70%) and deleting by identifier without over-pruning siblings (purge 75%). MemPalace's zeros are not a benchmark failure — they are an honest report that the library was built without supersede, release, or purge.

In production this maps to real failures. 70% amnesia means three in ten "forget this user" requests leave fragments reachable to other queries — a GDPR liability and a stale-context bug. 75% purge means one in four deletions either miss the target or take a neighbor with them — the silent delete-by-similarity failure that bricks compliance audits. MemPalace's 0% is the opposite failure: a GDPR Article 17 right-to-be-forgotten request becomes a manual data-migration project, not a one-line API call.

Reproduce: py bench/forgeteval/run.py --adapter {lethe|mem0|mempalace} --scale 200

ForgetEval is downstream of the depth model — and the depth model is downstream of ForgetEval. A failing purge_gdpr case in early runs forced recall(lexical=True) into the core as a first-class primitive. Both tables above reflect that loop.


Architecture

  • One physical axis: depth. Every state — pinned, surfaced, sinking, submerged, erased — is a numeric region. No status flags.
  • Single SQLite file. Three sub-tables (memory, memory_vec, memory_fts) keyed by shared rowid; plus an append-only event log and a supersession edge table. No external services.
  • Two retrieval primitives. recall(query) is RRF-blended vec + BM25; recall(query, lexical=True) is pure BM25. Purge uses the second — deleting alice@acme.io is a lexical lookup by identifier, not a semantic search for "similar customers."
  • Verifiable forgetting. Every signed purge returns an Ed25519-signed receipt anchored to a Merkle root over the event log. Tamper with any past event afterwards → receipt fails verification. No other open-source memory framework can produce this proof because none of them keep the log to anchor to.
  • Time-travel built in. recall(query, at=T) reconstructs depth state at any past timestamp from the event log.

Quickstart

pip install "pylethe[embed,crypto,mcp]"

The PyPI distribution name is pylethe (the lethe slot was already taken on PyPI by an unrelated package); the import remains from lethe import Lethe.

Library:

from lethe import Lethe

agent = Lethe("./agent.db")
mid = agent.inscribe("Alice works at Anthropic.")

agent.surrender(mid, mode="release")            # depth → 0
agent.surrender({"old": mid, "new": "Alice now at OpenAI."},
                mode="supersede")               # old sinks, new surfaces
agent.surrender(mid, mode="purge")              # erased from disk
agent.pin(mid)                                  # depth → +∞

CLI — one subcommand per primitive:

lethe inscribe "Alice works at Anthropic."
lethe recall "Where does Alice work?"
lethe supersede 1 --new "Alice now at OpenAI."
lethe blame "Alice's job"
lethe consolidate
lethe ingest ~/notes                       # batch: *.md *.txt *.rst

# Verifiable purge
lethe keygen
lethe --db agent.db purge --signed 42      # emits receipt JSON
lethe verify-receipt receipt.json --db agent.db --db-check

DB defaults to ~/.lethe/agent.db. Pass --json on any subcommand for machine-readable output.

MCP — eleven tools exposed over stdio (every core operation plus signed-purge receipts). Add to Claude Desktop / Claude Code / Cursor:

{
  "mcpServers": {
    "lethe": {
      "command": "python",
      "args": ["-m", "lethe.mcp_server"],
      "env": {"LETHE_DB": "/absolute/path/to/agent.db"}
    }
  }
}

Recipes

Runnable cookbook in recipes/ for the common patterns: OTP TTL, GDPR purge with cryptographic receipt, belief revision via supersession, pinning user preferences, and time-travel debugging. Each recipe is a self-contained ~40-line script that runs without fastembed — python recipes/02_gdpr_purge_receipt.py.


Status

v1.0.0-alpha. Core implemented and tested:

$ pytest tests
14 passed in 0.65s

Roadmap (next):

  • Human-curated adversarial ForgetEval. Substring traps, prefix collisions, paraphrase chains. Template-generated 1000-case is the floor, not the ceiling.
  • Receipt-verification benchmark family. Does the system produce auditable proof of deletion? A new ForgetEval axis no other framework even attempts.
  • Adaptive consolidation policies. consolidate() uses one fixed decay law; we want per-domain policies — financial records decay slower than chat memory.
  • Production-density distractor corpora. Synthetic office-trivia fillers replaced with real long-form text (Wikipedia, code, emails) for a tougher recall environment.

Star History

Star History Chart

License

MIT — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pylethe-1.0.0a0.tar.gz (34.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pylethe-1.0.0a0-py3-none-any.whl (30.6 kB view details)

Uploaded Python 3

File details

Details for the file pylethe-1.0.0a0.tar.gz.

File metadata

  • Download URL: pylethe-1.0.0a0.tar.gz
  • Upload date:
  • Size: 34.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pylethe-1.0.0a0.tar.gz
Algorithm Hash digest
SHA256 2bc72378e06bb893242f820ff078fe287d532ff8d2bea6ee0b88b54cae657582
MD5 72af2e933f4b1c984fbe24d2cd8c1924
BLAKE2b-256 f0241a447c4f7d52d73b7076465775bdb66019fad54e6f9905103abd38f1011b

See more details on using hashes here.

Provenance

The following attestation bundles were made for pylethe-1.0.0a0.tar.gz:

Publisher: release.yml on deeplethe/lethe

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pylethe-1.0.0a0-py3-none-any.whl.

File metadata

  • Download URL: pylethe-1.0.0a0-py3-none-any.whl
  • Upload date:
  • Size: 30.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pylethe-1.0.0a0-py3-none-any.whl
Algorithm Hash digest
SHA256 8fbc4963a1b9bd82dc3e3f79ea5c2a7d1ca6f7edc4e42844b7cb7876576d7bac
MD5 db037c413a59a9dd633c3964cd191c63
BLAKE2b-256 3b40d149ed01deb78dac3d991b0e1afef3836e3ce238b4f4631c09a27fc1524e

See more details on using hashes here.

Provenance

The following attestation bundles were made for pylethe-1.0.0a0-py3-none-any.whl:

Publisher: release.yml on deeplethe/lethe

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page