Skip to main content

Typed, policy-aware, evolving memory layer for AI agents.

Project description

TypedMemory

Long-term memory and reflection for AI agents. Persistent, evolving, context-aware — improves agent behavior over time.

CI PyPI Python License: MIT

📦 PyPI · 📚 Docs · 🏷️ Releases · 📝 Changelog

AI agents start believing their own hallucinations.

They contradict themselves silently, overwrite past decisions with no audit trail, and never resolve goals.

TypedMemory makes that visible. Structured knowledge, conflict policies, evolution over time.

5 lines for an agent

from typedmem import AgentMemory

mem = AgentMemory(profile="personal", path="agent.db")

mem.remember("User wants to learn Rust by year end")
mem.remember("User lives in Tokyo")

hits = mem.recall("what is the user trying to learn?")
#   → [ScoredMemory(content="User wants to learn Rust...", score=0.78)]

report = mem.reflect()
#   → AgentMemoryReflection(contradictions=[], drift_records=[], ...)

Four verbs over the whole pipeline: remember (extract + store), recall (semantic retrieval), reflect (run the evolver pipeline), forget (explicit delete).

The contradiction-detection moment

$ pip install typedmem

$ typedmem --profile engineering_design add \
    "SQLite handles our single-writer load fine" --type risk --subject storage
$ typedmem --profile engineering_design add \
    "SQLite blocks under concurrent writes"     --type risk --subject storage

$ typedmem --profile engineering_design contradictions

1 contradiction cluster(s):

cluster 1 (2 memories):
  [risk] [storage] SQLite handles our load fine
  [risk] [storage] SQLite blocks under concurrent writes

Two memories cross-linked by the FLAG policy. Both still in the store — no silent overwrite, no lost audit trail. Run typedmem history <id> on either to see exactly when and why the state changed.

Want the 30-second no-flags version? examples/DEMO.md — 5 lines, 4 sentences become 3 typed memories, one preference silently REPLACEs another.

Want the before-vs-after agent story? examples/agent_loop_demo.py — same 5 user utterances, with and without TypedMemory.

What makes TypedMemory different

Most systems store memory. TypedMemory evolves it.

  • Contradictions get surfaced — the FLAG policy cross-links conflicting memories so you can see both sides, not just the last write
  • Preferences get tracked — every REPLACE writes to replace_log; a drift detector flags unstable preferences before they corrupt your agent's behavior
  • Goals get resolved — when an event arrives that semantically matches an active goal, the goal flips to resolved (with a one-level undo)
  • Stale memories get summarized — non-destructively: originals are kept, a new summary memory links back via metadata["summarizes"]
  • Every action leaves an audit trailEvolutionRecord per change, written into the affected memory's metadata["evolution_history"]

Memory becomes a living knowledge layer, not a log.

Use cases

  • Debugging hallucinating agents. When an agent flips its story, you usually have no record of how it got there. With TypedMemory, every state change writes an EvolutionRecord into the affected memory's evolution_historytypedmem history <id> shows you exactly when it changed, what policy fired, and what the previous content was. Contradictions don't disappear under the new write; they get flagged so you can see both sides.
  • Multi-document research / RAG with provenance. Each fact carries a list of structured Source entries (document_id, chunk_id, span, authority). Two papers reporting the same outcome reinforce a single memory instead of producing duplicates. Citations come out of the store automatically.
  • Long-running personal assistants. Preferences with REPLACE write to replace_log; the PreferenceDriftDetector surfaces unstable preferences before they make your agent inconsistent. Stale events get non-destructively summarized into facts.
  • Design-doc agents. Decisions use SUPERSEDE — old decisions stay in the store with superseded_by pointing forward, so you keep the audit trail without polluting the active view.
  • Multi-tenant agents (legal + medical + customer-success on one machine). workspace namespaces every memory; no cross-domain collisions.

How it works

                              ┌──────────────────┐
                              │  DomainProfile   │  ← schema: which types,
                              │  TypeSpec × N    │     which policies,
                              │  prompt + rules  │     which validations
                              └────────┬─────────┘
                                       │
       text ──► Extractor ──► Memory ──┴──► MemoryStore ──► Retriever
                                            │
                                            ▼
                                         Evolver
                              (contradictions, drift, goals,
                               non-destructive summarization)

Every memory has a type (claim, decision, observation, …), a confidence, a structured source, a lifecycle policy, and a workspace — not a string in a vector database. Memories know how to update themselves on conflict, how to decay over time, and how to be summarized.

Zero runtime dependencies. Stdlib only. LLM clients, YAML profile loading, and richer embedders are optional extras.

Why this exists

Most "AI memory" libraries are wrappers around a vector database. That works for "remember what the user said," but it falls apart the moment you want an agent to:

  • track who said what, in which document, at which span (provenance)
  • handle the same fact from three sources without storing it three times (reinforcement)
  • recognize that a new decision supersedes the old one without losing the audit trail
  • summarize stale events without throwing away the originals
  • isolate legal memory from medical memory on the same machine
  • flag contradictions instead of silently overwriting them

TypedMemory handles these as first-class concepts, not bolt-ons.

Install

pip install typedmem                       # default install, zero deps
pip install 'typedmem[anthropic]'          # + AnthropicClient
pip install 'typedmem[openai]'             # + OpenAIClient
pip install 'typedmem[yaml]'               # + DomainProfile.from_yaml()
pip install 'typedmem[all]'

Python 3.10+.

60-second demo: an engineering design agent

import json
from typedmem import (
    DomainProfile, FakeClient, LLMExtractor, SQLiteMemoryStore,
)

profile = DomainProfile.builtin("engineering_design")
store = SQLiteMemoryStore.for_profile(profile, "design.db")

# Pretend the LLM extracted these from your design docs.
extractor = LLMExtractor(client=FakeClient([
    json.dumps([
        {"type": "decision", "content": "Use SQLite for storage",
         "subject": "storage_backend", "confidence": 0.9,
         "source": {"document_id": "design_v1.md"}},
        {"type": "risk", "content": "SQLite is single-writer",
         "subject": "storage_backend", "confidence": 0.8,
         "source": {"document_id": "design_v1.md"}},
    ]),
    json.dumps([
        {"type": "decision", "content": "Switch to PostgreSQL for concurrent writes",
         "subject": "storage_backend", "confidence": 0.9,
         "source": {"document_id": "design_v2.md"}},
        {"type": "risk", "content": "Postgres adds an external service",
         "subject": "storage_backend", "confidence": 0.85,
         "source": {"document_id": "design_v2.md"}},
    ]),
]), profile=profile)

for snippet in ("v1 text", "v2 text"):
    for m in extractor.extract(snippet):
        store.add(m)

# decision → SUPERSEDE: old preserved, new active.
print(store.by_type("decision"))                       # → just PostgreSQL
print(store.by_type("decision", include_superseded=True))  # → both

# risk → FLAG: two risks on the same subject get cross-linked.
for cluster in store.contradictions():
    for m in cluster:
        print(m.content)                                # → both risks

See examples/engineering_design_demo.py for the full version with audit trail and source provenance, or run:

typedmem profiles
typedmem --profile engineering_design add "..." --document-id design_v3.md
typedmem --profile engineering_design list --type decision
typedmem evolve --evolver contradictions

The mental model

Layer What it gives you Examples
Memory Typed object with content + confidence + workspace + sources + status Memory(type="claim", content=..., sources=[Source(...)])
Source Structured provenance with hashable identity (document_id, chunk_id, span) — dedup key for REINFORCE
workspace Namespace on every memory One agent, multiple corpora, zero cross-contamination
ConflictPolicy What to do when a new memory hits the same (workspace, type, subject) slot REPLACE · KEEP_BOTH · SUPERSEDE · REINFORCE · FLAG · IGNORE
DomainProfile Schema for a domain: which types, what policy each obeys, what's required engineering_design · research_paper · legal · medical_literature · personal · …
Evolver Reads memories (not text); produces audit-trailed actions ContradictionSurfacer · PreferenceDriftDetector · GoalResolver · SummaryEvolver

Built-in profiles

Profile Types Notable policies
core fact, note, goal, task, event Shared primitives all other profiles can opt into
personal + preference, observation preference → REPLACE (60d decay)
child_development + observation (tagged), milestone, concern observation tags: language/motor/emotional/cognitive/social
research_paper + claim, method, evidence, limitation, open_question evidence → REINFORCE (multiple papers corroborate)
engineering_design + decision, constraint, risk, assumption, todo decision → SUPERSEDE, risk → FLAG
legal + obligation, exception, deadline, definition, citation definition → SUPERSEDE
medical_literature + finding, population, intervention, outcome, limitation outcome → REINFORCE across studies

Custom profiles via Python dataclass, JSON, or YAML.

Storage

Three backends, one ABC:

Store Persistence Notes
InMemoryStore None Default; fastest
JSONLMemoryStore Append-only file Last-write-wins; tombstones; compact() rewrites
SQLiteMemoryStore SQLite file Indexed on (workspace, type, subject); persists embeddings; auto-migrates v0.2 → v0.4 schemas
from typedmem import SQLiteMemoryStore, DomainProfile

store = SQLiteMemoryStore.for_profile(
    DomainProfile.builtin("research_paper"),
    path="papers.db",
)

Retrieval

from typedmem import HashingEmbeddingProvider, Retriever

retriever = Retriever(store, embedder=HashingEmbeddingProvider())
hits = retriever.relevant(
    "blood pressure reduction",
    types=["evidence"],
    workspace="cardiology",
)

relevant() blends three signals: semantic (cosine), recency (exponential decay), confidence (with type-specific half-life). Without an embedder, falls back to token overlap.

Evolution

Evolvers read stored memories and produce auditable actions.

from typedmem import (
    ContradictionSurfacer, PreferenceDriftDetector,
    GoalResolver, SummaryEvolver,
    HashingEmbeddingProvider, AnthropicClient,
)

# 1. Pure read: walk the FLAG graph.
for cluster in store.contradictions():
    print(f"{len(cluster)} memories cross-link as contradictions")

# 2. Annotation: catch unstable preferences.
PreferenceDriftDetector(min_replaces=3, window_days=30).evolve(store)

# 3. Safe match: dry-run first, then commit.
embedder = HashingEmbeddingProvider()
plan = GoalResolver(embedder, threshold=0.85).evolve(store, dry_run=True)
print(plan.summary())
GoalResolver(embedder, threshold=0.85).evolve(store)            # commit

# 4. Non-destructive summary of stale events.
SummaryEvolver(AnthropicClient(), min_cluster_size=3).evolve(store)
# Originals untouched; new memory links via metadata["summarizes"].

Every action emits an EvolutionRecord (evolver, action, input_ids, output_ids, reason, timestamp) and gets appended to each affected memory's metadata["evolution_history"]. No black-box mutations.

CLI

typedmem profiles                                            # list built-in domain profiles
typedmem --profile research_paper add "..." --document-id paper.pdf
typedmem --profile engineering_design list --type decision
typedmem search "blood pressure" --type evidence
typedmem evolve --evolver contradictions
typedmem evolve --evolver goals --apply --threshold 0.9      # dry-run by default
typedmem history MEMORY_ID                                   # audit trail for one memory
typedmem workspaces

Default store: ~/.typedmem/memories.db (override with --store path.db or --store path.jsonl).

Status & roadmap

v0.4 is the first public release.

  • v0.5 sentence-transformer embedder, profile composition (extends), destructive compaction (MemoryStore.compact_summaries())
  • v0.6 hybrid BM25+semantic retrieval, query DSL, observability hooks

What TypedMemory doesn't do and doesn't plan to:

  • ship document chunkers / loaders — define the ingest() seam, bring your own (unstructured, langchain, plain regex)
  • ship its own vector DB — the abstraction is ready for one, but brute-force cosine wins under ~50k memories
  • pull network dependencies into the default install — every provider is an opt-in extra

License

MIT — see LICENSE.

Contributing

Issues and PRs welcome. Please run pytest and the demos in examples/ before opening a PR; CI runs them on Python 3.10/3.11/3.12.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

typedmem-0.5.0.tar.gz (69.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

typedmem-0.5.0-py3-none-any.whl (58.7 kB view details)

Uploaded Python 3

File details

Details for the file typedmem-0.5.0.tar.gz.

File metadata

  • Download URL: typedmem-0.5.0.tar.gz
  • Upload date:
  • Size: 69.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for typedmem-0.5.0.tar.gz
Algorithm Hash digest
SHA256 a4179cd2ddad450084b2f4a5b9bf9c6bc9cacde7d4287d4862bc9d22c4a463cc
MD5 6a3783ede34c3aa572745e9479c83107
BLAKE2b-256 44427df1a7a229bbccd0141de7797efd2dfd978db90b2566e25c29758657df09

See more details on using hashes here.

Provenance

The following attestation bundles were made for typedmem-0.5.0.tar.gz:

Publisher: release.yml on canis-minor/typedmem

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file typedmem-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: typedmem-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 58.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for typedmem-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e4d64c20b17e301557d884af660aef4042833095c0c20493bcd3e712984ac1ac
MD5 85639c8027cc0d0e8e6fb5320e57a251
BLAKE2b-256 5f4d4df1a046eb1ad2be62632b70ac44afd44509936e1b90a909e4205f4ce4ff

See more details on using hashes here.

Provenance

The following attestation bundles were made for typedmem-0.5.0-py3-none-any.whl:

Publisher: release.yml on canis-minor/typedmem

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page