Skip to main content

Persistent memory + emergent identity engine for any LLM

Project description

kore-mind

Persistent memory + emergent identity engine for any LLM.

One file = one mind. SQLite-based. Zero config. Zero external dependencies. Runtime-agnostic.

Part of kore-stack — the complete cognitive middleware for LLMs. pip install kore-stack for the full stack, or install individually:

Install

pip install kore-mind          # just the memory engine
pip install kore-stack         # full stack: mind + bridge + SC routing

Usage

from kore_mind import Mind

mind = Mind("agent.db")

# Register experiences
mind.experience("User works on complexity theory proofs")
mind.experience("User prefers direct, concise answers")

# Recall relevant memories
memories = mind.recall("proof techniques")

# Reflect: decay old memories, consolidate, update identity
identity = mind.reflect()
print(identity.summary)

# Forget: explicit pruning
mind.forget(threshold=0.1)

Core concepts

  • Memory has a lifecycle: salience decays over time. Unused memories fade. Accessed memories strengthen.
  • Identity is emergent: not configured, but computed from accumulated memories.
  • reflect() is the key operation: decay + consolidation + identity update.

API

Method Description
experience(text) Something happened. Record it.
recall(query) What's relevant now?
reflect(fn) Consolidate. Decay. Evolve.
identity() Who am I now?
forget(threshold) Explicit pruning.
scoped(source) Filtered view per user. Same DB.
traces() Query operation traces.

Semantic Search (v0.3)

Built-in embedding providers — semantic recall works with one line:

from kore_mind import Mind, numpy_embed

# Zero-dependency option (numpy only, no external service)
mind = Mind("agent.db", embed_fn=numpy_embed())

mind.experience("me gusta el café por la mañana")
mind.experience("Python es un lenguaje de programación")

# Finds "café" even searching for "bebidas calientes"
results = mind.recall("bebidas calientes")

Three providers available:

from kore_mind.embeddings import numpy_embed, ollama_embed, openai_embed

# 1. numpy_embed — zero dependencies, deterministic, fast
mind = Mind("agent.db", embed_fn=numpy_embed())

# 2. ollama_embed — local Ollama server (falls back to numpy if unavailable)
mind = Mind("agent.db", embed_fn=ollama_embed())

# 3. openai_embed — cloud, max quality (requires API key)
mind = Mind("agent.db", embed_fn=openai_embed(api_key="sk-..."))

Ollama optimizations (v0.3.1)

ollama_embed() now includes connection reuse, LRU cache, batch embedding, and fallback warnings:

embed = ollama_embed(model="nomic-embed-text", cache_size=512)

# Single embeddings — cached automatically (same text = 0 HTTP calls)
vec = embed("some text")

# Batch embedding — one HTTP call for all uncached texts
vectors = embed.batch(["text one", "text two", "text three"])

# Fallback to numpy emits RuntimeWarning (dimension mismatch: 256d vs 768d)

v0.2 Features

Per-user filtering

Each user gets their own "mind" — same database, different context.

# Option 1: default source
mind = Mind("agent.db", default_source="carlos")
mind.experience("Likes Python")  # automatically tagged to carlos
mind.recall("Python")            # only carlos's memories

# Option 2: scoped view
alice = mind.scoped("alice")
alice.experience("Prefers Rust")
alice.recall()  # only alice's memories

Observability

Full tracing of every operation. Zero overhead when disabled (default).

mind = Mind("agent.db", enable_traces=True)

mind.experience("Something happened")
mind.recall("what happened")

# Query traces
traces = mind.traces(operation="recall")
for t in traces:
    print(f"{t.operation} took {t.duration_ms:.1f}ms")

# Filter by source
traces = mind.traces(source="carlos", limit=50)

Smart Cache (storage layer)

Hash-based cache with TTL, per-user isolation, and hit counting. Used by kore-bridge for token savings.

from kore_mind.models import CacheEntry

entry = CacheEntry(
    query="What is P vs NP?",
    response="It's an open problem...",
    query_hash="a1b2c3d4",
    source="carlos",
    ttl=3600.0,
)
mind._storage.save_cache_entry(entry)
found = mind._storage.find_cache_by_hash("a1b2c3d4", source="carlos")

Rate Limiting (storage layer)

Query logging with temporal window counting. Used by kore-bridge for cognitive rate limiting.

Models

Model Description
Memory A memory with lifecycle (salience, decay, tags, embedding)
Identity Emergent identity (traits, summary, relationships)
MemoryType episodic, semantic, procedural
Trace Operation trace (operation, duration, source, metadata)
CacheEntry Cache entry (query, response, hash, TTL, hit count)

Backward compatibility

All new parameters have defaults that preserve v0.1 behavior:

# This works exactly the same as v0.1
mind = Mind("agent.db")
mind.experience("fact")
mind.recall("query")

Part of kore-stack

Package What it does
kore-mind (this) Memory, identity, traces, cache storage
kore-bridge LLM integration, cache logic, rate limiting, A/B testing
sc-router Query routing by Selector Complexity theory
kore-stack All of the above, one install: pip install kore-stack

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kore_mind-0.4.0.tar.gz (27.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kore_mind-0.4.0-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file kore_mind-0.4.0.tar.gz.

File metadata

  • Download URL: kore_mind-0.4.0.tar.gz
  • Upload date:
  • Size: 27.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for kore_mind-0.4.0.tar.gz
Algorithm Hash digest
SHA256 708ed0e44d74e4998eb96dacb182e9a836b41ddf515e22ff08d59a32daf8499b
MD5 0c7a852ec4e5c7adc6a10b283f4a216b
BLAKE2b-256 e5efde32c5122930b1d0fdc885831d2d138861c41faaab6e17ab1c6563291ba1

See more details on using hashes here.

File details

Details for the file kore_mind-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: kore_mind-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for kore_mind-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 03c94e6ef22206474b4d16e335901a3b3a70c78fa67165d6db95423d1b3a3b80
MD5 3d922919f577945b8ab6100383344fe9
BLAKE2b-256 287ce20be0be24d586a0dbf65b347aa4221d9a9d4c866c427518f24d9d26cdfe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page