Persistent memory + emergent identity engine for any LLM
Project description
kore-mind
Persistent memory + emergent identity engine for any LLM.
One file = one mind. SQLite-based. Zero config. Zero external dependencies. Runtime-agnostic.
Part of kore-stack — the complete cognitive middleware for LLMs. pip install kore-stack for the full stack, or install individually:
Install
pip install kore-mind # just the memory engine
pip install kore-stack # full stack: mind + bridge + SC routing
Usage
from kore_mind import Mind
mind = Mind("agent.db")
# Register experiences
mind.experience("User works on complexity theory proofs")
mind.experience("User prefers direct, concise answers")
# Recall relevant memories
memories = mind.recall("proof techniques")
# Reflect: decay old memories, consolidate, update identity
identity = mind.reflect()
print(identity.summary)
# Forget: explicit pruning
mind.forget(threshold=0.1)
Core concepts
- Memory has a lifecycle: salience decays over time. Unused memories fade. Accessed memories strengthen.
- Identity is emergent: not configured, but computed from accumulated memories.
- reflect() is the key operation: decay + consolidation + identity update.
API
| Method | Description |
|---|---|
experience(text) |
Something happened. Record it. |
recall(query) |
What's relevant now? |
reflect(fn) |
Consolidate. Decay. Evolve. |
identity() |
Who am I now? |
forget(threshold) |
Explicit pruning. |
scoped(source) |
Filtered view per user. Same DB. |
traces() |
Query operation traces. |
Semantic Search (v0.3)
Built-in embedding providers — semantic recall works with one line:
from kore_mind import Mind, numpy_embed
# Zero-dependency option (numpy only, no external service)
mind = Mind("agent.db", embed_fn=numpy_embed())
mind.experience("me gusta el café por la mañana")
mind.experience("Python es un lenguaje de programación")
# Finds "café" even searching for "bebidas calientes"
results = mind.recall("bebidas calientes")
Three providers available:
from kore_mind.embeddings import numpy_embed, ollama_embed, openai_embed
# 1. numpy_embed — zero dependencies, deterministic, fast
mind = Mind("agent.db", embed_fn=numpy_embed())
# 2. ollama_embed — local Ollama server (falls back to numpy if unavailable)
mind = Mind("agent.db", embed_fn=ollama_embed())
# 3. openai_embed — cloud, max quality (requires API key)
mind = Mind("agent.db", embed_fn=openai_embed(api_key="sk-..."))
Ollama optimizations (v0.3.1)
ollama_embed() now includes connection reuse, LRU cache, batch embedding, and fallback warnings:
embed = ollama_embed(model="nomic-embed-text", cache_size=512)
# Single embeddings — cached automatically (same text = 0 HTTP calls)
vec = embed("some text")
# Batch embedding — one HTTP call for all uncached texts
vectors = embed.batch(["text one", "text two", "text three"])
# Fallback to numpy emits RuntimeWarning (dimension mismatch: 256d vs 768d)
v0.2 Features
Per-user filtering
Each user gets their own "mind" — same database, different context.
# Option 1: default source
mind = Mind("agent.db", default_source="carlos")
mind.experience("Likes Python") # automatically tagged to carlos
mind.recall("Python") # only carlos's memories
# Option 2: scoped view
alice = mind.scoped("alice")
alice.experience("Prefers Rust")
alice.recall() # only alice's memories
Observability
Full tracing of every operation. Zero overhead when disabled (default).
mind = Mind("agent.db", enable_traces=True)
mind.experience("Something happened")
mind.recall("what happened")
# Query traces
traces = mind.traces(operation="recall")
for t in traces:
print(f"{t.operation} took {t.duration_ms:.1f}ms")
# Filter by source
traces = mind.traces(source="carlos", limit=50)
Smart Cache (storage layer)
Hash-based cache with TTL, per-user isolation, and hit counting. Used by kore-bridge for token savings.
from kore_mind.models import CacheEntry
entry = CacheEntry(
query="What is P vs NP?",
response="It's an open problem...",
query_hash="a1b2c3d4",
source="carlos",
ttl=3600.0,
)
mind._storage.save_cache_entry(entry)
found = mind._storage.find_cache_by_hash("a1b2c3d4", source="carlos")
Rate Limiting (storage layer)
Query logging with temporal window counting. Used by kore-bridge for cognitive rate limiting.
Models
| Model | Description |
|---|---|
Memory |
A memory with lifecycle (salience, decay, tags, embedding) |
Identity |
Emergent identity (traits, summary, relationships) |
MemoryType |
episodic, semantic, procedural |
Trace |
Operation trace (operation, duration, source, metadata) |
CacheEntry |
Cache entry (query, response, hash, TTL, hit count) |
Backward compatibility
All new parameters have defaults that preserve v0.1 behavior:
# This works exactly the same as v0.1
mind = Mind("agent.db")
mind.experience("fact")
mind.recall("query")
Part of kore-stack
| Package | What it does |
|---|---|
| kore-mind (this) | Memory, identity, traces, cache storage |
| kore-bridge | LLM integration, cache logic, rate limiting, A/B testing |
| sc-router | Query routing by Selector Complexity theory |
| kore-stack | All of the above, one install: pip install kore-stack |
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file kore_mind-0.4.0.tar.gz.
File metadata
- Download URL: kore_mind-0.4.0.tar.gz
- Upload date:
- Size: 27.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
708ed0e44d74e4998eb96dacb182e9a836b41ddf515e22ff08d59a32daf8499b
|
|
| MD5 |
0c7a852ec4e5c7adc6a10b283f4a216b
|
|
| BLAKE2b-256 |
e5efde32c5122930b1d0fdc885831d2d138861c41faaab6e17ab1c6563291ba1
|
File details
Details for the file kore_mind-0.4.0-py3-none-any.whl.
File metadata
- Download URL: kore_mind-0.4.0-py3-none-any.whl
- Upload date:
- Size: 19.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
03c94e6ef22206474b4d16e335901a3b3a70c78fa67165d6db95423d1b3a3b80
|
|
| MD5 |
3d922919f577945b8ab6100383344fe9
|
|
| BLAKE2b-256 |
287ce20be0be24d586a0dbf65b347aa4221d9a9d4c866c427518f24d9d26cdfe
|