Skip to main content

Persistent memory + emergent identity engine for any LLM

Project description

kore-mind

Persistent memory + emergent identity engine for any LLM.

One file = one mind. SQLite-based. Zero config. Zero external dependencies. Runtime-agnostic.

Part of kore-stack — the complete cognitive middleware for LLMs. pip install kore-stack for the full stack, or install individually:

Install

pip install kore-mind          # just the memory engine
pip install kore-stack         # full stack: mind + bridge + SC routing

Usage

from kore_mind import Mind

mind = Mind("agent.db")

# Register experiences
mind.experience("User works on complexity theory proofs")
mind.experience("User prefers direct, concise answers")

# Recall relevant memories
memories = mind.recall("proof techniques")

# Reflect: decay old memories, consolidate, update identity
identity = mind.reflect()
print(identity.summary)

# Forget: explicit pruning
mind.forget(threshold=0.1)

Core concepts

  • Memory has a lifecycle: salience decays over time. Unused memories fade. Accessed memories strengthen.
  • Identity is emergent: not configured, but computed from accumulated memories.
  • reflect() is the key operation: decay + consolidation + identity update.

API

Method Description
experience(text) Something happened. Record it.
recall(query) What's relevant now?
reflect(fn) Consolidate. Decay. Evolve.
identity() Who am I now?
forget(threshold) Explicit pruning.
scoped(source) Filtered view per user. Same DB.
traces() Query operation traces.

Semantic Search (v0.3)

Built-in embedding providers — semantic recall works with one line:

from kore_mind import Mind, numpy_embed

# Zero-dependency option (numpy only, no external service)
mind = Mind("agent.db", embed_fn=numpy_embed())

mind.experience("me gusta el café por la mañana")
mind.experience("Python es un lenguaje de programación")

# Finds "café" even searching for "bebidas calientes"
results = mind.recall("bebidas calientes")

Three providers available:

from kore_mind.embeddings import numpy_embed, ollama_embed, openai_embed

# 1. numpy_embed — zero dependencies, deterministic, fast
mind = Mind("agent.db", embed_fn=numpy_embed())

# 2. ollama_embed — local Ollama server (falls back to numpy if unavailable)
mind = Mind("agent.db", embed_fn=ollama_embed())

# 3. openai_embed — cloud, max quality (requires API key)
mind = Mind("agent.db", embed_fn=openai_embed(api_key="sk-..."))

Ollama optimizations (v0.3.1)

ollama_embed() now includes connection reuse, LRU cache, batch embedding, and fallback warnings:

embed = ollama_embed(model="nomic-embed-text", cache_size=512)

# Single embeddings — cached automatically (same text = 0 HTTP calls)
vec = embed("some text")

# Batch embedding — one HTTP call for all uncached texts
vectors = embed.batch(["text one", "text two", "text three"])

# Fallback to numpy emits RuntimeWarning (dimension mismatch: 256d vs 768d)

v0.2 Features

Per-user filtering

Each user gets their own "mind" — same database, different context.

# Option 1: default source
mind = Mind("agent.db", default_source="carlos")
mind.experience("Likes Python")  # automatically tagged to carlos
mind.recall("Python")            # only carlos's memories

# Option 2: scoped view
alice = mind.scoped("alice")
alice.experience("Prefers Rust")
alice.recall()  # only alice's memories

Observability

Full tracing of every operation. Zero overhead when disabled (default).

mind = Mind("agent.db", enable_traces=True)

mind.experience("Something happened")
mind.recall("what happened")

# Query traces
traces = mind.traces(operation="recall")
for t in traces:
    print(f"{t.operation} took {t.duration_ms:.1f}ms")

# Filter by source
traces = mind.traces(source="carlos", limit=50)

Smart Cache (storage layer)

Hash-based cache with TTL, per-user isolation, and hit counting. Used by kore-bridge for token savings.

from kore_mind.models import CacheEntry

entry = CacheEntry(
    query="What is P vs NP?",
    response="It's an open problem...",
    query_hash="a1b2c3d4",
    source="carlos",
    ttl=3600.0,
)
mind._storage.save_cache_entry(entry)
found = mind._storage.find_cache_by_hash("a1b2c3d4", source="carlos")

Rate Limiting (storage layer)

Query logging with temporal window counting. Used by kore-bridge for cognitive rate limiting.

Models

Model Description
Memory A memory with lifecycle (salience, decay, tags, embedding)
Identity Emergent identity (traits, summary, relationships)
MemoryType episodic, semantic, procedural
Trace Operation trace (operation, duration, source, metadata)
CacheEntry Cache entry (query, response, hash, TTL, hit count)

Backward compatibility

All new parameters have defaults that preserve v0.1 behavior:

# This works exactly the same as v0.1
mind = Mind("agent.db")
mind.experience("fact")
mind.recall("query")

Part of kore-stack

Package What it does
kore-mind (this) Memory, identity, traces, cache storage
kore-bridge LLM integration, cache logic, rate limiting, A/B testing
sc-router Query routing by Selector Complexity theory
kore-stack All of the above, one install: pip install kore-stack

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kore_mind-0.3.1.tar.gz (23.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kore_mind-0.3.1-py3-none-any.whl (17.4 kB view details)

Uploaded Python 3

File details

Details for the file kore_mind-0.3.1.tar.gz.

File metadata

  • Download URL: kore_mind-0.3.1.tar.gz
  • Upload date:
  • Size: 23.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for kore_mind-0.3.1.tar.gz
Algorithm Hash digest
SHA256 6d582df113ad3cbf93d2ff5128caf7f6434af15aa65839023447e043ac39fe9c
MD5 f3b76a820c37b707e8709e542f71fc9b
BLAKE2b-256 684aaa22bf9e26cf00ffd9fe31314736546dfcbe1f7860206e6a9f6e6df0b0d7

See more details on using hashes here.

File details

Details for the file kore_mind-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: kore_mind-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 17.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for kore_mind-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a990a1ca90117ccf8703d7e67cd77ce7e6711083206835e717e8a6dcc2d4cc5c
MD5 95a9a3b5b7171681936ac04bb7196b4f
BLAKE2b-256 c41a4fdba4340c656ec451aaa3c6f5b8dba0af11696d4a591c783045ff5f91cc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page