Online associative memory for LLMs

Project description

MDA — Modular Dynamic Architecture

Online associative memory for LLMs. Learns during inference. No backpropagation. No GPU required.

What is MDA?

Large language models can reason but cannot remember. RAG partially addresses this — but it cannot update during a conversation, and it cannot learn from the conversation itself.

MDA fills precisely these gaps.

It encodes knowledge as 256-dimensional Holographic Distributed Representations (HDRs), connects concepts through a sparse synapse graph, and retrieves context by activating entity networks — not by text-chunk similarity search. New knowledge is integrated immediately, without rebuilding any index.

MDA is not a RAG replacement. It is the persistent learning layer that RAG and LLMs are missing.

Key Properties

Token-free — no tokenizer, no vocabulary
Attention-free — no transformer encoder required
Online learning — learns during inference via the Oja rule
No catastrophic forgetting — entities are independent; new knowledge never overwrites old
CPU-only — runs entirely on numpy, no GPU required
Model-agnostic — works with Ollama, OpenAI, Anthropic, or any LLM

Benchmark Results

Evaluated against a strong RAG baseline (bge-large-en-v1.5 + ChromaDB, top-6 retrieval) across 80 questions spanning 8 cognitive categories:

Category	RAG	MDA	Δ
ATOMIC_RECALL	100%	85%	−15%
MULTI_HOP	90%	90%	0%
CROSS_DOCUMENT	80%	70%	−10%
REASONING	70%	90%	+20%
INCREMENTAL_LEARNING	0%	60%	+60%
NOISE_RESISTANCE	100%	100%	0%
MEMORY_COMPRESSION	90%	70%	−20%
BOUNDARY	100%	100%	0%
OVERALL	78.8%	83.1%	+4.3%

MDA uses 3.1× less context per query than RAG while achieving higher overall accuracy.

Long-context retention (200 turns): RAG 0% — MDA 92%.

Quick Start

pip install mda-memory

Or clone for development:

git clone https://github.com/Rangle2/mda
cd mda
pip install -e .

Basic Usage

from mda import MDA

memory = MDA()

# Teach facts
memory.learn("The capital of Veloria is Aranthos.")
memory.learn("Aranthos was founded by Queen Seraphel in 412 AE.")

# Retrieve context — returns memory string ready for LLM injection
context = memory.context_for("Who founded the capital?")
print(context)
# → [MEMORY] Aranthos was founded by Queen Seraphel in 412 AE.

With Ollama

# Start Ollama with any model
ollama pull qwen3:4b

# Run MDA CLI
mda --model qwen3:4b

With Anthropic (Claude)

# Set your API key
export ANTHROPIC_API_KEY=your_key_here

# Run MDA CLI with Claude
mda --model claude-haiku-4-5-20251001 --provider anthropic

Running Benchmarks

# Main benchmark — Ollama
python benchmark/benchmark.py --model qwen3:4b

# Main benchmark — Anthropic
python benchmark/benchmark.py --model claude-haiku-4-5-20251001 --provider anthropic

# Long-context retention benchmark — Ollama
python benchmark/long_context_benchmark.py --model qwen3:4b

# Long-context retention benchmark — Anthropic
python benchmark/long_context_benchmark.py --model claude-haiku-4-5-20251001 --provider anthropic

Project Structure

mda/
├── mda/
│   ├── __init__.py        # Public API — exposes MDA class
│   ├── mda.py             # MDA class implementation
│   ├── core/
│   │   ├── bind.py        # HDR ops: bind, unbind, normalize, cosine
│   │   ├── encoder.py     # HolisticEncoder: text → 256-dim vector
│   │   ├── entity.py      # Entity: v, r, h, W, neurons, synapses, senses
│   │   ├── neuron.py      # Neuron (Oja rule), Synapse (Hebbian)
│   │   └── registry.py    # EntityRegistry: surface → Entity lookup
│   ├── inference/
│   │   ├── associative.py # AssociativeChain: multi-hop traversal
│   │   ├── broca.py       # BrocaModule: W-hybrid context scoring
│   │   ├── reasoning.py   # ReasoningEngine: cross-entity inference
│   │   └── memory.py      # ConversationMemory: multi-turn context
│   ├── training/
│   │   └── checkpoint.py  # save/load: float32, atomic write
│   └── integrations/
│       ├── engine.py      # MDAEngine: LLM bridge (Ollama / Anthropic)
│       └── cli.py         # Interactive CLI (Rich + prompt_toolkit)
├── benchmark/
│   ├── benchmark.py       # 80-question main benchmark
│   └── long_context_benchmark.py  # 200-turn retention benchmark
├── tests/                 # pytest test suite
└── pyproject.toml

How It Works

Entity & W Matrix

Every concept is an Entity with a 256-dim identity vector v and a lazy-initialized Hebbian weight matrix W. W is None until the entity is first activated — memory overhead is proportional to usage.

Online Learning (Oja Rule)

ΔW = η(yxᵀ − y²W)

where x is the query vector and y = Wx. No backpropagation. No gradient descent. Runs in O(d²) per entity per turn.

AssociativeChain

Query → origin entity → breadth-first synapse traversal (depth 3) → context assembly. Dynamic inhibition prevents context flooding.

BrocaModule

Facts are scored with a W-hybrid formula:

score = 0.35·s_query + 0.45·s_W + 0.20·s_sense

Facts below 0.20 are suppressed.

Running Tests

pytest tests/

Roadmap

GPU acceleration — PyTorch tensor ops, parallel entity traversal
Low-rank W — W ≈ A×B factorization for higher-dimensional HDRs
mda.cloud API — persistent memory as a service
MDA + RAG hybrid — offline corpus retrieval + online conversational learning
Real-world corpus evaluation — beyond fictional benchmark domains
Independent benchmark — community-constructed evaluation set

License

SSPL 1.0 — free for research and personal use. Commercial use requires a separate agreement.

For commercial licensing: mert@kairfy.com

Project details

Release history Release notifications | RSS feed

This version

0.1.2

Apr 26, 2026

0.1.1

Apr 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mda_memory-0.1.2.tar.gz (78.4 kB view details)

Uploaded Apr 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mda_memory-0.1.2-py3-none-any.whl (67.0 kB view details)

Uploaded Apr 26, 2026 Python 3

File details

Details for the file mda_memory-0.1.2.tar.gz.

File metadata

Download URL: mda_memory-0.1.2.tar.gz
Upload date: Apr 26, 2026
Size: 78.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mda_memory-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`2436dcfaecbf3f580326c4d1327e05edd53d63bbccb56778833ed3df4c2c4f06`
MD5	`879174a9880d3a4e96072384591c1142`
BLAKE2b-256	`f8f7bef44a32bd302cd5e59bdc81d8a21355456b047f011a07c4540dfa0d2655`

See more details on using hashes here.

File details

Details for the file mda_memory-0.1.2-py3-none-any.whl.

File metadata

Download URL: mda_memory-0.1.2-py3-none-any.whl
Upload date: Apr 26, 2026
Size: 67.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mda_memory-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e80ed01a0258992b690f8c9b7b3ffd05bd568f18507e83293479780015237f9c`
MD5	`119dba18c201d8984f5086364448fa08`
BLAKE2b-256	`7e17012d26ec2a15747b531e3dcac249d9d22669336626404b725cc0dccb47a7`

See more details on using hashes here.

mda-memory 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

MDA — Modular Dynamic Architecture

What is MDA?

Key Properties

Benchmark Results

Quick Start

Basic Usage

With Ollama

With Anthropic (Claude)

Running Benchmarks

Project Structure

How It Works

Entity & W Matrix

Online Learning (Oja Rule)

AssociativeChain

BrocaModule

Running Tests

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes