Persistent dynamic memory engine with vector search and wave-field re-ranking

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

caspian

These details have not been verified by PyPI

Project description

WaveMind is persistent dynamic memory for AI agents: vector search first, wave-field priority second, SQLite as the source of truth.

Python License

Terminal Demo

From a cloned repository:

$ python examples/demo.py
✓ Remembered: "Andrey is a trader who tracks market breakouts."
✓ Remembered: "Andrey prefers short practical answers about AI agents."

Query: "Andrey trader agent"
→ Result 1 (0.54): "Andrey is a trader who tracks market breakouts."
→ Result 2 (0.30): "Andrey prefers short practical answers about AI agents."

The demo is offline, keyless, and uses the built-in hash encoder.

Quick Start

Install from PyPI and create your first local memory:

python -m pip install wavemind
wavemind remember "Andrey is a trader" --namespace demo
wavemind query "trader" --namespace demo

What happens here:

remember writes the text and its vector pattern into a local SQLite database.
By default, the database file is wavemind.sqlite3 in your current working directory.
--namespace demo keeps this memory separate from other users, agents, or projects.
query reads from the same SQLite file and returns the closest remembered texts.

Optional Embeddings

For sentence-transformer embeddings:

python -m pip install "wavemind[sentence]"
wavemind --encoder sentence remember "Andrey is a trader" --namespace demo
wavemind --encoder sentence query "What does Andrey do?" --namespace demo

Data Location

For an explicit database path, put global options before the command:

wavemind --db ./agent_memory.sqlite3 remember "Andrey is a trader" --namespace demo
wavemind --db ./agent_memory.sqlite3 query "trader" --namespace demo

HTTP API

Run the local FastAPI server:

wavemind --db ./agent_memory.sqlite3 serve --host 127.0.0.1 --port 8000

Store and query memory over HTTP:

curl -X POST http://127.0.0.1:8000/remember -H "Content-Type: application/json" -d "{\"text\":\"Andrey is a trader\",\"namespace\":\"demo\"}"
curl -X POST http://127.0.0.1:8000/query -H "Content-Type: application/json" -d "{\"query\":\"trader\",\"namespace\":\"demo\",\"top_k\":1}"

Install From Source

For contributors installing from a local clone:

git clone https://github.com/CaspianG/wavemind.git
cd wavemind
python -m pip install -e ".[sentence]"

One-file setup scripts are also included in the repository:

sh install.sh

install.bat

LangChain Memory

Install the optional integration:

pip install "wavemind[langchain]"

Use WaveMind as a drop-in LangChain memory object:

from wavemind.integrations.langchain import WaveMindMemory

memory = WaveMindMemory(db_path="agent_memory.sqlite3")
# Replace: memory = ConversationBufferMemory()

Offline runnable example from a cloned repository:

python examples/langchain_memory.py

Why Dynamic Memory

WaveMind is not positioned as "a faster Chroma." Chroma, Qdrant, Pinecone, and Weaviate are vector databases: they store embeddings and return nearest neighbors. That is the right tool for many static RAG workloads.

WaveMind is an agent memory layer. It still uses vector search first, but then applies memory-specific signals that a plain vector store does not model by default:

memory behavior	Why it matters for agents	WaveMind mechanism
Hot memories	Facts recalled repeatedly should become easier to recall again.	Wave-field hotness and priority updates.
Aging memories	Old low-value facts should fade instead of competing forever.	TTL and decay-aware scoring.
Scoped memory	One user, agent, workspace, or project should not leak into another.	Namespaces and tags.
Explicit forgetting	Agents need deletion, privacy cleanup, and correction workflows.	`forget()` plus SQLite persistence.
Stable restart behavior	A memory system must survive process restarts.	SQLite source of truth, reloadable indexes.
Vector plus memory rank	Semantic similarity is necessary but not sufficient for long-running agents.	k-NN candidates first, wave field as re-ranker.

The current Chroma benchmark below is intentionally conservative: it compares static retrieval on the same facts and the same hash embeddings. That benchmark is useful, but it does not exercise WaveMind's main product thesis: memory that changes over time as an agent recalls, reinforces, ages, and forgets information.

The benchmark that should decide whether WaveMind is worth using is a dynamic agent-memory benchmark:

scenario	What should happen
A user repeats a preference many times.	WaveMind should rank it higher than equally similar but unused facts.
A fact expires via TTL.	WaveMind should suppress it without requiring manual vector cleanup.
A user corrects an old fact.	WaveMind should prefer the newer or reinforced memory.
A query is ambiguous across namespaces.	WaveMind should return only the scoped user's memory.
A long conversation has many irrelevant facts.	WaveMind should preserve useful recall instead of treating all vectors equally.

In short: static vector search answers "what is nearest?" Agent memory also asks "what is still relevant, reinforced, scoped, and allowed to be remembered?"

Benchmark

Real Russian sentences from Tatoeba, 50 one-word queries, NumPy exact index.

metric	hash	sentence-transformers
precision@1	1.00	1.00
precision@3	1.00	1.00
avg query	0.49 ms	52.84 ms

Capacity check with the hash encoder:

memories	precision@1	precision@3	avg query
200	1.00	1.00	0.49 ms
1000	0.88	1.00	1.50 ms
5000	0.72	0.88	5.68 ms

Run locally from a cloned repository:

python benchmarks/ru_sentences_benchmark.py --sentences 200 --queries 50 --encoder hash --index numpy
python benchmarks/ru_sentences_benchmark.py --sentences 200 --queries 50 --encoder sentence --index numpy

Agent-memory benchmark against Chroma:

200 Russian user facts, 50 natural-language questions, same precomputed HashingTextEncoder embeddings for WaveMind and Chroma. Full machine-readable result: benchmarks/agent_memory_results.json.

This is a static retrieval benchmark. It measures baseline ranking and latency, not hotness, TTL, repeated recall, or memory aging.

engine	precision@1	precision@3	avg latency
WaveMind	0.82	0.90	2.25 ms
Chroma	0.82	0.88	0.93 ms

Run locally from a cloned repository:

pip install -e ".[bench]"
python benchmarks/agent_memory_benchmark.py --engines wavemind chroma --facts 200 --queries 50

Comparison

feature	WaveMind	Chroma	Qdrant
Primary role	Agent memory engine	Embedding database	Production vector database
Local SQLite persistence	Yes	Yes	No, separate service/storage
HTTP API	FastAPI included	Included	Included
Dynamic memory priority	Wave-field hotness, TTL, priority	Metadata/filter driven	Payload/filter driven
Built-in forgetting	TTL and explicit forget	Manual delete/filtering	Manual delete/filtering
Best fit	Small to medium agent memory with dynamic recall	Local RAG apps and prototypes	Large-scale vector search
Scale target today	Up to 1000 optimal on NumPy, FAISS recommended beyond 5000	Larger than WaveMind local mode	Production scale

WaveMind is not trying to replace dedicated vector databases at scale. The intended product gap is dynamic priority: frequently used memories can become hotter while old or low-priority memories fade. For static RAG over large document collections, use a mature vector database. For agent memory that needs persistence, scoped recall, TTL, forgetting, and reinforcement, WaveMind is designed to sit above or beside the vector index.

Known Limitations

Optimal capacity on the current NumPy exact index is up to 1000 records.
At 5000 records, one-word precision@1 is currently 0.72 with the hash encoder; many misses are ambiguous queries where another sentence containing the same word ranks first.
For N > 5000, use the FAISS backend with --index faiss or another production vector index.
sentence-transformers/paraphrase-multilingual-mpnet-base-v2 requires about 420 MB of model files and measured about 53 ms per query on the benchmark machine.
The Chroma comparison currently uses shared precomputed hash embeddings to isolate retrieval/ranking behavior; semantic model comparisons should be run separately.
In the 200-fact agent benchmark, Chroma is faster on average while WaveMind is slightly higher at precision@3.
The current public benchmark does not yet prove the dynamic-memory advantage. The next benchmark must test hotness, TTL, corrections, namespace isolation, and repeated recall.

Roadmap

FAISS-first production index path with persisted index rebuilds.
Dynamic agent-memory benchmark against Chroma/Qdrant: hotness, TTL, stale-fact suppression, corrections, and namespace isolation.
Expand the agent-memory benchmark to sentence-transformers, FAISS, Chroma default embeddings, and Qdrant.
Better semantic query expansion for short and ambiguous queries.
Namespace quotas, backups, and daemon hardening for SaaS use.
Webhook on recall for agent runtimes.
OHLCV pattern-memory experiments for market research and backtests.

License

MIT. See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

caspian

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.2.1

Jul 1, 2026

2.2.0

Jul 1, 2026

2.1.1

Jul 1, 2026

2.1.0

Jun 30, 2026

2.0.5

Jun 18, 2026

This version

2.0.4

Jun 18, 2026

2.0.3

Jun 18, 2026

2.0.2

Jun 18, 2026

2.0.1

Jun 18, 2026

2.0.0

Jun 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wavemind-2.0.4.tar.gz (32.8 kB view details)

Uploaded Jun 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

wavemind-2.0.4-py3-none-any.whl (24.9 kB view details)

Uploaded Jun 18, 2026 Python 3

File details

Details for the file wavemind-2.0.4.tar.gz.

File metadata

Download URL: wavemind-2.0.4.tar.gz
Upload date: Jun 18, 2026
Size: 32.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for wavemind-2.0.4.tar.gz
Algorithm	Hash digest
SHA256	`38f08c73a60d104f286416e2e4a864242ecbabaf22e15243a697e4ecc7791f8b`
MD5	`626438730f086c16ae34abd743ed71ce`
BLAKE2b-256	`69694b03a1444fe521b11d2a7ab74aa8ec678d2e261e57747b379eb3fd98daac`

See more details on using hashes here.

Provenance

The following attestation bundles were made for wavemind-2.0.4.tar.gz:

Publisher: publish.yml on CaspianG/wavemind

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: wavemind-2.0.4.tar.gz
- Subject digest: 38f08c73a60d104f286416e2e4a864242ecbabaf22e15243a697e4ecc7791f8b
- Sigstore transparency entry: 1860644573
- Sigstore integration time: Jun 18, 2026
Source repository:
- Permalink: CaspianG/wavemind@84e7cba18d3e1b9ce97df8a64af05306ab8b453f
- Branch / Tag: refs/tags/v2.0.4
- Owner: https://github.com/CaspianG
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@84e7cba18d3e1b9ce97df8a64af05306ab8b453f
- Trigger Event: push

File details

Details for the file wavemind-2.0.4-py3-none-any.whl.

File metadata

Download URL: wavemind-2.0.4-py3-none-any.whl
Upload date: Jun 18, 2026
Size: 24.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for wavemind-2.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`161ae98c6259078ce60441be7c04a7ebbb56cde5a7e0ec1b74e232d8a0321b26`
MD5	`19e8809c7c46b24aa787b498719fdbe6`
BLAKE2b-256	`9397d322f9d3bc68c8313f69fc6aaa8123742a83e584f92e66ddf3a074033fb1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for wavemind-2.0.4-py3-none-any.whl:

Publisher: publish.yml on CaspianG/wavemind

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: wavemind-2.0.4-py3-none-any.whl
- Subject digest: 161ae98c6259078ce60441be7c04a7ebbb56cde5a7e0ec1b74e232d8a0321b26
- Sigstore transparency entry: 1860644700
- Sigstore integration time: Jun 18, 2026
Source repository:
- Permalink: CaspianG/wavemind@84e7cba18d3e1b9ce97df8a64af05306ab8b453f
- Branch / Tag: refs/tags/v2.0.4
- Owner: https://github.com/CaspianG
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@84e7cba18d3e1b9ce97df8a64af05306ab8b453f
- Trigger Event: push

wavemind 2.0.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

WaveMind is persistent dynamic memory for AI agents: vector search first, wave-field priority second, SQLite as the source of truth.

Terminal Demo

Quick Start

Optional Embeddings

Data Location

HTTP API

Install From Source

LangChain Memory

Why Dynamic Memory

Benchmark

Comparison

Known Limitations

Roadmap

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance