waggle-mcp

MCP server that gives LLMs persistent graph-structured memory

These details have not been verified by PyPI

Project links

Project description

waggle-mcp

Persistent, structured memory for AI agents — typically lower-token than chunk-based retrieval, often 2-4× on factual lookups.
Your LLM remembers facts, decisions, and context across every conversation, backed by a real knowledge graph.

Python 3.11+ MCP compatible Local embeddings MIT

Why waggle-mcp?

waggle-mcp is a local-first memory layer for MCP-compatible AI clients, built on a persistent knowledge graph. It gives your AI a persistent knowledge graph it can read and write through any MCP-compatible client (Claude Desktop, Cursor, Codex, Antigravity, etc.).

Stuffed context	Structured retrieval
Huge prompts every session	Compact subgraph retrieved at query time
Session-local memory	Persistent multi-session memory
Flat notes and chunks	Typed nodes and edges: decisions, reasons, contradictions
"What changed?" requires replaying logs	Temporal queries and diffs are first-class

Waggle often uses materially fewer tokens than naive chunked retrieval on factual lookups, while graph-traversal queries intentionally spend more context to include reasoning chains such as updates, contradictions, and dependencies.

Quick start

pip install waggle-mcp
waggle-mcp init
# Restart your MCP client. Done.

init detects your MCP client, writes its config, and creates the local database directory. Default mode is local SQLite with on-device embeddings. Antigravity and manual configuration details are in docs/reference.md.

Manual MCP setup examples for Codex, Claude Code, Cursor, and Antigravity are in docs/reference.md.

Using It In MCP Clients

Once Waggle is installed in an MCP client, people normally do not run waggle-mcp commands by hand during everyday use. They talk to the agent normally, and the agent uses Waggle's MCP tools to store and retrieve memory.

Codex

Typical pattern:

You work in a normal Codex thread.
Codex calls observe_conversation, store_node, store_edge, query_graph, or prime_context when memory is useful.
On a later task, Codex can pull back the connected subgraph instead of relying on the current chat window alone.

Example:

You say: Remember that we chose PostgreSQL because MySQL replication was painful.
Codex stores that as structured memory.
Days later you ask: What did we decide about the database?
Codex can call query_graph and recover the earlier decision plus its reason.

Claude Code

Typical pattern:

You configure Waggle as an MCP server in Claude Code.
Claude Code uses Waggle tools to persist decisions, preferences, architecture notes, and project state across sessions.
prime_context and export_context_bundle are useful when starting a fresh task or handing context to another model.

Cursor

Typical pattern:

Cursor uses Waggle over MCP while you work in the editor.
Facts and decisions can be saved as graph memory instead of getting lost in past chats.
Later questions like why did we change this? or what superseded this decision? can be answered from connected nodes and edges.

Antigravity

Typical pattern:

Antigravity can use Waggle as its persistent memory backend through MCP.
Conversation turns can be extracted with observe_conversation.
Linked context can be exported with export_context_bundle or edited through the Markdown vault workflow.

For a built-in CLI explanation of the feature surface, run:

waggle-mcp features

See it in action

Session 1 — April 10

User:  Let's use PostgreSQL. MySQL replication has been painful.
Agent: [calls observe_conversation()]
       → stores decision node: "Chose PostgreSQL over MySQL"
       → stores reason node:   "MySQL replication painful"
       → links them with a depends_on edge

Session 2 — April 12 (fresh context window, no history)

User:  What did we decide about the database?
Agent: [calls query_graph("database decision")]
       → retrieves the decision node + linked reason from April 10

       "You decided on PostgreSQL on April 10. The reason recorded was
        that MySQL replication had been painful."

Session 3 — April 14

User:  Actually, let's reconsider — the team is more familiar with MySQL.
Agent: [calls store_node() + store_edge(new_node → old_node, "contradicts")]
       → both positions are preserved, and the contradiction is explicit

Key Features

Automatic Extraction: observe_conversation ingests facts into the graph without manual schema work.
Portable Context: export_context_bundle generates Markdown/JSON context packs for another AI.
Vault Round-trip: export_markdown_vault / import_markdown_vault for Obsidian-style node editing.
Conflict Resolution: list_conflicts / resolve_conflict to manage contradictions without losing history.
Deterministic Fallback: Stable SHA-256 hashing for reliable, reproducible offline operation when transformer models are unavailable.

Benchmarks & Verification

Waggle performance is verified against checked-in fixtures and automated regression tests.

Project Fixtures

Area	Corpus	Result
Extraction	25-case deterministic fixture	`100.0%`
Retrieval	18-query retrieval fixture	`83.3% Hit@k`
Query stress	40 adversarial retrieval-only cases	`97.5% Hit@k`, `97.5% exact support`
Deduplication	22 cases (semi-semantic)	`0` false merges at the selected threshold; `77.3%` overall due to conservative false negatives
Automated tests	Infrastructure & logic	`91 passed`

External Benchmarks

Benchmark	Coverage	Metric	Status
LongMemEval	500 questions	`81.6% R@5` held-out deterministic	Verified

LongMemEval note: The checked-in full-split 97.4% R@5 result is useful as a retrieval ceiling on the saved benchmark setup, but the held-out 81.6% split is the more honest number for generalization.
Deduplication: Zero false-positive merges across the threshold sweep. Accuracy limited by conservative similarity bounds.
Comparative benchmark note: The comparative Waggle-vs-RAG corpus is still evolving. For current per-family/token numbers, use the checked-in artifact index in tests/artifacts/README.md rather than this top-level summary.

Detailed benchmark artifacts and the new Benchmark Methodology guide provide full traceability.

Known Limitations

Best on structured recall, weaker on benchmark-style answer synthesis: Waggle is strongest when the problem is "retrieve the right facts and relationships" rather than "emit one benchmark-formatted final answer from memory."
Edges matter: Isolated store_node writes do not create graph context by themselves. Connected context comes from store_edge, observe_conversation, decompose_and_store, and automatic contradiction/update detection.
Graph retrieval is not always the cheapest mode: factual lookups are often much cheaper than chunked RAG, but graph-expansion queries intentionally use more tokens to carry reasoning context.
Deduplication is conservative by design: the system prefers missed merges over unsafe merges, which protects correctness but leaves some semantically similar duplicates unmerged.
README numbers are intentionally narrow: only the most stable benchmark claims are summarized here; per-family and evolving comparative numbers live in the artifact docs instead.

For operational details, scaling considerations, tool-level behavior, and the full MCP feature surface, see docs/reference.md.

Reference & Docs

Detailed reference material lives in external documentation:

docs/reference.md: Environment variables, admin commands, Docker setup, and full tool surface.
deploy/kubernetes/README.md: Production deployment.
docs/runbooks/: Operations and troubleshooting.
tests/artifacts/README.md: Benchmark artifacts and traceability.

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.7

Apr 18, 2026

0.1.6

Apr 18, 2026

0.1.5

Apr 18, 2026

0.1.4

Apr 18, 2026

0.1.3

Apr 12, 2026

0.1.2

Apr 12, 2026

0.1.1

Apr 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

waggle_mcp-0.1.7.tar.gz (128.7 kB view details)

Uploaded Apr 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

waggle_mcp-0.1.7-py3-none-any.whl (115.1 kB view details)

Uploaded Apr 18, 2026 Python 3

File details

Details for the file waggle_mcp-0.1.7.tar.gz.

File metadata

Download URL: waggle_mcp-0.1.7.tar.gz
Upload date: Apr 18, 2026
Size: 128.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for waggle_mcp-0.1.7.tar.gz
Algorithm	Hash digest
SHA256	`b87c795cb6555ff7a10f221f910bad98dd045b7cc54e523540ca38f252f5d5d8`
MD5	`8d6525a868ccc46a8903a4d4f431233b`
BLAKE2b-256	`f8726ee8a87e1a1ef2623b9497086162fd108a59e594c3fa503523dae018b6ce`

See more details on using hashes here.

File details

Details for the file waggle_mcp-0.1.7-py3-none-any.whl.

File metadata

Download URL: waggle_mcp-0.1.7-py3-none-any.whl
Upload date: Apr 18, 2026
Size: 115.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for waggle_mcp-0.1.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8ee75e3c7df0995808277ce1aabd4a8f992b3c202d7d0fcf959b6b5e06266125`
MD5	`75cf459669ac56896e3225312d5d7bbe`
BLAKE2b-256	`55cc6a29192b73245dac095cf0ca29fde5ab11e53b4eeaae9de6558b9afac413`

See more details on using hashes here.

waggle-mcp 0.1.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Why waggle-mcp?

Quick start

Using It In MCP Clients

Codex

Claude Code

Cursor

Antigravity

See it in action

Key Features

Benchmarks & Verification

Project Fixtures

External Benchmarks

Known Limitations

Reference & Docs

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes