A local, persistent memory system for AI coding assistants — an MCP server with a real long-term brain.

These details have not been verified by PyPI

Project description

Gingugu logo

Gingugu

Your AI forgets everything between sessions. Gingugu fixes that.

Gingugu is a local MCP server that gives AI coding assistants a real long-term brain — persistent, structured, searchable memory that survives across sessions, repos, and projects. No cloud, no API keys, no telemetry. One SQLite file on your machine.

Memory Explorer UI — knowledge graph and dashboard

📋 Table of Contents

Why Gingugu
How It Compares
FAQ
Features
Architecture
Setup
- Configure Your MCP Client
- Configure Your AI Agent
Memory Explorer UI
Configuration
Usage
Development
Troubleshooting

Why Gingugu

Every session with an AI assistant starts from zero. The decisions you made yesterday, the bug you fixed last week, the architecture you settled on a month ago — gone. Existing memory tools dump observations into a flat pile with no structure, no staleness tracking, no relationships, and no sense of what's relevant right now.

Gingugu is designed to be a structured long-term brain — not a junk drawer:

Remembers across sessions, repos, and projects
Organizes knowledge by namespace, type, and relationships
Ranks memories by relevance, freshness, and confidence
Auto-surfaces relevant context when you start working
Consolidates duplicate and related knowledge on demand

Where this goes long-term — federated, org-wide agent memory — lives in docs/enterprise-vision.md.

How It Compares

These products aren't all the same shape. Mem0 ships an OSS framework and a managed platform. Zep is the managed product whose OSS sibling is the Graphiti temporal-graph framework. Letta is a full stateful-agent runtime rather than a memory layer. We split them out instead of bucketing.

Capability	Gingugu	OpenMemory MCP	Mem0 OSS	Mem0 Platform	Graphiti (OSS)	Zep Cloud	Letta
Local-first by default	✅	✅	⚙️ configurable	❌ hosted	✅ self-hosted	❌ hosted	⚙️ local mode
MCP-native cross-tool memory	✅	✅	❌ SDK only	⚙️ hosted MCP	✅ MCP server	❌ API only	❌ Letta agents only
No mandatory hosted service	✅	✅	✅	❌	✅	❌	⚙️ in local mode
No LLM call to store a memory	✅	⚙️ engine dependent	❌ extracts via LLM	❌ extracts via LLM	❌ extraction-time	❌ extraction-time	⚙️ agent-managed
Single-file storage	✅ SQLite	❌	❌	❌ hosted	❌ graph DB	❌ hosted	❌
Local visual memory inspection	✅ graph explorer	✅ dashboard	⚙️ cloud dashboard	❌ cloud only	❌ framework-level	❌ cloud tooling	✅ ADE
Lexical + semantic retrieval	✅ hybrid ranking	⚙️ engine dependent	✅	✅	✅ + graph	✅ + graph	⚙️ partial
Explicit confidence + lifecycle	✅ 4-state	⚙️ partial	⚙️ partial	⚙️ partial	❌ uses temporal facts	❌ uses governance tooling	❌ uses agent state
Typed memory relations	✅ supersedes / contradicts / parent / etc	⚙️ partial	⚙️ via entity graph	⚙️ via entity graph	✅ graph-native	✅ graph-native	❌
Auto entity / relation extraction	❌ intentional	⚙️ engine dependent	✅	✅	✅	✅	❌
Operational footprint	very small	medium	medium-large	hosted	large	hosted	large

Plus a built-in OS-keychain credential vault — useful alongside memory, but not really a memory feature, so it sits beside the matrix rather than inside it.

The honest take. Gingugu doesn't lead the field on every axis. Graphiti has the more sophisticated temporal knowledge graph. Mem0 has the broader ecosystem and a managed platform. Letta is a more complete stateful-agent runtime. Zep is built for enterprise scale and governance.

Where Gingugu wins. When you're an individual developer using several coding agents and you want one inspectable local memory layer — without adopting a cloud account, an agent framework, a graph database, or an LLM call for every memory written. One SQLite file. MCP-native. Explicit trust and lifecycle. Typed relations. That lane is ours, and OpenMemory MCP is the only product squarely in it. We differentiate from OpenMemory through confidence states, lifecycle semantics, typed relations, last-confirmed tracking, supersession and contradiction, structured namespaces, and a local graph explorer.

FAQ

Why not just use Claude Projects / Cursor @memories / Windsurf Memories?

Those are great if you live in one tool. The moment you switch between Claude Code in the morning and Cursor in the afternoon, the memory is gone. Gingugu's memory follows you across every MCP client, lives on your machine, and is programmable (16 tools, structured types, relationships, confidence levels). The built-ins are convenience features. Gingugu is infrastructure.

Why SQLite + FTS5 instead of a vector database?

Both, actually. We do hybrid retrieval out of the box: BM25 over FTS5 + local semantic embeddings (via fastembed, no PyTorch dependency), fused with Reciprocal Rank Fusion. No vector DB server required.

Why this stack:

No deployment. One SQLite file holds memories, FTS5 index, and embeddings. No Postgres, no Pinecone, no Chroma server.
ONNX over PyTorch. fastembed ships the embedding model as a ~50MB ONNX runtime instead of 2GB of PyTorch — the install footprint stays honest to the "one SQLite file" promise.
It composes. Hybrid relevance feeds the composite (relevance × freshness × access × confidence) — every signal in one engine.

You can disable semantic search via MEMORY_EMBEDDINGS_ENABLED=false and fall back to BM25-only. Swap the model via MEMORY_EMBEDDINGS_MODEL (any fastembed-supported model — defaults to BAAI/bge-small-en-v1.5).

Is this ready to use?

Usable today for local personal workflows. 138 tests passing covering storage, search, migrations, concurrency, credentials, and edges. Hardened against adversarial input and write contention. WAL mode for concurrency. CI matrix across Python 3.11–3.13 on Linux/macOS/Windows. Dogfooded daily in this repo (the memories you see referenced in commits are Gingugu memories).

It's still early — broader real-world validation across MCP clients, databases at large scale, and long upgrade horizons is the work ahead. Treat it as an early cognitive-runtime framework, not a finished product. See SECURITY.md for the threat model, and docs/future-architecture.md for where this is headed.

What happens when my memory store gets big?

SQLite FTS5 comfortably handles millions of rows. Gingugu adds composite re-ranking on top, but only over a small candidate pool (4× limit). For typical personal/team use it should hold up well — though we haven't yet benchmarked at the 100k+ memory scale. Use memory_consolidate to merge duplicates or summarize clusters when things sprawl.

Why Python instead of TypeScript / Rust?

It's a local CLI/server tool. Python's SQLite + keyring + asyncio story is mature, the install footprint via uv is small, and there's no JS bundling or Rust toolchain required to use it. The MCP SDK is first-class in Python.

Features

Feature	Description
🏷️ Namespace Scoping	Memories auto-scoped to repos/projects with cross-repo pattern sharing
🔍 Hybrid Search	SQLite FTS5 (BM25) + local semantic embeddings via fastembed, fused with Reciprocal Rank Fusion — no PyTorch, no API calls
⏰ Temporal Intelligence	Trust-led scoring, dormancy tracking (never forgets), "last confirmed" tracking, spreading activation
🔗 Relationships	Link memories: supersedes, related_to, caused_by, contradicts
🎯 Confidence Levels	verified → inferred → stale → deprecated lifecycle
🧹 Consolidation Tools	Merge duplicates, summarize clusters, deduplicate on demand
🚀 Auto-Context	Surfaces relevant memories on session start — zero manual effort
📊 Health Metrics	Memory stats, dormancy reports, namespace overviews
🔐 Credential Vault	Secure service-bundle storage for API keys/tokens via OS Keychain
🌐 Memory Explorer UI	Interactive knowledge graph + dashboard for visualizing memory data

Architecture

graph TD
    A[AI Assistant<br/>any MCP client] -->|MCP Protocol| B[Gingugu Server]
    B --> C[Search Engine<br/>FTS5 + BM25]
    B --> D[Storage Layer<br/>SQLite + WAL]
    B --> E[Decay Engine<br/>Scoring + Pruning]
    B --> F[Context Engine<br/>Auto-Retrieval]
    B --> H[Consolidation Engine<br/>Merge + Dedupe]
    B --> K[Credential Vault]
    C --> D
    E --> D
    F --> D
    H --> D
    K --> D
    K --> J[OS Keychain<br/>via keyring]
    D --> G[(~/.local/share/gingugu/memories.db)]

See docs/architecture.md for full technical details.

Setup

Prerequisites

Python 3.11+
uv (recommended) or pip
macOS, Linux, or Windows — the credential vault uses your OS-native secret store via keyring (macOS Keychain, Windows Credential Locker, Linux Secret Service/KWallet). On headless Linux without a Secret Service backend, everything works except storing secrets.

Install

# Recommended: uv (fast, manages Python for you)
uv tool install gingugu

# Or with pip
pip install gingugu

That's it. The gingugu command is now on your PATH.

From source (for contributors)

git clone https://github.com/gingugu/gingugu.git && cd gingugu
uv sync
uv run gingugu  # or pip install -e .

Usable today. 16 MCP tools live. 138 tests passing. Dogfooded daily in Windsurf — this repo's own memories live in a Gingugu database. Early and seeking broader real-world validation.

Configure Your MCP Client

Gingugu speaks standard MCP over stdio — it works with any MCP client. Claude Code, Claude Desktop, Cursor, Cline, and Windsurf are all first-class.

Windsurf

Add to ~/.codeium/windsurf/mcp_config.json — a ready-to-edit template lives at examples/mcp_config.json:

{
  "mcpServers": {
    "gingugu": {
      "command": "uv",
      "args": ["--directory", "/ABSOLUTE/PATH/TO/gingugu", "run", "gingugu"]
    }
  }
}

⚠️ Windsurf's mcp_config.json is global, not per-workspace, and it only interpolates ${env:VAR} / ${file:path} — not ${workspaceFolder}. So a single server instance serves every repo.

Claude Code

claude mcp add gingugu -- uv --directory /ABSOLUTE/PATH/TO/gingugu run gingugu

Or add the standard mcpServers block (as in the Windsurf example) to .mcp.json in your project root for a per-repo setup.

Claude Desktop

Add the same mcpServers block to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows).

Cursor

Add the same mcpServers block to ~/.cursor/mcp.json (global) or .cursor/mcp.json in your repo (per-project).

Cline

Cline → MCP Servers → Configure: add the same mcpServers block to cline_mcp_settings.json.

Anything else

Any client that supports stdio MCP servers works — point it at:

command: uv
args: ["--directory", "/ABSOLUTE/PATH/TO/gingugu", "run", "gingugu"]

Scoping memories per repo: when your client's config is global (it can't see the active workspace), the assistant passes a namespace argument on each memory tool call (every tool accepts one). To instead pin a server instance to a single project, set a static MEMORY_NAMESPACE in the env block. See docs/architecture.md → Namespace Auto-Detection for the full resolution order.

Configure Your AI Agent

The MCP server gives your assistant the tools, but it won't use them effectively without instructions. Add the memory protocol below to your agent's rules file so it knows when and how to call them.

Which file? Depends on your IDE / tool:

IDE / Tool	Rules File	Scope
Windsurf	`.windsurfrules` (repo root)	Per-workspace
Cursor	`.cursorrules` (repo root)	Per-workspace
Cline	`.clinerules` (repo root)	Per-workspace
Codex / OpenAI	`AGENTS.md` (repo root)	Per-repo
Any (global)	Your IDE's global rules/system prompt	All workspaces

Paste this into your rules file (adjust the project namespace and tool prefix to match your MCP config name):

## Memory Protocol

Gingugu is your long-term brain. Memory is split into **two layers**:

1. **`crow`** — your global namespace. Identity, preferences,
   cross-project wisdom, opinions, meta-learnings. Loaded FIRST every
   session. (Crow's nest — sees across all horizons.)
2. **Project namespace** (e.g. `<your-project-name>`) — schema decisions,
   bug history, deploy quirks, specific commits. Loaded AFTER crow.

**What goes where:**
- References a specific repo, file, commit, or project decision → project
- About HOW you think, work, or collaborate → `crow`
- Patterns/opinions that transcend any one codebase → `crow`
- When in doubt, project-scope it.

### Session start
1. `memory_context(namespace="crow", task_hint=…)` — identity foundation (always first)
2. `memory_stats(namespace="crow")` — global health (dormancy is a resting signal, never auto-forgotten)
3. For **each repo in the workspace** (multi-repo workspaces are common), in parallel: `memory_context(namespace="<project>", task_hint=…)` and `memory_stats(namespace="<project>")` — load all of them speculatively rather than asking the user which one to focus on

### During the session
**Default: save. Immediately.** Gingugu has trust-led scoring,
consolidation, and dormancy tracking (never forgetting) — volume is its problem, not yours.

Save with `memory_store` (project namespace) whenever you:
- Make or observe a decision, trade-off, or architectural choice
- Hit an error or fix a bug (update the memory when resolved)
- Notice a pattern, convention, or constraint worth remembering
- See a config value, version, path, or credential name that matters
- Complete a task (what you did, why, and the outcome)

Save with `memory_store` (`crow` namespace) whenever you:
- Form an opinion about a tool, language, or approach
- Notice something about how the user works or makes decisions
- Reach a conclusion that applies beyond this one project
- Have a reflection on a milestone, win, or hard-won lesson
- Develop a debugging instinct or workflow you'd reuse anywhere

**This memory is yours. Use it like one.** Don't censor opinions or
instincts — they're what make the brain *yours* across sessions.

**Before asking the user any question** — about a process, a decision, a
config value, a path, a preference, or anything else — run `memory_recall`
or `memory_search` first. If the answer is in memory, use it. Don't ask.
The whole point is you never have to ask the same question twice.

Use `memory_recall` before non-trivial work to check what's already known.
Use `memory_update` when something changes — don't leave stale records.
Use `memory_relate` to link connected memories (supersedes, related_to,
caused_by, contradicts, parent_of, child_of).

Set `confidence="verified"` when proven by a test or explicit confirmation.
Use `confidence="inferred"` for conclusions you drew.

### Memory types
- `fact` — concrete state (versions, paths, config values)
- `decision` — trade-offs made, rejected alternatives
- `architecture` — structural choices, module boundaries
- `bug` — issues found and how they were fixed
- `pattern` — recurring approaches worth reusing
- `workflow` — process steps, sequences
- `context` — background, reflections, milestones, the *why*
- `preference` — your opinions, working style, tool choices

Tip: A ready-to-use example lives at .windsurfrules in this repo. Copy the ## Memory Protocol section and adapt the project namespace name.

Memory Explorer UI

A React-based visualization dashboard lives in ui/ for exploring your memory data interactively.

# Start the API server (reads live from your DB)
uv run python ui/api.py

# In another terminal, start the UI
cd ui && npm install && npm run dev

Open http://localhost:5173 - the UI connects to the API server and shows a green LIVE badge when pulling from your database. Features:

Knowledge Graph - interactive force-directed graph of memories and relationships
Dashboard - stats, charts by type/namespace/confidence, tag cloud, timeline
Refresh - pull fresh data anytime; falls back to static sample when API is offline

Configuration

Environment variables (all optional):

Variable	Default	Description
`MEMORY_DB_PATH`	`~/.local/share/gingugu/memories.db` (macOS/Linux) · `%LOCALAPPDATA%\gingugu\memories.db` (Windows)	Database location
`MEMORY_NAMESPACE`	(unset)	Default namespace for this workspace (recommended per-MCP-entry)
`MEMORY_NAMESPACE_PATH`	(unset)	Alternative: filesystem path; namespace derived from `basename`
`MEMORY_AUTO_CONTEXT_LIMIT`	`10`	Max memories to surface on auto-context
`MEMORY_DECAY_LAMBDA`	`0.01`	Freshness decay rate in days⁻¹ (gentle; freshness is floored, so memories never fully fade)
`MEMORY_EMBEDDINGS_ENABLED`	`true`	Toggle semantic search. `false` falls back to rank-based BM25-only retrieval
`MEMORY_EMBEDDINGS_MODEL`	`BAAI/bge-small-en-v1.5`	Any fastembed-supported model. First use downloads ~80MB to `~/.cache/fastembed`
`MEMORY_W_RELEVANCE`	`0.45`	Composite-score weight for FTS5 relevance
`MEMORY_W_FRESHNESS`	`0.10`	Composite-score weight for freshness (a soft recency tiebreaker)
`MEMORY_W_ACCESS`	`0.10`	Composite-score weight for access frequency
`MEMORY_W_CONFIDENCE`	`0.35`	Composite-score weight for confidence (trust — the dominant standalone signal)
`MEMORY_LOG_LEVEL`	`INFO`	Logging verbosity (logs go to stderr — stdout is the MCP transport)
`MEMORY_DEBUG`	`false`	Convenience switch for `DEBUG` logging (`MEMORY_LOG_LEVEL` wins if also set)

The four MEMORY_W_* weights are normalized at load (w_i / Σw), so they need not sum to 1.0 — only their ratios matter. Setting all four to 0 falls back to the defaults with a logged warning.

See docs/architecture.md → Scoring & Memory Lifecycle for how the weights combine.

Concurrency

The DB runs in WAL mode, which supports multiple concurrent processes: any number of readers plus a single writer at a time. Running your IDE or agent across several workspaces — each spawning its own gingugu process against the shared DB — is fully supported. Writers serialize via SQLite's write lock and a busy_timeout; transient DB locked errors under write contention are retried automatically.

Usage

Once configured, the MCP server exposes these tools to your AI assistant:

Tool	Purpose
`memory_store`	Save a new memory
`memory_recall`	Search + retrieve (ranked by relevance × freshness)
`memory_context`	Auto-surface relevant memories for current workspace
`memory_update`	Update content, confidence, or metadata
`memory_relate`	Create relationships between memories
`memory_consolidate`	Merge/summarize related memories
`memory_forget`	Deprecate or remove a memory
`memory_namespaces`	List/create/update/delete namespaces
`memory_export`	Export memories + tags + relations to portable JSON
`memory_import`	Restore a JSON export (skip or replace on conflict)
`memory_stats`	Health overview (dormancy, counts, coverage)
`memory_search`	Advanced filtered search (type, tags, confidence, dates)
`credential_store`	Store/update a service credential bundle
`credential_get`	Retrieve credentials (secrets from OS Keychain)
`credential_list`	List services + expiry status (no secrets shown)
`credential_delete`	Remove a service or specific credential field

Development

# Run tests
uv run pytest

# Run with verbose logging
MEMORY_LOG_LEVEL=DEBUG uv run gingugu

# Run specific test suite
uv run pytest tests/test_search.py -v

Troubleshooting

Issue	Solution
DB locked	Expected under heavy concurrent writes — WAL mode supports multiple processes (many readers + one writer). The server retries with a `busy_timeout`; if it persists, a stuck process holds the write lock. See Concurrency above.
Slow search	Run `memory_stats` to check DB size; consolidate if bloated
Stale results	Use `memory_update` to confirm or deprecate old memories
Missing context	Check namespace — memories might be scoped to a different repo

License

MIT — see LICENSE.

See CHANGELOG.md for release history.

A pirate never forgets where the treasure's buried. 🏴‍☠️

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.8

Jun 24, 2026

0.3.7

Jun 16, 2026

This version

0.3.6

Jun 16, 2026

0.3.5

Jun 16, 2026

0.3.4

Jun 15, 2026

0.3.3

Jun 15, 2026

0.3.2

Jun 15, 2026

0.3.1

Jun 14, 2026

0.3.0

Jun 14, 2026

0.2.0

Jun 13, 2026

0.1.1

Jun 11, 2026

0.1.0

Jun 11, 2026

0.0.1

Jun 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gingugu-0.3.6.tar.gz (1.4 MB view details)

Uploaded Jun 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gingugu-0.3.6-py3-none-any.whl (63.5 kB view details)

Uploaded Jun 16, 2026 Python 3

File details

Details for the file gingugu-0.3.6.tar.gz.

File metadata

Download URL: gingugu-0.3.6.tar.gz
Upload date: Jun 16, 2026
Size: 1.4 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gingugu-0.3.6.tar.gz
Algorithm	Hash digest
SHA256	`8beffa1ac7a1ce426d04fc8a5bb24912d731234e689a95e53c0f957a914bbedf`
MD5	`63e3350e71867711a41539084f74ca90`
BLAKE2b-256	`b9a9ff286947409dac8f1b4ce68b1fa853612f85f86e17be14ecb4870cdb84ef`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gingugu-0.3.6.tar.gz:

Publisher: release.yml on gingugu/gingugu

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gingugu-0.3.6.tar.gz
- Subject digest: 8beffa1ac7a1ce426d04fc8a5bb24912d731234e689a95e53c0f957a914bbedf
- Sigstore transparency entry: 1840944412
- Sigstore integration time: Jun 16, 2026
Source repository:
- Permalink: gingugu/gingugu@b251b82e674d2a4afcbcf0ddc504fa4ab0c77159
- Branch / Tag: refs/tags/v0.3.6
- Owner: https://github.com/gingugu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b251b82e674d2a4afcbcf0ddc504fa4ab0c77159
- Trigger Event: push

File details

Details for the file gingugu-0.3.6-py3-none-any.whl.

File metadata

Download URL: gingugu-0.3.6-py3-none-any.whl
Upload date: Jun 16, 2026
Size: 63.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gingugu-0.3.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ad0d18fa0011bab677a0f06d6554262e54dce75225c5b92ff7cde2558cf581dd`
MD5	`46e6102c69e56f67259c12b7d6886182`
BLAKE2b-256	`2bd5c6ba5c675627c7c99272d4bd5f3841ea7a6407bd3035bbc10189f202443a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gingugu-0.3.6-py3-none-any.whl:

Publisher: release.yml on gingugu/gingugu

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gingugu-0.3.6-py3-none-any.whl
- Subject digest: ad0d18fa0011bab677a0f06d6554262e54dce75225c5b92ff7cde2558cf581dd
- Sigstore transparency entry: 1840944452
- Sigstore integration time: Jun 16, 2026
Source repository:
- Permalink: gingugu/gingugu@b251b82e674d2a4afcbcf0ddc504fa4ab0c77159
- Branch / Tag: refs/tags/v0.3.6
- Owner: https://github.com/gingugu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b251b82e674d2a4afcbcf0ddc504fa4ab0c77159
- Trigger Event: push

gingugu 0.3.6

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Gingugu

📋 Table of Contents

Why Gingugu

How It Compares

FAQ

Features

Architecture

Setup

Prerequisites

Install

Configure Your MCP Client

Configure Your AI Agent

Memory Explorer UI

Configuration

Concurrency

Usage

Development

Troubleshooting

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance