Local Codex memory manager with an official Python SDK-based MCP server.
Project description
Breathing Memory
Breathing Memory is a local memory support system for coding agents. It runs as a stdio MCP server through the official Python MCP SDK, stores memory in SQLite, and isolates memory by project so one installation can be reused across repositories without mixing contexts.
Overview
Breathing Memory keeps collaboration context that an agent should remember but a repository should not need to encode everywhere.
- local stdio MCP server
- SQLite storage under user app-data, isolated by project
- fragment-centric public model built around
anchorandfragment - text-first retrieval today, with a public search surface already aligned for later semantic retrieval work
- dynamic
working / holdingmaintenance with a compression backend that uses a supported coding agent without polluting normal conversation history
Supported Clients
- Codex
Installation
The intended long-term user path is:
pip install 'breathing-memory[semantic]'
breathing-memory install-codex
breathing-memory install-codex registers the breathing-memory MCP server with the currently supported client, pins that registration to a stable project identity for the current repository, and creates or updates the managed Breathing Memory block in the current repository's AGENTS.md.
The default path is the user-level Codex config. If you want repository-local Codex config instead, choose it explicitly with breathing-memory install-codex --codex-config repo.
Published package:
- recommended:
pip install 'breathing-memory[semantic]' - minimal
super_liteinstall:pip install breathing-memory - contributor setup and unreleased local work: docs/dev-guide.md
Quickstart
Recommended first run:
python3 -m venv .venv
. .venv/bin/activate
pip install 'breathing-memory[semantic]'
breathing-memory doctor
breathing-memory install-codex
Useful commands:
breathing-memory doctor: inspect installation, active project identity, DB path selection, Codex registration state, and effective retrieval modebreathing-memory serve: start the stdio MCP serverbreathing-memory inspect-memory --json: inspect current memory state
Codex registration targets:
breathing-memory install-codex: write to the user-level Codex configbreathing-memory install-codex --codex-config repo: write to.codex/config.tomlin the current repository
How Memory Works
Breathing Memory does not auto-capture the full client conversation by itself. The supported operating path is explicit MCP use by the calling agent.
The basic flow is:
- Check
memory_recentbefore persisting immediately repeated agent / user turns - If there is an unremembered final agent answer from the previous turn, save it first with
memory_remember(actor="agent") - Save the current user message with
memory_remember(actor="user") - Search before an answer with
memory_search - Record feedback with
memory_feedbackwhen the user clearly confirms or corrects remembered information
Key points:
- one user utterance becomes one fragment
- one final user-facing agent answer is normally remembered on the next user turn
- commentary is not remembered
- use
memory_recentas a caller-side first check beforememory_rememberwhen you suspect an immediately repeated save - track which retrieved fragments materially informed the final answer and pass them in
source_fragment_ids - if the final answer materially used remembered fragments, pass those ids in
source_fragment_ids - use
memory_feedbackonly when the user's evaluation can be attributed safely - edits are modeled as forks rather than overwrites
- duplicate deferred agent capture for the same reply target and content is suppressed
- user duplicate checks are caller-side and should use
memory_recentrather than engine-side suppression - archived runtime files such as
archived_sessions/*.jsonlare not the primary capture path - if no later user turn arrives, the final agent answer may remain unremembered
Current MCP tools:
memory_remembermemory_searchmemory_fetchmemory_recentmemory_feedbackmemory_stats
memory_search keeps the default response compact. When debugging retrieval, callers can opt in to per-result diagnostics with include_diagnostics=true.
Runtime Notes
Breathing Memory stores data under the user app-data directory resolved by platformdirs, then separates memory by project identity. The exact SQLite path can be inspected with breathing-memory doctor.
For Codex installs, install-codex now pins the MCP registration to a stable project identity derived from the repository at install time, so the live MCP server does not drift with VSCode or Codex internal working directories. doctor prefers that registration-derived identity when it is available, so its reported DB path matches the live MCP target rather than the shell's current directory.
If you already have remembered data under an older unpinned Codex registration, migration is manual by design. Move the SQLite database yourself if you want to keep that history; Breathing Memory does not auto-discover or auto-merge old databases.
The user-facing setup is intentionally framed as two paths: super_lite with no extra semantic dependencies, and default through the optional semantic extra. Runtime auto still has an internal lite fallback when embeddings are available but HNSW support is unavailable, but that fallback is treated as an implementation detail rather than as a primary setup target. When semantic retrieval encounters live fragments with missing embeddings, Breathing Memory backfills those vectors before continuing. If default search finds a missing or invalid ANN index, it attempts repair first, waits briefly for conflicting rebuild work, and returns a structured status when the caller should decide whether to retry or fall back.
breathing-memory doctor reports both the configured retrieval mode and the effective runtime mode, along with whether the default app-data location is writable, where Codex registration was found, whether the registration uses a PATH command or an absolute path, and whether HNSW support and index readiness are available. After installing breathing-memory[semantic], you can verify whether auto can target the HNSW-backed path and whether the index is already ready or still needs repair.
breathing-memory install-codex also prints the effective retrieval mode in its post-install summary, so the semantic state is visible even before the first MCP conversation.
The current compression backend invokes a supported coding agent without leaving normal conversation history. In the current supported setup, that path uses Codex through codex exec --ephemeral.
Further Reading
- docs/user-guide.md: installation, runtime operation, storage behavior, and MCP tool usage
- docs/dev-guide.md: contributor-oriented setup and repository layout
- docs/spec.md: normative behavior and implementation-facing rules
- docs/design-rationale.md: adopted design choices and the reasons behind them
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file breathing_memory-0.5.5.tar.gz.
File metadata
- Download URL: breathing_memory-0.5.5.tar.gz
- Upload date:
- Size: 62.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0cd83abbea9594e115cca1607c3a1402f6bdcfc60d89dfebb5b796fcc645239c
|
|
| MD5 |
551ab4a736783f874b2c684208fb2496
|
|
| BLAKE2b-256 |
106ab4ca97365ad2137666c4e755fcf1a58d0f46d0b1434dfad703c45dbe2254
|
Provenance
The following attestation bundles were made for breathing_memory-0.5.5.tar.gz:
Publisher:
publish.yml on KazinaG/breathing_memory
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
breathing_memory-0.5.5.tar.gz -
Subject digest:
0cd83abbea9594e115cca1607c3a1402f6bdcfc60d89dfebb5b796fcc645239c - Sigstore transparency entry: 1203319920
- Sigstore integration time:
-
Permalink:
KazinaG/breathing_memory@083af32e9a0e4cf15921aa2af362b1b58bcf6f81 -
Branch / Tag:
refs/tags/v0.5.5 - Owner: https://github.com/KazinaG
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@083af32e9a0e4cf15921aa2af362b1b58bcf6f81 -
Trigger Event:
push
-
Statement type:
File details
Details for the file breathing_memory-0.5.5-py3-none-any.whl.
File metadata
- Download URL: breathing_memory-0.5.5-py3-none-any.whl
- Upload date:
- Size: 42.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
abd151f27a6ee45648cadb013e17631d16ca2b255ec6861fcb0b04d3b5d6a37f
|
|
| MD5 |
320f26f57bc66f69195723d5074a6a56
|
|
| BLAKE2b-256 |
d45373f0effe0795d9d30021d76353e78cf50286b9d97ea52fdbab3280e6cc64
|
Provenance
The following attestation bundles were made for breathing_memory-0.5.5-py3-none-any.whl:
Publisher:
publish.yml on KazinaG/breathing_memory
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
breathing_memory-0.5.5-py3-none-any.whl -
Subject digest:
abd151f27a6ee45648cadb013e17631d16ca2b255ec6861fcb0b04d3b5d6a37f - Sigstore transparency entry: 1203319923
- Sigstore integration time:
-
Permalink:
KazinaG/breathing_memory@083af32e9a0e4cf15921aa2af362b1b58bcf6f81 -
Branch / Tag:
refs/tags/v0.5.5 - Owner: https://github.com/KazinaG
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@083af32e9a0e4cf15921aa2af362b1b58bcf6f81 -
Trigger Event:
push
-
Statement type: