Local Codex memory manager with an official Python SDK-based MCP server.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Breathing Memory

Breathing Memory is a local memory support system for coding agents. It runs as a stdio MCP server through the official Python MCP SDK, stores memory in SQLite, and isolates memory by project so one installation can be reused across repositories without mixing contexts.

Overview

Breathing Memory keeps collaboration context that an agent should remember but a repository should not need to encode everywhere.

local stdio MCP server
SQLite storage under user app-data, isolated by project
fragment-centric public model built around anchor and fragment
text-first retrieval today, with a public search surface already aligned for later semantic retrieval work
dynamic working / holding maintenance with a compression backend that uses a supported coding agent without polluting normal conversation history

Supported Clients

Codex

Installation

The intended long-term user path is:

pip install 'breathing-memory[semantic]'
breathing-memory install-codex

breathing-memory install-codex registers the breathing-memory MCP server with the currently supported client, pins that registration to a stable project identity for the current repository, and creates or updates the managed Breathing Memory block in the current repository's AGENTS.md. The default path is the user-level Codex config. If you want repository-local Codex config instead, choose it explicitly with breathing-memory install-codex --codex-config repo.

Published package:

recommended: pip install 'breathing-memory[semantic]'
minimal super_lite install: pip install breathing-memory
contributor setup and unreleased local work: docs/dev-guide.md

Quickstart

Recommended first run:

python3 -m venv .venv
. .venv/bin/activate
pip install 'breathing-memory[semantic]'
breathing-memory doctor
breathing-memory install-codex

Useful commands:

breathing-memory doctor: inspect installation, active project identity, DB path selection, Codex registration state, and effective retrieval mode
breathing-memory serve: start the stdio MCP server
breathing-memory warmup: eagerly load the semantic embedding backend for the current environment
breathing-memory inspect-memory --json: inspect current memory state

Codex registration targets:

breathing-memory install-codex: write to the user-level Codex config
breathing-memory install-codex --codex-config repo: write to .codex/config.toml in the current repository

Environment-specific setup:

normal local environment
- pip install 'breathing-memory[semantic]'
- breathing-memory doctor
- breathing-memory install-codex
repository-local Codex config, including DevContainer workflows that track .codex/config.toml
- pip install 'breathing-memory[semantic]'
- breathing-memory doctor
- breathing-memory install-codex --codex-config repo
slim containers where native build dependencies may be missing
- install Python headers and compiler toolchain first
- then run pip install 'breathing-memory[semantic]'
- confirm the effective mode with breathing-memory doctor
minimal lexical-only install
- pip install breathing-memory
- breathing-memory install-codex
environments where you want to preload the semantic model before a session
- pip install 'breathing-memory[semantic]'
- breathing-memory warmup

How Memory Works

Breathing Memory does not auto-capture the full client conversation by itself. The supported operating path is explicit MCP use by the calling agent.

The basic flow is:

Check memory_recent before persisting immediately repeated agent / user turns
If there is an unremembered final agent answer from the previous turn, save it first with memory_remember(actor="agent")
Save the current user message with memory_remember(actor="user")
Search before an answer with memory_search
Record feedback with memory_feedback when the user clearly confirms or corrects remembered information

Key points:

one user utterance becomes one fragment
one final user-facing agent answer is normally remembered on the next user turn
commentary is not remembered
use memory_recent as a caller-side first check before memory_remember when you suspect an immediately repeated save
track which retrieved fragments materially informed the final answer and pass them in source_fragment_ids
if the final answer materially used remembered fragments, pass those ids in source_fragment_ids
use memory_feedback only when the user's evaluation can be attributed safely
edits are modeled as forks rather than overwrites
duplicate deferred agent capture for the same reply target and content is suppressed
user duplicate checks are caller-side and should use memory_recent rather than engine-side suppression
archived runtime files such as archived_sessions/*.jsonl are not the primary capture path
if no later user turn arrives, the final agent answer may remain unremembered

Current MCP tools:

memory_remember
memory_search
memory_read_active_collaboration_policy
memory_fetch
memory_recent
memory_feedback
memory_stats

memory_search keeps the default response compact. When debugging retrieval, callers can opt in to per-result diagnostics with include_diagnostics=true.

Runtime Notes

Breathing Memory stores data under the user app-data directory resolved by platformdirs, then separates memory by project identity. The exact SQLite path can be inspected with breathing-memory doctor.

For Codex installs, install-codex now pins the MCP registration to a stable project identity derived from the repository at install time, so the live MCP server does not drift with VSCode or Codex internal working directories. doctor prefers that registration-derived identity when it is available, so its reported DB path matches the live MCP target rather than the shell's current directory.

If you already have remembered data under an older unpinned Codex registration, migration is manual by design. Move the SQLite database yourself if you want to keep that history; Breathing Memory does not auto-discover or auto-merge old databases.

The user-facing setup is intentionally framed as two paths: super_lite with no extra semantic dependencies, and default through the optional semantic extra. Runtime auto still has an internal lite fallback when embeddings are available but HNSW support is unavailable, but that fallback is treated as an implementation detail rather than as a primary setup target. When semantic retrieval encounters live fragments with missing embeddings, Breathing Memory backfills those vectors before continuing. If default search finds a missing or invalid ANN index, it attempts repair first, waits briefly for conflicting rebuild work, and returns a structured status when the caller should decide whether to retry or fall back.

breathing-memory serve keeps the MCP handshake path light. If the semantic backend is available, the server starts a best-effort background warmup only after the MCP session is live. The first semantic call may still need to wait if that warmup has not finished yet, and semantic calls can still fail if model import or download fails. breathing-memory warmup exists when you want to preload that backend explicitly before a session.

breathing-memory doctor reports both the configured retrieval mode and the effective runtime mode, along with whether the default app-data location is writable, where Codex registration was found, whether the registration uses a PATH command or an absolute path, and whether HNSW support and index readiness are available. After installing breathing-memory[semantic], you can verify whether auto can target the HNSW-backed path and whether the index is already ready or still needs repair. breathing-memory install-codex also prints the effective retrieval mode in its post-install summary, so the semantic state is visible even before the first MCP conversation.

The current compression backend invokes a supported coding agent without leaving normal conversation history. In the current supported setup, that path uses Codex through codex exec --ephemeral.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Kazina

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.1

Apr 4, 2026

0.7.0

Apr 4, 2026

0.6.0

Apr 4, 2026

This version

0.5.7

Apr 3, 2026

0.5.6

Apr 2, 2026

0.5.5

Mar 31, 2026

0.5.4

Mar 30, 2026

0.5.3

Mar 30, 2026

0.5.2

Mar 29, 2026

0.5.1

Mar 29, 2026

0.5.0

Mar 29, 2026

0.4.0

Mar 28, 2026

0.3.0

Mar 28, 2026

0.2.0

Mar 28, 2026

0.1.0

Mar 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

breathing_memory-0.5.7.tar.gz (65.0 kB view details)

Uploaded Apr 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

breathing_memory-0.5.7-py3-none-any.whl (43.7 kB view details)

Uploaded Apr 3, 2026 Python 3

File details

Details for the file breathing_memory-0.5.7.tar.gz.

File metadata

Download URL: breathing_memory-0.5.7.tar.gz
Upload date: Apr 3, 2026
Size: 65.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for breathing_memory-0.5.7.tar.gz
Algorithm	Hash digest
SHA256	`7848c722731a4c425b4952c74a935c8de120fe3aa1637772ea7da4b507c250b0`
MD5	`b00fa85e03097d70c4a7c03b86075d62`
BLAKE2b-256	`23a42f9801c0659dbde498d2f419606001f43af8b6e72f5ec96bc5f4078d0a5c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for breathing_memory-0.5.7.tar.gz:

Publisher: publish.yml on KazinaG/breathing_memory

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: breathing_memory-0.5.7.tar.gz
- Subject digest: 7848c722731a4c425b4952c74a935c8de120fe3aa1637772ea7da4b507c250b0
- Sigstore transparency entry: 1224341527
- Sigstore integration time: Apr 3, 2026
Source repository:
- Permalink: KazinaG/breathing_memory@76b8a40fcd14e0404e910bb2266f5b99ae1f38a9
- Branch / Tag: refs/tags/v0.5.7
- Owner: https://github.com/KazinaG
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@76b8a40fcd14e0404e910bb2266f5b99ae1f38a9
- Trigger Event: push

File details

Details for the file breathing_memory-0.5.7-py3-none-any.whl.

File metadata

Download URL: breathing_memory-0.5.7-py3-none-any.whl
Upload date: Apr 3, 2026
Size: 43.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for breathing_memory-0.5.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1857fbfaac3ad753a720c09834de1becc675a961221e98f5cff3c29f780a3f4b`
MD5	`175d796fb76546286a19ab8d8b0875e8`
BLAKE2b-256	`bbf4cba9eb63355b94b66e76cd693102af67bdd8edba2cd83117a65098da6c9d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for breathing_memory-0.5.7-py3-none-any.whl:

Publisher: publish.yml on KazinaG/breathing_memory

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: breathing_memory-0.5.7-py3-none-any.whl
- Subject digest: 1857fbfaac3ad753a720c09834de1becc675a961221e98f5cff3c29f780a3f4b
- Sigstore transparency entry: 1224341528
- Sigstore integration time: Apr 3, 2026
Source repository:
- Permalink: KazinaG/breathing_memory@76b8a40fcd14e0404e910bb2266f5b99ae1f38a9
- Branch / Tag: refs/tags/v0.5.7
- Owner: https://github.com/KazinaG
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@76b8a40fcd14e0404e910bb2266f5b99ae1f38a9
- Trigger Event: push

breathing-memory 0.5.7

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Breathing Memory

Overview

Supported Clients

Installation

Quickstart

How Memory Works

Runtime Notes

Further Reading

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance