memor-cli

Measured memory for coding agents. Fire and forget — no API keys needed.

These details have not been verified by PyPI

Project links

Project description

                                                _
 _ __ ___   ___ _ __ ___   ___  _ __       __ _(_)
| '_ ` _ \ / _ \ '_ ` _ \ / _ \| '__|____ / _` | |
| | | | | |  __/ | | | | | (_) | | |_____| (_| | |
|_| |_| |_|\___|_| |_| |_|\___/|_|        \__,_|_|

  Measured memory for coding agents.

Automatic background memory for Claude Code, Cursor, Codex, and Copilot. Fire and forget — no API keys needed.

Memor watches your coding sessions, extracts decisions and patterns, and recalls relevant context on every prompt. Works with Claude Code, Cursor, OpenAI Codex CLI, and GitHub Copilot CLI. Zero configuration. One install. Your agent remembers everything.

Quick Start

# Install globally (recommended)
pipx install memor-cli

# Install the hook + download embedding model (~60MB)
memor install-hook                  # interactive — pick Claude Code, Codex, or Copilot
memor install-hook --agent claude   # or pass directly

# Start as a background service (macOS/Linux)
memor service install

# Or run in the foreground
memor daemon

That's it. Every prompt now gets automatic context recall. memor service install also starts the dashboard as a background service, so it's already live at http://localhost:8420 (and is recycled whenever you stop/restart/uninstall the service). To run it in the foreground instead:

memor dashboard
# Opens http://localhost:8420

Alternative install: pip install memor-cli works too — just make sure ~/.local/bin is on your PATH so the memor command is available.

How It Works

  You type a prompt (Claude Code / Codex / Copilot)
      |
      v
  Hook fires — auto-detects which agent
      |
      v
  Embed query locally (model2vec, ~2ms)
      |
      v
  Hybrid retrieval: dense vectors + lexical BM25, fused (RRF)
      |
      v
  Relevance gate drops off-topic matches (inject nothing if nothing fits)
      |
      v
  Rank: similarity + recency + kind weight + quality
      |
      v
  Inject relevant context into prompt
      |
      v
  Your agent sees past decisions, bugfixes,
  architecture choices — without you re-explaining

Supported Agents

Agent	Hook protocol	Config location	Install
Claude Code	`UserPromptSubmit` + `additionalContext`	`~/.claude/settings.json`	`memor install-hook --agent claude`
Codex CLI	`UserPromptSubmit` + `additionalContext`	`~/.codex/hooks/hooks.json`	`memor install-hook --agent codex`
Copilot CLI	`userPromptSubmitted` + `additionalContext`	`~/.copilot/hooks/memor.json`	`memor install-hook --agent copilot`
Cursor	`beforeSubmitPrompt` + `additionalContext`	`~/.claude/settings.json` (loaded as Claude user hooks)	automatic — covered by the Claude install

A single memor-hook binary auto-detects which agent is calling it — no separate entry points needed. Cursor loads the same Claude user hooks, so installing for Claude Code covers Cursor too. The dashboard tracks recalls per agent so you can see usage across all your environments.

Note: Cloud-hosted agents (Codex cloud, Copilot cloud agent) run in remote sandboxes and cannot reach local hooks. MCP server support for sandboxed agents is planned (#26).

Two background processes:

Daemon — polls ~/.claude/projects/ for transcripts, embeds chunks, runs distillation, analyzes feedback (positive and negative), promotes cross-project patterns to global scope, compacts duplicates, auto-compacts the vector index when bloated, tracks session-level token usage. All local.
Hook — fires on every prompt, recalls relevant memories, injects them as context. Sub-15ms. Works across Claude Code, Cursor, Codex, and Copilot.

No API keys required. Embeddings run locally via model2vec (potion-base-8M, 256-dim). Vectors stored in sqlite-vec. Everything runs on your machine.

Hybrid Retrieval

Memor retrieves over two channels and fuses them, so it catches both semantic matches and exact terms:

Dense — local vector similarity (model2vec) for semantic recall.
Lexical — SQLite FTS5 / BM25 over the raw text, to recover exact identifiers, error strings, and API names that static embeddings blur together.

The two rankings are combined with Reciprocal Rank Fusion (RRF). A relevance gate drops anti-correlated (off-topic) candidates before ranking, so an unrelated prompt injects nothing rather than the least-bad guess. The lexical channel only activates when the dense channel finds the query on-topic, preventing generic words from pulling in noise.

Tunable via MEMOR_MIN_SIMILARITY (relevance floor, default 0.0) and MEMOR_MAX_TOKENS (injection budget, default 1500).

Scoring

Surviving candidates are ranked by four signals:

Signal	Weight	How it works
Semantic similarity	50%	Dense + lexical relevance, fused via RRF
Recency	25%	Exponential decay with 14-day half-life — recent decisions rank higher
Kind weight	15%	Distilled memories (1.3x) rank above raw session chunks (1.0x)
Quality	10%	Bayesian score from implicit feedback — memories the agent actually uses rank higher

This means a relevant decision from yesterday beats a vaguely-related chunk from a month ago — even if the raw embedding similarity is similar.

Feedback Loop

Memor tracks whether recalled memories actually get used by the agent — and whether they actively hurt. After each session, the daemon analyzes the transcript in both directions:

Positive signal — n-gram overlap or semantic similarity between recalled content and the agent's response. Memories that consistently prove useful get quality boosts.
Negative signal — user rejection ("no that's wrong", "we switched to X") or assistant contradiction ("however, looking at the current code, we actually use Y"). Memories that get corrected receive a quality penalty, making them less likely to be recalled next time.

The quality formula is Bayesian: (uses - negatives + 1) / (recalls + 2). One correction weighs as much as one positive use, so harmful memories drop fast. Memories never recalled in 30+ days get automatically deactivated. Near-duplicate memories are compacted into one.

What Gets Stored

Kind	Source	Description
`session_chunk`	Daemon auto-ingest	Filtered turns from Claude Code transcripts
`memory`	Extractive distillation	Key decisions, patterns, bugfixes per session

Memories are automatically classified as decision, bugfix, lesson, snippet, or generic extract based on content patterns. The daemon runs a signal filter that keeps decisions, bugfixes, lessons, and code rationale while skipping noise (tool calls, file listings, boilerplate).

Global Memories

Some patterns aren't project-specific — they're yours. "Always use type hints." "Structure FastAPI apps with a routes/ directory." "Prefer composition over inheritance."

Memor detects these automatically. When the same pattern appears in 3 or more projects (measured by embedding similarity), the daemon promotes it to a _global scope:

Global memories are recalled everywhere — they show up in every project's search results alongside project-specific memories.
Source duplicates are deactivated — the per-project copies get superseded by the single global version, reducing clutter.
No manual tagging — promotion is fully automatic, based on cross-project clustering.

This means your coding habits and preferences follow you into new projects from the first prompt, without you having to re-explain anything.

Dashboard

memor dashboard

Dark fintech-inspired UI showing:

Hero metrics — total memories, recall count, avg latency, coverage — with sparkline bars
Agent breakdown — per-agent recall stats (Claude, Cursor, Codex, Copilot) with hit rates
Daily recall activity — stacked bar chart of hits vs misses over time
Session efficiency — real token savings measured from API usage data (avg tokens/turn with vs without recall)
Per-project breakdown — artifact counts, token totals, last activity
Recent recalls — every hook event with agent badge, scores, latency, and status

Commands

memor help                           Print the full manual
memor install-hook                   Install hook + download model (interactive agent picker)
  --agent claude|codex|copilot       Choose agent directly
memor daemon                         Auto-ingest + distill (background watcher)
memor dashboard                      Web dashboard on localhost:8420
memor version                        Print installed version
memor service install                Run daemon + dashboard as background services (launchd/systemd)
  --no-dashboard                     Install only the daemon
memor service restart                Restart both services (use after `pipx upgrade`)
memor service stop                   Stop both background services
memor service uninstall              Remove both background services
memor service status                 Show daemon + dashboard status
  (dashboard port: set MEMOR_DASHBOARD_PORT, default 8420)
memor query <text>                   Search memories from the CLI
memor reingest                       Wipe DB and re-ingest everything
memor reingest --project <name>      Re-ingest only one project
memor forget-stale                   Deactivate memories unused for 30+ days
memor compact                        Rebuild vector index, reclaim wasted space
memor scan                           Audit DB for leaked secrets
memor scan --purge                   Redact secrets in place
memor setup-model                    Download/retry the embedding model
memor ingest-cc <file>               Ingest a single transcript
memor ingest-project <dir>           Bulk ingest a project directory
memor ingest-doc <file>              Ingest a markdown document
memor distill --project <name>       Run distillation manually
memor eval <cases.json>              Run eval suite
memor eval-counterfactual --project  Win/tie/loss vs no-memory baseline
memor bench-embed --project <name>   Compare embedding models

Architecture

memor/
+-- types.py              Core dataclasses: Artifact, Scope, Hit, RetrievalTrace
+-- interfaces.py         Protocols: Embedder, LLM, MemoryStore
+-- cli.py                Typer CLI entry point
+-- daemon.py             Auto-ingest + auto-distill + compaction watcher
+-- project.py            Git-root project resolver (filesystem-aware)
+-- recall.py             Shared recall core (used by hook + skill)
+-- redact.py             Secret detection and redaction at ingest
+-- feedback.py           Feedback analyzer (positive usage + negative signals)
+-- global_memories.py    Cross-project promotion to _global scope
|
+-- retrieve/
|   +-- retriever.py      Hybrid retrieval (dense + BM25, RRF) + relevance gate + scoring
|
+-- store/
|   +-- sqlite_store.py   SQLite + sqlite-vec + FTS5 (WAL mode, dimension safety)
|
+-- embed/
|   +-- local.py          model2vec (potion-base-8M, 256-dim, ~60MB)
|   +-- api.py            OpenAI-compatible embedding API (optional)
|   +-- fake.py           Deterministic SHA-256 embedder (tests)
|
+-- service.py            Background service management (launchd/systemd)
+-- dashboard/
|   +-- server.py         FastAPI dashboard backend
|   +-- static/index.html Self-contained dashboard (no CDN deps)
|
+-- distill/
|   +-- extractive.py     TF-IDF + clustering + auto-classification
|   +-- distiller.py      Extractive + optional LLM abstractive
|
+-- eval/
    +-- runner.py          4-baseline eval runner
    +-- judge.py           LLM-as-judge evaluation
    +-- embed_benchmark.py Embedding model comparison

memor/hook_cli.py          Hook entry point — auto-detects Claude/Cursor/Codex/Copilot
memor/hook_server.py       Hook server with agent detection + response formatting
skill/recall.py            Standalone recall script

Security

Nothing leaves your machine. In the default configuration:

No telemetry, no analytics, no phone-home. Zero outbound network calls.
Embeddings run locally via model2vec static token embeddings — no inference runtime, no GPU (one-time model download from HuggingFace — no user data sent).
Hook transport is a Unix socket (~/.memor/hook.sock), not a network port.
Dashboard binds localhost only.

The only optional network paths are the LLM-based abstractive distiller (requires explicitly setting ANTHROPIC_API_KEY) and the API embedding backend — both off by default.

Secret redaction

Memor automatically redacts secrets at ingest, before anything is embedded or stored:

API keys (AWS AKIA..., OpenAI sk-..., Anthropic sk-ant-..., GitHub ghp_..., Stripe, Slack)
JWTs, PEM private key blocks
Connection strings (postgres://, mongodb://, redis://, etc.)
.env-style assignments (DB_PASSWORD=..., API_KEY=...)
High-entropy tokens (Shannon entropy > 4.0, length > 20)

Redacted content is replaced with [REDACTED] in place, preserving surrounding context. To audit and clean an existing database: memor scan (audit) or memor scan --purge (redact in place).

Contradiction handling

When a new memory contradicts an older one in the same project (detected via replacement cues like "switched from X to Y", "no longer", "ripped out"), the older memory is automatically deactivated. This prevents stale decisions from being recalled and misleading the agent.

Local storage

The memory database (~/.memor/memor.db) is stored as plaintext SQLite on disk. For at-rest protection, we recommend enabling OS-level full-disk encryption (FileVault on macOS, LUKS on Linux) which covers all local files with zero performance overhead.

Development

git clone https://github.com/bnimit/memor-ai.git
cd memor-ai
python3 -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

pytest  # 304 tests

License

MIT. See LICENSE for the full text.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.9.0

Jun 24, 2026

0.8.0

Jun 12, 2026

0.7.1

Jun 11, 2026

0.7.0

Jun 10, 2026

0.6.1

Jun 10, 2026

0.6.0

Jun 9, 2026

0.5.1

Jun 9, 2026

0.5.0

Jun 9, 2026

0.4.1

Jun 8, 2026

0.4.0

Jun 7, 2026

0.3.0

Jun 5, 2026

0.2.0

Jun 5, 2026

0.1.1

Jun 4, 2026

0.1.0

Jun 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memor_cli-0.9.0.tar.gz (117.3 kB view details)

Uploaded Jun 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

memor_cli-0.9.0-py3-none-any.whl (88.2 kB view details)

Uploaded Jun 24, 2026 Python 3

File details

Details for the file memor_cli-0.9.0.tar.gz.

File metadata

Download URL: memor_cli-0.9.0.tar.gz
Upload date: Jun 24, 2026
Size: 117.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for memor_cli-0.9.0.tar.gz
Algorithm	Hash digest
SHA256	`95a74474f9201727fae446cd12116b3b0cd7d0d5f763854aa825c814679f5eea`
MD5	`24496fd671467990a3f5a1084852d138`
BLAKE2b-256	`de98ec4c696f965faffd7e5b2c26842d7c24d392ec6b97882f022459430d0df0`

See more details on using hashes here.

File details

Details for the file memor_cli-0.9.0-py3-none-any.whl.

File metadata

Download URL: memor_cli-0.9.0-py3-none-any.whl
Upload date: Jun 24, 2026
Size: 88.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for memor_cli-0.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7c4bac67f06056ffb3ae5f4f8ddc1b08fc29930f1994c52c7f0961f12a7d1194`
MD5	`4f1f22aa842ca53ed9b2c17fe6165698`
BLAKE2b-256	`f0d86af7afb6702262c830d2ea559aebb9b68a0bfa4390649b8928b68eb0eba0`

See more details on using hashes here.

memor-cli 0.9.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Quick Start

How It Works

Supported Agents

Hybrid Retrieval

Scoring

Feedback Loop

What Gets Stored

Global Memories

Dashboard

Commands

Architecture

Security

Secret redaction

Contradiction handling

Local storage

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes