Local-first agent memory for Claude Code: episodic + semantic memory in one SQLite file.

These details have not been verified by PyPI

Project description

snowpack

The snowpack is the season's memory — every storm recorded as a layer.

Local-first agent memory for Claude Code. Snowpack ingests Claude Code session transcripts into episodic memory (what happened across sessions) and semantic memory (durable facts, entities, relationships), all in a single SQLite file with vector + keyword search. The agent reaches it through an ordinary CLI — no MCP server, no daemon, no infrastructure.

Status

Core pipeline implemented (episodic + semantic memory, hybrid retrieval, telemetry, distillation). See docs/adr/ADR-001-memory-architecture.md for the architecture and decision record, docs/hooks.md for ingestion hook setup, and docs/claude-md-snippet.md for the agent-facing usage docs.

Quick start

# 1. Install
pip install snowpack       # or: uv tool install snowpack

# 2. Wire everything up (idempotent, re-runnable, prompts before writing)
snowpack setup

snowpack setup checks Ollama (printing install/pull commands if it's down — it's a soft requirement, see "Embeddings" below), creates ~/.snowpack/snowpack.db, merges the ingestion + compaction-survival hooks into ~/.claude/settings.json (timestamped backup first), installs the memory snippet into ~/.claude/CLAUDE.md between managed markers, and adds the snowpack permission allowlist. --dry-run shows the diffs first, --check is a doctor that audits every integration point, and --uninstall removes exactly what setup added.

# 3. Use it
snowpack probe "auth decisions"   # hybrid retrieval (vector + keyword + recency)

(Ingestion runs out-of-band via the installed hooks; snowpack obs ingest also works manually.)

Embeddings: Ollama setup and choosing a model

Vector search needs a local embedding model served by Ollama. It is a soft requirement: without it snowpack still works — ingest stores episodes un-embedded, probe degrades to keyword + recency search, and the next ingest after Ollama comes up backfills the missing vectors automatically.

Install and run Ollama

# macOS
brew install ollama        # or download the app from https://ollama.com

# Linux
curl -fsSL https://ollama.com/install.sh | sh

# start the server (the desktop app does this automatically)
ollama serve

# fetch the default embedding model (~270 MB)
ollama pull nomic-embed-text

Prefer it sandboxed? A hardened Docker setup (localhost-only API, dropped capabilities, isolated model storage) ships in docker/docker-compose.yml:

docker compose -f docker/docker-compose.yml up -d
docker compose -f docker/docker-compose.yml exec ollama ollama pull nomic-embed-text

See docs/ollama-docker.md for GPU setup and the macOS caveat (containers can't use Apple Silicon's GPU — native Ollama is faster there).

Verify it's answering:

curl -s http://localhost:11434/api/embed \
  -d '{"model": "nomic-embed-text", "input": ["hello"]}' | head -c 120

If Ollama runs somewhere other than localhost:11434 (a container, another machine), point snowpack at it with SNOWPACK_OLLAMA_URL:

export SNOWPACK_OLLAMA_URL=http://gpu-box:11434

Choosing the embedding model

The model is fixed per database at snowpack init, because the vector tables are created with that model's output dimension (vec0 columns are fixed-width):

snowpack init                                  # nomic-embed-text (768-d)
snowpack init --model mxbai-embed-large       # higher quality, 1024-d
snowpack init --model all-minilm              # smaller/faster, 384-d

You normally don't pass --dim: init asks the running Ollama what dimension the model actually produces (and refuses a --dim that contradicts it). If Ollama isn't running, init falls back to a built-in table for common models (nomic-embed-text, mxbai-embed-large, all-minilm, snowflake-arctic-embed, bge-m3) — for anything else, either start Ollama first or pass --dim explicitly.

The configured model, dimension, and task prefixes are recorded in the database (meta table) and used for every subsequent embed, so you never specify the model again after init — obs ingest and probe read it from the database. To see what a database was initialized with:

sqlite3 ~/.snowpack/snowpack.db "SELECT * FROM meta"

Changing models later means re-embedding everything: until snowpack reindex ships, that is rm ~/.snowpack/snowpack.db, snowpack init --model <new>, snowpack obs ingest (episodes re-ingest from the transcripts, but extracted facts and telemetry are lost — export first if you care).

CLI surface

Command	Purpose
`snowpack setup`	One-command onboarding: hooks, CLAUDE.md, permissions, db (`--check` doctor, `--dry-run`, `--uninstall`)
`snowpack init`	Create and configure the database
`snowpack obs ingest`	Ingest new transcript exchanges (incremental, idempotent)
`snowpack obs extract`	Extract durable facts from episodes (API-assisted)
`snowpack obs list`	List recent episodes
`snowpack probe "query"`	Hybrid retrieval (vector + keyword + graph + recency) with telemetry
`snowpack feedback`	Mark retrieved memories as used — trains ranking
`snowpack stash`	Working-memory checkpoint per project
`snowpack resume`	Re-injection payload for SessionStart hooks (compaction survival)
`snowpack stats`	Telemetry overview; `--refresh` recomputes usefulness
`snowpack sinter`	Mine repeated corrections into CLAUDE.md candidates
`snowpack entity merge`	Point a duplicate entity at its canonical form
`snowpack pit`	Local web UI: entity graph + telemetry dashboard

The pit (web UI)

snowpack pit            # serves http://127.0.0.1:8617 and opens the browser

A read-only, single-page UI over the same SQLite file (no extra dependencies, no build step; the graph library is vendored so it works offline):

Graph tab — entities as nodes, facts as edges. Visual weights are real telemetry, not decoration: node size = usage, edge width = retrieval frequency, color = staleness, and dead gray = never retrieved — your pruning candidates at a glance. Click through node → fact → provenance episode; toggle superseded facts; search to highlight.
Stats tab — totals, retrieval latency, channel win-rate (vector vs keyword vs graph — how to rebalance fusion weights), zero-result queries (gap detection), most/least-used facts, persistent weak layers, and recent retrievals expandable to per-result channels/scores/used flags.

The server binds 127.0.0.1 only and never mutates user data (the one write is recomputing derived usefulness scores on demand). Full guide — including how to read the visual encoding and troubleshooting — in docs/pit.md; stack decisions in docs/adr/ADR-002-pit-ui.md.

Documentation map

docs/adr/ — architecture decision records (ADR-001 core, ADR-002 pit UI, ADR-003 pre-Phase-2 hardening program)
docs/plans/ — point-in-time implementation plans approved before each build round, with outcomes
docs/pit.md — running and reading the pit UI
docs/hooks.md — out-of-band ingestion hooks
docs/ollama-docker.md — sandboxed Ollama
docs/claude-md-snippet.md — agent-facing usage docs for CLAUDE.md
docs/releasing.md — publishing wheels to PyPI (trusted publishing)

Roadmap

The agent-memory market is crowded with cloud-first offerings (Mem0, Zep, Letta). Snowpack takes the opposite entry: a local-first core that syncs up when you want it to — local-first is the foundation later phases build on, not a stage to discard.

Phase 1 — local dev tool (now). Everything in this repo: single SQLite file, CLI + hooks integration, telemetry from day one. Goal: prove retrieval quality and accumulate the usage data later tuning depends on.
Phase 2 — local-first + sync. The SQLite file stays the on-device source of truth; optional sync to a hosted backend adds multi-device use, backup, and selective team sharing. The integration surface broadens beyond Claude Code: MCP server plus a language-agnostic SDK/HTTP API.
Phase 3 — hosted platform. A managed, multi-tenant memory service covering all four memory types (episodic, semantic, working, procedural). Self-hosting stays a first-class path.

Full reasoning, the fixed-vs-provisional decision table, and migration risks live in docs/adr/ADR-001-memory-architecture.md ("Phasing & evolution").

Fact extraction needs an API key (ANTHROPIC_API_KEY, OPENAI_API_KEY, or SNOWPACK_EXTRACTION_API_KEY) and defaults to Anthropic's OpenAI-compatible endpoint; override with SNOWPACK_EXTRACTION_BASE_URL / _MODEL. Keys are read from the environment only and never stored.

Development

uv sync
uv run pytest
uv run ruff check

Demo data

To try the full surface without real transcripts (and without touching ~/.snowpack), seed a sandboxed demo — synthetic transcripts for two fake projects, pre-extracted facts (including a superseded pair), and probe telemetry:

uv run python scripts/seed_demo.py        # creates ~/.snowpack-demo
export SNOWPACK_DB=~/.snowpack-demo/snowpack.db
export SNOWPACK_CLAUDE_PROJECTS=~/.snowpack-demo/projects
snowpack probe "what did we decide about auth" --all-projects
snowpack stats
snowpack pit

It works without Ollama (probe degrades to keyword+recency, exactly as in real use); with Ollama running the same script embeds everything.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.15

Jun 13, 2026

0.1.14

Jun 12, 2026

0.1.13

Jun 12, 2026

0.1.12

Jun 12, 2026

0.1.11

Jun 12, 2026

0.1.10

Jun 12, 2026

0.1.9

Jun 12, 2026

0.1.5

Jun 11, 2026

0.1.4

Jun 11, 2026

0.1.3

Jun 11, 2026

0.1.2

Jun 11, 2026

This version

0.1.1

Jun 11, 2026

0.1.0

Jun 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

snowpack-0.1.1.tar.gz (190.1 kB view details)

Uploaded Jun 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

snowpack-0.1.1-py3-none-any.whl (125.2 kB view details)

Uploaded Jun 11, 2026 Python 3

File details

Details for the file snowpack-0.1.1.tar.gz.

File metadata

Download URL: snowpack-0.1.1.tar.gz
Upload date: Jun 11, 2026
Size: 190.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for snowpack-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`a30dd9fc68b755c90ff619911e8d99f90c99355909bacddfe9411ca3dceafbed`
MD5	`cbaa28061db7e75dfb096f7ef6f40d08`
BLAKE2b-256	`bcc78941ae2b455e8a412727858587a8b7ba4cdc4ad2971cecf81070f741d622`

See more details on using hashes here.

Provenance

The following attestation bundles were made for snowpack-0.1.1.tar.gz:

Publisher: publish.yml on davidkelly-snoday/snowpack

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: snowpack-0.1.1.tar.gz
- Subject digest: a30dd9fc68b755c90ff619911e8d99f90c99355909bacddfe9411ca3dceafbed
- Sigstore transparency entry: 1788257394
- Sigstore integration time: Jun 11, 2026
Source repository:
- Permalink: davidkelly-snoday/snowpack@61626272a899e48a24c7de687ed977923eba9038
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/davidkelly-snoday
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@61626272a899e48a24c7de687ed977923eba9038
- Trigger Event: push

File details

Details for the file snowpack-0.1.1-py3-none-any.whl.

File metadata

Download URL: snowpack-0.1.1-py3-none-any.whl
Upload date: Jun 11, 2026
Size: 125.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for snowpack-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bbbc3522f9a3c78d025934bb909a9d837cb9f617494fa314af394b220fbce307`
MD5	`ca7f76aebfe9af4f10071668608df75e`
BLAKE2b-256	`a49a6e9fdd709bc93ac125f05ba87339f77c01b25edcc1aaa055989bb490ae64`

See more details on using hashes here.

Provenance

The following attestation bundles were made for snowpack-0.1.1-py3-none-any.whl:

Publisher: publish.yml on davidkelly-snoday/snowpack

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: snowpack-0.1.1-py3-none-any.whl
- Subject digest: bbbc3522f9a3c78d025934bb909a9d837cb9f617494fa314af394b220fbce307
- Sigstore transparency entry: 1788257511
- Sigstore integration time: Jun 11, 2026
Source repository:
- Permalink: davidkelly-snoday/snowpack@61626272a899e48a24c7de687ed977923eba9038
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/davidkelly-snoday
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@61626272a899e48a24c7de687ed977923eba9038
- Trigger Event: push

snowpack 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

snowpack

Status

Quick start

Embeddings: Ollama setup and choosing a model

Install and run Ollama

Choosing the embedding model

CLI surface

The pit (web UI)

Documentation map

Roadmap

Development

Demo data

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance