Skip to main content

Incremental, crash-resilient re-mining wrapper for mempalace — mine only what's new since last run, or a single session, without losing history or creating duplicates.

Project description

mempalace-refresh

Incremental, crash-resilient re-mining wrapper for mempalace.

Mempalace's built-in mine command is skip-if-filed by design — once a file has any drawer in the palace, mine never re-visits it. That's correct for static files, but wrong for live-appending Claude Code session logs that grow every message, across compactions, for days.

mempalace-refresh makes mempalace properly incremental:

  • Only mines what's new since the last run (per-file mtime tracking)
  • Subprocess per file — a ChromaDB segfault on file N doesn't nuke the batch
  • Additive — upsert semantics + stable drawer IDs mean re-mining never deletes or duplicates; old chunks no-op, new chunks are appended
  • source_file metadata stays correct — points at the real .jsonl, not a tmp path
  • Targeted — mine just one session with ONLY <uuid>
  • Fail-loud on API drift — if a mempalace update renames what we monkey-patch, the script exits 99 with a clear message

Install

pip install mempalace-refresh

Requires mempalace 3.2.x installed.

Usage

mempalace-refresh              # catch up: mine everything new since last run
mempalace-refresh STATUS       # per-file change status
mempalace-refresh ONLY <uuid>  # mine a single session (substring match)
mempalace-refresh RESET        # wipe state (next run re-mines everything)
mempalace-refresh REPAIR       # delegates to `mempalace repair --yes`

State lives at ~/.cache/mempalace-refresh/state.json. Nothing else is stored — the palace itself is mempalace's.

Environment

  • PROJECTS_DIR — override ~/.claude/projects/ (Claude Code session logs)
  • MEMPALACE_PALACE — override ~/.mempalace

How it works (short)

  1. Each .jsonl under PROJECTS_DIR is tracked by mtime in state.
  2. Changed files are mined one at a time via a fresh Python subprocess.
  3. The subprocess monkey-patches mempalace.palace.file_already_mined to bypass mempalace's skip-check, and mempalace.convo_miner.scan_convos to feed it exactly one file.
  4. Mempalace then scans the real .jsonl, runs its regex-based general_extractor, and upserts drawers. Because drawer IDs are hash(source_file + chunk_index):
    • Chunks that already existed → upsert updates metadata in place, no-op
    • Chunks for newly appended content → new IDs → genuinely new drawers
  5. State is committed per file so any later crash loses zero progress.

Result: your palace reflects every session exactly as it would if you'd mined each one from the start, plus incremental additions for all subsequent growth.

License

MIT. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mempalace_refresh-0.2.2-py3-none-any.whl (11.2 kB view details)

Uploaded Python 3

File details

Details for the file mempalace_refresh-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for mempalace_refresh-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 01041be0b2708449593e3f76a9abb048ff536428f57d21ccdd6bf7e03664e17f
MD5 cae37650d92e5724759f4c266506cb80
BLAKE2b-256 ec54a18c921847cad88f247185bd89bd9335c694c4cc14da3fb0d67d636ca455

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page