Skip to main content

Persistent memory for AI coding agents. Local SQLite + UserPromptSubmit hook — matched beliefs in the prompt before the model sees it. Deterministic, auditable, no embeddings, no cloud.

Project description

A figure of shimmering cloud rising from a dark sea, weaving threads of light into a constellation of beliefs

aelfrice

Your AI stops forgetting. Set up once. Stays out of the way.

No cloud. No account. No telemetry.

PyPI Python License CI

You correct your agent. "Got it," it says. Next session, same mistake.

aelfrice runs in the background and stops the amnesia. Write a rule once and every relevant prompt thereafter ships with that rule already attached — before the model sees your message. No rules-file chain to maintain, no cross-references for the agent to skip; the matched beliefs are in the prompt, not in a file the agent is supposed to consult.

For developers using AI coding agents — first-class via the UserPromptSubmit hook for hosts that expose it; any MCP host via the included stdio server. Local-only by design — embeddings, vector RAG, and cloud sync are explicitly out of scope. See Philosophy for the trade-off.

Install

uv tool install aelfrice    # requires uv — https://docs.astral.sh/uv/
aelf setup                  # wire the UserPromptSubmit hook into your agent
aelf onboard .              # one-shot project scan: filesystem, git log, code structure
aelf lock "never push directly to main; use scripts/publish.sh"

That's it. The next prompt that mentions "push" already has the rule. From here on out aelfrice is invisible — no command to remember to run, no file to keep updated.


What you'll see

You type a message in your agent. aelfrice's hook fires before the model sees it and prepends matched beliefs as an <aelfrice-memory> block:

<aelfrice-memory>
The following are retrieved beliefs from the local memory store. ...
<belief id="a1f3c2d0" lock="user">never push directly to main; use scripts/publish.sh</belief>
<belief id="91e02d3c" lock="user">commits must be SSH-signed with ~/.ssh/id_ed25519</belief>
<belief id="77c01b2a">the publish script runs the release checks before tagging</belief>
</aelfrice-memory>

push the release

The model reads the whole thing as one message. Your rules arrive every relevant time, not when the agent decides to check a file.


What it does for you

  • Stops the AI forgetting your rules. Lock a rule once with aelf lock "..." — it comes back attached to every prompt after that, in every new session. You don't have to remind the AI; aelfrice does.
  • The AI can't skip it. What aelfrice remembers is in the prompt before the model sees it — not stored in a file the AI is supposed to check on its own. The reminding happens for the AI, not by you.
  • Nothing to maintain. Passive capture runs in the background: every turn is logged and ingested, successful git commit messages too. Your memory grows while you work, without you typing aelf at all.
  • Stays on your computer, leaves on command. One SQLite file on your machine. No cloud account. No telemetry. If you stop trusting aelfrice, aelf uninstall removes it cleanly in one command (--archive encrypts the DB to a file first).

Why not just a rules file?

A rules file is advice the agent may read; aelfrice is context the model has already read. And by Leonard Lin's bar, "a vector store with a similarity query" is not a memory system either — a memory system has to answer who wrote this, when, via what ingress, what supersedes it, and how do I take it back. aelfrice meets the four pillars (provenance, write gates, conflict handling, reversibility) directly. The side-by-side against hand-maintained rules files and vector stores lives in COMPARISON.md.


Day-to-day

After aelf setup you rarely type aelf again. The everyday surface:

aelf onboard .                      # once per project — scan and ingest
aelf lock "never push to main"      # add a permanent rule
aelf locked                          # see what rules are active
aelf search "push to main"           # check what the agent will see
aelf status                          # quick health summary
aelf setup / aelf doctor            # initial install + verification
aelf feed                            # read the belief-write event log (v3.5+)
aelf stale --older-than 90 --cold-for 30   # surface forgotten beliefs (v3.5+)
aelf review --generate               # weekly keep / remove / lock checkpoint (v3.5+)

aelf --help shows the everyday surface; aelf --help --advanced lists the rest. Full reference: COMMANDS. The same operations are exposed as MCP tools and /aelf:* slash commands — same library underneath. See MCP and SLASH_COMMANDS.


How it works

Three retrieval lanes run on every prompt (a fourth, BFS graph expansion, is opt-in), the best matches get prepended to your prompt, and the model reads the lot as one message:

L0: locked beliefs   -> rules you marked permanent (always returned, never trimmed)
L2.5: entity index   -> deterministic NER-extracted entity lookup, exact + stem match
L1: FTS5 keyword     -> SQLite full-text search, BM25 + posterior-weighted rerank
L3: graph walk       -> typed-edge BFS from the L0+L2.5+L1 seed set (SUPPORTS, CONTRADICTS,
                        SUPERSEDES, DERIVED_FROM, ...) — opt-in: [retrieval] bfs_enabled = true

L0 always ships. L1, L2.5, and (when enabled) L3 are budget-trimmed against the merged candidate set in score-descending order; locked beliefs win every overflow. Default budget: 1,500 tokens per hook-injected prompt (the aelf search / library retrieve() default is 2,400). A separate structural-HRR lane (Plate-FFT bind/probe) routes queries that parse as structural markers in the retrieve_v2 API; ordinary prompts never touch it.

Lock count is the operator's baseline-context budget knob. If you lock 200 things, every session opens with all 200, by design. Everything non-locked is BM25-ranked and budget-trimmed. The first prompt of a new session carries one extra block — a <session-start> sub-block listing all locks plus load-bearing unlocked beliefs (corroboration ≥ 2, or posterior mean ≥ ⅔ with α+β ≥ 4); subsequent prompts in the same session skip it.

Bench evidence on the labelled query-strategy corpus measured +0.2851 absolute NDCG@k (+94.8%) versus the v1.4 raw-BM25 baseline (v3.0 30-row corpus, 2026-05-12) at +0.96 ms p99 over legacy-bm25 (re-measured 2026-05-26; gate budget +5 ms delta; see tests/bench_gate/test_query_strategy.py). Full lane wiring, composition, and federation peer DBs: ARCHITECTURE § Retrieval.


Memory model

Every belief carries a (α, β) Beta-Bernoulli posterior: α / (α+β) is the confidence; α + β is how much evidence backs that confidence. New beliefs sit at low evidence (high variance, retrievable but discounted); locked beliefs short-circuit decay and pin as ground truth.

You run It stores
aelf lock "never commit .env files" Permanent rule. Returned on every retrieval.
aelf onboard . Walks the project — git log, prose headings, code structure — and ingests structural facts as agent_inferred beliefs.
aelf feedback <id> used α += 1. Strengthens the belief's posterior.
aelf feedback <id> harmful β += 1. Weakens it. Locks resist passive feedback by design — change with aelf unlock / aelf delete.
aelf promote <id> Flips origin from agent_inferred to user_validated. With --to-scope <SCOPE>, also moves federation visibility (project / global / shared:<name>).
/aelf:wonder <topic> Researches the topic and writes the findings as speculative phantoms; /aelf:reason <topic> can then walk them.
(passive — no command) Default-on auto-capture: every prompt/response turn is logged and ingested at compaction; successful git commit events are ingested too. Opt out per-hook via aelf setup --no-transcript-ingest, --no-commit-ingest, --no-session-start, --no-stop-hook, --no-sessionstart-recap, --no-search-tool, --no-search-tool-bash, --no-pre-issue-guard — see INSTALL § default-on hooks.

Each belief has an origin column tying it to the action that wrote it — one of user_stated, user_corrected, user_validated, user_transcript, agent_inferred, agent_remembered, document_recent, speculative, unknown. The store is a single SQLite file; open it in any browser, nothing is hidden.


Reasoning surfaces

Two slash commands let the agent reach back into the belief graph mid-turn, beyond the auto-injected retrieval block. They pair: /aelf:wonder grows the graph by researching; /aelf:reason walks the enriched graph for structured verdicts.

/aelf:wonder <topic> — the research surface. Given a topic, aelfrice runs gap analysis on what the store already knows, generates 2–6 orthogonal research axes (always-on domain_research + internal_gap_analysis; conditional contradiction_resolution / uncertainty_deep_dive / coverage_extension), has the host agent fan out one research task per axis to research and write up findings, then persists the merged research as new speculative beliefs via wonder_ingest. Those phantoms sit in the graph at low evidence — discoverable by retrieval and by the next /aelf:reason <topic> — until you promote them with aelf promote (or lock the underlying statement, which auto-promotes a matching phantom). Agent-count shorthand quick 2-agent / deep 4-agent is recognised in the query string.

/aelf:reason <query> — the structured-walk surface. Walks the belief graph from BM25-seeded starting points and emits a typed reasoning trace: hops with edge-type breadcrumbs, a VERDICT (SUFFICIENT / PARTIAL / UNCERTAIN / INSUFFICIENT / CONTRADICTORY), IMPASSES (typed gaps, ties, or constraint failures), and SUGGESTED UPDATES(belief_id, direction, note) rows that map straight to aelf feedback so the conclusion closes the loop on the beliefs that fed it. Each impasse is dispatched by the host agent to a role-tagged worker (Verifier / Gap-filler / Fork-resolver). Peer hops in foreign federation scopes are annotated [scope:<name>].

The pair-rhythm is the point: /aelf:wonder adds fresh thinking to the graph, then /aelf:reason draws conclusions across it. Both surfaces are deterministic in the aelfrice layer (verdict classification, impasse derivation, axis generation, suggested-update mapping). The only LLM calls happen when the host agent dispatches one worker per impasse or research axis — and those calls run under the host's own credentials, not aelfrice's. Specs: COMMANDS § wonder, COMMANDS § reason.


What you get for free

Running in the background. No action required after aelf setup.

  • Passive capture. Eight default-on hooks: UserPromptSubmit retrieval, four-event transcript-ingest, PostToolUse:Bash commit-ingest, SessionStart locked-belief injection (with a belief-write recap line, v3.5+), Stop lock-prompt, PreToolUse:Grep|Glob memory-first search, PreToolUse:Bash memory-first search, and the PreToolUse:Bash issue-dup guard. Session activity flows into the belief graph without you typing aelf at all; opt out per-hook — see INSTALL § default-on hooks.
  • Determinism. SQLite + a deterministic numeric stack (numpy / scipy / snowballstemmer — no GPU, no network). No embeddings, no learned re-rankers, no LLM in the retrieval path. Every result traces to the action that wrote it.
  • Local-only. SQLite at <git-common-dir>/aelfrice/memory.db. By default, the only outbound call aelfrice itself makes is the update notifier — a TTL-gated, read-only GET to https://pypi.org/pypi/aelfrice/json (disable with AELF_NO_UPDATE_CHECK=1). No telemetry; no accounts. The memory/retrieval path never touches the network. (LLM dispatches in /aelf:wonder / /aelf:reason flows do reach the network — under the host agent's credentials, not aelfrice's; the retrieval path stays local.) Per-project isolation by construction. Read-only cross-project federation via knowledge_deps.json — peer DBs are opened read-only, foreign-id mutations are rejected at the API surface. See PRIVACY.md.
  • Removable. aelf uninstall --archive backup.aenc encrypts the DB to a file, then deletes it. Or --purge for a full wipe.

Obsidian export

If you already live in Obsidian, aelf export-obsidian <vault-path> emits the belief graph as one Markdown note per belief under <vault>/aelfrice/. Typed edges land in YAML front-matter for Dataview; the same edges appear in the note body as wikilinks so the graph view has something to draw. The export is one-way (DB → vault): SQLite stays the source of truth, and the <vault>/aelfrice/ subdirectory is wiped and rewritten on each run.

Scopes: --scope all (everything, capped by --max-notes), --scope recent (newest first), --scope query "<text>" (BM25 seeds + N-hop neighbourhood). Default cap is 500 notes; the hard ceiling is 5000 unless --force is passed.

Two structural limits, shipped with the feature: Obsidian's built-in graph view chokes around a few thousand nodes (bound the export with --scope query / --max-notes, or use aelf graph for query-anchored visualization at any store size), and the graph view is untyped — edge types are preserved in YAML front-matter and queryable via Dataview, but the graph view will not show them.


Status

Latest stable: v3.5.1 (2026-06-10). Per-entry detail in CHANGELOG § 3.5.1. Per-version history: docs/concepts/ROADMAP.md. Known limits: docs/user/LIMITATIONS.md.

OSSInsight

Reproducibility


Documentation

Citation

@software{aelfrice2026,
  author = {robotrocketscience},
  title  = {aelfrice: deterministic Bayesian memory for AI coding agents},
  year   = {2026},
  url    = {https://github.com/robotrocketscience/aelfrice},
  license = {MIT}
}

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aelfrice-3.6.0.tar.gz (7.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aelfrice-3.6.0-py3-none-any.whl (687.5 kB view details)

Uploaded Python 3

File details

Details for the file aelfrice-3.6.0.tar.gz.

File metadata

  • Download URL: aelfrice-3.6.0.tar.gz
  • Upload date:
  • Size: 7.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aelfrice-3.6.0.tar.gz
Algorithm Hash digest
SHA256 b988cc50cb3bb9f82b811d3c5c4fe96a031547b7f1e318c36b65e12deb206231
MD5 788a13fff5812734694b435ecbb393f8
BLAKE2b-256 deeb1b54dda77a1dcb53302566189deb1b1763babe9ec161ca3e19e8fb13e42d

See more details on using hashes here.

Provenance

The following attestation bundles were made for aelfrice-3.6.0.tar.gz:

Publisher: publish.yml on robotrocketscience/aelfrice

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file aelfrice-3.6.0-py3-none-any.whl.

File metadata

  • Download URL: aelfrice-3.6.0-py3-none-any.whl
  • Upload date:
  • Size: 687.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aelfrice-3.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6770ff2871f93cd0944c285ab66b7b0ccffd8c3e1e77fa49310b3a164f075144
MD5 ad94bd21e955fd3760a889d05f5f595a
BLAKE2b-256 9c0b0439d58a71de2b731e107113866d2489e0f776385b293f8e2898dafb638a

See more details on using hashes here.

Provenance

The following attestation bundles were made for aelfrice-3.6.0-py3-none-any.whl:

Publisher: publish.yml on robotrocketscience/aelfrice

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page