A pluggable, embeddable reconsolidating skill memory engine for AI agents
Project description
etchmem
Where agents grow up — not where they take notes.
Modern AI agents are built from skills. Every time a skill runs, it leaves a trail: results, feedback, corrections, surprises. That trail is experience — and experience is how an agent learns how the world actually works.
etchmem is memory for that maturation. Not a scratchpad for the current session. Not a user profile to personalize replies. Not a long chat log you grep when context runs out. It is the layer where raw observations turn into durable understanding — how things work, what users react to, what failed last time — and keep sharpening every time the agent draws on it again.
Over time, knowledge compounds. The agent stops merely following scenarios and starts carrying a worldview shaped by use.
pip install etchmem
What it is
etchmem gives your agent a persistent memory that matures through living. You deposit raw observations — facts, outcomes, feedback — scoped to a skill or left general. You retrieve with natural-language queries. Periodically you consolidate, and the engine synthesizes scattered signal into compact knowledge articles, rewriting anything that has been recalled against fresh context.
It is not a vector database and not an agent runtime. It sits above ChromaDB, which handles all storage and embedding, and exposes exactly three methods.
How it works
Memory is kept in three tiers:
- Relational — raw, append-only records with a TTL. Fresh signal lives here.
- Buffer — a working space for deposits and recall-events, waiting for the next consolidation run.
- Injected — synthesized knowledge articles, the primary search target. Only current knowledge is kept; superseded articles are hard-deleted.
The key idea is reconsolidation: every recall() call emits a recall-event into the buffer. When you later call consolidate(), the engine detects which injected articles were used, checks whether fresh signal changes anything, and rewrites them if needed. Knowledge that gets recalled stays current as a side-effect of being recalled. Knowledge nobody asks about simply rests. This is how a skill — or the agent as a whole — accumulates experience instead of starting from zero each run.
Embeddings are computed locally by ChromaDB's built-in model — no external embedding API. Synthesis (the rewrite step inside consolidate) calls an LLM via OPENAI_API_KEY or ANTHROPIC_API_KEY.
API
from etchmem import Engine
engine = Engine() # persists to .etchmem/ in the current directory
remember(data, hint=None, skill=None, metadata=None)
Deposits a text record into the relational tier. No LLM call — cheap.
data— the text to store.hint— optional float 0–1, your importance signal.skill— optional scope name; lets you filter recall by agent skill.metadata— arbitrary dict (source URL, tags, …).
recall(query, skill=None, top_k=None, hint=None) → list[SearchResult]
Retrieves relevant knowledge and emits a recall-event for future reconsolidation. Results are blended from injected (primary) and relational (fresh signal), ranked by a composite score.
consolidate(num_records="all", method="LIFO") → dict
Runs the consolidation worker: clusters the buffer, forms new injected articles, reconsolidates recalled articles against fresh context, hard-deletes superseded ones. Returns a summary dict with counts (formed, reconsolidated, dropped, kept, flushed, superseded). Requires an LLM API key.
Hello world
import os
from etchmem import Engine
os.environ["ANTHROPIC_API_KEY"] = "sk-ant-..." # or OPENAI_API_KEY
engine = Engine()
# Add some facts
engine.remember("The Eiffel Tower is 330 metres tall including its antenna.")
engine.remember("The tower was completed in 1889 and was originally intended to be temporary.")
engine.remember("It receives about 7 million visitors per year, making it the world's most visited paid monument.")
# Retrieve relevant knowledge
results = engine.recall("How tall is the Eiffel Tower?")
for r in results:
print(r.score, r.content)
# Consolidate — synthesizes the raw deposits into a compact knowledge article
summary = engine.consolidate()
print(summary)
# {'formed': 1, 'reconsolidated': 0, 'dropped': 0, 'kept': 0, 'flushed': 3, 'superseded': 0}
# Subsequent recalls now hit the synthesized article,
# and any new signal deposited before the next consolidate()
# will be blended in during reconsolidation.
results = engine.recall("Eiffel Tower visitors")
print(results[0].content)
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file etchmem-0.1.5.tar.gz.
File metadata
- Download URL: etchmem-0.1.5.tar.gz
- Upload date:
- Size: 41.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2e705c8c0e906722ebd1e508ea44e975e0938cd4fc6a10b42d13df53365e25b0
|
|
| MD5 |
78189618278e057878d98415cacaa0da
|
|
| BLAKE2b-256 |
f632429c3266b0fca52300f7c2732e60dcc22755d919c6fa68be84df58802d0e
|
File details
Details for the file etchmem-0.1.5-py3-none-any.whl.
File metadata
- Download URL: etchmem-0.1.5-py3-none-any.whl
- Upload date:
- Size: 36.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1fd8bcbe934c6f7bbdc5605737ec9a5e7d9fb7a5087e8d31fb9ee85ab44b8506
|
|
| MD5 |
4b5260f507760aa1893305b8df8c0452
|
|
| BLAKE2b-256 |
222621975c50105cd0c3ae9cd21f082723a485a1d0cef5ad7c04855e95c2ece4
|