EverAlgo agent memory: AgentCase / AgentSkill extractors and AgentBoundaryDetector facade.
Project description
everalgo-agent-memory
Agent-side memory products for EverAlgo — AgentCaseExtractor distils an agent trajectory MemCell into one AgentCase; AgentSkillExtractor maintains a cluster's reusable skill set from accumulated cases; AgentProfileExtractor proposes precision-first section-level patches to the agent's injected config files (SOUL.md / AGENTS.md). AgentBoundaryDetector handles boundary detection over mixed ConversationItem trajectories (chat + tool calls).
See the umbrella project: EverAlgo monorepo and the architecture document at docs/concepts/architecture.md.
Install
pip install everalgo-agent-memory
# Auto-pulls: everalgo-core, everalgo-boundary, everalgo-clustering
What this distribution provides
| Symbol | Role |
|---|---|
AgentBoundaryDetector |
Boundary detection on agent trajectories (filter → detect → remap for mixed ConversationItem lists) |
AgentCaseExtractor |
Distils one agent-trajectory MemCell into [] | [AgentCase] (11-step pipeline) |
AgentSkillExtractor |
Aggregates one new AgentCase into incremental skill operations for a cluster; returns add / update / retire entries |
AgentProfileExtractor |
Screens one trajectory MemCell for durable agent-config signals (four-gate, default noop) and returns validated section-level SOUL.md / AGENTS.md patches + unified diffs |
Quick start
import asyncio
import json
from everalgo.agent_memory.case import AgentCaseExtractor
from everalgo.llm.types import ChatResponse
from everalgo.testing.fake_llm import FakeLLMClient
from everalgo.types import AgentCase, ChatMessage, MemCell, ToolCall, ToolCallFunction, ToolCallRequest, ToolCallResult
_CASE_JSON = json.dumps({
"task_intent": "Search for Python async retry libraries",
"approach": "1. Search. 2. Filter. 3. Summarise.",
"quality_score": 0.82,
"key_insight": "Use tenacity AsyncRetrying for native async back-off.",
})
async def main() -> None:
fake = FakeLLMClient(responses=[ChatResponse(content=_CASE_JSON, model="fake")])
mc = MemCell(
items=[
ChatMessage(id="u1", role="user", content="Best async retry libs?", timestamp=1_700_000_000_000, sender_id="user"),
ToolCallRequest(tool_calls=[ToolCall(id="c1", function=ToolCallFunction(name="web.search", arguments='{}'))], timestamp=1_700_000_000_100, sender_id="assistant"),
ToolCallResult(tool_call_id="c1", content="Found: tenacity.", timestamp=1_700_000_000_200),
ToolCallRequest(tool_calls=[ToolCall(id="c2", function=ToolCallFunction(name="web.search", arguments='{}'))], timestamp=1_700_000_000_300, sender_id="assistant"),
ToolCallResult(tool_call_id="c2", content="tenacity supports async.", timestamp=1_700_000_000_400),
ChatMessage(id="a1", role="assistant", content="Use tenacity AsyncRetrying.", timestamp=1_700_000_000_500, sender_id="assistant"),
],
timestamp=1_700_000_000_500,
)
cases: list[AgentCase] = await AgentCaseExtractor(llm=fake).aextract(mc)
if cases:
print(cases[0].task_intent)
asyncio.run(main())
See examples/04_agent_memory_case.py for the full runnable example.
API surface
class AgentBoundaryDetector:
def __init__(self, *, llm: LLMClient) -> None: ...
async def adetect(
self, items: list[ConversationItem], *, is_final: bool = False, prompt: str | None = None
) -> DetectionResult: ...
class AgentCaseExtractor:
def __init__(self, *, llm: LLMClient) -> None: ...
async def aextract(
self, memcell: MemCell, *,
prompt_filter: str | None = None,
prompt_compress: str | None = None,
prompt_tool_pre_compress: str | None = None,
) -> list[AgentCase]: ... # length 0 (filtered) or 1
class AgentSkillExtractor:
def __init__(self, *, llm: LLMClient) -> None: ...
async def aextract(
self,
case: AgentCase,
*,
existing_relevant_skills: Sequence[AgentSkill],
supporting_cases: Sequence[AgentCase],
prompt_success: str | None = None,
prompt_failure: str | None = None,
prompt_maturity: str | None = None,
skip_quality_threshold: float = 0.2,
skip_maturity_scoring: bool = True,
maturity_threshold: float = 0.6,
retire_confidence: float = 0.1,
failure_quality_threshold: float = 0.5,
max_description_tokens: int = 400,
max_content_tokens: int = 5000,
maturity_trivial_change_ratio: float = 0.2,
maturity_reeval_change_ratio: float = 0.4,
) -> list[AgentSkill]: ...
class AgentProfileExtractor:
def __init__(
self, *, llm: LLMClient,
min_recurrence: int = 2, # implicit signals must recur this many times across sessions (gate 3)
max_file_tokens: int = 8000, # anti-bloat budget for each patched file
) -> None: ...
async def aextract(
self,
memcell: MemCell,
*,
soul_md: str,
agents_md: str,
pending_signals: Sequence[AgentProfileSignal] = (),
prompt: str | None = None,
) -> AgentProfileUpdate: ...
All class methods have a sync bridge: extractor.extract(...) is async_to_sync(aextract).
AgentCaseExtractor pipeline
The extractor runs an 11-step pipeline: strip-before-first-user → structural pre-filter → heuristic trim → over-size bail → LLM filter (skipped when ≥2 tool rounds) → tool pre-compress → LLM compress → parse → validate → build AgentCase. Returns [] when the trajectory is filtered out; returns [AgentCase] on success.
AgentSkillExtractor return contract
Pass the new AgentCase and the pre-filtered existing_relevant_skills (e.g. top-K cosine from the caller's store) plus the associated supporting_cases (cases referenced by existing skills). The caller decodes add / update / retire by checking whether skill.id is already in existing_relevant_skills and whether skill.confidence < retire_confidence.
Cases with quality_score < skip_quality_threshold (default 0.2) short-circuit to [] without calling the LLM.
AgentSkill.cluster_id is always "" on extraction — the caller stamps the cluster identity after persisting.
AgentProfileExtractor contract
SOUL.md / AGENTS.md are injected into every future system prompt and are self-reinforcing, so the operator is precision-first: it defaults to noop and a candidate signal must pass all four gates (persistence, directedness, evidence strength, novelty) before any patch is proposed. Routing follows one line: SOUL = who the agent is and how it speaks; AGENTS = global rules the agent must obey when acting; everything else (user facts, future intents, one-off task parameters, task solutions) is rejected at the gate. A single LLM call emits each candidate's gate verdicts together with its proposed patch; the prompt sees the full chat view (user + assistant text turns, tool traffic excluded) for context, but config authority stays user-only — every candidate must quote the user verbatim and the quote is re-checked in code against user messages alone. The gates are re-enforced in code.
The returned AgentProfileUpdate carries section-level patches (never whole-file rewrites — human edits are preserved), soul_diff / agents_diff unified diffs, and new_soul_md / new_agents_md with all patches applied. Patches with is_conflict=True override an existing rule (the user changed their mind); they are applied like any other patch, and the flag lets the caller route them through a user-confirmation step or surface them in debugging. signals are implicit below-gate observations: persist them and pass them back as pending_signals on later runs so recurring corrections accumulate toward min_recurrence.
Every LLM proposal is re-validated in code: the evidence quote must appear verbatim in the user messages, a modify old_text must match the file exactly once, an add must not duplicate existing content, and a patch that would push a file past max_file_tokens is dropped.
Customising prompts
import everalgo.agent_memory.prompts.case_filter as _cf
_cf.AGENT_CASE_FILTER_PROMPT = my_custom_filter_prompt # global override
Or per-call: pass prompt_filter= / prompt_compress= / prompt_tool_pre_compress= to aextract.
Related distributions
everalgo-boundary—detect_boundariesprimitive used byAgentBoundaryDetectoreveralgo-clustering—cluster_by_llmfor skill-cluster assignmenteveralgo-rank—CaseRanker/SkillRankerfor read-time ranking
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file everalgo_agent_memory-0.3.1.tar.gz.
File metadata
- Download URL: everalgo_agent_memory-0.3.1.tar.gz
- Upload date:
- Size: 71.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ff89fd0608530440bb21123eb76b9b5ade4d171c9c04f6407cf038b84eb6df15
|
|
| MD5 |
df78979422d8da3ef64de6443122272a
|
|
| BLAKE2b-256 |
ec946eead80b95d0c26e750563e18b097c710ae29467d14f719efb2d72ed12df
|
File details
Details for the file everalgo_agent_memory-0.3.1-py3-none-any.whl.
File metadata
- Download URL: everalgo_agent_memory-0.3.1-py3-none-any.whl
- Upload date:
- Size: 54.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4e8c2cf06ec4699199de5c769aa7c5ac30fdf61086526edbbd0b68bee5ec9219
|
|
| MD5 |
143a2586325becd29053c47ad3ef560b
|
|
| BLAKE2b-256 |
d82e3d892f31ecfbac8f127f4664877bed6a5ad74bd8c101e106ab583a5e5ca2
|