Edge-first long-term memory retrieval for multi-session agent conversations.

Project description

EdgeMem

EdgeMem is an edge-first long-term memory retrieval prototype for multi-session agent conversations. It builds lightweight local memory indexes from dialogue turns, routes questions through timeline and graph structure, and optionally calls an LLM only for final answer generation and evaluation.

Highlights

Edge-first ingestion: local entity extraction, keyword extraction, time parsing, and JSON persistence.
Structured memory retrieval: timeline retrieval, hypergraph routing, tri-graph diffusion, optional embedding retrieval, and score fusion.
LoCoMo and LongMemEval-S support: reproducible CLIs for long-term memory QA retrieval and generation.
Explainable evidence: every retrieved snippet includes source, score, session, speaker, and timestamp metadata.
Standalone visual demo: docs/edgemem_visual_demo.html demonstrates graph-routed memory vs summary-only memory.

Project Layout

edgemem_op/
  edgemem/                    # Core Python package
    cli/                      # answer/evaluate/baseline CLIs
    data_models.py            # MemoryNode, HyperEdge, HyperGraph, Evidence
    timeline.py               # Timeline retriever
    hypergraph.py             # Hypergraph memory and routing
    trigraph.py               # Entity/keyword-turn-session graph retriever
    embedding_retriever.py    # Optional embedding retriever
    generator.py              # Evidence-to-answer prompt
    eval_judge.py             # LLM-as-judge evaluation
  edgemem_memory_plugin/      # Embeddable memory plugin facade
  scripts/                    # Analysis and demo trace utilities
  docs/                       # Method notes, formulation, visual demo
  evaluation/data/locomo/     # Place LoCoMo data here locally; not committed
  longmemeval/...             # Place LongMemEval-S cleaned data here locally; not committed
  examples/traces/            # Exported retrieval trace example
  requirements.txt            # Minimal runtime dependencies
  pyproject.toml              # Package metadata

Installation

cd edgemem_op
python -m pip install -r requirements.txt
python -m spacy download en_core_web_sm

If you want to use a larger spaCy model:

python -m spacy download en_core_web_trf
export EDGEMEM_SPACY_MODEL=en_core_web_trf

Configuration

EdgeMem reads OpenAI-compatible API settings from environment variables:

export LLM_API_KEY="your-key"
export LLM_BASE_URL="https://api.openai.com/v1"  # optional
export LLM_MODEL="gpt-4o-mini"

export EMB_API_KEY="$LLM_API_KEY"                # only needed for embed/triembed
export EMB_BASE_URL="$LLM_BASE_URL"
export EMB_MODEL="text-embedding-3-small"

You can also copy my_config.example.py to my_config.py for local development, but do not commit my_config.py.

Python Memory Plugin

EdgeMem can be embedded directly as a long-term memory plugin:

from edgemem_memory_plugin import EdgeMemMemoryPlugin, MemoryEvent, MemoryQuery

memory = EdgeMemMemoryPlugin()
memory.write_event(
    MemoryEvent(
        user_id="demo_user",
        session_id="session_1",
        text="I care about turn-level R@5 for LongMemEval.",
    )
)

hits = memory.retrieve(
    MemoryQuery(
        user_id="demo_user",
        query="What metric do I care about?",
        top_k=3,
        mode="fused",
        rerank="trigraph",
    )
)
print(memory.format_context(hits))

Run the included example:

python edgemem_memory_plugin/examples/basic_usage.py

Quick Start

Package smoke tests after installation:

python -m spacy download en_core_web_sm
edgemem-memory-demo
edgemem-smoke-longmemeval --help
edgemem-smoke-locomo --help

If the spaCy model download is blocked, the package can still run basic smoke tests with spaCy's blank English fallback, but NER quality will be lower.

Timeline-only baseline:

python -m edgemem.cli.answer \
  --dataset evaluation/data/locomo/locomo10.json \
  --mode timeline \
  --question "When did Caroline go to the LGBTQ support group?"

Fused timeline + tri-graph retrieval:

python -m edgemem.cli.answer \
  --dataset evaluation/data/locomo/locomo10.json \
  --mode fused \
  --rerank trigraph \
  --context-window 1 \
  --question "When did Caroline go to the LGBTQ support group?"

Run the LoCoMo evaluation:

python -m edgemem.cli.evaluate \
  --dataset evaluation/data/locomo/locomo10.json \
  --mode fused \
  --rerank trigraph \
  --run-name locomo_all_trigraph_context \
  --max-workers 10 \
  --context-window 1 \
  --verbose

Run LoCoMo retrieval-only detailed metrics without answer generation:

python -m edgemem.cli.evaluate_locomo_retrieval \
  --dataset evaluation/data/locomo/locomo10.json \
  --mode fused \
  --rerank trigraph \
  --top-k 5 \
  --ks 1,3,5,10 \
  --context-window 1 \
  --run-name locomo_retrieval_detailed \
  --verbose

This writes turn/session/context retrieval metrics to results/locomo_retrieval_summary_<run-name>.json.

Convenience smoke test wrapper:

edgemem-smoke-locomo \
  --dataset evaluation/data/locomo/locomo10.json \
  --limit 5

Add --with-llm plus LLM_API_KEY/LLM_BASE_URL/LLM_MODEL to generate answers and run LLM-as-judge.

Run EdgeMem retrieval on LongMemEval-S cleaned:

python -m edgemem.cli.evaluate_longmemeval \
  --dataset longmemeval/longmemeval-data-cleaned/data/longmemeval_s_cleaned.json \
  --mode fused \
  --rerank trigraph \
  --retrieval-top-k 50 \
  --generation-top-k 5 \
  --context-window 1 \
  --run-name longmemeval_s_cleaned_edgemem \
  --verbose

This command evaluates retrieval without calling an LLM. Add --generate to also write results/longmemeval_hypotheses_<run-name>.jsonl with {question_id, hypothesis} lines for the official LongMemEval QA evaluator.

Convenience smoke test wrapper:

edgemem-smoke-longmemeval \
  --dataset longmemeval/longmemeval-data-cleaned/data/longmemeval_s_cleaned.json \
  --limit 3

Dataset pages:

LoCoMo: https://github.com/snap-stanford/locomo
LongMemEval: https://github.com/xiaowu0162/LongMemEval

Visual Demo

Open this file directly in a browser:

docs/edgemem_visual_demo.html

To export a real retrieval trace from the current code:

python scripts/export_edgemem_trace.py \
  --dataset evaluation/data/locomo/locomo10.json \
  --conversation-index 0 \
  --qa-index 0 \
  --top-k 5 \
  --max-nodes 80 \
  --output examples/traces/edgemem_trace_demo.json

Notes for Open Source Release

my_config.py, .env, caches, generated results, benchmark datasets, and real API keys are intentionally ignored.
Download LoCoMo and LongMemEval data separately before running benchmark CLIs.
The core non-embedding retrieval path requires no LLM during ingestion or retrieval.
LLM calls are used by generator.py, eval_judge.py, embedding retrieval, and scripts that explicitly rejudge results.

Project details

Release history Release notifications | RSS feed

0.1.3

May 12, 2026

This version

0.1.2

May 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edgemem-0.1.2.tar.gz (44.6 kB view details)

Uploaded May 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

edgemem-0.1.2-py3-none-any.whl (58.1 kB view details)

Uploaded May 12, 2026 Python 3

File details

Details for the file edgemem-0.1.2.tar.gz.

File metadata

Download URL: edgemem-0.1.2.tar.gz
Upload date: May 12, 2026
Size: 44.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for edgemem-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`22f61f7418f0e3c31cb3839c10add38cda8f25b4a7f40b0d44874b1ab123a54d`
MD5	`d7a88df27facb177fcadd63dec55dac4`
BLAKE2b-256	`416e8c5b35087ec1bfe9128826224508cd104445550c4d72a7a98d422c6b9d1f`

See more details on using hashes here.

File details

Details for the file edgemem-0.1.2-py3-none-any.whl.

File metadata

Download URL: edgemem-0.1.2-py3-none-any.whl
Upload date: May 12, 2026
Size: 58.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for edgemem-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4c4b0e68bfcf8d5d64ef4943bed1e805272081da20cf65fe5f7c1c84228c8a94`
MD5	`65146e9dd596b2d1fd3613068ee8e378`
BLAKE2b-256	`7deb82aa4a5d99728272be0b1484838a44752b5852e29002068f4d173cbc10f9`

See more details on using hashes here.

edgemem 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

EdgeMem

Highlights

Project Layout

Installation

Configuration

Python Memory Plugin

Quick Start

Visual Demo

Notes for Open Source Release

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes