Fragmented memory system for Hermes Agent — automatically retrieve relevant memory fragments and inject them into conversation context

These details have not been verified by PyPI

Project description

Fragmented Memory Plugin for Hermes Agent

The Fragmented Memory system automatically retrieves relevant memory fragments and injects them into the conversation context for each dialogue.

User: "How did we set up that React project structure last time?"
                      ↓
         Fragmented Memory System     ← Redis + RediSearch
                      ↓
    ┌─────────────────────────────────────┐
    │  [1] User prefers TypeScript + Vite    │
    │  [2] Previous projects used pinia state management │
    │  [3] Backend suggested using .NET 10 implementation       │
    └─────────────────────────────────────┘
                      ↓
         Model directly uses fragments to answer

Features

Semantic Splitting — auto-split conversations into standalone fragments
BM25 Full-Text Search — works out of the box with no external API
Optional Vector Search — KNN via RediSearch (OpenAI / DashScope embedder)
Time Decay — newer fragments rank higher (60-day half-life configurable)
Sentiment Weighting — emotional fragments get priority
User Feedback — mark fragments useful/useless to improve ranking
Hot Topic Boost — frequently discussed topics rank higher
Full Memory Injection — each fragment traces back to its complete original text, shown inline as "(full memory: ...)"
Associative Recall — after retrieving a fragment's full memory, the system searches again to find more related fragments
Entity Extraction — auto-tags fragments with entities (people, places, crypto tickers, domain terms) at store time; searched alongside content text for higher recall
Entity Co-occurrence — auto-track which entities appear together, expand search to co-occurring entities for associative recall ("BTC" → also finds fragments mentioning "缠论")
Domain Dictionary — jieba user dictionary auto-generated from fragment corpus + synonym table, loaded on /new for better Chinese tokenization
Workflow Lock — set fragmented:workflow_lock in Redis to globally disable memory retrieval (e.g. during automated workflows)
Skip Patterns — define skip lists (via file) to avoid searching on trivial queries like "ok", "got it"
Auto Sync — every turn is archived automatically, memory tool writes are synced
Search-Time Expiry — invalid_at field in index: set a timestamp and the fragment is filtered out at search time (no data loss, can be reverted)
Auto Maintenance — consolidation (keyword clustering + LLM summarization) + selective forgetting (multi-dimension low-value detection) run every 2h to keep storage tidy

Design Philosophy: Brain-like Memory

Fragmented Memory takes a different approach. It's modeled after how the human brain actually remembers:

Brain Mechanism	Implementation
Forgetting Curve	Time decay (60-day half-life) — old memories fade naturally
Emotion Deepens Memory	Emotional weight boost — intense moments stick
Repetition Reinforces	Attention tracking + hot topic scoring
Correction Works	Correction detection — user says "no", the wrong memory is suppressed
Use It or Lose It	Feedback reinforcement (frag_memory_feedback)
Association & Analogy	Synonym discovery (Jaccard co-occurrence statistics) — "deploy" ↔ "release"
Entity Association	Entity co-occurrence tracking — fragments mentioning "BTC" also recall "halving" without being synonyms
Fragmented Storage	Split conversations into atomic pieces, not full transcripts
Fragment Lineage	Each fragment links back to its full original text — "fragment A reminds me of full conversation B"
Associative Recall	Search a fragment → trace to full memory → search again for more related fragments
Entity Tagging	Like the brain tagging memories with people/places/things — auto-extracted entities searched alongside content
Sleep Consolidation	Background maintenance every 2h: keyword-based clustering + LLM summarization + synonym discovery at 3 AM via cron
Context Isolation	agent_id tagging — different identities, separate memories
Fuzzy but Enough	BM25 full-text search — doesn't need an exact match to recall

No vector database. No embedding API calls. No LLM inference for memory operations. Just pure statistical methods running on Redis + RediSearch — the same techniques the brain uses: frequency, recency, emotional salience, association, and feedback.

Requirements

Python 3.10+
Hermes Agent 0.12+ — provides MemoryProvider interface
Redis 7+ — with RediSearch module (v2.6+)
jieba — Chinese tokenization (auto-installed)
Embedding API (optional) — OpenAI / DashScope / any compatible /v1/embeddings service

Installation

pip install fragmented-memory

Or install directly from GitHub:

pip install git+https://github.com/j-zly/fragmented-memory.git

Configuration

Configuration precedence (high to low): Environment variables > JSON config file > config.yaml inline > defaults

1. Configuration Methods

There are three ways to configure Fragmented Memory, listed in order of priority:

Environment Variables (Highest precedence)
Set environment variables like FRAGMENTED_REDIS_HOST, FRAGMENTED_REDIS_PASSWORD, etc.
JSON Config File (~/.config/fragmented-memory/config.json)
A complete JSON configuration file for all settings.
Code Defaults (Lowest precedence)
Default values defined in the code.

2. Complete Configuration Example

Here's a comprehensive example of the configuration file ~/.config/fragmented-memory/config.json with all available options:

{
  // Redis 连接配置
  "redis_host": "127.0.0.1",
  "redis_port": 6379,
  "redis_password": "",
  
  // 搜索相关配置
  "top_k": 5,
  "candidate_k": 10,
  "bm25_limit": 10,
  "tag_filter": "",

  // 跳过检索配置
  "skip_min_length": 2,
  "skip_patterns_file": "~/.config/fragmented-memory/skip_patterns.txt",

  // 时间衰减配置
  "decay_half_days": 60,
  "hot_topic_decay_half_days": 30,
  
  // 排序权重配置
  "sentiment_boost_positive": 1.5,
  "sentiment_boost_negative": 1.3,
  "feedback_positive_boost": 1.3,
  "feedback_negative_penalty": 0.5,
  "hot_topic_boost": 1.2,
  
  // 注意力机制配置
  "attention_boost_max": 1.5,
  "attention_base_increment": 2.0,
  "attention_emotion_factor": 1.5,
  
  // 嵌入配置（可选）
  "embedder": {
    "provider": "dashscope",
    "api_key": "sk-xxx",
    "base_url": "https://dashscope.aliyuncs.com/compatible-mode/v1",
    "model": "text-embedding-v2"
  },
  
  // 自动维护配置
  "consolidate_min_group": 2,
  "consolidate_max_age_hours": 72,
  "forget_max_age_days": 30,
  "full_max_age_days": 60,
  "forget_dry_run": false,   // false = actually delete low-value fragments

  // 代理隔离配置
  "agent_id": "main-brain",
  "is_primary": true,

  // 同义词发现配置
  "synonym_min_word_freq": 10,
  "synonym_jaccard_threshold": 0.5,
  "synonym_min_co_occurrence": 3,

  // 实体共现配置
  "entity_cooc_top_n": 3,
  "entity_cooc_min_count": 2,
  
  // 情感强度因子
  "emotion_intensity_factor": 0.4
}

Note: Redis password compatibility: leave empty for no authentication, or provide password to automatically send AUTH command.

3. Environment Variables Reference

Environment Variable	Corresponding Config Item	Description
`FRAGMENTED_REDIS_HOST`	`redis_host`	Redis server host
`FRAGMENTED_REDIS_PORT`	`redis_port`	Redis server port
`FRAGMENTED_REDIS_PASSWORD`	`redis_password`	Redis password for authentication
`FRAGMENTED_TOP_K`	`top_k`	Number of final fragments returned
`FRAGMENTED_CANDIDATE_K`	`candidate_k`	Candidate fragments count (for KNN)
`FRAGMENTED_BM25_LIMIT`	`bm25_limit`	BM25 search candidate count
`FRAGMENTED_TAG_FILTER`	`tag_filter`	Tag filtering (comma-separated)
`FRAGMENTED_DECAY_HALF_DAYS`	`decay_half_days`	Time decay half-life (days)
`FRAGMENTED_HOT_TOPIC_DECAY_HALF_DAYS`	`hot_topic_decay_half_days`	Hot topic time decay half-life (days)
`FRAGMENTED_EMBED_CACHE_TTL`	`embed_cache_ttl`	Embedding cache TTL (seconds)
`FRAGMENTED_EMBEDDER`	`embedder.provider`	Embedding provider (`openai`, `dashscope`)
`FRAGMENTED_EMBEDDER_URL`	`embedder.base_url`	Embedding API endpoint
`FRAGMENTED_EMBEDDER_MODEL`	`embedder.model`	Embedding model name
`FRAGMENTED_CONSOLIDATE_MIN_GROUP`	`consolidate_min_group`	Minimum fragments to trigger consolidation
`FRAGMENTED_CONSOLIDATE_MAX_AGE_HOURS`	`consolidate_max_age_hours`	Minimum age (hours) before fragments can be consolidated
`FRAGMENTED_FORGET_MAX_AGE_DAYS`	`forget_max_age_days`	Number of days before fragments might be forgotten
`FRAGMENTED_FULL_MAX_AGE_DAYS`	`full_max_age_days`	Number of days before full memories might be forgotten
`FRAGMENTED_FORGET_DRY_RUN`	`forget_dry_run`	Safe mode: `true` = count only, `false` = actually delete
`FRAGMENTED_EMOTION_INTENSITY_FACTOR`	`emotion_intensity_factor`	Emotion intensity → weight coefficient (0=disabled, 1=max)

Note: Redis password is compatible with empty value (no auth) or password provided for AUTH command.
Note: Changes to config.json take effect immediately without restarting (just send /new).

4. Create Redis Index (first-time usage)

The code will auto-create (ensure_index()), or execute manually:

redis-cli FT.CREATE idx:memories ON HASH PREFIX 1 "memory:frag:" SCHEMA \
    content TEXT WEIGHT 1 \
    tags TAG SEPARATOR "," \
    category TAG SEPARATOR "," \
    source TEXT WEIGHT 1 \
    created TEXT WEIGHT 0 \
    fragment_type TAG SEPARATOR "," \
    invalid_at TAG SEPARATOR "," \
    entities TAG SEPARATOR "," \
    embed_bin VECTOR FLAT 6 TYPE FLOAT32 DIM 1536 DISTANCE_METRIC COSINE

Dimension (DIM) is dynamically adjusted based on the embedding model used, default 1536. For Docker: docker run -d --name redis-stack -p 6379:6379 redis/redis-stack:latest

5. Hermes Configuration

Enable in ~/.hermes/config.yaml:

memory:
  provider: fragmented

If embedder is not configured, only BM25 full-text search mode will be used.

Also supports environment variable configuration (highest precedence):

export FRAGMENTED_REDIS_HOST=127.0.0.1
export FRAGMENTED_REDIS_PORT=6379
export FRAGMENTED_TOP_K=5
export FRAGMENTED_EMBEDDER=dashscope
export FRAGMENTED_EMBEDDER_MODEL=text-embedding-v2
export OPENAI_API_KEY=sk-xxx        # embedder API key

6. Workflow Lock

Temporarily disable memory retrieval during automated workflows (like batch processing):

# Lock (3600s TTL)
redis-cli SET fragmented:workflow_lock 1 EX 3600

# Unlock
redis-cli DEL fragmented:workflow_lock

7. Skip Patterns File

Create a file (one pattern per line, # for comments):

# ~/.config/fragmented-memory/skip_patterns.txt
好的
嗯
对
是
哦
可以
没错
ok
okay
yes
yeah

Then reference it in config.json:

{
  "skip_min_length": 2,
  "skip_patterns_file": "~/.config/fragmented-memory/skip_patterns.txt"
}

8. Restart Gateway

# For CLI mode, restart session is sufficient
# For Gateway mode, restart the process

Configuration Reference

Config Item	Environment Variable	Default Value	Description
`redis_host`	`FRAGMENTED_REDIS_HOST`	`127.0.0.1`	Redis address
`redis_port`	`FRAGMENTED_REDIS_PORT`	`6379`	Redis port
`top_k`	`FRAGMENTED_TOP_K`	`5`	Number of final fragments returned
`candidate_k`	`FRAGMENTED_CANDIDATE_K`	`10`	Candidate fragments count (for KNN)
`tag_filter`	`FRAGMENTED_TAG_FILTER`	`""`	Tag filtering (comma-separated)
`bm25_limit`	`FRAGMENTED_BM25_LIMIT`	`10`	BM25 search candidate count
`decay_half_days`	`FRAGMENTED_DECAY_HALF_DAYS`	`60`	Time decay half-life (days)
`embed_cache_ttl`	`FRAGMENTED_EMBED_CACHE_TTL`	`3600`	Embedding cache TTL (seconds)
`sentiment_boost_positive`	—	`1.5`	Positive fragment weight multiplier
`sentiment_boost_negative`	—	`1.3`	Negative fragment weight multiplier
`feedback_positive_boost`	—	`1.3`	Positive feedback bonus weight
`feedback_negative_penalty`	—	`0.5`	Negative feedback penalty coefficient
`hot_topic_boost`	—	`1.2`	Hot topic weighting multiplier
`embedder.provider`	`FRAGMENTED_EMBEDDER`	`openai`	`openai` / `dashscope`
`embedder.api_key`	`OPENAI_API_KEY`	—	Embedding API key
`embedder.base_url`	`FRAGMENTED_EMBEDDER_URL`	`https://api.openai.com/v1`	API endpoint
`embedder.model`	`FRAGMENTED_EMBEDDER_MODEL`	`text-embedding-3-small`	Embedding model name
`consolidate_min_group`	—	`2`	Minimum fragments to trigger consolidation
`consolidate_max_age_hours`	—	`72`	Minimum age (hours) before fragments can be consolidated
`forget_max_age_days`	—	`30`	Number of days before fragments might be forgotten
`full_max_age_days`	—	`60`	Number of days before full memories (memory:full:*) might be forgotten
`forget_dry_run`	—	`true`	Safe mode for forgetting: `true` = count only, `false` = actually delete
`agent_id`	—	`""`	Agent identity tag for memory isolation (e.g. `"main-brain"`)
`is_primary`	—	`false`	When `true`, agent sees all fragments; `false` = only tagged ones
`hot_topic_decay_half_days`	—	`30`	Hot topic time decay half-life (days)
`emotion_intensity_factor`	—	`0.4`	Emotion intensity → weight coefficient (0=disabled, 1=max)
`skip_min_length`	—	`2`	Minimum query length to trigger search
`skip_patterns_file`	—	`""`	Path to file containing skip patterns (one per line, # for comments)
`attention_boost_max`	—	`1.5`	Maximum attention weighting value
`attention_base_increment`	—	`2.0`	Base attention increment per mention
`attention_emotion_factor`	—	`1.5`	Emotion intensity amplification factor for attention
`synonym_min_word_freq`	—	`10`	Minimum fragment frequency for word to be considered
`synonym_jaccard_threshold`	—	`0.5`	Jaccard similarity threshold for synonym detection
`synonym_min_co_occurrence`	—	`3`	Minimum co-occurrence count for synonym detection
`entity_cooc_top_n`	—	`3`	Number of co-occurring entities to expand search with
`entity_cooc_min_count`	—	`2`	Minimum co-occurrence count for entity association

sentiment_*, feedback_*, hot_topic_* and other ranking weight parameters currently only support configuration through JSON config file, not environment variables. Set to 1.0 to disable the effect of that dimension.

Embedding Models and Dimensions

Model	Dimensions
OpenAI text-embedding-3-small	1536
OpenAI text-embedding-3-large	3072
OpenAI text-embedding-ada-002	1536
DashScope text-embedding-v2	1536
DashScope text-embedding-v3	1024

Dimensions are automatically detected, switching models doesn't require reconfiguration.

Synonym Table

Stored in Redis Hash fragmented:synonyms, expanded at search time to improve recall:

redis-cli HSET fragmented:synonyms setup '["install","configure","deploy","setup"]'
redis-cli HSET fragmented:synonyms fix '["fix","modify","correct","repair","solve"]'

Verification

Check logs after startup:

Memory provider 'fragmented' registered (0 tools)
fragmented: connected (session=xxx, top_k=5, tag_filter=(none))
fragmented: BM25-only mode (no embedder configured)

Architecture

┌────────────────────────────────────────────────────────┐
│                    User sends message                  │
└──────────────────┬─────────────────────────────────────┘
                   │
         ┌─────────▼─────────┐
         │   prefetch()       │  ← Automatically triggered on every user message
         │   ↓                │
         │  Workflow Lock?    │  ← Checks fragmented:workflow_lock
         │   ↓                │
         │  Skip patterns?    │  ← Length / exact match against skip list
         │   ↓                │
         │  BM25 Full-Text Search │  ← Default, zero cost, triggered by user message keywords
         │  (KNN Vector search) │  ← Optional (needs embedder)
         │  Entity co-occurrence │  ← Expand query with co-occurring entities
         │   ↓                │
         │  Six-dimensional Re-ranking │  ← Similarity × Time decay
         │                    │    × Emotion × Feedback × Hot Topic × Attention
         │   ↓                │
         │  Full Memory Injection     │  ← Top 3 fragments → trace full text → show inline
         │  Associative Recall        │  ← Full text → search again → dedup append
         │   ↓                │
         │  Top N Injected into Context   │  ← Search results injected as code block
         └─────────┬─────────┘
                   │
         ┌─────────▼─────────┐
         │   Model Response   │  ← Fragments can be used as reference
         └───────────────────┘
                   │
         ┌─────────▼─────────┐
         │   sync_turn()      │  ← Automatically archive at end of conversation
         │   Store Full Text  │  ← memory:full:{hash} for fragment lineage
         │   ↓                │
         │   Smart Sentence Splitting      │  ← Protect abbreviations/numbers/quotes
         │   Entity Extraction        │  ← jieba + regex → entities TAG field
         │   Entity Co-occurrence     │  ← Track entity pairs in ZSET
         │   Attention Tracking        │  ← Extract keywords and increase attention score
         │   ↓                │
         │   Stored in Redis        │  ← Available for next retrieval
         └───────────────────┘
                   │
         ┌─────────▼─────────┐
         │   [cron] Every 2h     │  ← Background maintenance
         │   ① Multi-level Consolidation  │  ← Same topic → keyword clustering → LLM → level+1
         │   ② Selective Forgetting  │  ← Age>30d + no feedback + low emotion + low attention → actual delete
         └───────────────────┘

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.1.0

Jun 12, 2026

2.0.2

Jun 8, 2026

2.0.1

Jun 7, 2026

2.0.0

Jun 7, 2026

1.0.0

Jun 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fragmented_memory-2.1.0-py3-none-any.whl (51.3 kB view details)

Uploaded Jun 12, 2026 Python 3

File details

Details for the file fragmented_memory-2.1.0-py3-none-any.whl.

File metadata

Download URL: fragmented_memory-2.1.0-py3-none-any.whl
Upload date: Jun 12, 2026
Size: 51.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for fragmented_memory-2.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9f30045ef0b577609a217ec1bb9ae15241694a8c7805f8e5b9e011e34ee4f66f`
MD5	`c004ba87b2770f4b485e96f07e8e0686`
BLAKE2b-256	`730384cb2684155df12fc07737890419e538f8422adc41a1f0fdc47cf13b3288`

See more details on using hashes here.

fragmented-memory 2.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Fragmented Memory Plugin for Hermes Agent

Features

Design Philosophy: Brain-like Memory

Requirements

Installation

Configuration

1. Configuration Methods

2. Complete Configuration Example

3. Environment Variables Reference

4. Create Redis Index (first-time usage)

5. Hermes Configuration

6. Workflow Lock

7. Skip Patterns File

8. Restart Gateway

Configuration Reference

Embedding Models and Dimensions

Synonym Table

Verification

Architecture

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes