Skip to main content

Persistent semantic memory system for OpenCode sessions

Project description

opencode-memory

Persistent semantic memory system for OpenCode AI assistant sessions.

Overview

opencode-memory is an MCP (Model Context Protocol) server that provides long-term memory capabilities for OpenCode sessions. It automatically learns from your workflow, stores decisions and procedures, and provides relevant context when needed.

Key benefits:

  • Remember decisions, blockers, procedures, and facts across sessions
  • Semantic search finds relevant memories even with different wording
  • Session coordination prevents conflicts when running multiple OpenCode instances
  • Project-scoped directives for per-project instructions
  • Zero manual effort - learns passively from your workflow

Features

  • Hybrid retrieval: Combines full-text search (FTS5) with vector similarity search
  • Instant storage: Memories stored immediately, embeddings computed async
  • Session coordination: Tracks active sessions, supports item claiming to prevent conflicts
  • Contextual directives: Global + project-specific instructions loaded at boot
  • Entity tracking: Links memories to MRs, issues, epics (!123, #456, &789)
  • GitLab enrichment: Fetches metadata for entities from GitLab API
  • Memory age: All outputs show age to identify potentially outdated information

Installation

Quick Install (Recommended)

Works on both Linux and macOS:

curl -fsSL https://gitlab.com/ghavenga/opencode-memory/-/raw/master/scripts/setup.sh | bash

This will:

  • Clone the repository to ~/.local/share/opencode-memory-install
  • Create a virtual environment and install dependencies
  • Download the embedding model (~90MB, one-time, runs locally)
  • Set up a background service:
    • Linux: systemd user service
    • macOS: launchd LaunchAgent
  • Configure OpenCode integration
  • Create ~/.config/opencode/AGENTS.md with memory bootstrap
  • Optionally bootstrap core directives (requires confirmation due to LLM costs)

Cost Information

Default background processing is free - The memory daemon uses:

  • Local embedding model (sentence-transformers, no API calls)
  • Pattern-based extraction (regex, no LLM)
  • Session summaries (stores conversations as-is, no LLM processing)

LLM-based knowledge extraction is OFF by default - There is an optional feature that uses opencode run to analyze old conversations and extract procedures, decisions, and facts. This:

  • Runs every 6 hours, processing up to 50 conversations per cycle
  • Makes LLM API calls for each conversation (significant cost potential)
  • Is disabled by default to avoid unexpected charges

To enable LLM extraction (if you want it), add to ~/.config/opencode-memory/config.toml:

[ingestion]
llm_extraction = true

Optional directive bootstrapping - The setup script can create initial directives using opencode inline (~4 LLM API calls). This prompts for confirmation and defaults to 'no' when running non-interactively.

To skip directive bootstrapping entirely:

SKIP_BOOTSTRAP=1 curl -fsSL .../setup.sh | bash

From Source (Manual)

# Clone the repository
git clone https://gitlab.com/ghavenga/opencode-memory.git
cd opencode-memory

# Create virtual environment
python -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -e ".[dev]"

# Download the embedding model (runs once, ~90MB)
python -c "from sentence_transformers import SentenceTransformer; SentenceTransformer('all-MiniLM-L6-v2')"

From PyPI

pip install opencode-semantic-memory

Quick Start

1. Start the Memory Server

The recommended setup runs a shared HTTP server that all OpenCode sessions connect to:

# Start manually
source .venv/bin/activate
python -m opencode_memory.http_server

Or install as a background service (auto-starts on login):

Linux (systemd):

# Copy and customize the service file
cp opencode-memory.service ~/.config/systemd/user/

# Edit to match your paths
nano ~/.config/systemd/user/opencode-memory.service

# Enable and start
systemctl --user daemon-reload
systemctl --user enable --now opencode-memory

# Check status
systemctl --user status opencode-memory

macOS (launchd):

# Copy the plist file
cp com.opencode.memory.plist ~/Library/LaunchAgents/

# Edit to match your paths (replace $HOME with actual path)
nano ~/Library/LaunchAgents/com.opencode.memory.plist

# Create log directory
mkdir -p ~/.local/state/opencode-memory

# Load the service
launchctl load ~/Library/LaunchAgents/com.opencode.memory.plist

# Check status
launchctl list | grep opencode

2. Configure OpenCode

Add to ~/.config/opencode/opencode.json:

{
  "mcp": {
    "memory": {
      "type": "remote",
      "url": "http://localhost:9824/mcp"
    }
  }
}

3. Configure the Agent

Add to ~/.config/opencode/AGENTS.md:

# Agent Instructions

At session start, always call `memory_get_boot_context` to load prior context, active sessions, and blockers.

## Memory System

The memory system provides persistent context across sessions.

### Proactive Usage

Call `memory_remember` when:
- You make a decision about approach
- You hit a blocker
- You learn how to do something
- You discover an interesting fact

Call `memory_recall(query)` before:
- Using an unfamiliar API
- Making assumptions about how something works

Call `memory_get_context(entity_ref)` before working on any MR/issue/epic.

Categories: decision, blocker, procedure, fact, event, directive

Configuration

Create ~/.config/opencode-memory/config.toml:

[identity]
user = "your-gitlab-username"  # Optional, auto-detected from git
instance = "gitlab.com"

[boot]
identity = true
active_sessions = true
hot_items = true
unresolved_blockers = true
recent_decisions = false
max_hot_items = 5

[ingestion]
watch_paths = ["~/.local/share/opencode/opencode.db"]
db_poll_interval = 30
llm_extraction = false
working_directory = "/path/to/your/projects"

[storage]
path = "~/.local/share/opencode-memory"

MCP Tools

Note: Tools are registered with the memory_ prefix when accessed through the "memory" MCP server. For example, recall becomes memory_recall in OpenCode sessions.

Core Memory Tools

Tool Description
memory_recall(query, limit?, project?, category?, compact?) Semantic search across memories
memory_remember(content, category, what?, why?, learned?, entities?, project?) Store a memory
memory_get_context(entity_ref) Get all memories for !MR, #issue, or &epic
memory_get_boot_context() Load startup context (identity, blockers, directives)
memory_get_linked_memories(memory_id, link_types?) Get memories linked to a specific memory

Session Coordination

Tool Description
memory_session_start(session_id, working_on?) Register session
memory_session_end(session_id, summary?) End session with summary
memory_session_heartbeat(session_id) Keep session alive
memory_get_active_sessions() List active sessions
memory_claim_item(session_id, item_ref) Claim exclusive ownership
memory_release_item(session_id, item_ref) Release claimed item

Memory Management

Tool Description
memory_resolve_blocker(memory_id) Mark blocker as resolved
memory_unresolve_blocker(memory_id) Reopen a blocker
memory_archive_memory(memory_id, reason?) Archive outdated memory (soft delete)
memory_delete_memory(memory_id, also_delete_vector?) Permanently delete a memory
memory_edit_memory(memory_id, content?, what?, why?, learned?) Edit memory content or metadata
memory_bulk_archive(memory_ids?, category?, older_than_days?, reason) Archive multiple memories
memory_consolidate_memory(days_stale?, project?) Find stale/duplicate memories
memory_search_history(query, category?, limit?) Search with category filter

Backup & Transfer

Tool Description
memory_export_memories(output_path?, project?, categories?, since_days?) Export to JSON
memory_import_memories(input_path, dry_run?, skip_duplicates?) Import from JSON

Utilities

Tool Description
memory_ingest_file(file_path) Manually ingest a markdown file
memory_enrich_entity(entity_ref, project?) Fetch GitLab metadata
memory_bootstrap_memory(path?) Scan project files for initial facts
memory_log_session(summary, learnings?, entities?) Log session summary
memory_memory_status() Check system health and queue status

Memory Categories

Category Use For
decision Architectural choices, design decisions
blocker Obstacles preventing progress
procedure How-to knowledge, workflows
fact Project-specific information
event Significant occurrences
directive Always-on instructions (global or project-scoped)
plan Long-term goals and strategies
idea Future possibilities, deferred considerations
conversation Full conversation content (auto-generated)

Storage

All data is stored locally:

  • SQLite: ~/.local/share/opencode-memory/memory.db - Memories, entities, sessions, FTS index
  • LanceDB: ~/.local/share/opencode-memory/vectors/ - Vector embeddings for semantic search

The daemon automatically:

  • Cleans up old resolved blockers (>90 days)
  • Archives old conversations (>180 days)
  • Compacts LanceDB versions (keeps last 10)

Environment Variables

Variable Description
GITLAB_TOKEN Enable GitLab entity enrichment
HF_HUB_OFFLINE=1 Prevent model downloads (use cached)
TRANSFORMERS_OFFLINE=1 Prevent model downloads
OPENCODE_MEMORY_HOST HTTP server bind address (default: 127.0.0.1)
OPENCODE_MEMORY_PORT HTTP server port (default: 9824)
OPENCODE_MEMORY_API_KEY Optional API key for authentication
OPENCODE_MEMORY_RATE_LIMIT Requests per minute per client (default: 60)

Development

# Activate environment
source .venv/bin/activate

# Run tests
python -m pytest tests/ -v

# Lint
ruff check src/

# Type check
mypy src/

# Check memory stats
python -m opencode_memory.cli stats

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                      opencode-memory                            │
├─────────────────────────────────────────────────────────────────┤
│  MCP Server (HTTP/stdio)                                        │
│  └── 25+ tools for memory management                            │
├─────────────────────────────────────────────────────────────────┤
│  Background Daemon                                              │
│  ├── OpenCode DB Observer (polls for new sessions)              │
│  ├── Entity Enrichment (GitLab API)                             │
│  └── Cleanup (archives old memories, compacts vectors)          │
├─────────────────────────────────────────────────────────────────┤
│  Storage Layer                                                  │
│  ├── SQLite + FTS5 (memories, entities, sessions)               │
│  └── LanceDB (vector embeddings)                                │
├─────────────────────────────────────────────────────────────────┤
│  Embedding Engine                                               │
│  └── all-MiniLM-L6-v2 (384-dim, runs in subprocess)             │
└─────────────────────────────────────────────────────────────────┘

Operations

Log Rotation

When running as a systemd service, logs are managed by journald. To configure log rotation:

# Check current log size
journalctl --user -u opencode-memory --disk-usage

# Set max journal size (add to ~/.config/systemd/user.conf or override)
# Or create ~/.config/systemd/journald.conf.d/opencode-memory.conf:
cat > ~/.config/systemd/journald.conf.d/opencode-memory.conf << 'EOF'
[Journal]
SystemMaxUse=100M
MaxRetentionSec=7d
EOF

# Or manually vacuum old logs
journalctl --user --vacuum-time=7d
journalctl --user --vacuum-size=100M

For file-based logging, configure rotation in the systemd service:

[Service]
StandardOutput=append:/var/log/opencode-memory/server.log
StandardError=append:/var/log/opencode-memory/error.log

Then use logrotate (/etc/logrotate.d/opencode-memory):

/var/log/opencode-memory/*.log {
    daily
    rotate 7
    compress
    delaycompress
    missingok
    notifempty
    create 0640 $USER $USER
}

Monitoring

The HTTP server exposes several endpoints:

Endpoint Description
/health Basic health check (returns 200 if healthy, 503 if degraded)
/stats Detailed statistics (memories, cache, queue, links)
/metrics Prometheus-format metrics

Example monitoring with curl:

# Health check (for load balancers/monitoring)
curl -s http://localhost:9824/health | jq .

# Full stats
curl -s http://localhost:9824/stats | jq .

# Prometheus metrics
curl -s http://localhost:9824/metrics

CLI Tools

# Show comprehensive statistics
python -m opencode_memory.cli stats

# Ingest markdown files
python -m opencode_memory.cli ingest /path/to/notes --recursive

# Archive old memories
python -m opencode_memory.cli cleanup --dry-run

# Enrich entities with GitLab metadata
python -m opencode_memory.cli enrich --limit 100

Troubleshooting

Check service status

Linux:

systemctl --user status opencode-memory
journalctl --user -u opencode-memory -f

macOS:

launchctl list | grep opencode
tail -f ~/.local/state/opencode-memory/server.log
tail -f ~/.local/state/opencode-memory/server.error.log

Check memory system health

Use memory_status tool or:

python -c "
from opencode_memory.server import MemoryServer
import json
server = MemoryServer(enable_daemon=False)
print(json.dumps(server._get_status(), indent=2))
"

Prometheus metrics

The HTTP server exposes metrics at /metrics:

curl http://localhost:9824/metrics

Backup and restore

# Export all memories
memory_export_memories(output_path="/backup/memories.json")

# Import on new machine
memory_import_memories(input_path="/backup/memories.json", dry_run=True)  # Preview
memory_import_memories(input_path="/backup/memories.json")  # Actually import

Reset memory database

Linux:

rm -rf ~/.local/share/opencode-memory/
systemctl --user restart opencode-memory

macOS:

rm -rf ~/.local/share/opencode-memory/
launchctl unload ~/Library/LaunchAgents/com.opencode.memory.plist
launchctl load ~/Library/LaunchAgents/com.opencode.memory.plist

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

opencode_semantic_memory-0.1.0.tar.gz (11.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

opencode_semantic_memory-0.1.0-py3-none-any.whl (100.9 kB view details)

Uploaded Python 3

File details

Details for the file opencode_semantic_memory-0.1.0.tar.gz.

File metadata

  • Download URL: opencode_semantic_memory-0.1.0.tar.gz
  • Upload date:
  • Size: 11.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for opencode_semantic_memory-0.1.0.tar.gz
Algorithm Hash digest
SHA256 e73f36714766b7e6bc1dcb344d270089de4196df92dddd0b476b59eb1ef2892c
MD5 7a2b70ccdb4583e894a370ce692cbfca
BLAKE2b-256 dd149640c5f056f6d620e7957e45a87ce033cd2906a81c3514868bd63ddb9d1a

See more details on using hashes here.

File details

Details for the file opencode_semantic_memory-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for opencode_semantic_memory-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7ea5b22baa25dd1adc0adb5f41519ec247d04cf9589a3b6dd016c4caa03a4378
MD5 cdbaaf53a5074dee15fcabf9cc4911fe
BLAKE2b-256 86ce8cc4178c3f21baea0651eec26565d468c66603798115cb836133c98aa522

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page