Unified AI memory layer for coding assistants

These details have not been verified by PyPI

Project links

Project description

Oghma

Persistent memory for AI coding assistants.

Every time you close a session with Claude Code, Codex, or OpenCode, the discoveries you made disappear. That workaround you found for sqlite-vec? Gone. The API quirk that cost you an hour? Forgotten. Next week, you'll hit the same wall again.

Oghma watches your transcript files, extracts the non-obvious learnings via LLM, deduplicates against what it already knows, and makes everything searchable — keyword, semantic, or hybrid.

How it works

┌─────────────┐     ┌───────────┐     ┌──────────┐     ┌────────────┐
│ Transcripts │────▶│  Extract  │────▶│  Dedup   │────▶│   Store    │
│ (JSONL)     │     │  (LLM)    │     │ (cosine) │     │ (SQLite)   │
└─────────────┘     └───────────┘     └──────────┘     └────────────┘
  Claude Code            │                                    │
  Codex                  │                                    ▼
  OpenCode          Categories:                    ┌──────────────────┐
                    - gotcha                       │  Search (MCP)    │
                    - learning                     │  keyword / vec / │
                    - workflow                     │  hybrid (RRF)    │
                    - preference                   └──────────────────┘
                    - project_context

A background daemon polls for new/changed transcripts, sends chunks to an LLM for extraction, embeds the results, checks for semantic duplicates, and stores what's genuinely new. Your AI assistant queries this via MCP — so it remembers what you've learned across every session and every tool.

Features

Multi-tool extraction — Parses transcripts from Claude Code, Codex, OpenCode (and OpenClaw)
LLM-powered filtering — Configurable model with a tuned prompt that extracts gotchas and workarounds while filtering noise like "the user prefers Python"
Hybrid search — SQLite FTS5 + sqlite-vec fused via Reciprocal Rank Fusion with recency boost
Inline dedup — New memories are checked against existing embeddings before insertion. Duplicates never enter the DB.
MCP server — Plug into Claude Code, Cursor, or any MCP-compatible client
Maintenance CLI — Semantic dedup, noise purge, staleness pruning, memory promotion
Export — Markdown or JSON, grouped by category, date, or source

Quick start

# Install
pip install oghma

# Or from source
git clone https://github.com/terry-li-hm/oghma.git
cd oghma
pip install -e ".[dedup]"

# Set API keys
export OPENAI_API_KEY=sk-...          # for embeddings
export OPENROUTER_API_KEY=sk-or-...   # if using OpenRouter models for extraction

# Initialize and start
oghma init          # creates ~/.oghma/config.yaml
oghma start         # background daemon

Edit ~/.oghma/config.yaml to configure your extraction model, tool paths, and embedding settings.

MCP server

Add to your Claude Code config (~/.claude.json):

{
  "mcpServers": {
    "oghma": {
      "command": "uvx",
      "args": ["--from", "oghma", "oghma-mcp"]
    }
  }
}

This exposes five tools to your AI assistant:

Tool	Description
`oghma_search`	Search memories (keyword, vector, or hybrid)
`oghma_get`	Fetch a specific memory by ID
`oghma_add`	Add a memory directly (with auto-dedup)
`oghma_stats`	Memory counts by category and source
`oghma_categories`	List categories with counts

CLI reference

oghma init                  Create default config
oghma start [--foreground]  Start the extraction daemon
oghma stop                  Stop the daemon
oghma status [--json]       Daemon status and DB stats
oghma stats                 Memory counts by category/source

oghma search <query>        Search memories
  --mode keyword|vector|hybrid
  --category, --tool, --status, --limit

oghma dedup                 Find and remove semantic duplicates
oghma purge-noise           Remove memories matching noise patterns
oghma prune-stale           Delete memories older than N days
  --max-age-days 90
  --source-tool <name>

oghma promote <id>          Promote a memory to 'promoted' category
oghma export                Export to markdown or JSON
  --format, --group-by, --category

oghma validate-config       Check config for errors
oghma migrate-embeddings    Backfill embeddings for existing memories

All destructive commands default to --dry-run. Pass --execute to apply.

Configuration

~/.oghma/config.yaml:

daemon:
  poll_interval: 300          # seconds between checks
  min_messages: 6             # skip trivial sessions

extraction:
  model: google/gemini-3-flash-preview   # or gpt-4o-mini, deepseek/deepseek-chat, etc.
  confidence_threshold: 0.7
  dedup_threshold: 0.92       # cosine similarity — higher = stricter
  categories:
    - learning
    - preference
    - project_context
    - gotcha
    - workflow
    - promoted

embedding:
  provider: openai
  model: text-embedding-3-small
  dimensions: 1536

tools:
  claude_code:
    enabled: true
    paths:
      - ~/.claude/projects/-Users-*/*.jsonl
  codex:
    enabled: true
    paths:
      - ~/.codex/sessions/**/rollout-*.jsonl
  opencode:
    enabled: true
    paths:
      - ~/.local/share/opencode/storage/message/ses_*

Extraction models

Oghma supports any OpenAI or OpenRouter model:

Model	Provider	Quality	Cost
google/gemini-3-flash-preview	OpenRouter	Excellent	~$1.50/M tokens
gpt-4o-mini	OpenAI	Good	~$0.30/M tokens
deepseek/deepseek-chat-v3-0324	OpenRouter	Good	~$0.14/M tokens

Search modes

Mode	Engine	Best for
keyword	SQLite FTS5	Exact term matching, fast
vector	sqlite-vec (cosine similarity)	Conceptual/semantic search
hybrid	RRF fusion of both + recency boost	Best overall relevance

oghma search "async patterns" --mode hybrid --limit 20

How extraction works

The daemon sends conversation chunks to an LLM with a prompt engineered to extract only actionable insights:

Extracted: Tool gotchas, bug workarounds, API quirks, architecture decisions, error solutions, workflow patterns.

Filtered: Setup facts ("uses Python 3.12"), config restatements, assistant narration ("The AI suggested..."), trivially obvious observations.

Each memory gets a confidence score and a category. Post-extraction, regex noise patterns catch stragglers. Pre-insertion, embedding similarity catches duplicates. The result: your database grows with genuine insights, not noise.

Maintenance

# Recommended: run weekly via cron
oghma dedup --threshold 0.92 --execute
oghma purge-noise --execute

# Prune old memories from a retired tool
oghma prune-stale --max-age-days 90 --source-tool openclaw --execute

# Promote a frequently-useful memory
oghma promote 739

Adding a custom parser

Implement a parser with can_parse() and parse() methods:

from oghma.parsers import Message

class MyToolParser:
    def can_parse(self, file_path: Path) -> bool:
        return ".mytool" in str(file_path)

    def parse(self, file_path: Path) -> list[Message]:
        # Return list of Message(role="user"|"assistant", content="...")
        ...

Requirements

Python 3.10+
SQLite with FTS5 (included in most distributions)
sqlite-vec for vector search (optional, recommended)
OpenAI API key for embeddings
LLM API key for extraction (OpenAI or OpenRouter)

Environment variables

Variable	Required	Description
`OPENAI_API_KEY`	Yes	For embeddings (text-embedding-3-small)
`OPENROUTER_API_KEY`	If using OpenRouter	For Gemini, DeepSeek, etc.
`OGHMA_DB_PATH`	No	Override database path
`OGHMA_EXTRACTION_MODEL`	No	Override extraction model
`OGHMA_LOG_LEVEL`	No	DEBUG / INFO / WARNING / ERROR

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.3

Feb 24, 2026

0.6.2

Feb 11, 2026

0.6.1

Feb 11, 2026

0.6.0

Feb 11, 2026

This version

0.5.1

Feb 10, 2026

0.5.0

Feb 9, 2026

0.4.0

Feb 6, 2026

0.3.0

Feb 5, 2026

0.0.1

Feb 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oghma-0.5.1.tar.gz (53.5 kB view details)

Uploaded Feb 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

oghma-0.5.1-py3-none-any.whl (40.6 kB view details)

Uploaded Feb 10, 2026 Python 3

File details

Details for the file oghma-0.5.1.tar.gz.

File metadata

Download URL: oghma-0.5.1.tar.gz
Upload date: Feb 10, 2026
Size: 53.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for oghma-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`96450f12e9b4131259a06b8378ef826aabee752adde562719fab47b8d1fc9758`
MD5	`9e80e536ae3b73723ac24eb14d20334b`
BLAKE2b-256	`1f41c54b4f44a41265d383f2040ab6923c8228c971a4906a229a0840fb5e3bcd`

See more details on using hashes here.

File details

Details for the file oghma-0.5.1-py3-none-any.whl.

File metadata

Download URL: oghma-0.5.1-py3-none-any.whl
Upload date: Feb 10, 2026
Size: 40.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for oghma-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ebb42a3088486791502555682f037aa821a56c92c2795b5932b6f96f3c75819d`
MD5	`f7246604212c40a943feba1048c0241f`
BLAKE2b-256	`5a88a0c9f91967c455df3d45d1546ef98a85236f5b3c6b8bf6adb43450d78c0c`

See more details on using hashes here.

oghma 0.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Oghma

How it works

Features

Quick start

MCP server

CLI reference

Configuration

Extraction models

Search modes

How extraction works

Maintenance

Adding a custom parser

Requirements

Environment variables

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes