Unified AI memory layer for coding assistants

These details have not been verified by PyPI

Project links

Project description

Oghma

Persistent memory for AI coding assistants.

Oghma hums in the background — watching your coding sessions, extracting technical gotchas and workarounds via LLM, and making them searchable for when you vaguely remember solving something three months ago but forgot how.

It's a safety net for hard-won discoveries, not a knowledge base or personal wiki. For structured notes and preferences, use your own docs. For "what was that sqlite-vec trick again?" — search Oghma.

How it works

┌─────────────┐     ┌───────────┐     ┌──────────┐     ┌────────────┐
│ Transcripts │────▶│  Extract  │────▶│  Dedup   │────▶│   Store    │
│ (JSONL)     │     │  (LLM)    │     │ (cosine) │     │ (SQLite)   │
└─────────────┘     └───────────┘     └──────────┘     └────────────┘
  Claude Code            │                                    │
  Codex                  │                                    ▼
  OpenCode          Categories:                    ┌──────────────────┐
                    - gotcha                       │  Search (MCP)    │
                    - learning                     │  keyword / vec / │
                    - workflow                     │  hybrid (RRF)    │
                    - preference                   └──────────────────┘
                    - project_context

A background daemon polls for new/changed transcripts, sends chunks to an LLM for extraction, embeds the results, checks for semantic duplicates, and stores what's genuinely new. Your AI assistant queries this via MCP — so it remembers what you've learned across every session and every tool.

Features

Multi-tool extraction — Parses transcripts from Claude Code, Codex, OpenCode (and OpenClaw)
LLM-powered filtering — Configurable model with a tuned prompt that extracts gotchas and workarounds while filtering noise like "the user prefers Python"
Hybrid search — SQLite FTS5 + sqlite-vec fused via Reciprocal Rank Fusion with recency boost
Inline dedup — New memories are checked against existing embeddings before insertion. Duplicates never enter the DB.
MCP server — Plug into Claude Code, Cursor, or any MCP-compatible client
Maintenance CLI — Semantic dedup, noise purge, staleness pruning, memory promotion
Export — Markdown or JSON, grouped by category, date, or source

Quick start

# Install
pip install oghma

# Or from source
git clone https://github.com/terry-li-hm/oghma.git
cd oghma
pip install -e ".[dedup]"

# Set API keys
export OPENAI_API_KEY=sk-...          # for embeddings
export OPENROUTER_API_KEY=sk-or-...   # if using OpenRouter models for extraction

# Initialize and start
oghma init          # creates ~/.oghma/config.yaml
oghma start         # background daemon

Edit ~/.oghma/config.yaml to configure your extraction model, tool paths, and embedding settings.

Integration

Two ways to connect Oghma to your AI assistant:

Option A: Claude Code skill (recommended)

Zero token overhead — the skill is only loaded when invoked, not on every turn.

mkdir -p ~/.claude/skills/oghma
cp integrations/claude-code/SKILL.md ~/.claude/skills/oghma/SKILL.md

Your assistant will use oghma search via CLI when it needs to recall past learnings.

Option B: MCP server

Works with any MCP client (Claude Code, Cursor, Windsurf, etc.). Costs ~350 tokens/turn for tool schemas.

Add to your Claude Code config (~/.claude.json):

{
  "mcpServers": {
    "oghma": {
      "command": "uvx",
      "args": ["--from", "oghma", "oghma-mcp"]
    }
  }
}

This exposes four tools to your AI assistant:

Tool	Description
`oghma_search`	Search memories (keyword, vector, or hybrid)
`oghma_get`	Fetch a specific memory by ID
`oghma_stats`	Memory counts by category and source
`oghma_categories`	List categories with counts

CLI reference

oghma init                  Create default config
oghma start [--foreground]  Start the extraction daemon
oghma stop                  Stop the daemon
oghma status [--json]       Daemon status and DB stats
oghma stats                 Memory counts by category/source

oghma search <query>        Search memories
  --mode keyword|vector|hybrid
  --category, --tool, --status, --limit

oghma dedup                 Find and remove semantic duplicates
oghma purge-noise           Remove memories matching noise patterns
oghma prune-stale           Delete memories older than N days
  --max-age-days 90
  --source-tool <name>

oghma promote <id>          Promote a memory to 'promoted' category
oghma export                Export to markdown or JSON
  --format, --group-by, --category

oghma validate-config       Check config for errors
oghma migrate-embeddings    Backfill embeddings for existing memories

All destructive commands default to --dry-run. Pass --execute to apply.

Configuration

~/.oghma/config.yaml:

daemon:
  poll_interval: 300          # seconds between checks
  min_messages: 6             # skip trivial sessions

extraction:
  model: google/gemini-3-flash-preview   # or gpt-4o-mini, deepseek/deepseek-chat, etc.
  confidence_threshold: 0.7
  dedup_threshold: 0.92       # cosine similarity — higher = stricter
  categories:
    - learning
    - preference
    - project_context
    - gotcha
    - workflow
    - promoted

embedding:
  provider: openai
  model: text-embedding-3-small
  dimensions: 1536

tools:
  claude_code:
    enabled: true
    paths:
      - ~/.claude/projects/-Users-*/*.jsonl
  codex:
    enabled: true
    paths:
      - ~/.codex/sessions/**/rollout-*.jsonl
  opencode:
    enabled: true
    paths:
      - ~/.local/share/opencode/storage/message/ses_*

Extraction models

Oghma supports any OpenAI or OpenRouter model:

Model	Provider	Quality	Cost
google/gemini-3-flash-preview	OpenRouter	Excellent	~$1.50/M tokens
gpt-4o-mini	OpenAI	Good	~$0.30/M tokens
deepseek/deepseek-chat-v3-0324	OpenRouter	Good	~$0.14/M tokens

Search modes

Mode	Engine	Best for
keyword	SQLite FTS5	Exact term matching, fast
vector	sqlite-vec (cosine similarity)	Conceptual/semantic search
hybrid	RRF fusion of both + recency boost	Best overall relevance

oghma search "async patterns" --mode hybrid --limit 20

How memories enter the database

Memories arrive through two paths:

Path	How	`source_tool`	Best for
Daemon extraction	Background daemon processes transcripts via LLM	`claude_code`, `codex`, `opencode`	Catching things you'd forget to note
Manual addition	`oghma_add` via MCP or CLI	`manual`	Curated insights you know are valuable

Daemon extraction

The daemon sends conversation chunks to an LLM with a prompt engineered to extract only actionable insights:

Extracted: Tool gotchas, bug workarounds, API quirks, architecture decisions, error solutions, workflow patterns.

Filtered: Setup facts ("uses Python 3.12"), config restatements, assistant narration ("The AI suggested..."), trivially obvious observations.

Each memory gets a confidence score and a category. Post-extraction, regex noise patterns catch stragglers. Pre-insertion, embedding similarity catches duplicates. The result: your database grows with genuine insights, not noise.

Manual addition

You can add memories directly via the CLI (oghma add — coming soon). Use this for curated, high-confidence insights — not as a general notepad. For personal preferences and stable facts, a structured note (e.g., in your knowledge base) is usually a better fit.

Maintenance

# Recommended: run weekly via cron
oghma dedup --threshold 0.92 --execute
oghma purge-noise --execute

# Prune old memories from a retired tool
oghma prune-stale --max-age-days 90 --source-tool openclaw --execute

# Promote a frequently-useful memory
oghma promote 739

Adding a custom parser

Implement a parser with can_parse() and parse() methods:

from oghma.parsers import Message

class MyToolParser:
    def can_parse(self, file_path: Path) -> bool:
        return ".mytool" in str(file_path)

    def parse(self, file_path: Path) -> list[Message]:
        # Return list of Message(role="user"|"assistant", content="...")
        ...

Requirements

Python 3.10+
SQLite with FTS5 (included in most distributions)
sqlite-vec for vector search (optional, recommended)
OpenAI API key for embeddings
LLM API key for extraction (OpenAI or OpenRouter)

Environment variables

Variable	Required	Description
`OPENAI_API_KEY`	Yes	For embeddings (text-embedding-3-small)
`OPENROUTER_API_KEY`	If using OpenRouter	For Gemini, DeepSeek, etc.
`OGHMA_DB_PATH`	No	Override database path
`OGHMA_EXTRACTION_MODEL`	No	Override extraction model
`OGHMA_LOG_LEVEL`	No	DEBUG / INFO / WARNING / ERROR

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.3

Feb 24, 2026

0.6.2

Feb 11, 2026

This version

0.6.1

Feb 11, 2026

0.6.0

Feb 11, 2026

0.5.1

Feb 10, 2026

0.5.0

Feb 9, 2026

0.4.0

Feb 6, 2026

0.3.0

Feb 5, 2026

0.0.1

Feb 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oghma-0.6.1.tar.gz (54.4 kB view details)

Uploaded Feb 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

oghma-0.6.1-py3-none-any.whl (40.9 kB view details)

Uploaded Feb 11, 2026 Python 3

File details

Details for the file oghma-0.6.1.tar.gz.

File metadata

Download URL: oghma-0.6.1.tar.gz
Upload date: Feb 11, 2026
Size: 54.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.2

File hashes

Hashes for oghma-0.6.1.tar.gz
Algorithm	Hash digest
SHA256	`a850750b6dec8af29345c8bbcdb2d353bb4acbbc97373073477ec8f66022130b`
MD5	`62d9817f6b1c74cd45181a2120442ec3`
BLAKE2b-256	`031c4c2dc442502a9445dcb16e1d7439e516787debbf87a8ca6abe6277277b28`

See more details on using hashes here.

File details

Details for the file oghma-0.6.1-py3-none-any.whl.

File metadata

Download URL: oghma-0.6.1-py3-none-any.whl
Upload date: Feb 11, 2026
Size: 40.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.2

File hashes

Hashes for oghma-0.6.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8762e9af66cece7bc8d4e1abf619126683eb0f5e5c6d562835cecac657f338f2`
MD5	`5bfcda4619b9131b2529c7b5d57de9db`
BLAKE2b-256	`7caaf513aead3f5507ed0cfbe797908a97d0d40536e810e53134375ec1300f74`

See more details on using hashes here.

oghma 0.6.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Oghma

How it works

Features

Quick start

Integration

Option A: Claude Code skill (recommended)

Option B: MCP server

CLI reference

Configuration

Extraction models

Search modes

How memories enter the database

Daemon extraction

Manual addition

Maintenance

Adding a custom parser

Requirements

Environment variables

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes