Skip to main content

A deterministic, high-precision code intelligence MCP server

Project description

code-memory

A deterministic, high-precision code intelligence layer exposed as a Model Context Protocol (MCP) server.

code-memory gives your AI coding assistant structured access to your codebase through three focused pathways — eliminating context-window bloat and vague "search everything" queries.

Supported Languages

Full AST Support (Tree-sitter)

These languages have structural parsing with symbol extraction (functions, classes, methods, etc.):

Language Extensions
Python .py
JavaScript .js, .jsx
TypeScript .ts, .tsx
Java .java
Go .go
Rust .rs
C .c, .h
C++ .cpp, .hpp, .cc, .cxx
Ruby .rb
Kotlin .kt, .kts

Fallback Support (Whole-file Indexing)

These file types are indexed as complete units for BM25 and semantic search:

Category Extensions
C# .cs
Swift .swift
Scala .scala
Lua .lua
Shell .sh, .bash, .zsh
Config .yaml, .yml, .toml, .json
Web .html, .css, .scss
Database .sql
Docs .md, .txt

Note: Files and directories matching patterns in your .gitignore are automatically skipped during indexing. This excludes build artifacts, dependencies, and other generated files.

Architecture: Progressive Disclosure

Instead of a single monolithic search, code-memory routes queries through three purpose-built tools:

Question Type Tool Data Source
"Where / What / How?" — find definitions, references, structure, semantic search search_code BM25 + Dense Vector (SQLite vec)
"Architecture / Patterns" — understand architecture, explain workflows search_docs Semantic / Fuzzy
"Who / Why?" — debug regressions, understand intent search_history Git + BM25 + Dense Vector (SQLite vec)
"Setup / Prepare" — index parsing & embedding generation index_codebase AST Parser + sentence-transformers

This forces the LLM to pick the right retrieval strategy before any data is fetched.

Installation

From PyPI (Recommended)

# Install with pip
pip install code-memory

# Or with uvx (for MCP hosts)
uvx code-memory

From Source

# Clone the repo
git clone https://github.com/kapillamba4/code-memory.git
cd code-memory

# Install dependencies
uv sync

# Run the MCP server (stdio transport)
uv run mcp run server.py

Quickstart

Prerequisites

  • Python ≥ 3.13
  • uv package manager (recommended) or pip

Install & Run

# Install from PyPI
pip install code-memory

# Or run directly with uvx
uvx code-memory

Development

# Run with the MCP Inspector for interactive debugging
uv run mcp dev server.py

# Run tests
uv run pytest tests/ -v

# Lint and format
uv run ruff check .
uv run ruff format .

# Build package
uv build

Configure Your MCP Host

Gemini CLI / Gemini Code Assist

Add to your MCP settings (e.g. ~/.gemini/settings.json):

{
  "mcpServers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"]
    }
  }
}

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"]
    }
  }
}

Claude Code (CLI)

Add to .mcp.json in your project root or ~/.mcp.json for global access:

{
  "mcpServers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"]
    }
  }
}

VS Code (Copilot / Continue)

Add to .vscode/mcp.json in your workspace:

{
  "servers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"]
    }
  }
}

Configuration

Environment Variables

Variable Description Default
CODE_MEMORY_LOG_LEVEL Logging verbosity (DEBUG, INFO, WARNING, ERROR) INFO

Example:

CODE_MEMORY_LOG_LEVEL=DEBUG uvx code-memory

Tools

index_codebase

Indexes or re-indexes source files and documentation in the given directory. Run this before using search_code or search_docs to ensure the database is up to date. Uses tree-sitter for language-agnostic structural extraction and generates dense vector embeddings using sentence-transformers (runs locally, in-process) for semantic search.

index_codebase(directory=".")

search_code

Perform semantic search and find structural code definitions, locate where functions/classes are defined, or map out dependency references (call graphs). Uses hybrid retrieval (BM25 + vector embeddings) to find exact matches and semantic similarities.

search_code(query="parse python files", search_type="definition")
search_code(query="how do we establish the database connection", search_type="references")
search_code(query="src/auth/", search_type="file_structure")

search_docs

Understand the codebase conceptually — how things work, architectural patterns, SOPs. Searches markdown documentation, READMEs, and docstrings extracted from code.

search_docs(query="how does the authentication flow work?")
search_docs(query="installation instructions", top_k=5)

search_history

Debug regressions and understand developer intent through Git history.

search_history(query="fix login timeout", search_type="commits")
search_history(query="src/auth/login.py", search_type="file_history", target_file="src/auth/login.py")
search_history(query="server.py", search_type="blame", target_file="server.py", line_start=1, line_end=20)

Project Structure

code-memory/
├── server.py          # MCP server entry point (FastMCP)
├── db.py              # SQLite database layer with sqlite-vec
├── parser.py          # Tree-sitter-based code parser
├── doc_parser.py      # Markdown documentation parser
├── queries.py         # Hybrid retrieval query layer
├── git_search.py      # Git history search module
├── errors.py          # Custom exception hierarchy
├── validation.py      # Input validation functions
├── logging_config.py  # Structured logging configuration
├── tests/             # Test suite
├── pyproject.toml     # Project metadata & dependencies
└── prompts/           # Milestone prompt engineering files

Troubleshooting

"Git repository not found" error

Make sure you're running search_history from within a git repository. The tool searches upward from the current directory to find .git.

Empty search results

Run index_codebase(directory=".") first to index your code and documentation. The index is stored locally in code_memory.db.

Slow indexing

Indexing generates embeddings using a local sentence-transformers model. The first run downloads the model (~90MB). Subsequent runs are faster.

Embedding model errors

Ensure you have enough disk space and memory. The all-MiniLM-L6-v2 model requires ~500MB RAM when loaded.

Roadmap

  • Milestone 1 — Project scaffolding & MCP protocol wiring
  • Milestone 2 — Implement search_code with AST parsing + SQLite + sqlite-vec
  • Milestone 3 — Implement search_history with Git integration
  • Milestone 4 — Implement search_docs with semantic search
  • Milestone 5 — Production hardening & packaging

Contributing

See CONTRIBUTING.md for development setup and guidelines.

Changelog

See CHANGELOG.md for version history.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

code_memory-1.0.9.tar.gz (168.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

code_memory-1.0.9-py3-none-any.whl (192.8 kB view details)

Uploaded Python 3

File details

Details for the file code_memory-1.0.9.tar.gz.

File metadata

  • Download URL: code_memory-1.0.9.tar.gz
  • Upload date:
  • Size: 168.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for code_memory-1.0.9.tar.gz
Algorithm Hash digest
SHA256 2fa19f698a1d850799eb7089db483722beb55a3fc94ce1d2f3641f9e9ba1efc9
MD5 967429845c72e51aaa7c9226bcc7e8ec
BLAKE2b-256 773994271b19320824c06e014164e2c0d0c89cba7b035839147bb2e4811fc21c

See more details on using hashes here.

File details

Details for the file code_memory-1.0.9-py3-none-any.whl.

File metadata

  • Download URL: code_memory-1.0.9-py3-none-any.whl
  • Upload date:
  • Size: 192.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for code_memory-1.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 f8704056ee1dd146b06fd79172e8c2aa62928bb1dedf927497a6d6b981391fc7
MD5 f598785a399d6d6a1d6a3a61fd664943
BLAKE2b-256 84e7ce6176935a80375718720ddb4be65229f13cad0021890a483db97e16ca76

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page