A deterministic, high-precision code intelligence MCP server

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kapillamba4

These details have not been verified by PyPI

Project description

code-memory

A deterministic, high-precision code intelligence layer exposed as a Model Context Protocol (MCP) server.

No API key required — runs entirely locally with sentence-transformers
1 min setup — just uvx code-memory and you're ready
Token saving by 50% — precise code retrieval instead of dumping entire files

Please help star code-memory if you like this project!

Why code-memory?

Finding the right context from a large codebase is expensive, inaccurate, and limited by context windows. Dumping files into prompts wastes tokens, and LLMs lose track of the actual task as context fills up.

Instead of manually hunting with grep/find or dumping raw file text, code-memory runs semantic searches against a locally indexed codebase. Inspired by claude-context, but designed from the ground up for large-scale local search.

Supported Languages

Full AST Support (structural parsing with symbol extraction): Python, JavaScript/TypeScript, Java, Go, Rust, C/C++, Ruby, Kotlin

Fallback Support (whole-file indexing): C#, Swift, Scala, Lua, Shell, Config (yaml/toml/json), Web (html/css), SQL, Markdown

Files matching .gitignore patterns are automatically skipped.

Architecture: Progressive Disclosure

Instead of a single monolithic search, code-memory routes queries through three purpose-built tools:

Question Type	Tool	Data Source
"Where / What / How?" — find definitions, references, structure, semantic search	`search_code`	BM25 + Dense Vector (SQLite vec)
"Architecture / Patterns" — understand architecture, explain workflows	`search_docs`	Semantic / Fuzzy
"Who / Why?" — debug regressions, understand intent	`search_history`	Git + BM25 + Dense Vector (SQLite vec)
"Setup / Prepare" — index parsing & embedding generation	`index_codebase`	AST Parser + `sentence-transformers`

This forces the LLM to pick the right retrieval strategy before any data is fetched.

Installation

From PyPI (Recommended)

# Install with pip
pip install code-memory

# Or with uvx (for MCP hosts)
uvx code-memory

From Source

# Clone the repo
git clone https://github.com/kapillamba4/code-memory.git
cd code-memory

# Install dependencies
uv sync

# Run the MCP server (stdio transport)
uv run mcp run server.py

Pre-built Binaries (Standalone)

Download standalone executables from GitHub Releases — no Python installation required.

Platform	Architecture	File
Linux	x86_64	`code-memory-linux-x86_64`
macOS	x86_64 (Intel)	`code-memory-macos-x86_64`
macOS	ARM64 (Apple Silicon)	`code-memory-macos-arm64`
Windows	x86_64	`code-memory-windows-x86_64.exe`

# Linux/macOS: Download and make executable
chmod +x code-memory-*
./code-memory-*

# Windows: Run directly
code-memory-windows-x86_64.exe

Note: The first run will download the embedding model (~600MB) to ~/.cache/huggingface/. Subsequent runs use the cached model.

Quickstart

Prerequisites

Python ≥ 3.13
uv package manager (recommended) or pip

Install & Run

# Install from PyPI
pip install code-memory

# Or run directly with uvx
uvx code-memory

Development

# Run with the MCP Inspector for interactive debugging
uv run mcp dev server.py

# Run tests
uv run pytest tests/ -v

# Lint and format
uv run ruff check .
uv run ruff format .

# Build package
uv build

# Build standalone binary (requires pyinstaller)
pip install pyinstaller
pyinstaller --clean code-memory.spec
# Binary output: dist/code-memory

Configure Your MCP Host

You can use either uvx (requires Python) or the standalone binary (no dependencies).

Using uvx (Python required)

Gemini CLI / Gemini Code Assist

Add to your MCP settings (e.g. ~/.gemini/settings.json):

{
  "mcpServers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"]
    }
  }
}

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"]
    }
  }
}

Claude Code (CLI)

Add to .mcp.json in your project root or ~/.mcp.json for global access:

{
  "mcpServers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"]
    }
  }
}

VS Code (Copilot / Continue)

Add to .vscode/mcp.json in your workspace:

{
  "servers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"]
    }
  }
}

Using Standalone Binary (No Python required)

Replace the path with the location of your downloaded binary:

{
  "mcpServers": {
    "code-memory": {
      "command": "/path/to/code-memory-linux-x86_64"
    }
  }
}

For Windows:

{
  "mcpServers": {
    "code-memory": {
      "command": "C:\\path\\to\\code-memory-windows-x86_64.exe"
    }
  }
}

Configuration

Environment Variables

Variable	Description	Default
`CODE_MEMORY_LOG_LEVEL`	Logging verbosity (DEBUG, INFO, WARNING, ERROR)	INFO
`EMBEDDING_MODEL`	HuggingFace model ID for embeddings	`nomic-ai/nomic-embed-text-v1.5`

Example:

CODE_MEMORY_LOG_LEVEL=DEBUG uvx code-memory

Custom Embedding Model

You can use a different embedding model by setting the EMBEDDING_MODEL environment variable:

EMBEDDING_MODEL="BAAI/bge-small-en-v1.5" uvx code-memory

For MCP hosts, add the environment variable to your configuration:

{
  "mcpServers": {
    "code-memory": {
      "command": "uvx",
      "args": ["code-memory"],
      "env": {
        "EMBEDDING_MODEL": "BAAI/bge-small-en-v1.5"
      }
    }
  }
}

Note: Changing the embedding model will invalidate existing indexes. You'll need to re-run index_codebase after switching models.

Tools

`index_codebase`

Indexes or re-indexes source files and documentation in the given directory. Run this before using search_code or search_docs to ensure the database is up to date. Uses tree-sitter for language-agnostic structural extraction and generates dense vector embeddings using sentence-transformers (runs locally, in-process) for semantic search.

index_codebase(directory=".")

`search_code`

Perform semantic search and find structural code definitions, locate where functions/classes are defined, or map out dependency references (call graphs). Uses hybrid retrieval (BM25 + vector embeddings) to find exact matches and semantic similarities.

search_code(query="parse python files", search_type="definition")
search_code(query="how do we establish the database connection", search_type="references")
search_code(query="src/auth/", search_type="file_structure")

`search_docs`

Understand the codebase conceptually — how things work, architectural patterns, SOPs. Searches markdown documentation, READMEs, and docstrings extracted from code.

search_docs(query="how does the authentication flow work?")
search_docs(query="installation instructions", top_k=5)

`search_history`

Debug regressions and understand developer intent through Git history.

search_history(query="fix login timeout", search_type="commits")
search_history(query="src/auth/login.py", search_type="file_history", target_file="src/auth/login.py")
search_history(query="server.py", search_type="blame", target_file="server.py", line_start=1, line_end=20)

Project Structure

code-memory/
├── server.py          # MCP server entry point (FastMCP)
├── db.py              # SQLite database layer with sqlite-vec
├── parser.py          # Tree-sitter-based code parser
├── doc_parser.py      # Markdown documentation parser
├── queries.py         # Hybrid retrieval query layer
├── git_search.py      # Git history search module
├── errors.py          # Custom exception hierarchy
├── validation.py      # Input validation functions
├── logging_config.py  # Structured logging configuration
├── tests/             # Test suite
├── pyproject.toml     # Project metadata & dependencies
└── prompts/           # Milestone prompt engineering files

Troubleshooting

"Git repository not found" error

Make sure you're running search_history from within a git repository. The tool searches upward from the current directory to find .git.

Empty search results

Run index_codebase(directory=".") first to index your code and documentation. The index is stored locally in code_memory.db.

Slow indexing

Indexing generates embeddings using a local sentence-transformers model. The first run downloads the model (~600MB for jina-code-embeddings-0.5b). Subsequent runs are faster.

Embedding model errors

Ensure you have enough disk space and memory. The jina-code-embeddings-0.5b model requires ~1GB RAM when loaded.

Roadmap

Milestone 1 — Project scaffolding & MCP protocol wiring
Milestone 2 — Implement search_code with AST parsing + SQLite + sqlite-vec
Milestone 3 — Implement search_history with Git integration
Milestone 4 — Implement search_docs with semantic search
Milestone 5 — Production hardening & packaging

Contributing

See CONTRIBUTING.md for development setup and guidelines.

Changelog

See CHANGELOG.md for version history.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kapillamba4

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.33

May 20, 2026

1.0.32

May 11, 2026

1.0.31

May 10, 2026

1.0.30

May 10, 2026

1.0.29

May 10, 2026

1.0.28

Mar 15, 2026

1.0.27

Mar 2, 2026

1.0.25

Feb 28, 2026

1.0.24

Feb 28, 2026

1.0.23

Feb 24, 2026

1.0.22

Feb 24, 2026

1.0.21

Feb 24, 2026

1.0.19

Feb 24, 2026

1.0.18

Feb 23, 2026

This version

1.0.17

Feb 23, 2026

1.0.16

Feb 23, 2026

1.0.15

Feb 23, 2026

1.0.14

Feb 23, 2026

1.0.13

Feb 23, 2026

1.0.12

Feb 23, 2026

1.0.11

Feb 22, 2026

1.0.10

Feb 22, 2026

1.0.9

Feb 22, 2026

1.0.7

Feb 22, 2026

1.0.6

Feb 22, 2026

1.0.5

Feb 22, 2026

1.0.4

Feb 22, 2026

1.0.3

Feb 22, 2026

1.0.2

Feb 22, 2026

0.1.0

Feb 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

code_memory-1.0.17.tar.gz (369.0 kB view details)

Uploaded Feb 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

code_memory-1.0.17-py3-none-any.whl (395.6 kB view details)

Uploaded Feb 23, 2026 Python 3

File details

Details for the file code_memory-1.0.17.tar.gz.

File metadata

Download URL: code_memory-1.0.17.tar.gz
Upload date: Feb 23, 2026
Size: 369.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for code_memory-1.0.17.tar.gz
Algorithm	Hash digest
SHA256	`e2919f691b07110911390960aedbb61bfddb6674c9180756371d1bedd71e9f63`
MD5	`e550d12218888bde20878c0a1875fb88`
BLAKE2b-256	`5b25604a4b3ed5f28dcaa55a02e6c686f80711fb40f889ae3cece16a17c1348f`

See more details on using hashes here.

File details

Details for the file code_memory-1.0.17-py3-none-any.whl.

File metadata

Download URL: code_memory-1.0.17-py3-none-any.whl
Upload date: Feb 23, 2026
Size: 395.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for code_memory-1.0.17-py3-none-any.whl
Algorithm	Hash digest
SHA256	`88025a2bf38323ab8b7dbd74c097752f265f3ad5d567f5034ac86a5c9e84783b`
MD5	`cfb616053fc8e8ebabc036969b00ff96`
BLAKE2b-256	`323f30d2b89e2adeb68e032605516e947a9d4d3760c0df496652c9a8530ed499`

See more details on using hashes here.

code-memory 1.0.17

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

code-memory

Why code-memory?

Supported Languages

Architecture: Progressive Disclosure

Installation

From PyPI (Recommended)

From Source

Pre-built Binaries (Standalone)

Quickstart

Prerequisites

Install & Run

Development

Configure Your MCP Host

Using uvx (Python required)

Gemini CLI / Gemini Code Assist

Claude Desktop

Claude Code (CLI)

VS Code (Copilot / Continue)

Using Standalone Binary (No Python required)

Configuration

Environment Variables

Custom Embedding Model

Tools

index_codebase

search_code

search_docs

search_history

Project Structure

Troubleshooting

"Git repository not found" error

Empty search results

Slow indexing

Embedding model errors

Roadmap

Contributing

Changelog

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`index_codebase`

`search_code`

`search_docs`

`search_history`