Cursor-style vector search MCP plugin for Claude Code

These details have not been verified by PyPI

Project links

Project description

VecGrep

Cursor-style semantic code search as an MCP plugin for Claude Code.

Instead of grepping 50 files and sending 30,000 tokens to Claude, VecGrep returns the top 8 semantically relevant code chunks (~1,600 tokens). That's a ~95% token reduction for codebase queries.

How it works

Chunk — Parses source files with tree-sitter to extract semantic units (functions, classes, methods)
Embed — Encodes each chunk locally using all-MiniLM-L6-v2-code-search-512 (384-dim, ~80MB one-time download) via the fastembed ONNX backend (~100ms startup) or PyTorch, automatically using Metal (Apple Silicon), CUDA (NVIDIA), or CPU
Store — Saves embeddings + metadata in LanceDB under ~/.vecgrep/<project_hash>/
Search — ANN index (IVF-PQ) for fast approximate search on large codebases

Incremental re-indexing via mtime/size checks skips unchanged files.

Architecture

VecGrep architecture diagram

Installation

Requires Python 3.12 and uv.

Note: Python 3.12 is required — tree-sitter-languages does not yet have wheels for Python 3.13+.

pip install vecgrep                        # standard pip
uv tool install --python 3.12 vecgrep     # uv tool (recommended)

Claude Code integration

Run once — works for every project:

claude mcp add --scope user vecgrep -- vecgrep

This installs VecGrep as a persistent binary and registers it in your user config (~/.claude.json) so it's available globally across all projects. Starts instantly — no download delay on Claude Code launch.

Usage with Claude

You don't trigger VecGrep manually - Claude decides when to call the tools based on what you ask.

What you say to Claude	Tool invoked
"Index my project at /Users/me/myapp"	`index_codebase`
"How does authentication work in this codebase?"	`search_code`
"Find where database connections are set up"	`search_code`
"How many files are indexed?"	`get_index_status`

Typical first-time flow:

You:    "Search for how payments are handled in /Users/me/myapp"
Claude: [calls index_codebase automatically since no index exists]
Claude: [calls search_code with your query]
Claude: "Here's how payments work — in src/payments.py:42..."

After the first index, subsequent searches skip unchanged files automatically — no re-indexing needed unless your code changes.

Tools

`index_codebase(path, force=False)`

Index a project directory. Skips unchanged files on subsequent calls.

index_codebase("/path/to/myproject")
# → "Indexed 142 file(s), 1847 chunk(s) added (0 file(s) skipped, unchanged)"

`search_code(query, path, top_k=8)`

Semantic search. Auto-indexes if no index exists.

search_code("how does user authentication work", "/path/to/myproject")

Returns formatted snippets with file paths, line numbers, and similarity scores:

[1] src/auth.py:45-72 (score: 0.87)
def authenticate_user(token: str) -> User:
    ...

[2] src/middleware.py:12-28 (score: 0.81)
...

`get_index_status(path)`

Check index statistics.

Index status for: /path/to/myproject
  Files indexed:  142
  Total chunks:   1847
  Last indexed:   2026-02-22T07:20:31+00:00
  Index size:     28.4 MB

Configuration

VecGrep can be tuned via environment variables:

Variable	Default	Description
`VECGREP_BACKEND`	`onnx`	Embedding backend: `onnx` (fastembed, fast startup) or `torch` (sentence-transformers, any HF model)
`VECGREP_MODEL`	`isuruwijesiri/all-MiniLM-L6-v2-code-search-512`	HuggingFace model ID to use for embeddings

Backend comparison:

Backend	Startup	PyTorch required	Custom HF models
`onnx` (default)	~100ms	No	ONNX-exported models only
`torch`	~2–3s	Yes	Any HuggingFace model

Examples:

# Use a different model with the torch backend
VECGREP_BACKEND=torch VECGREP_MODEL=sentence-transformers/all-MiniLM-L6-v2 vecgrep

# Use a custom ONNX model
VECGREP_MODEL=my-org/my-onnx-model vecgrep

Supported languages

Python, JavaScript/TypeScript, Rust, Go, Java, C/C++, Ruby, Swift, Kotlin, C#

All other text files fall back to sliding-window line chunks.

Index location

~/.vecgrep/<sha256-of-project-path>/index.db

Each project gets its own isolated index. Delete the directory to wipe the index.

Acknowledgements

The embedding model used by VecGrep is all-MiniLM-L6-v2-code-search-512, a model fine-tuned specifically for semantic code search by @isuruwijesiri.

@misc{all_MiniLM_L6_v2_code_search_512,
  author    = {isuruwijesiri},
  title     = {all-MiniLM-L6-v2-code-search-512},
  year      = {2026},
  publisher = {Hugging Face},
  url       = {https://huggingface.co/isuruwijesiri/all-MiniLM-L6-v2-code-search-512}
}

Community


❓ Questions	Start a Q&A discussion
💡 Ideas	Share an idea
🚀 Show & Tell	Share how you use VecGrep
🐛 Bugs	Open an issue

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.8.0

May 19, 2026

1.7.0

Mar 4, 2026

This version

1.6.0

Mar 2, 2026

1.5.0

Feb 28, 2026

1.0.0

Feb 21, 2026

0.1.0

Feb 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vecgrep-1.6.0.tar.gz (652.3 kB view details)

Uploaded Mar 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vecgrep-1.6.0-py3-none-any.whl (19.7 kB view details)

Uploaded Mar 2, 2026 Python 3

File details

Details for the file vecgrep-1.6.0.tar.gz.

File metadata

Download URL: vecgrep-1.6.0.tar.gz
Upload date: Mar 2, 2026
Size: 652.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vecgrep-1.6.0.tar.gz
Algorithm	Hash digest
SHA256	`a3d94524d0b35092be928998fca65688b0645119184c35c1fd947b4de3280237`
MD5	`b2d89bfad7e853518782f25ada161ef3`
BLAKE2b-256	`4c5d69c9ae32470660fcfee70344564a1b63973ee23cb2d55327d326fd86b41c`

See more details on using hashes here.

File details

Details for the file vecgrep-1.6.0-py3-none-any.whl.

File metadata

Download URL: vecgrep-1.6.0-py3-none-any.whl
Upload date: Mar 2, 2026
Size: 19.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vecgrep-1.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`db760aa236ae13b5327cd93b0abd77009dfb90f3e2ad7e82004ac3f6d45d6d89`
MD5	`97b86d890349c92035f14730138c29aa`
BLAKE2b-256	`4f7199cee231b9792ca70e2e167feca82883a194e8f8b6bb52c07317a318f831`

See more details on using hashes here.

vecgrep 1.6.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

VecGrep

How it works

Architecture

Installation

Claude Code integration

Usage with Claude

Tools

index_codebase(path, force=False)

search_code(query, path, top_k=8)

get_index_status(path)

Configuration

Supported languages

Index location

Acknowledgements

Community

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`index_codebase(path, force=False)`

`search_code(query, path, top_k=8)`

`get_index_status(path)`