Skip to main content

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server

Project description

codexlens-search

Semantic code search engine with MCP server for Claude Code.

2-stage vector search + FTS + RRF fusion + reranking — install once, configure API keys, ready to use.

Quick Start (Claude Code MCP)

Add to your project .mcp.json:

{
  "mcpServers": {
    "codexlens": {
      "command": "uvx",
      "args": ["--from", "codexlens-search[mcp]", "codexlens-mcp"],
      "env": {
        "CODEXLENS_EMBED_API_URL": "https://api.openai.com/v1",
        "CODEXLENS_EMBED_API_KEY": "${OPENAI_API_KEY}",
        "CODEXLENS_EMBED_API_MODEL": "text-embedding-3-small",
        "CODEXLENS_EMBED_DIM": "1536"
      }
    }
  }
}

That's it. Claude Code will auto-discover the tools: index_projectsearch_code.

Install

# Standard install (includes vector search + API clients)
pip install codexlens-search

# With MCP server for Claude Code
pip install codexlens-search[mcp]

Optional extras for advanced use:

Extra Description
mcp MCP server (codexlens-mcp command)
gpu GPU-accelerated embedding (onnxruntime-gpu)
faiss-cpu FAISS ANN backend
watcher File watcher for auto-indexing

MCP Tools

Tool Description
search_code Semantic search with hybrid fusion + reranking
index_project Build or rebuild the search index
index_status Show index statistics
index_update Incremental sync (only changed files)
find_files Glob file discovery
list_models List models with cache status
download_models Download local fastembed models

MCP Configuration Examples

API Embedding Only (simplest)

{
  "mcpServers": {
    "codexlens": {
      "command": "uvx",
      "args": ["--from", "codexlens-search[mcp]", "codexlens-mcp"],
      "env": {
        "CODEXLENS_EMBED_API_URL": "https://api.openai.com/v1",
        "CODEXLENS_EMBED_API_KEY": "${OPENAI_API_KEY}",
        "CODEXLENS_EMBED_API_MODEL": "text-embedding-3-small",
        "CODEXLENS_EMBED_DIM": "1536"
      }
    }
  }
}

API Embedding + API Reranker (best quality)

{
  "mcpServers": {
    "codexlens": {
      "command": "uvx",
      "args": ["--from", "codexlens-search[mcp]", "codexlens-mcp"],
      "env": {
        "CODEXLENS_EMBED_API_URL": "https://api.openai.com/v1",
        "CODEXLENS_EMBED_API_KEY": "${OPENAI_API_KEY}",
        "CODEXLENS_EMBED_API_MODEL": "text-embedding-3-small",
        "CODEXLENS_EMBED_DIM": "1536",
        "CODEXLENS_RERANKER_API_URL": "https://api.jina.ai/v1",
        "CODEXLENS_RERANKER_API_KEY": "${JINA_API_KEY}",
        "CODEXLENS_RERANKER_API_MODEL": "jina-reranker-v2-base-multilingual"
      }
    }
  }
}

Multi-Endpoint Load Balancing

{
  "mcpServers": {
    "codexlens": {
      "command": "uvx",
      "args": ["--from", "codexlens-search[mcp]", "codexlens-mcp"],
      "env": {
        "CODEXLENS_EMBED_API_ENDPOINTS": "https://api1.example.com/v1|sk-key1|model,https://api2.example.com/v1|sk-key2|model",
        "CODEXLENS_EMBED_DIM": "1536"
      }
    }
  }
}

Format: url|key|model,url|key|model,...

Local Models (Offline, No API)

pip install codexlens-search[mcp]
codexlens-search download-models
{
  "mcpServers": {
    "codexlens": {
      "command": "codexlens-mcp",
      "env": {}
    }
  }
}

Pre-installed (no uvx)

{
  "mcpServers": {
    "codexlens": {
      "command": "codexlens-mcp",
      "env": {
        "CODEXLENS_EMBED_API_URL": "https://api.openai.com/v1",
        "CODEXLENS_EMBED_API_KEY": "${OPENAI_API_KEY}",
        "CODEXLENS_EMBED_API_MODEL": "text-embedding-3-small",
        "CODEXLENS_EMBED_DIM": "1536"
      }
    }
  }
}

CLI

codexlens-search --db-path .codexlens sync --root ./src
codexlens-search --db-path .codexlens search -q "auth handler" -k 10
codexlens-search --db-path .codexlens status
codexlens-search list-models
codexlens-search download-models

Environment Variables

Embedding

Variable Description Example
CODEXLENS_EMBED_API_URL Embedding API base URL https://api.openai.com/v1
CODEXLENS_EMBED_API_KEY API key sk-xxx
CODEXLENS_EMBED_API_MODEL Model name text-embedding-3-small
CODEXLENS_EMBED_API_ENDPOINTS Multi-endpoint: url|key|model,... See above
CODEXLENS_EMBED_DIM Vector dimension 1536

Reranker

Variable Description Example
CODEXLENS_RERANKER_API_URL Reranker API base URL https://api.jina.ai/v1
CODEXLENS_RERANKER_API_KEY API key jina-xxx
CODEXLENS_RERANKER_API_MODEL Model name jina-reranker-v2-base-multilingual

Tuning

Variable Default Description
CODEXLENS_BINARY_TOP_K 200 Binary coarse search candidates
CODEXLENS_ANN_TOP_K 50 ANN fine search candidates
CODEXLENS_FTS_TOP_K 50 FTS results per method
CODEXLENS_FUSION_K 60 RRF fusion k parameter
CODEXLENS_RERANKER_TOP_K 20 Results to rerank
CODEXLENS_INDEX_WORKERS 2 Parallel indexing workers
CODEXLENS_MAX_FILE_SIZE 1000000 Max file size in bytes

Architecture

Query → [Embedder] → query vector
         ├→ [BinaryStore] → candidates (Hamming)
         │     └→ [ANNIndex] → ranked IDs (cosine)
         ├→ [FTS exact] → exact matches
         └→ [FTS fuzzy] → fuzzy matches
              └→ [RRF Fusion] → merged ranking
                    └→ [Reranker] → final top-k

Development

git clone https://github.com/catlog22/codexlens-search.git
cd codexlens-search
pip install -e ".[dev]"
pytest

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

codexlens_search-0.3.0.tar.gz (57.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

codexlens_search-0.3.0-py3-none-any.whl (54.1 kB view details)

Uploaded Python 3

File details

Details for the file codexlens_search-0.3.0.tar.gz.

File metadata

  • Download URL: codexlens_search-0.3.0.tar.gz
  • Upload date:
  • Size: 57.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for codexlens_search-0.3.0.tar.gz
Algorithm Hash digest
SHA256 1f73b2e69057428d65221d2c3b0d73f84a7a2b1cec9a31b1f930b4d862969c20
MD5 23d9a82d3a4cea8f08d310ec15721593
BLAKE2b-256 616a742053d8aadaa764c99f8cb58a6fc07b0587adcbedf4e4733dfd69d5aa6a

See more details on using hashes here.

File details

Details for the file codexlens_search-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for codexlens_search-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ce7b423ec156faf7afba14bddb4532fef221f815ea3335f1de78564cf7e512e9
MD5 7f49d4dcfec4208e762bb4f39b3fb1f8
BLAKE2b-256 bad829f9b4ffe2816795f273fe3b4ee3f6bde541f57cbe971b9bd9fc6e189d5c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page