Skip to main content

Code indexing for better AI code generation

Project description

Helix Kodit Logo

Kodit: A Code Indexing MCP Server

Kodit connects your AI coding assistant to external codebases to provide accurate and up-to-date snippets of code.

Documentation License Discussions

:star: Help us reach more developers and grow the Helix community. Star this repo!

Helix Kodit is an MCP server that connects your AI coding assistant to external codebases. It can:

  • Improve your AI-assisted code by providing canonical examples direct from the source
  • Index local and public codebases
  • Integrates with any AI coding assistant via MCP
  • Search using keyword and semantic search
  • Integrate with any OpenAI-compatible or custom API/model

If you're an engineer working with AI-powered coding assistants, Kodit helps by providing relevant and up-to-date examples of your task so that LLMs make less mistakes and produce fewer hallucinations.

Features

Codebase Indexing

Kodit connects to a variety of local and remote codebases to build an index of your code. This index is used to build a snippet library, ready for ingestion into an LLM.

  • Index local directories and public Git repositories
  • Build comprehensive snippet libraries for LLM ingestion
  • Support for 20+ programming languages including Python, JavaScript/TypeScript, Java, Go, Rust, C/C++, C#, HTML/CSS, and more
  • Advanced code analysis with dependency tracking and call graph generation
  • Intelligent snippet extraction with context-aware dependencies
  • Efficient indexing with selective reindexing (only processes modified files)
  • Privacy first: respects .gitignore and .noindex files
  • NEW in 0.3: Auto-indexing configuration for shared server deployments
  • NEW in 0.3: Enhanced Git provider support including Azure DevOps
  • NEW in 0.3: Index private repositories via a PAT
  • NEW in 0.3: Improved progress monitoring and reporting during indexing
  • NEW in 0.3: Advanced code slicing infrastructure with Tree-sitter parsing
  • NEW in 0.4: Automatic periodic sync to keep indexes up-to-date

MCP Server

Relevant snippets are exposed to an AI coding assistant via an MCP server. This allows the assistant to request relevant snippets by providing keywords, code, and semantic intent. Kodit has been tested to work well with:

  • Seamless integration with popular AI coding assistants
  • Tested and verified with:
  • Please contribute more instructions! ... any other assistant is likely to work ...
  • New in 0.3: Advanced search filters by source, language, author, date range, and file path
  • New in 0.3: Hybrid search combining BM25 keyword search with semantic search
  • New in 0.4: Enhanced MCP tools with rich context parameters and metadata

Hosted MCP Server

New in 0.4: Try Kodit instantly with our hosted MCP server at https://kodit.helix.ml/mcp! No installation required - just add it to your AI coding assistant and start searching popular codebases immediately.

The hosted server provides:

  • Pre-indexed popular open source repositories
  • Zero configuration - works out of the box
  • Same powerful search capabilities as self-hosted Kodit
  • Perfect for trying Kodit before setting up your own instance

Find out more in the hosted Kodit documentation.

Enterprise Ready

Out of the box, Kodit works with a local SQLite database and very small, local models. But enterprises can scale out with performant databases and dedicated models. Everything can even run securely, privately, with on-premise LLM platforms like Helix.

Supported databases:

Supported providers:

  • Local (which uses tiny CPU-only open-source models)
  • OpenAI
  • Secure, private LLM enclave with Helix.
  • Any other OpenAI compatible API

NEW in 0.3: Enhanced deployment options:

  • Docker Compose configurations with VectorChord
  • Kubernetes manifests for production deployments

Quick Start

  1. Install Kodit
  2. Index codebases
  3. Integrate with your coding assistant

Documentation

Roadmap

The roadmap is currently maintained as a Github Project.

💬 Support

For commercial support, please contact Helix.ML. To ask a question, please open a discussion.

License

Apache 2.0 © 2025 HelixML, Inc.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kodit-0.5.15.tar.gz (480.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kodit-0.5.15-py3-none-any.whl (264.2 kB view details)

Uploaded Python 3

File details

Details for the file kodit-0.5.15.tar.gz.

File metadata

  • Download URL: kodit-0.5.15.tar.gz
  • Upload date:
  • Size: 480.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.11 {"installer":{"name":"uv","version":"0.9.11"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for kodit-0.5.15.tar.gz
Algorithm Hash digest
SHA256 35e70133ba8703566f322d6fa61b936a12e1f3ac8819558a5ebe63e5e849cf87
MD5 ea08c35d4abcc3085c8fa6996d1151e5
BLAKE2b-256 49f49cbc0230612a1dc629e2381705f0e01bdc480aa2660b5e5108e62368c1ba

See more details on using hashes here.

File details

Details for the file kodit-0.5.15-py3-none-any.whl.

File metadata

  • Download URL: kodit-0.5.15-py3-none-any.whl
  • Upload date:
  • Size: 264.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.11 {"installer":{"name":"uv","version":"0.9.11"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for kodit-0.5.15-py3-none-any.whl
Algorithm Hash digest
SHA256 6855bb3fbc5c039a74073566696251d8e484df3bbc7bb0674fd23ef578a87413
MD5 aa003b8b7bdaf2808348b7ff346b3e18
BLAKE2b-256 a6b4e6da9ad6ee81e9898a5e1464fbb2da4c7dda0eac6fcf5f935f80cb7a2990

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page