Skip to main content

Code indexing for better AI code generation

Project description

Helix Kodit Logo

Kodit: A Code Indexing MCP Server

Kodit connects your AI coding assistant to external codebases to provide accurate and up-to-date snippets of code.

Documentation License Discussions

:star: Help us reach more developers and grow the Helix community. Star this repo!

Helix Kodit is an MCP server that connects your AI coding assistant to external codebases. It can:

  • Improve your AI-assisted code by providing canonical examples direct from the source
  • Index local and public codebases
  • Integrates with any AI coding assistant via MCP
  • Search using keyword and semantic search
  • Integrate with any OpenAI-compatible or custom API/model

If you're an engineer working with AI-powered coding assistants, Kodit helps by providing relevant and up-to-date examples of your task so that LLMs make less mistakes and produce fewer hallucinations.

Features

Codebase Indexing

Kodit connects to a variety of local and remote codebases to build an index of your code. This index is used to build a snippet library, ready for ingestion into an LLM.

  • Index local directories and public Git repositories
  • Build comprehensive snippet libraries for LLM ingestion
  • Support for 20+ programming languages including Python, JavaScript/TypeScript, Java, Go, Rust, C/C++, C#, HTML/CSS, and more
  • Advanced code analysis with dependency tracking and call graph generation
  • Intelligent snippet extraction with context-aware dependencies
  • Efficient indexing with selective reindexing (only processes modified files)
  • Privacy first: respects .gitignore and .noindex files
  • NEW in 0.3: Auto-indexing configuration for shared server deployments
  • NEW in 0.3: Enhanced Git provider support including Azure DevOps
  • NEW in 0.3: Index private repositories via a PAT
  • NEW in 0.3: Improved progress monitoring and reporting during indexing
  • NEW in 0.3: Advanced code slicing infrastructure with Tree-sitter parsing
  • NEW in 0.4: Automatic periodic sync to keep indexes up-to-date

MCP Server

Relevant snippets are exposed to an AI coding assistant via an MCP server. This allows the assistant to request relevant snippets by providing keywords, code, and semantic intent. Kodit has been tested to work well with:

  • Seamless integration with popular AI coding assistants
  • Tested and verified with:
  • Please contribute more instructions! ... any other assistant is likely to work ...
  • New in 0.3: Advanced search filters by source, language, author, date range, and file path
  • New in 0.3: Hybrid search combining BM25 keyword search with semantic search
  • New in 0.4: Enhanced MCP tools with rich context parameters and metadata

Hosted MCP Server

New in 0.4: Try Kodit instantly with our hosted MCP server at https://kodit.helix.ml/mcp! No installation required - just add it to your AI coding assistant and start searching popular codebases immediately.

The hosted server provides:

  • Pre-indexed popular open source repositories
  • Zero configuration - works out of the box
  • Same powerful search capabilities as self-hosted Kodit
  • Perfect for trying Kodit before setting up your own instance

Find out more in the hosted Kodit documentation.

Enterprise Ready

Out of the box, Kodit works with a local SQLite database and very small, local models. But enterprises can scale out with performant databases and dedicated models. Everything can even run securely, privately, with on-premise LLM platforms like Helix.

Supported databases:

Supported providers:

  • Local (which uses tiny CPU-only open-source models)
  • OpenAI
  • Secure, private LLM enclave with Helix.
  • Any other OpenAI compatible API

NEW in 0.3: Enhanced deployment options:

  • Docker Compose configurations with VectorChord
  • Kubernetes manifests for production deployments

Quick Start

  1. Install Kodit
  2. Index codebases
  3. Integrate with your coding assistant

Documentation

Roadmap

The roadmap is currently maintained as a Github Project.

💬 Support

For commercial support, please contact Helix.ML. To ask a question, please open a discussion.

License

Apache 2.0 © 2025 HelixML, Inc.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kodit-0.5.12.tar.gz (454.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kodit-0.5.12-py3-none-any.whl (252.3 kB view details)

Uploaded Python 3

File details

Details for the file kodit-0.5.12.tar.gz.

File metadata

  • Download URL: kodit-0.5.12.tar.gz
  • Upload date:
  • Size: 454.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.10 {"installer":{"name":"uv","version":"0.9.10"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for kodit-0.5.12.tar.gz
Algorithm Hash digest
SHA256 20c1c3141e59286d77a3437ff0db1b4fd692312a9cdffd116f1c1554772023b5
MD5 158219cc0b95fe763f1f633ae5cd86bb
BLAKE2b-256 231511ff9f6ae7df57f3bb00ec503443137d9fc34d4f4fc5aa6e99490e8ec14e

See more details on using hashes here.

File details

Details for the file kodit-0.5.12-py3-none-any.whl.

File metadata

  • Download URL: kodit-0.5.12-py3-none-any.whl
  • Upload date:
  • Size: 252.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.10 {"installer":{"name":"uv","version":"0.9.10"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for kodit-0.5.12-py3-none-any.whl
Algorithm Hash digest
SHA256 e8ed54a2eaefe3810535891b02366731b0df60eb9f386778fd51abc165286a23
MD5 f9526583ab48aee9ff607b6885727d64
BLAKE2b-256 2cb5af18e4f6077dafedd347865ac526d0f6a0fbf16146ae6eba29f148b8dc86

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page