Skip to main content

Local-first codebase intelligence for AI assistants via MCP

Project description

ChunkHound

Local-first codebase intelligence

Tests License: MIT 100% AI Generated Discord

Your AI assistant searches code but doesn't understand it. ChunkHound researches your codebase—extracting architecture, patterns, and institutional knowledge at any scale. Integrates via MCP.

Features

  • cAST Algorithm - Research-backed semantic code chunking
  • Multi-Hop Semantic Search - Discovers interconnected code relationships beyond direct matches
  • Semantic search - Natural language queries like "find authentication code"
  • Regex search - Pattern matching without API keys
  • Local-first - Your code stays on your machine
  • 32 languages with structured parsing
    • Programming (via Tree-sitter): Python, JavaScript, TypeScript, JSX, TSX, Java, Kotlin, Groovy, C, C++, C#, Go, Rust, Haskell, Swift, Bash, MATLAB, Makefile, Objective-C, PHP, Dart, Lua, Vue, Svelte, Zig
    • Configuration: JSON, YAML, TOML, HCL, Markdown
    • Text-based (custom parsers): Text files, PDF
  • MCP integration - Works with Claude, VS Code, Cursor, Windsurf, Zed, etc
  • Real-time indexing - Automatic file watching, smart diffs, seamless branch switching

Documentation

Visit chunkhound.github.io for complete guides:

Requirements

Installation

# Install uv if needed
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install ChunkHound
uv tool install chunkhound

Quick Start

  1. Create .chunkhound.json in project root
{
  "embedding": {
    "provider": "voyageai",
    "api_key": "your-voyageai-key"
  },
  "llm": {
    "provider": "claude-code-cli"
  }
}

Note: Use "codex-cli" instead if you prefer Codex. Both work equally well and require no API key.

  1. Index your codebase
chunkhound index

For configuration, IDE setup, and advanced usage, see the documentation.

Why ChunkHound?

Approach Capability Scale Maintenance
Keyword Search Exact matching Fast None
Traditional RAG Semantic search Scales Re-index files
Knowledge Graphs Relationship queries Expensive Continuous sync
ChunkHound Semantic + Regex + Code Research Automatic Incremental + realtime

Ideal for:

  • Large monorepos with cross-team dependencies
  • Security-sensitive codebases (local-only, no cloud)
  • Multi-language projects needing consistent search
  • Offline/air-gapped development environments

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chunkhound-4.1.0a1.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

chunkhound-4.1.0a1-py3-none-any.whl (1.3 MB view details)

Uploaded Python 3

File details

Details for the file chunkhound-4.1.0a1.tar.gz.

File metadata

  • Download URL: chunkhound-4.1.0a1.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for chunkhound-4.1.0a1.tar.gz
Algorithm Hash digest
SHA256 d118a344a4772b551be93905c67db81c5fe4b75cfa43b115490d202ae9a0ca11
MD5 3a8161401f7a53cdf5121693fb744577
BLAKE2b-256 73c9f09a416507f81ce83284fac963ff1ed23cb3701597de21ca3967d59218a1

See more details on using hashes here.

Provenance

The following attestation bundles were made for chunkhound-4.1.0a1.tar.gz:

Publisher: release-rc.yml on chunkhound/chunkhound

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file chunkhound-4.1.0a1-py3-none-any.whl.

File metadata

  • Download URL: chunkhound-4.1.0a1-py3-none-any.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for chunkhound-4.1.0a1-py3-none-any.whl
Algorithm Hash digest
SHA256 eb1b614be39873b45fafca57ebd3852acf28ce0ca7074a09f15d59ad6f735536
MD5 77281ee6afebe9737449a710cc8b2553
BLAKE2b-256 6c5a5463a843ed044fa5ecd0525eb85bfd321a4f216795e68d3d6dd9eb361bfc

See more details on using hashes here.

Provenance

The following attestation bundles were made for chunkhound-4.1.0a1-py3-none-any.whl:

Publisher: release-rc.yml on chunkhound/chunkhound

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page