Local-first codebase intelligence for AI assistants via MCP
Project description
Local-first codebase intelligence
Your AI assistant searches code but doesn't understand it. ChunkHound researches your codebase—extracting architecture, patterns, and institutional knowledge at any scale. Integrates via MCP.
Features
- cAST Algorithm - Research-backed semantic code chunking
- Multi-Hop Semantic Search - Discovers interconnected code relationships beyond direct matches
- Semantic search - Natural language queries like "find authentication code"
- Regex search - Pattern matching without API keys
- Local-first - Your code stays on your machine
- 32 languages with structured parsing
- Programming (via Tree-sitter): Python, JavaScript, TypeScript, JSX, TSX, Java, Kotlin, Groovy, C, C++, C#, Go, Rust, Haskell, Swift, Bash, MATLAB, Makefile, Objective-C, PHP, Dart, Lua, Vue, Svelte, Zig
- Configuration: JSON, YAML, TOML, HCL, Markdown
- Text-based (custom parsers): Text files, PDF
- MCP integration - Works with Claude, VS Code, Cursor, Windsurf, Zed, etc
- Real-time indexing - Automatic file watching, smart diffs, seamless branch switching
Documentation
Visit chunkhound.github.io for complete guides:
Requirements
- Python 3.10+
- uv package manager
- API keys (optional - regex search works without any keys)
- Embeddings: VoyageAI (recommended) | OpenAI | Local with Ollama
- LLM (for Code Research): Claude Code CLI or Codex CLI (no API key needed) | Anthropic | OpenAI | Grok (xAI)
Installation
# Install uv if needed
curl -LsSf https://astral.sh/uv/install.sh | sh
# Install ChunkHound
uv tool install chunkhound
Quick Start
- Create
.chunkhound.jsonin project root
{
"embedding": {
"provider": "voyageai",
"api_key": "your-voyageai-key"
},
"llm": {
"provider": "claude-code-cli"
}
}
Note: Use
"codex-cli"instead if you prefer Codex. Both work equally well and require no API key.
- Index your codebase
chunkhound index
For configuration, IDE setup, and advanced usage, see the documentation.
Why ChunkHound?
| Approach | Capability | Scale | Maintenance |
|---|---|---|---|
| Keyword Search | Exact matching | Fast | None |
| Traditional RAG | Semantic search | Scales | Re-index files |
| Knowledge Graphs | Relationship queries | Expensive | Continuous sync |
| ChunkHound | Semantic + Regex + Code Research | Automatic | Incremental + realtime |
Ideal for:
- Large monorepos with cross-team dependencies
- Security-sensitive codebases (local-only, no cloud)
- Multi-language projects needing consistent search
- Offline/air-gapped development environments
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file chunkhound-4.1.0a1.tar.gz.
File metadata
- Download URL: chunkhound-4.1.0a1.tar.gz
- Upload date:
- Size: 1.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d118a344a4772b551be93905c67db81c5fe4b75cfa43b115490d202ae9a0ca11
|
|
| MD5 |
3a8161401f7a53cdf5121693fb744577
|
|
| BLAKE2b-256 |
73c9f09a416507f81ce83284fac963ff1ed23cb3701597de21ca3967d59218a1
|
Provenance
The following attestation bundles were made for chunkhound-4.1.0a1.tar.gz:
Publisher:
release-rc.yml on chunkhound/chunkhound
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
chunkhound-4.1.0a1.tar.gz -
Subject digest:
d118a344a4772b551be93905c67db81c5fe4b75cfa43b115490d202ae9a0ca11 - Sigstore transparency entry: 1307808611
- Sigstore integration time:
-
Permalink:
chunkhound/chunkhound@ba812a720550c31b681fa46ee2dfab2dcb17b805 -
Branch / Tag:
refs/tags/v4.1.0a1 - Owner: https://github.com/chunkhound
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release-rc.yml@ba812a720550c31b681fa46ee2dfab2dcb17b805 -
Trigger Event:
push
-
Statement type:
File details
Details for the file chunkhound-4.1.0a1-py3-none-any.whl.
File metadata
- Download URL: chunkhound-4.1.0a1-py3-none-any.whl
- Upload date:
- Size: 1.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
eb1b614be39873b45fafca57ebd3852acf28ce0ca7074a09f15d59ad6f735536
|
|
| MD5 |
77281ee6afebe9737449a710cc8b2553
|
|
| BLAKE2b-256 |
6c5a5463a843ed044fa5ecd0525eb85bfd321a4f216795e68d3d6dd9eb361bfc
|
Provenance
The following attestation bundles were made for chunkhound-4.1.0a1-py3-none-any.whl:
Publisher:
release-rc.yml on chunkhound/chunkhound
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
chunkhound-4.1.0a1-py3-none-any.whl -
Subject digest:
eb1b614be39873b45fafca57ebd3852acf28ce0ca7074a09f15d59ad6f735536 - Sigstore transparency entry: 1307808735
- Sigstore integration time:
-
Permalink:
chunkhound/chunkhound@ba812a720550c31b681fa46ee2dfab2dcb17b805 -
Branch / Tag:
refs/tags/v4.1.0a1 - Owner: https://github.com/chunkhound
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release-rc.yml@ba812a720550c31b681fa46ee2dfab2dcb17b805 -
Trigger Event:
push
-
Statement type: