GPU-accelerated RAG module for vaultspec vault search
Project description
vaultspec-rag
Semantic search for your vaultspec vault and project codebase
vaultspec-rag adds GPU-accelerated search to projects managed by vaultspec-core. It indexes your .vault/ documents -- research notes, architecture decisions, plans, execution logs -- alongside your source code. Query both with natural language so your AI tools find relevant context on their own.
Getting started
Prerequisites
- Python 3.13 or later
- uv
- A CUDA GPU with at least 3 GB VRAM (mandatory -- no CPU fallback)
- vaultspec-core
Install
uv add vaultspec-rag
This pulls in vaultspec-core and all GPU dependencies.
Verify
vaultspec-rag --version
Index and search
vaultspec-rag indexes two sources: vault (.vault/ documents) and code (project source files).
vaultspec-rag index # both
vaultspec-rag index --type vault # vault only
vaultspec-rag index --type code # code only
vaultspec-rag search "architecture decision"
vaultspec-rag search --type code "error handling"
Using the MCP server
The Model Context Protocol (MCP) server gives AI assistants direct access to vault and codebase search. It runs in two transport modes with different project-resolution rules.
stdio mode -- one process per project. The MCP client launches vaultspec-search-mcp as a subprocess, scoped to a single workspace via VAULTSPEC_RAG_ROOT. Use this for Claude Desktop, Claude Code, and similar single-project AI tools.
{
"mcpServers": {
"vaultspec-rag": {
"command": "vaultspec-search-mcp",
"env": {
"VAULTSPEC_RAG_ROOT": "/path/to/your/project"
}
}
}
}
HTTP mode -- one daemon, many projects. Start vaultspec-rag server service start as a background daemon, then connect any MCP client to http://127.0.0.1:8766/mcp. The daemon has no default project; every tool call must include project_root. Use this to share one GPU-loaded service across workspaces.
See the MCP integration reference for the full tool list, both modes' contracts, and choosing between them.
Further reading
| Guide | What it covers |
|---|---|
| Usage modes | Ad-hoc vs. service operation |
| CLI commands | Command tree, flags, --port fast path |
| Configuration | Precedence, environment variables, .vaultragignore |
| Service management | Background daemon, health endpoint, model warmup |
| Python API | Facade functions for programmatic use |
| Architecture overview | Access layers, GPU lifecycle, multi-project support |
| Models | Embedding stack and model cards |
Getting help
Open an issue on GitHub.
Contributing and license
Contributions welcome -- bug reports, feature ideas, or pull requests. vaultspec-rag uses the MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vaultspec_rag-0.2.1.tar.gz.
File metadata
- Download URL: vaultspec_rag-0.2.1.tar.gz
- Upload date:
- Size: 777.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bd853e9377c08c2c2b93da67bc97e45325a9761d4ccb7e9ed738a4952a1b65d9
|
|
| MD5 |
a1c4c8d946b1591d6f5266ffbd8cfcb6
|
|
| BLAKE2b-256 |
11d5e5a8fef5dbf28730e852d278d554c01c4d9ae72cd80fda4566de7c4632cb
|
File details
Details for the file vaultspec_rag-0.2.1-py3-none-any.whl.
File metadata
- Download URL: vaultspec_rag-0.2.1-py3-none-any.whl
- Upload date:
- Size: 196.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1f22e8e81af25c94e69f2c47cf1171767fa82dd9d7f2b72fb2476dd65d69bb6a
|
|
| MD5 |
4989658c1be027062728e004a4d3064d
|
|
| BLAKE2b-256 |
6cc9250a4b0d0443978363378f7f1435a99ef88f1f411edcade4835dd913661d
|