Skip to main content

GPU-accelerated semantic search for your docs and source code: hybrid dense + sparse RAG on a local Qdrant backend, served to Claude Code and other MCP clients.

Project description

vaultspec-rag logo

vaultspec-rag - semantic search for a vaultspec-core workspace

PyPI version Python 3.13+ License: MIT

vaultspec-rag is the semantic search companion to vaultspec-core. The RAG in the name stands for retrieval-augmented generation. That's the pattern of pulling relevant snippets out of your own files so an agent can answer with grounded context. It indexes the markdown documents in your vault and the source code that sits beside them, then lets you search them by meaning rather than by exact keyword match. Search by meaning closes the vocabulary gap. A query for "how do we handle file locks during indexing" finds a decision record about concurrent writes and per-root locks, even though it never uses the word "indexing."

Requirements

  • Python 3.13 or newer
  • uv as the package manager
  • An NVIDIA GPU with CUDA support
  • About 3 GB of free GPU memory
  • Linux or Windows

macOS, AMD GPUs, and Apple Silicon are not supported. For the reasoning behind the hardware floor, see the architecture overview.

Quickstart

uv add vaultspec-rag
uv run vaultspec-rag install
uv sync
uv run vaultspec-rag index
uv run vaultspec-rag search "your question here"

install configures the GPU PyTorch build and provisions the search models and the managed Qdrant server. uv sync then fetches the GPU PyTorch build. The first run is slower because of one-time model downloads.

For repeat use, start the server-backed service first - see the getting started guide for the full walkthrough.

What's a vault?

A vault is a .vault/ directory of markdown files - research notes, architecture decision records, plans, and execution logs - that vaultspec-core creates and manages. If you don't have one yet, set one up there first.

vaultspec-rag adds exactly one capability on top of that: semantic search over the vault and the source code beside it. Vault creation, document templates, frontmatter validation, and the spec-driven workflow all stay in vaultspec-core.

Both packages live side by side in the same project. You can use vaultspec-core on its own without ever installing vaultspec-rag. vaultspec-rag without vaultspec-core has nothing to search.

Documentation

Getting started

  • Getting started - install, index, and run your first query end to end.
  • Installation - the GPU build, dependency provisioning, and recovery steps.

Daily use

Reference

Concepts

  • Architecture - how it works, why a GPU is required, and why the service is server-first.
  • Indexing - indexing and retrieval internals.

Support and help

File bugs and ask questions on the GitHub issue tracker.

A useful bug report includes your vaultspec-rag version, your operating system, your GPU model, the exact command you ran, and the full stderr output.

What changed

See the changelog for release notes and version history.

License

vaultspec-rag is released under the MIT License. See LICENSE for the full text.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vaultspec_rag-0.2.23.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vaultspec_rag-0.2.23-py3-none-any.whl (694.0 kB view details)

Uploaded Python 3

File details

Details for the file vaultspec_rag-0.2.23.tar.gz.

File metadata

  • Download URL: vaultspec_rag-0.2.23.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.23 {"installer":{"name":"uv","version":"0.11.23","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for vaultspec_rag-0.2.23.tar.gz
Algorithm Hash digest
SHA256 fd9750dc333272447f1503334169d44ec6a2a5cb2891ec97ee5a08be1d0a7a3d
MD5 f55d0e88bae6369476216a418f9260e7
BLAKE2b-256 bca30b3edbd25df1dcdaf78819f53dbf8a6c3d92e9e4ac79ca84cc65b62677c3

See more details on using hashes here.

File details

Details for the file vaultspec_rag-0.2.23-py3-none-any.whl.

File metadata

  • Download URL: vaultspec_rag-0.2.23-py3-none-any.whl
  • Upload date:
  • Size: 694.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.23 {"installer":{"name":"uv","version":"0.11.23","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for vaultspec_rag-0.2.23-py3-none-any.whl
Algorithm Hash digest
SHA256 b7b16a3368e53109f77f827f7f847a6a63db02c8c3d351dc53cfc4fe80660aad
MD5 24530dafecc4f5f71c0f8937b5515f06
BLAKE2b-256 e3d99f8a9b6366917a21e9d44afb73ed6d438c08f40fd6ed53c9196a43d19f83

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page