GPU-accelerated semantic search for your docs and source code: hybrid dense + sparse RAG on a local Qdrant backend, served to Claude Code and other MCP clients.

These details have not been verified by PyPI

Project description

vaultspec-rag logo

vaultspec-rag: semantic search for a vaultspec-core workspace

A vaultspec-core project accumulates a durable record of decisions, plans, research, and the code they produced. vaultspec-rag searches that record and your source code by meaning, not by keyword.

Search "file lock concurrent write per-root" and vaultspec-rag surfaces the decision that governs it, even when the document never uses those exact words. It is the retrieval layer of the project: it finds and ranks the grounding, and a client such as an AI assistant reads it.

The architecture overview explains how it works; the glossary defines the terms used across the docs.

Requirements

Before you install, confirm your machine meets these minimum requirements:

Python 3.13 or newer
uv as the package manager
An NVIDIA GPU with CUDA support
About 3 GB of free GPU memory
Linux or Windows

macOS, AMD GPUs, and Apple Silicon are not supported. The architecture overview explains why the hardware floor sits where it does.

Quickstart

Install

Add vaultspec-rag to your project and set it up:

uv add vaultspec-rag
uv run vaultspec-rag install
uv sync

install configures the GPU PyTorch build, downloads the search models, and provisions the managed search server. uv sync then pulls in that GPU build. The models total a few gigabytes, so the first download takes several minutes, but it runs only once.

Index and search

Start the server:
```
uv run vaultspec-rag server start
```
Index your project:
```
uv run vaultspec-rag index
```

Search:

uv run vaultspec-rag search "concept plus the domain terms"

The first run builds the index. After that, the running service watches your files and reindexes changes automatically, so the index stays current without another command. See the getting started guide for the full walkthrough.

Searching by meaning

The index is hybrid. A semantic half matches concepts and a keyword half matches exact terms, so write your query as a short phrase that both describes the concept and names the domain terms the target text would use. Pure prose starves the keyword half.

uv run vaultspec-rag search "reentrant collection lock ordering during indexing"

1. adr/2026-06-12-service-concurrency-adr
   adr | feature: service-concurrency | status: accepted | 2026-06-12
   Store-layer locking distinguishes local mode (one reentrant lock per
   collection) from server mode (no point locks). The lifecycle lock is
   always acquired before any collection lock.
2. reference/2026-06-12-service-concurrency-reference
   reference | feature: service-concurrency | 2026-06-12
   Per-root locks keep concurrent writers from colliding on the shared store.

Each result is a rank, a location you can open, and the matching text. Vault hits carry a metadata line, so a superseded ADR shows as superseded before you read it.

Searching code and filtering

Search code with --type code, and narrow with filters including --language, --path, and a symbol name. Add --scores to see the relevance number beside each rank:

uv run vaultspec-rag search "gpu section wrapping the reranker predict forward pass" --type code --language python --scores

1. src/vaultspec_rag/search/_searcher.py:308 (score 0.9241)
   # The GPU lock wraps only the model forward call; the
   # score-to-float conversion below runs after release.
   with self._gpu_section(timings):
       while True:
           try:
               raw_scores = reranker.predict(

For the full filter set (path globs, document type, feature, date), see search and index.

Documentation

Getting started

Getting started - install, index, and run your first query end to end.
Installation - the GPU build, dependency provisioning, and recovery steps.

Daily use

Search and index - run searches and refresh the index.
Service mode - keep models warm in a background service for faster queries.
Backends - the managed Qdrant server versus local-only mode.
MCP integration - wire search into Claude Code and other MCP clients.
Automation - JSON output and scripting.
Preprocessing hooks - index PDFs, spreadsheets, and other formats.

Reference

CLI reference - every command and flag.
Configuration - settings, environment variables, and defaults.
Service discovery - the service.json contract for integrators.
Glossary - terms used across the docs.

Concepts

Architecture - how it works, why a GPU is required, and the server and local-only modes.
Indexing - indexing and retrieval internals.

Support and help

File bugs and ask questions on the GitHub issue tracker.

A good bug report carries five things: your vaultspec-rag version, your operating system, your GPU model, the exact command you ran, and the full stderr output. With those, a maintainer can reproduce the fault. Without them, the report is hard to act on.

Changelog and license

The changelog holds release notes and version history.

vaultspec-rag is released under the MIT License. See LICENSE for the full text.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.25

Jun 25, 2026

0.2.24

Jun 24, 2026

0.2.23

Jun 21, 2026

0.2.22

Jun 21, 2026

0.2.21

Jun 13, 2026

0.2.20

Jun 11, 2026

0.2.19

Jun 10, 2026

0.2.18

Jun 10, 2026

0.2.17

Jun 10, 2026

0.2.16

Jun 5, 2026

0.2.15

Jun 4, 2026

0.2.14

Jun 3, 2026

0.2.13

Jun 2, 2026

0.2.12

Jun 1, 2026

0.2.11

Jun 1, 2026

0.2.10

May 31, 2026

0.2.9

May 31, 2026

0.2.8

May 3, 2026

0.2.7

May 3, 2026

0.2.6

Apr 28, 2026

0.2.5

Apr 27, 2026

0.2.4

Apr 27, 2026

0.2.3

Apr 24, 2026

0.2.1

Apr 12, 2026

0.2.0a0 pre-release

Apr 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vaultspec_rag-0.2.25.tar.gz (2.3 MB view details)

Uploaded Jun 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vaultspec_rag-0.2.25-py3-none-any.whl (777.4 kB view details)

Uploaded Jun 25, 2026 Python 3

File details

Details for the file vaultspec_rag-0.2.25.tar.gz.

File metadata

Download URL: vaultspec_rag-0.2.25.tar.gz
Upload date: Jun 25, 2026
Size: 2.3 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.24 {"installer":{"name":"uv","version":"0.11.24","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for vaultspec_rag-0.2.25.tar.gz
Algorithm	Hash digest
SHA256	`e9152c5ade6415c4ef49cbb2f8e4159e3c723cae553f42f921abc1ae30efc258`
MD5	`9de90d4e66af1f3c6c192603e366c3ff`
BLAKE2b-256	`81ace249e3d977dabd71080ff47ec9dce5ad17386e170b31d96298fdda5ddee6`

See more details on using hashes here.

File details

Details for the file vaultspec_rag-0.2.25-py3-none-any.whl.

File metadata

Download URL: vaultspec_rag-0.2.25-py3-none-any.whl
Upload date: Jun 25, 2026
Size: 777.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.24 {"installer":{"name":"uv","version":"0.11.24","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for vaultspec_rag-0.2.25-py3-none-any.whl
Algorithm	Hash digest
SHA256	`24f11488f67fecd1f6a80fee8cd9f8bb81e46dfb5934d4cd9135b803173a65f7`
MD5	`add077923c2c7dd155632abbab119a0b`
BLAKE2b-256	`89c86eee35d6782636f2d2a6dceaed9a44f6e5cceb51180fdf59963bf170669e`

See more details on using hashes here.

vaultspec-rag 0.2.25

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

vaultspec-rag: semantic search for a vaultspec-core workspace

Requirements

Quickstart

Install

Index and search

Searching by meaning

Searching code and filtering

Documentation

Getting started

Daily use

Reference

Concepts

Support and help

Changelog and license

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes