Skip to main content

A vector-powered CLI for semantic search over files.

Project description

Vexor

Vexor

Python PyPI CI Codecov License


Vexor is a vector-powered CLI and desktop app for semantic file search. It uses configurable embedding models and ranks results by cosine similarity.

GUI

Vexor Demo Video

Why Vexor?

When you remember what a file does but forget its name or location, Vexor finds it instantly—no grep patterns or directory traversal needed.

Designed for both humans and AI coding assistants, enabling semantic file discovery in autonomous agent workflows.

Install

Download standalone binary from releases (no Python required), or:

pip install vexor  # also works with pipx, uv

Quick Start

0. Guided Setup (Recommended)

vexor init

The wizard also runs automatically on first use when no config exists.

1. Configure API Key

vexor config --set-api-key "YOUR_KEY"

Or via environment: VEXOR_API_KEY, OPENAI_API_KEY, or GOOGLE_GENAI_API_KEY.

2. Search

vexor "api client config"  # defaults to search
vexor search "api client config"  # searches current directory
# or explicit path:
vexor search "api client config" --path ~/projects/demo --top 5

Vexor auto-indexes on first search. Example output:

Vexor semantic file search results
──────────────────────────────────
#   Similarity   File path                       Lines   Preview
1   0.923        ./src/config_loader.py          -       config loader entrypoint
2   0.871        ./src/utils/config_parse.py     -       parse config helpers
3   0.809        ./tests/test_config_loader.py   -       tests for config loader

3. Explicit Index (Optional)

vexor index  # indexes current directory
# or explicit path:
vexor index --path ~/projects/demo --mode code

Useful for CI warmup or when auto_index is disabled.

Python API

Vexor can also be imported and used directly from Python:

from vexor import index, search

index(path=".", mode="head")
response = search("config loader", path=".", mode="name")

for hit in response.results:
    print(hit.path, hit.score)

By default it reads ~/.vexor/config.json. To ignore config and pass everything explicitly, set use_config=False.

Configuration

vexor config --set-provider openai          # default; also supports gemini/custom/local
vexor config --set-model text-embedding-3-small
vexor config --set-batch-size 0             # 0 = single request
vexor config --set-embed-concurrency 2       # parallel embedding requests
vexor config --set-auto-index true          # auto-index before search (default)
vexor config --rerank bm25                  # optional BM25 rerank for top-k results
vexor config --rerank flashrank             # FlashRank rerank (requires optional extra)
vexor config --rerank remote                # remote rerank via HTTP endpoint
vexor config --set-flashrank-model ms-marco-MultiBERT-L-12  # multilingual model
vexor config --set-flashrank-model          # reset FlashRank model to default
vexor config --clear-flashrank              # remove cached FlashRank models
vexor config --set-remote-rerank-url https://proxy.example.com/v1/rerank
vexor config --set-remote-rerank-model bge-reranker-v2-m3
vexor config --set-remote-rerank-api-key $VEXOR_REMOTE_RERANK_API_KEY  # or env var
vexor config --clear-remote-rerank          # clear remote rerank config
vexor config --set-base-url https://proxy.example.com  # optional proxy
vexor config --clear-base-url               # reset to official endpoint
vexor config --show                         # view current settings

Rerank defaults to off. It is highly recommended to configure the Reranker in advance to improve search accuracy. FlashRank requires pip install "vexor[flashrank]" and caches models under ~/.vexor/flashrank.

Config stored in ~/.vexor/config.json.

Rerank

Rerank reorders the semantic results with a secondary ranker. It uses 2x the requested --top as candidates (e.g., top 10 reranked to show 5).

Recommended defaults:

  • Keep off unless you want extra precision.
  • Use bm25 for lightweight lexical boosts; it is fast and lightweight.
  • BM25 uses a multilingual tokenizer (Bert pre-tokenizer), so it can handle CJK better.
  • Use flashrank for stronger reranking (requires pip install "vexor[flashrank]" and downloads a model to ~/.vexor/flashrank).
  • Use remote to call a hosted reranker that accepts {model, query, documents} and returns ranked indexes.
  • For Chinese or multi-language content, set --set-flashrank-model ms-marco-MultiBERT-L-12.
  • If unset, FlashRank defaults to ms-marco-TinyBERT-L-2-v2.

Providers: Remote vs Local

Vexor supports both remote API providers (openai, gemini, custom) and a local provider (local):

  • Remote providers use api_key and optional base_url.
  • custom is OpenAI-compatible and requires both model and base_url.
  • Local provider ignores api_key/base_url and only uses model plus local_cuda (CPU/GPU switch).

Local Model (Offline)

Install the lightweight local backend:

pip install "vexor[local]"

GPU backend (requires CUDA drivers):

pip install "vexor[local-cuda]"

Download a local embedding model and auto-configure Vexor:

vexor local --setup --model intfloat/multilingual-e5-small

Then use vexor search / vexor index as usual.

Local models are stored in ~/.vexor/models (clear with vexor local --clean-up).

GPU (optional): install onnxruntime-gpu (or vexor[local-cuda]) and use vexor local --setup --cuda (or vexor local --cuda). Switch back with vexor local --cpu.

Index Modes

Control embedding granularity with --mode:

Mode Description
auto Default. Smart routing: Python/JS/TS → code, Markdown → outline, small files → full, large files → head
name Embed filename only (fastest, zero content reads)
head Extract first snippet for lightweight semantic context
brief Extract high-frequency keywords from PRDs/requirements docs
full Chunk entire content; long documents searchable end-to-end
code AST-aware chunking by module/class/function boundaries for Python and JavaScript/TypeScript; other files fall back to full
outline Chunk Markdown by heading hierarchy with breadcrumbs; non-.md falls back to full

Cache Behavior

Index cache keys derive from: --path, --mode, --include-hidden, --no-recursive, --no-respect-gitignore, --ext, --exclude-pattern.

Keep flags consistent to reuse cache; changing flags creates a separate index.

vexor config --show-index-all    # list all cached indexes
vexor config --clear-index-all   # clear all cached indexes
vexor index --path . --clear     # clear index for specific path

Re-running vexor index only re-embeds changed files; >50% changes trigger full rebuild.

Command Reference

Command Description
vexor init Run the interactive setup wizard
vexor QUERY Shortcut for vexor search QUERY
vexor search QUERY --path PATH Semantic search (auto-indexes if needed)
vexor index --path PATH Build/refresh index manually
vexor config --show Display current configuration
vexor config --clear-flashrank Remove cached FlashRank models under ~/.vexor/flashrank
vexor local --setup [--model MODEL] Download a local model and set provider to local
vexor local --clean-up Remove local model cache under ~/.vexor/models
vexor local --cuda Enable CUDA for local embeddings (requires onnxruntime-gpu)
vexor local --cpu Disable CUDA and use CPU for local embeddings
vexor install --skills claude Install Agent Skill for Claude Code
vexor install --skills codex Install Agent Skill for Codex
vexor doctor Run diagnostic checks (command, config, cache, API key, API connectivity)
vexor update [--upgrade] [--pre] Check for new version (optionally upgrade; --pre includes pre-releases)
vexor feedback Open GitHub issue form (or use gh)
vexor alias Print a shell alias for vx and optionally apply it

Common Flags

Flag Description
--path PATH Target directory (default: current working directory)
--mode MODE Index mode (auto/name/head/brief/full/code/outline)
--top K / -k Number of results (default: 5)
--ext .py,.md / -e Filter by extension (repeatable)
--exclude-pattern PATTERN Exclude paths by gitignore-style pattern (repeatable; .js treated as **/*.js)
--include-hidden / -i Include hidden files
--no-recursive / -n Don't recurse into subdirectories
--no-respect-gitignore Include gitignored files
--format porcelain Script-friendly TSV output
--format porcelain-z NUL-delimited output

Porcelain output fields: rank, similarity, path, chunk_index, start_line, end_line, preview (line fields are - when unavailable).

AI Agent Skill

This repo includes a skill for AI agents to use Vexor effectively:

vexor install --skills claude  # Claude Code
vexor install --skills codex   # Codex

Skill source: plugins/vexor/skills/vexor-cli

Documentation

See docs for more details.

Contributions, issues, and PRs welcome! Star if you find it helpful.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vexor-0.19.0a1.tar.gz (81.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vexor-0.19.0a1-py3-none-any.whl (92.1 kB view details)

Uploaded Python 3

File details

Details for the file vexor-0.19.0a1.tar.gz.

File metadata

  • Download URL: vexor-0.19.0a1.tar.gz
  • Upload date:
  • Size: 81.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vexor-0.19.0a1.tar.gz
Algorithm Hash digest
SHA256 51af1aa9ef7a8704317b080e7ca8186184f738c98148acd9f2a5cd7b35dff586
MD5 fd9f1feb85db3c933f471880b2b85c23
BLAKE2b-256 326e5443a4e56cfe1f8ab1ff074e5d1b80bb148946c3b9cfc58ab293897c7527

See more details on using hashes here.

Provenance

The following attestation bundles were made for vexor-0.19.0a1.tar.gz:

Publisher: publish.yml on scarletkc/vexor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vexor-0.19.0a1-py3-none-any.whl.

File metadata

  • Download URL: vexor-0.19.0a1-py3-none-any.whl
  • Upload date:
  • Size: 92.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vexor-0.19.0a1-py3-none-any.whl
Algorithm Hash digest
SHA256 8a5b70665ec6d325d7b77be692220a07d48e142c1f0e75ca4cbbabc3bf5033e7
MD5 7e002402919fd1a53ab60e72d56f5be5
BLAKE2b-256 dc0858b563dee29b681c4c5651d5d130f5afa1064fc00022964b29f166af58d5

See more details on using hashes here.

Provenance

The following attestation bundles were made for vexor-0.19.0a1-py3-none-any.whl:

Publisher: publish.yml on scarletkc/vexor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page