Skip to main content

Local-first RAG library — ingest files and SQLite, query semantically, pipe results into any AI agent

Project description

Remex

Remex

Your private knowledge base — fully offline, never leaves your machine.


GitHub Release CI PyPI License Windows Python


Remex Studio — homepage

Remex turns any folder of documents — PDFs, Word files, notes, spreadsheets, code — into a private, searchable knowledge base. Ask questions in plain language and get answers grounded in your own files, with sources cited.

Everything runs on your machine. No cloud account. No API key required for search. Bring your own AI provider (Anthropic, OpenAI, or a local Ollama instance) only when you want synthesised answers.



Remex Studio

Native desktop app for Windows. No terminal required.

⬇ Download the latest release


What you can do

🔍 Semantic search Vector similarity search across one or more collections simultaneously
🤖 AI Answer Ask a question, get a synthesised answer with cited sources (Anthropic · OpenAI · Ollama)
📄 12 file formats .pdf .docx .md .txt .csv .json .jsonl .html .pptx .xlsx .epub .odt
🗄 SQLite ingest Embed rows from any table alongside your files
♻️ Incremental ingest SHA-256 hash check — only changed files are re-processed
🎯 Source filter Narrow results to one or more documents before searching or asking AI
🔎 Chunk viewer Expand any result to read the full chunk, navigate with keyboard arrows
📦 Collections manager Rename, describe, purge, bulk-delete sources, one-click re-ingest
📤 Export JSON · CSV · Markdown · BibTeX · RIS · CSL-JSON · Obsidian vault
🌙 Themes Light, dark, auto (follows OS) + ten accent colours
⌨️ Keyboard-driven Press ? anywhere in Studio for the full shortcuts reference


Remex Pro — 29€, one-time

Free Remex covers the full local-first workflow. Remex Pro unlocks power-user features for a single one-time payment (two machines):

Feature Free Pro
Semantic search
AI Answer
All 12 file formats + SQLite
Incremental ingest
Source filter + chunk viewer
Collections manager
JSON · CSV · Markdown export
Query history (last 20)
Pro embedding models (bge-large, e5-large, nomic-embed)
Advanced exports (BibTeX · RIS · CSL-JSON · Obsidian vault)
Watch-folder auto-ingest
Unlimited searchable query history
Extra accent themes (8 additional)
Priority email support (48-hour SLA)

Activate in Settings → License inside Studio. Licenses are issued and validated offline via Lemon Squeezy.



Python CLI & Library

pip install remex-cli            # core — ingest + query
pip install "remex-cli[api]"     # + FastAPI sidecar (used by Studio)

Quick start

# Scaffold a project
remex init

# Ingest a folder of documents
remex ingest ./docs

# Semantic search
remex query "how does authentication work?"

# AI-synthesised answer (requires ANTHROPIC_API_KEY, OPENAI_API_KEY, or a running Ollama)
remex query "how does authentication work?" --ai

Command reference

Command Description
remex init [path] Scaffold docs/, remex.toml, and .gitignore entries
remex ingest <dir> Ingest files from a directory into a collection
remex ingest-sqlite <db> Ingest rows from a SQLite table
remex query <text> Semantic search; add --ai for an AI-synthesised answer
remex sources List all ingested source paths in a collection
remex stats Show chunk and source counts
remex delete-source <path> Remove all chunks for a specific source
remex purge Remove chunks whose source file no longer exists on disk
remex reset Wipe an entire collection
remex list-collections List all collections in a database
remex serve Start the FastAPI sidecar on localhost:8745
remex <command> --help    # full option reference for any command

Use as a library

from remex import ingest, query

# Ingest a folder
result = ingest("./docs", collection_name="my-kb")
print(f"{result.chunks_stored} chunks stored")

# Search
results = query("how does auth work?", collection_name="my-kb")
for r in results:
    print(f"[{r.score:.3f}] {r.source}{r.text[:120]}")


Configuration

Drop a remex.toml in your project root (or run remex init to generate one):

[remex]
db              = "./remex_db"
collection      = "my-kb"
embedding_model = "all-MiniLM-L6-v2"

# chunk_size     = 1000   # tokens per chunk
# overlap        = 200    # overlap between chunks
# min_chunk_size = 50     # discard chunks shorter than this
# chunking       = "word" # "word" or "sentence"

CLI flags always override remex.toml values.



Supported embedding models

Preset Model Size Notes
Light all-MiniLM-L6-v2 22 MB Default — fast, good accuracy
Balanced intfloat/e5-base-v2 438 MB Better retrieval quality
Multilingual paraphrase-multilingual-MiniLM-L12-v2 470 MB 50+ languages
Large (Pro) BAAI/bge-large-en-v1.5 1.3 GB Best English accuracy
E5 Large (Pro) intfloat/e5-large-v2 1.3 GB Strong retrieval benchmark
Long ctx (Pro) nomic-ai/nomic-embed-text-v1.5 547 MB 8 192-token context window

Any model from SBERT, HuggingFace sentence-similarity, or Ollama can be used by typing the model name directly.



Building from source

Studio requires Rust, Node.js 20+, and the Tauri prerequisites for Windows.

# Python CLI
pip install -e ".[dev]"
pytest

# Studio (dev server with hot-reload)
cd studio
npm install
npm run tauri dev

# Studio (production build)
npm run tauri build

See studio/README.md for the full build guide.



Changelog · Contributing · Licensing · GitHub

Python CLI: Apache-2.0 · Studio (v1.3.0+): FSL-1.1-MIT — see LICENSES.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

remex_cli-1.3.0.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

remex_cli-1.3.0-py3-none-any.whl (52.5 kB view details)

Uploaded Python 3

File details

Details for the file remex_cli-1.3.0.tar.gz.

File metadata

  • Download URL: remex_cli-1.3.0.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for remex_cli-1.3.0.tar.gz
Algorithm Hash digest
SHA256 8f16d3e8ffb53d058c2d1bf3303bbe1aeed20e6015d149e508a8dccd0d765446
MD5 ef853319ef74d081d5f4f94b48938bac
BLAKE2b-256 2c39172f031a905026295f589827d6a4c339aa2063d8e1af77579bce5cbed6ac

See more details on using hashes here.

Provenance

The following attestation bundles were made for remex_cli-1.3.0.tar.gz:

Publisher: publish.yml on adm-crow/remex

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file remex_cli-1.3.0-py3-none-any.whl.

File metadata

  • Download URL: remex_cli-1.3.0-py3-none-any.whl
  • Upload date:
  • Size: 52.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for remex_cli-1.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 92b4f0d086cab88315776b6a485954295eb7036724d6a7fe209bf20950cf1b5a
MD5 b7f40e2bc1aa14bd2977eabecdb84240
BLAKE2b-256 9a1c6ce0c8c5a4aa1cce71341f19e47d916a8ea1f981c49e04eb64d9165c740c

See more details on using hashes here.

Provenance

The following attestation bundles were made for remex_cli-1.3.0-py3-none-any.whl:

Publisher: publish.yml on adm-crow/remex

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page