Skip to main content

Local-first RAG library — ingest files and SQLite, query semantically, pipe results into any AI agent

Project description

Remex

Remex

Your private knowledge base — fully offline, never leaves your machine.


GitHub Release CI PyPI License Windows Python


Remex Studio — homepage

Remex turns any folder of documents — PDFs, Word files, notes, spreadsheets, code — into a private, searchable knowledge base. Ask questions in plain language and get answers grounded in your own files, with sources cited.

Everything runs on your machine. No cloud account. No API key required for search. Bring your own AI provider (Anthropic, OpenAI, or a local Ollama instance) only when you want synthesised answers.



Remex Studio

Native desktop app for Windows. No terminal required.

⬇ Download the latest release

⚠️ Windows SmartScreen warning Windows may display a "Windows protected your PC" warning when downloading or installing Remex Studio. This happens because the app is not yet code-signed with a paid certificate — the software is safe and fully open source, feel free to audit the source code in this repository. To proceed: click "More info" then "Run anyway".


What you can do

🔍 Semantic search Vector similarity search across one or more collections simultaneously
🤖 AI Answer Ask a question, get a synthesised answer with cited sources (Anthropic · OpenAI · Ollama)
📄 12 file formats .pdf .docx .md .txt .csv .json .jsonl .html .pptx .xlsx .epub .odt
🗄 SQLite ingest Embed rows from any table alongside your files
♻️ Incremental ingest SHA-256 hash check — only changed files are re-processed
🎯 Source filter Narrow results to one or more documents before searching or asking AI
🔎 Chunk viewer Expand any result to read the full chunk, navigate with keyboard arrows
📦 Collections manager Rename, describe, purge, bulk-delete sources, one-click re-ingest
📤 Export JSON · CSV · Markdown · BibTeX · RIS · CSL-JSON · Obsidian vault
🌙 Themes Light, dark, auto (follows OS) + ten accent colours
⌨️ Keyboard-driven Press ? anywhere in Studio for the full shortcuts reference


Remex Pro — 29€, one-time

Free Remex covers the full local-first workflow. Remex Pro unlocks power-user features for a single one-time payment (two machines):

Feature Free Pro
Semantic search
AI Answer
All 12 file formats + SQLite
Incremental ingest
Source filter + chunk viewer
Collections manager
JSON · CSV · Markdown export
Query history (last 20)
Pro embedding models (bge-large, e5-large, nomic-embed)
Advanced exports (BibTeX · RIS · CSL-JSON · Obsidian vault)
Watch-folder auto-ingest
Unlimited searchable query history
Extra accent themes (8 additional)
Priority email support (48-hour business-day SLA)

Activate in Settings → License inside Studio. Licenses are issued and validated offline via Lemon Squeezy.



Python CLI & Library

pip install remex-cli            # core — ingest + query
pip install "remex-cli[api]"     # + FastAPI sidecar (used by Studio)

Quick start

# Scaffold a project
remex init

# Ingest a folder of documents
remex ingest ./docs

# Semantic search
remex query "how does authentication work?"

# AI-synthesised answer (requires ANTHROPIC_API_KEY, OPENAI_API_KEY, or a running Ollama)
remex query "how does authentication work?" --ai

Command reference

Command Description
remex init [path] Scaffold docs/, remex.toml, and .gitignore entries
remex ingest <dir> Ingest files from a directory into a collection
remex ingest-sqlite <db> Ingest rows from a SQLite table
remex query <text> Semantic search; add --ai for an AI-synthesised answer
remex sources List all ingested source paths in a collection
remex stats Show chunk and source counts
remex delete-source <path> Remove all chunks for a specific source
remex purge Remove chunks whose source file no longer exists on disk
remex reset Wipe an entire collection
remex list-collections List all collections in a database
remex serve Start the FastAPI sidecar on localhost:8745
remex <command> --help    # full option reference for any command

Use as a library

from remex import ingest, query

# Ingest a folder
result = ingest("./docs", collection_name="my-kb")
print(f"{result.chunks_stored} chunks stored")

# Search
results = query("how does auth work?", collection_name="my-kb")
for r in results:
    print(f"[{r.score:.3f}] {r.source}{r.text[:120]}")


Configuration

Drop a remex.toml in your project root (or run remex init to generate one):

[remex]
db              = "./remex_db"
collection      = "my-kb"
embedding_model = "all-MiniLM-L6-v2"

# chunk_size     = 1000   # tokens per chunk
# overlap        = 200    # overlap between chunks
# min_chunk_size = 50     # discard chunks shorter than this
# chunking       = "word" # "word" or "sentence"

CLI flags always override remex.toml values.



Supported embedding models

Preset Model Size Notes
Light all-MiniLM-L6-v2 22 MB Default — fast, good accuracy
Balanced intfloat/e5-base-v2 438 MB Better retrieval quality
Multilingual paraphrase-multilingual-MiniLM-L12-v2 470 MB 50+ languages
Large (Pro) BAAI/bge-large-en-v1.5 1.3 GB Best English accuracy
E5 Large (Pro) intfloat/e5-large-v2 1.3 GB Strong retrieval benchmark
Long ctx (Pro) nomic-ai/nomic-embed-text-v1.5 547 MB 8 192-token context window

Any model from SBERT, HuggingFace sentence-similarity, or Ollama can be used by typing the model name directly.



Building from source

Studio requires Rust, Node.js 20+, and the Tauri prerequisites for Windows.

# Python CLI
pip install -e ".[dev]"
pytest

# Studio (dev server with hot-reload)
cd studio
npm install
npm run tauri dev

# Studio (production build)
npm run tauri build

See studio/README.md for the full build guide.



Changelog · Contributing · Licensing · GitHub

Python CLI: Apache-2.0 · Studio (v1.3.0+): FSL-1.1-MIT — see LICENSES.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

remex_cli-1.4.0.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

remex_cli-1.4.0-py3-none-any.whl (53.3 kB view details)

Uploaded Python 3

File details

Details for the file remex_cli-1.4.0.tar.gz.

File metadata

  • Download URL: remex_cli-1.4.0.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for remex_cli-1.4.0.tar.gz
Algorithm Hash digest
SHA256 812a574afafbf9c53ec4966c4d77425c5bf96149ed431ec9da8e0d3b4973e0d0
MD5 77399ad8937b6af260e25fec5a6d524d
BLAKE2b-256 1ad815603864dd5f94c1f73008cbd0e1ceb0d8c7fcad96ce944c282ab6340645

See more details on using hashes here.

Provenance

The following attestation bundles were made for remex_cli-1.4.0.tar.gz:

Publisher: publish.yml on adm-crow/remex

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file remex_cli-1.4.0-py3-none-any.whl.

File metadata

  • Download URL: remex_cli-1.4.0-py3-none-any.whl
  • Upload date:
  • Size: 53.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for remex_cli-1.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ce0c7497a9d73fa36636257ad03c54e91ef528fbfb51013e3f66555cb26d9222
MD5 ee440e9a662ea382375b454934e00847
BLAKE2b-256 a46b24be5a6343dd85bf51084b0b5f3c31ffbf0ebb335d42499f56fe62baa8eb

See more details on using hashes here.

Provenance

The following attestation bundles were made for remex_cli-1.4.0-py3-none-any.whl:

Publisher: publish.yml on adm-crow/remex

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page