Local-first RAG library — ingest files and SQLite, query semantically, pipe results into any AI agent
Project description
remex
Your private knowledge base — fully offline, never leaves your machine.
|
Semantic search |
AI answer |
|
File ingestion |
Collections manager |
Remex is a local-first knowledge base for your documents. Point it at any folder — PDFs, notes, code, spreadsheets — and it becomes instantly searchable using natural language. Ask questions and get answers backed by the exact sources in your files, with no data ever leaving your machine.
It runs entirely offline, requires no cloud account, and works with any AI provider you already have — Anthropic, OpenAI, or a local Ollama instance.
Remex Studio
A native desktop app (Windows · macOS · Linux) to ingest, search, and query your documents with AI — no terminal required.
Building from source: see
studio/README.md.
Python CLI
pip install remex-cli # core — ingest + query
pip install remex-cli[api] # adds the FastAPI sidecar (used by Studio)
Quick start
# 1. Scaffold a project
remex init
# 2. Drop files into docs/ then ingest
remex ingest docs/
# 3. Semantic search
remex query "how does authentication work?"
# 4. AI answer (auto-detects Anthropic / OpenAI / Ollama)
remex query "how does authentication work?" --ai
All commands
| Command | Description |
|---|---|
remex init [path] |
Scaffold docs/, remex.toml, and .gitignore |
remex ingest [dir] |
Ingest files from a directory |
remex ingest-sqlite <db> |
Ingest rows from a SQLite table |
remex query <text> |
Semantic search (add --ai for AI answer) |
remex sources |
List all ingested source paths |
remex stats |
Show chunk/source counts for a collection |
remex delete-source <path> |
Remove all chunks for a source |
remex purge |
Remove chunks whose source file no longer exists |
remex reset |
Wipe an entire collection |
remex list-collections |
List all collections in a database |
remex serve |
Start the FastAPI sidecar (used by Studio) |
Use remex <command> --help for full option reference.
Features
| Fully offline | No data leaves your machine — local embeddings, local storage |
| 12 file formats | .pdf .docx .md .txt .csv .json .html .pptx .xlsx .epub .odt .jsonl |
| SQLite ingest | Embed rows from any table — or all tables at once — alongside your files |
| Incremental ingest | SHA-256 hash check — unchanged files are skipped automatically |
| Batch embedding | Chunks are embedded in batches for fast ingestion of large directories |
| AI answers | Auto-detects Anthropic, OpenAI, or a local Ollama instance |
| Multi-collection search | Query across collections, results merged by relevance score |
| Source filtering | Narrow query results to a specific file or table |
| Export results | Copy or export query results for use elsewhere |
| Collections manager | Rename, delete, and inspect your knowledge bases from the UI |
| Keyboard shortcuts | Full keyboard navigation — press ? in Studio for the reference |
Configuration
Place a remex.toml in your project root (created by remex init):
[remex]
db = "./remex_db"
collection = "remex"
embedding_model = "all-MiniLM-L6-v2"
# chunk_size = 1000
# overlap = 200
# min_chunk_size = 50
# chunking = "word" # "word" or "sentence"
CLI flags always override remex.toml values.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file remex_cli-1.2.0.tar.gz.
File metadata
- Download URL: remex_cli-1.2.0.tar.gz
- Upload date:
- Size: 1.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
82fade832b02676cedff472bd5ae80e70aea0db9084bf64fdb4b3c1a94571c4c
|
|
| MD5 |
de8322f8c5227a19d22c6fe0e8a9be9a
|
|
| BLAKE2b-256 |
41c3eae973e374adf2a82455dd9973e774396434cb03f221abcfa980698ead4a
|
Provenance
The following attestation bundles were made for remex_cli-1.2.0.tar.gz:
Publisher:
publish.yml on adm-crow/remex
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
remex_cli-1.2.0.tar.gz -
Subject digest:
82fade832b02676cedff472bd5ae80e70aea0db9084bf64fdb4b3c1a94571c4c - Sigstore transparency entry: 1340362667
- Sigstore integration time:
-
Permalink:
adm-crow/remex@80f8ff89faa563569f7fddaeda1481af49197d96 -
Branch / Tag:
refs/tags/v1.2.0 - Owner: https://github.com/adm-crow
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@80f8ff89faa563569f7fddaeda1481af49197d96 -
Trigger Event:
push
-
Statement type:
File details
Details for the file remex_cli-1.2.0-py3-none-any.whl.
File metadata
- Download URL: remex_cli-1.2.0-py3-none-any.whl
- Upload date:
- Size: 46.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e7b40873096f36bcc6e2086b15c729c4deef93259f1a1cefabe1c1708889421
|
|
| MD5 |
0466c2e7ff1bce5e2cadd31d17458b80
|
|
| BLAKE2b-256 |
eb8258a3aa553717e4a411e166ddcda493c255fdfea9c9a3fdde0b203838476d
|
Provenance
The following attestation bundles were made for remex_cli-1.2.0-py3-none-any.whl:
Publisher:
publish.yml on adm-crow/remex
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
remex_cli-1.2.0-py3-none-any.whl -
Subject digest:
7e7b40873096f36bcc6e2086b15c729c4deef93259f1a1cefabe1c1708889421 - Sigstore transparency entry: 1340362681
- Sigstore integration time:
-
Permalink:
adm-crow/remex@80f8ff89faa563569f7fddaeda1481af49197d96 -
Branch / Tag:
refs/tags/v1.2.0 - Owner: https://github.com/adm-crow
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@80f8ff89faa563569f7fddaeda1481af49197d96 -
Trigger Event:
push
-
Statement type: