Skip to main content

A high-performance, local-first RAG document assistant.

Project description

🔬 Quaero

High-Performance, Local-First RAG Document Assistant

Transform your local documents into an intelligent, queryable knowledge base—without the bloat.

Python 3.11+ License: MIT LanceDB Ollama


🎯 Overview

Quaero is a streamlined, local-first Retrieval-Augmented Generation (RAG) engine. Built for developers, researchers, and engineers, it completely bypasses heavy frameworks like LangChain in favor of a custom, memory-flat ingestion pipeline and blazing-fast vector search via LanceDB.

Your data never leaves your machine.

✨ The Engineering Edge

  • Tiered Ingestion Router: Automatically routes files to the most efficient parser (e.g., C-bound PyMuPDF for PDFs, native python-docx for Word, and raw streaming for code/text) while bouncing binary executables at the door.
  • Memory-Flat Processing: Reads and hashes massive files (like 1,000-page textbooks) using lazy generators, keeping your RAM usage practically at zero during ingestion.
  • State Reconciliation: Native sync tracking detects when you modify or delete a physical file and automatically purges or updates the orphaned vectors via relational metadata.
  • Zero-Config Vector Search: Powered by LanceDB's PyArrow backend for native, sub-millisecond Cosine distance retrieval.

🚀 Quick Start

Prerequisites

  • Python 3.11+
  • Ollama installed and running locally.

1. Installation

Install directly via pip (or pipx for isolated environments):

pip install quaero

2. Initial Setup

Run the interactive wizard to configure your models and chunk sizes:

quaero setup

(We recommend embeddinggemma for embeddings and a fast, instruction-tuned model like gemma or llama3 for inference).

3. Build Your Knowledge Base

Point Quaero at a single file or an entire directory. It will recursively crawl and index supported formats.

quaero ingest /path/to/your/documents/

4. Start Querying

Launch the interactive terminal UI to chat with your documents:

quaero chat

Or execute a single-shot query:

quaero chat "What are the main persistence mechanisms described in the malware textbook?"

💻 CLI Command Reference

Quaero features a modern, Rich-powered CLI.

  • quaero status - View database health and vector counts.
  • quaero ingest
  • quaero sync - Reconcile the vector database with your physical filesystem (purges orphans, updates modifications).
  • quaero config show - Display active thresholds, models, and chunk parameters.
  • quaero config set - Tune the engine on the fly (e.g., quaero config set score_threshold 0.6).
  • quaero db reset - Nuke the database and start fresh.

🏗️ Architecture

graph TD A[Local Filesystem] -->|quaero sync / ingest| B[Tiered Extraction Router] B --> C[Memory-Flat Text Splitter] C --> D[Ollama Embedding Engine] D --> E[(LanceDB Vector Store)]

F[User Query] --> G[Cosine Similarity Search]
G --> E
E --> H[Context Assembly]
H --> I[Ollama Inference]
I --> J[Grounded Terminal Response]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quaerite-0.1.0.tar.gz (18.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

quaerite-0.1.0-py3-none-any.whl (22.8 kB view details)

Uploaded Python 3

File details

Details for the file quaerite-0.1.0.tar.gz.

File metadata

  • Download URL: quaerite-0.1.0.tar.gz
  • Upload date:
  • Size: 18.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for quaerite-0.1.0.tar.gz
Algorithm Hash digest
SHA256 74f3b2d19fcf59b0f58595c9af1795d9629efda5ba0aede1a5191dd414244883
MD5 995980a5352a74ccd657a0c7d89537a4
BLAKE2b-256 e7d1a71cec0bbae5021174c9f6296d17d11bbf0fbec53125b5d894785a0dd5a4

See more details on using hashes here.

Provenance

The following attestation bundles were made for quaerite-0.1.0.tar.gz:

Publisher: publish.yml on ADPer0705/quaero

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file quaerite-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: quaerite-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 22.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for quaerite-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3b618111a88056899abe231a332ee9abcf67557123e31e169b469e981e0a7c6f
MD5 9adc74501408782d5178fabf3bebafb8
BLAKE2b-256 dfd951f801fb6cacbe337d540efc2a829bd890c20781834ad13fb10dfe6ddb7a

See more details on using hashes here.

Provenance

The following attestation bundles were made for quaerite-0.1.0-py3-none-any.whl:

Publisher: publish.yml on ADPer0705/quaero

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page