A framework that replace traditional RAG pipelines. Ingest any number of documents in multiple workspaces (channels, departments, etc.), index it with BM25, and let the agent search, fetch, and reason over it, exactly like searching the web, but entirely on your machine. No vector store, no embedding needed.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

wissam-metawee

These details have not been verified by PyPI

Project description

Local Search Agent

Give your AI agent a search engine for your local files.

What is this?

Local Search Agent is a Python framework that gives your AI agent a search engine for your local files and lets it search, fetch, and reason over your local documents — the same way a researcher searches the web, but entirely on your machine.

Point it at a folder. Ask a question. The agent searches your documents, reads the relevant ones, and gives you an answer with citations — no cloud upload, no API calls to external search services, no embeddings, no vector stores.

"What was the AWS spend in Q3?"  →  agent searches index  →  fetches relevant docs  →  answers with sources

Why not RAG?

Traditional RAG (Retrieval-Augmented-Generation) has a fundamental problem: it converts your documents into embeddings and stores them in a vector database. That means:

Stale indexes — embeddings go out of date silently. You never know if the agent is reading your latest documents or a six-month-old snapshot
Black-box retrieval — you can't see why a document was retrieved or not. Debugging poor answers is guesswork
Chunking anxiety — split too small and you lose context. Split too large and retrieval quality degrades. There's no right answer
Infrastructure overhead — a vector database is another service to run, maintain, and pay for
Semantic drift — embeddings are sensitive to how questions are phrased. A question about "cloud expenditure" may never match a document that says "AWS spend"

Local Search Agent takes a different approach: BM25 keyword search via Meilisearch, structured metadata, and a LangGraph agent loop with tools. The agent searches your document index the same way a developer searches Stack Overflow — with real queries, real results, and full transparency into what was retrieved and why.

The result is deterministic, auditable, and fast. You can see exactly what the agent fetched for every answer.

How it works

1. INGEST     Your documents → parsed, cleaned, chunked, indexed into Meilisearch
2. SERVE      FastAPI file server makes documents available to the agent via HTTP
3. SEARCH     LangGraph agent loop: search_local_index → fetch_local_url → reason
4. ANSWER     Agent returns an answer with inline source citations

Everything runs locally. Meilisearch downloads automatically on first use, no manual setup.

Screenshots

Desktop UI

Local Search Agent UI

CLI Interactive Mode

Local Search Agent CLI

Python API

Local Search Agent Python API

Video Demos

0.3.0 Release — Watch the Role-based Access Control
0.2.1 Release — Watch the Zooming + Exporting conversation
0.2.0 Release — Watch the Reranker + Watch mode
Native UI — Watch the UI design and configuration video demo
CLI AGENT — Watch the Terminal document querying video demo
Python API — Watch the Local Search Agent API Integration video demo

Install

pip install local-search-agent

Set your API key

# Google AI Studio (free tier — recommended) or paid from openai or anthropic
local-search config set-key --provider google --key YOUR_KEY

# Or use Ollama for a fully local, zero-cost setup (no key needed)
# Install from https://ollama.com 
# Download any model that support function calling and system instructions: 
`ollama pull gemma4:e2b` (7.2GB) 
`ollama pull gemma4:e4b` (9.6GB) 
`ollama pull nemotron-3-nano:4b` (2.8GB Highly recommended)

Quick Start

Desktop UI

local-search ui

The desktop UI open:

Create a workspace, name it, point it at a directory of files. The "Database path" field is optional — leave it blank to use the default location shown in the hint, or paste a custom path and click "Set & Restart".
Ingest (parse, clean, chunk).
Get a free google api key from ai-studio.
Set your api key at the top bar's right corner, or add a paid key for anthropic\openai . Note: For paid models or ollama, you will need to set model name via the config button at the top bar's right corner.
click Ingest from the left sidebar.
watch the progress bar at the bottom bar, wait until all files marked as completed.
Start asking questions.

CLI

# Create a workspace and ingest documents
local-search workspace create finance "C:\my_docs"
local-search ingest --workspace finance --dirs "C:\my_docs"

# Start the file server (keep this running)
local-search serve --workspace finance

# Ask a question
local-search query "What was the AWS spend in Q3?" --workspace finance --provider google

# Use interactive mode
local-search query --workspace finance --provider google

Python API

from local_search_agent import SearchAgentFramework, SearchAgentConfig

config = SearchAgentConfig(
    document_dirs=["C:/my_docs"],
    workspace_name="finance",
    provider="google",
    # db_path defaults to your OS user config dir — same location as keys.json
    # override only if you need a custom location:
    # db_path="D:/mydata/search.db",
)

framework = SearchAgentFramework(config)
framework.ingest_and_index()
framework.start_file_server()

response = framework.query("What was the AWS spend in Q3?")
print(response["answer"])

Agent Tool Integration

Wrap an indexed workspace as a tool and plug it into any external AI agent — LangChain, LangGraph, Google Gemini SDK, or any framework that calls a function.

from local_search_agent import SearchAgentFramework, SearchAgentConfig, LocalSearchTool

config = SearchAgentConfig(
    document_dirs=["C:/skills"],
    workspace_name="skills",
    provider="google",
    model_name="gemini-3.1-flash-lite",  # cheap model for retrieval
)

# Index once
framework = SearchAgentFramework(config)
framework.ingest_and_index()
framework.start_file_server()

# Create the tool
skill_tool = LocalSearchTool(config)

# Use inside a LangChain / LangGraph agent
from langchain_core.tools import tool

@tool
def search_skills(query: str) -> str:
    """Search the skills knowledge base for coding patterns and techniques."""
    return skill_tool.run(query).answer

Pass return_raw=True to bypass the internal LLM summarisation and return the full document text verbatim — useful when the calling agent should reason over the raw content itself:

skill_tool = LocalSearchTool(config, return_raw=True)

See the Python API Reference for the full LocalSearchTool documentation.

Multi-Tenant RBAC (Self-Hosting)

Running this for more than just yourself? Turn one shared deployment into a proper multi-user server with role-based access control — three roles (superadmin / admin / member), admin/member granted per subject per workspace, superadmin unconditional across the whole deployment, enforced on every protected route. Also includes optional per-role model/provider cost controls and concurrency/rate-limit management for cloud LLM accounts or shared Ollama hardware.

Three pluggable identity providers cover the common setups out of the box: a header provider for when you already have an authenticating reverse proxy in front, an API-key provider (with browser sessions) for when you don't have existing auth infrastructure, and a JWT provider that validates against your company's own IdP (Auth0, Okta, Azure AD, Google Workspace) when employees already sign in via SSO.

# Bootstrap: create a workspace, a superadmin key, an admin's key, and grant admin access
local-search workspace create finance "/srv/docs/finance"
local-search auth create-key --subject root@acme.com --display-name "IT/Ops" --superadmin
local-search auth create-key --subject alice@acme.com --display-name "Alice" --created-by root@acme.com
local-search grant-access --subject alice@acme.com --workspace finance --role admin

# Run the dashboard with RBAC turned on
local-search ui --multi-tenant --db /var/lib/local-search-agent/prod.db

This is entirely opt-in — set identity_provider on SearchAgentConfig (or pass --multi-tenant to local-search ui) to turn it on; every existing single-user workflow above works exactly as written if you never do. See Role-Based Access Control for the full guide, and Production Deployment for running this as a shared server (Docker, systemd, reverse proxy) — note that Dockerfile/systemd/Caddy files live at the repo root, not in the PyPI package itself, so self-hosters should either clone this repo or copy the file contents straight out of that doc.

Optional: Faster OCR for Scanned PDFs (Tesseract)

By default, scanned or image-based PDFs are processed using RapidOCR. Installing Tesseract enables a faster OCR path (~5 second per page vs. minutes without it).

Digitally-created PDFs (with a text layer) are never affected — they use direct text extraction and skip OCR entirely.

Windows

Download and run the installer from https://github.com/UB-Mannheim/tesseract/wiki
Make sure "Add Tesseract to the system PATH" is checked during installation.

Linux

sudo apt install tesseract-ocr        # Ubuntu / Debian
sudo dnf install tesseract             # Fedora / RHEL
sudo pacman -S tesseract               # Arch

macOS

brew install tesseract

After installation, restart the application — Tesseract is detected automatically. If it's not found, ingestion continues normally using RapidOCR with no errors.

Supported File Types

Format	Extension
PDF	`.pdf`
Word	`.docx`
Excel	`.xlsx`
PowerPoint	`.pptx`
HTML	`.html`, `.htm`
Plain text	`.txt`, `.md`
CSV	`.csv`
JSON	`.json`
XML	`.xml`
Email	`.eml`

Key Features

One command install — pip install local-search-agent. Meilisearch downloads automatically
No embeddings, no vector stores — BM25 search with structured metadata. Fast, deterministic, auditable
Native desktop UI — pywebview window with live streaming agent responses, workspace management, and chat history
Multi-provider LLM — Google, Ollama (local), OpenAI, Anthropic
Multi-workspace — isolate document collections by department, project, channel, or topic. Each workspace is its own search index
Incremental sync — background scheduler re-indexes only changed files. A 10,000-document corpus with 50 changes re-indexes only the 50
Multi-tenant RBAC — opt-in three-role access control (superadmin/admin/member) per workspace, with pluggable identity (header, API key, or JWT/SSO), plus optional per-role model/provider cost controls and concurrency/rate-limit management, for running one shared deployment across a team
Full CLI parity — everything you can do in the UI you can do from the terminal
Python API — embed the framework directly in your own application
Cross-platform — Windows, macOS, Linux

Documentation

Guide	Description
Getting Started	First steps, quick start for UI, CLI, and Python API
Installation	Full install guide, API keys, Ollama setup, platform notes
Architecture	Full architrecture, design guide
CLI Reference	All commands and flags
Python API Reference	Full API documentation
Configuration	All config options and patterns
Ingestion	How ingestion works, supported formats, chunking, scheduler
Multi-Workspace	Managing multiple document collections
Role-Based Access Control	Multi-tenant RBAC: identity providers, roles, CLI walkthrough
Production Deployment	Self-hosting as a shared server: Docker, systemd, reverse proxy
Semantic Search	Experimental: concept extraction, query expansion
Troubleshooting	Common issues and fixes

Contributing

Contributions are welcome. Clone the repo and install in editable mode with dev dependencies:

git clone https://github.com/wiss84/local-search-agent.git
cd local-search-agent
pip install -e ".[dev]"

Run tests before submitting a PR:

pytest tests/ -v --cov=local_search_agent --cov-report=term-missing
ruff check .
ruff format .

License

MIT — see LICENSE for details.

Built by Wissam Metawee

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

wissam-metawee

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0

Jul 12, 2026

0.2.1

Jun 27, 2026

0.2.0

Jun 24, 2026

0.1.9

Jun 23, 2026

0.1.8

Jun 11, 2026

0.1.7

Jun 4, 2026

0.1.6

Jun 2, 2026

0.1.5

Jun 1, 2026

0.1.4

May 30, 2026

0.1.3

May 29, 2026

0.1.2

May 29, 2026

0.1.1

May 29, 2026

0.1.0

May 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

local_search_agent-0.3.0.tar.gz (2.1 MB view details)

Uploaded Jul 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

local_search_agent-0.3.0-py3-none-any.whl (2.0 MB view details)

Uploaded Jul 12, 2026 Python 3

File details

Details for the file local_search_agent-0.3.0.tar.gz.

File metadata

Download URL: local_search_agent-0.3.0.tar.gz
Upload date: Jul 12, 2026
Size: 2.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for local_search_agent-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`aca43e6f2fe8e5859339864801f2b7e171ba2298ce353033dfb48e506e17a3c9`
MD5	`c6c8d2d946fa65b0f8e719e56b5593fd`
BLAKE2b-256	`3ed6f2eb4a9b58d3dbc5285915ce6c78858d39943f0facb958567d21ed375fa1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for local_search_agent-0.3.0.tar.gz:

Publisher: ci-cd.yml on wiss84/local-search-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: local_search_agent-0.3.0.tar.gz
- Subject digest: aca43e6f2fe8e5859339864801f2b7e171ba2298ce353033dfb48e506e17a3c9
- Sigstore transparency entry: 2152186218
- Sigstore integration time: Jul 12, 2026
Source repository:
- Permalink: wiss84/local-search-agent@2a32b307117690bc1f4d88691ee6263c7035e5df
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/wiss84
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci-cd.yml@2a32b307117690bc1f4d88691ee6263c7035e5df
- Trigger Event: push

File details

Details for the file local_search_agent-0.3.0-py3-none-any.whl.

File metadata

Download URL: local_search_agent-0.3.0-py3-none-any.whl
Upload date: Jul 12, 2026
Size: 2.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for local_search_agent-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2dae28491b96f5a4435327f5c6158362c9f73ca456cf529347e9f7aa38a92408`
MD5	`238536f116d5794ddd707e6b4bcaf07d`
BLAKE2b-256	`51fb64698124f037c35614c5c4846d19244d2dfca2cb03a7c78133e0c7a2ca68`

See more details on using hashes here.

Provenance

The following attestation bundles were made for local_search_agent-0.3.0-py3-none-any.whl:

Publisher: ci-cd.yml on wiss84/local-search-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: local_search_agent-0.3.0-py3-none-any.whl
- Subject digest: 2dae28491b96f5a4435327f5c6158362c9f73ca456cf529347e9f7aa38a92408
- Sigstore transparency entry: 2152186322
- Sigstore integration time: Jul 12, 2026
Source repository:
- Permalink: wiss84/local-search-agent@2a32b307117690bc1f4d88691ee6263c7035e5df
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/wiss84
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci-cd.yml@2a32b307117690bc1f4d88691ee6263c7035e5df
- Trigger Event: push

local-search-agent 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

Local Search Agent

What is this?

Why not RAG?

How it works

Screenshots

Desktop UI

CLI Interactive Mode

Python API

Video Demos

Install

Set your API key

Quick Start

Desktop UI

CLI

Python API

Agent Tool Integration

Multi-Tenant RBAC (Self-Hosting)

Optional: Faster OCR for Scanned PDFs (Tesseract)

Supported File Types

Key Features

Documentation

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance