Agent-friendly local web search using SearXNG for snippets and Crawl4AI for full-text retrieval.
Project description
Local Web Search
Local Web Search is an agent-friendly local web search backend. It gives agents two simple tools:
web_searchasks a local SearXNG instance for search results and returns compact snippets.web_fetchuses Crawl4AI to fetch full page text only when the agent asks for a specific result.
SQLite caching keeps result IDs and fetched page text stable across tool calls. The tool names intentionally look like normal web tools, but all execution happens in your local environment.
Install
python -m pip install local-web-search
Install optional integrations as needed:
python -m pip install "local-web-search[server]"
python -m pip install "local-web-search[agents]"
python -m pip install "local-web-search[server,agents]"
The Python import package is local_agentic_search:
from local_agentic_search import LocalSearchService
Quick Start
Clone the repository if you want the bundled Docker Compose SearXNG setup:
git clone https://github.com/maestromaximo/local-web-search.git
cd local-web-search
docker compose up -d searxng
python -m pip install -e ".[server,agents,dev]"
python -m crawl4ai-setup
Check that SearXNG is reachable:
local-web-search doctor
Search from the CLI:
local-web-search search "OpenAI Agents SDK function tools" --max-results 5
Fetch full text for a result returned by search:
local-web-search fetch res_... --max-chars 4000
Fetch a URL directly:
local-web-search fetch https://example.com --max-chars 4000
Run the HTTP API:
local-web-search serve --host 127.0.0.1 --port 8099
CLI
local-web-search doctor
local-web-search search "query" [--max-results 5] [--language en]
local-web-search fetch res_... [--start 0] [--max-chars 4000]
local-web-search fetch https://example.com [--max-chars 4000]
local-web-search serve [--host 127.0.0.1] [--port 8099]
Run local-web-search --help or local-web-search <command> --help for the
full command reference.
OpenAI Agents SDK Usage
from agents import Agent, Runner
from local_agentic_search.agent_tools import build_agent_tools
web_search, web_fetch = build_agent_tools(build_container_if_missing=True)
agent = Agent(
name="Research assistant",
instructions=(
"Use web_search when current web information is useful. Search results "
"are snippets. Call web_fetch with a result_id before relying on page "
"details not present in a snippet."
),
model="gpt-4.1-mini",
tools=[web_search, web_fetch],
)
result = Runner.run_sync(agent, "Find recent information about Crawl4AI.")
print(result.final_output)
By default, build_agent_tools() assumes Docker/SearXNG is already running and
prints a yellow console warning once per process. To let the tool factory start
SearXNG when the container is missing or stopped:
web_search, web_fetch = build_agent_tools(build_container_if_missing=True)
If port 8888 is already used on the host, choose another host port and keep
Docker plus the Python client aligned:
web_search, web_fetch = build_agent_tools(
build_container_if_missing=True,
searxng_port=8899,
)
If the SearXNG container is already running on the old port, recreate it after changing ports:
docker compose down
$env:LOCAL_WEB_SEARCH_SEARXNG_PORT = "8899"
docker compose up -d searxng
To silence the warning while keeping startup manual:
web_search, web_fetch = build_agent_tools(suppress_docker_warning=True)
Responses API Tool Schemas
For direct OpenAI API tool loops, use:
from local_agentic_search.tool_schemas import responses_tool_schemas
tools = responses_tool_schemas()
Your application still executes web_search and web_fetch locally and returns
their JSON outputs as function call outputs.
HTTP API
Start the server with:
local-web-search serve
Routes:
GET /healthGET /search?q=...&max_results=5POST /searchPOST /fetchGET /openai/tools
See examples/http_client_example.py for a minimal async HTTP client.
Examples
examples/agents_sdk_example.py: build OpenAI Agents SDK tools namedweb_searchandweb_fetch.examples/http_client_example.py: call the local HTTP API withhttpx.examples/responses_schema_example.py: print the Responses API tool schemas.
Search Result Shape
Each web_search result includes:
{
"result_id": "res_...",
"search_id": "search_...",
"position": 1,
"title": "Page title",
"url": "https://example.com",
"snippet": "Compact SearXNG snippet",
"site_links": [],
"full_text_available": true,
"full_text_command": {
"tool": "web_fetch",
"arguments": {
"result_id": "res_...",
"start": 0,
"max_chars": 4000
}
},
"full_text_command_text": "web_fetch(result_id='res_...', start=0, max_chars=4000)"
}
web_fetch returns a page slice with start, end, total_chars, has_more,
and a next_fetch_command when more text is available.
Configuration
Environment variables:
SEARXNG_BASE_URL: defaults tohttp://127.0.0.1:8888LOCAL_WEB_SEARCH_CACHE: defaults to.cache/local_agentic_search.sqlite3LOCAL_WEB_SEARCH_RESULTS_TTL_SECONDS: defaults to86400LOCAL_WEB_SEARCH_PAGES_TTL_SECONDS: defaults to604800LOCAL_WEB_SEARCH_FETCH_CHARS: default fetch slice size, defaults to4000LOCAL_WEB_SEARCH_MAX_FETCH_CHARS: maximum fetch slice size, defaults to20000LOCAL_WEB_SEARCH_CRAWL_TIMEOUT_MS: Crawl4AI timeout, defaults to45000LOCAL_WEB_SEARCH_DOCKER_COMPOSE_FILE: optional compose file pathLOCAL_WEB_SEARCH_DOCKER_CONTAINER: optional SearXNG container nameLOCAL_WEB_SEARCH_SEARXNG_HOST: host used by the Python client whenSEARXNG_BASE_URLis unset, defaults to127.0.0.1LOCAL_WEB_SEARCH_SEARXNG_PORT: host port for the bundled SearXNG service, defaults to8888LOCAL_WEB_SEARCH_SEARXNG_BIND: Docker bind address for the bundled SearXNG service, defaults to127.0.0.1
The automatic Docker path uses a fast docker container inspect check. If the
configured container is paused, it runs docker container unpause; if it is
missing or stopped, it runs docker compose up -d --build searxng.
Publishing
The repository includes GitHub Actions for CI and PyPI publishing. The publish
workflow runs on every push to main and on manual dispatch. It checks PyPI for
existing local-web-search releases, bumps the patch version if necessary,
commits that version bump back to main, then builds and publishes with PyPI
Trusted Publishing.
To enable publishing, add a PyPI Trusted Publisher for:
- PyPI project:
local-web-search - Owner:
maestromaximo - Repository:
local-web-search - Workflow:
publish.yml - Environment:
pypi
License and Notices
Local Web Search is licensed under the Apache License 2.0.
SearXNG is a separate service licensed under the GNU Affero General Public
License v3.0 or later. Crawl4AI is licensed under the Apache License 2.0. See
THIRD_PARTY_NOTICES.md for details.
You are responsible for respecting robots.txt, website terms, copyright, authentication boundaries, privacy rules, and rate limits when searching and fetching public web content.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file local_web_search-0.1.2.tar.gz.
File metadata
- Download URL: local_web_search-0.1.2.tar.gz
- Upload date:
- Size: 26.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
98061a51435bd04b21f8166c5278f33a0670ae4c795c7c741071548398d8b313
|
|
| MD5 |
6780d983b039161fd9815757d257385e
|
|
| BLAKE2b-256 |
03f40138d0386b744ee764f5f3b7d7d9e472263652de3b9195d606f9d69fd605
|
Provenance
The following attestation bundles were made for local_web_search-0.1.2.tar.gz:
Publisher:
publish.yml on maestromaximo/local-web-search
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
local_web_search-0.1.2.tar.gz -
Subject digest:
98061a51435bd04b21f8166c5278f33a0670ae4c795c7c741071548398d8b313 - Sigstore transparency entry: 1673178681
- Sigstore integration time:
-
Permalink:
maestromaximo/local-web-search@1a0ff9af28198ae3e78e6e429ce0ac88d9dc0e60 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/maestromaximo
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@1a0ff9af28198ae3e78e6e429ce0ac88d9dc0e60 -
Trigger Event:
push
-
Statement type:
File details
Details for the file local_web_search-0.1.2-py3-none-any.whl.
File metadata
- Download URL: local_web_search-0.1.2-py3-none-any.whl
- Upload date:
- Size: 26.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d26e3f65fe7dfd3de02769eac90a80ea8695c54196fe7b032d7c17f085a5b13e
|
|
| MD5 |
05fc5b61f71739ffb140ea655fc50276
|
|
| BLAKE2b-256 |
4580448c1ab86a03c0a68d56395f28b84a6fcfbee33d397a64920ad5fe46e69e
|
Provenance
The following attestation bundles were made for local_web_search-0.1.2-py3-none-any.whl:
Publisher:
publish.yml on maestromaximo/local-web-search
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
local_web_search-0.1.2-py3-none-any.whl -
Subject digest:
d26e3f65fe7dfd3de02769eac90a80ea8695c54196fe7b032d7c17f085a5b13e - Sigstore transparency entry: 1673178684
- Sigstore integration time:
-
Permalink:
maestromaximo/local-web-search@1a0ff9af28198ae3e78e6e429ce0ac88d9dc0e60 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/maestromaximo
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@1a0ff9af28198ae3e78e6e429ce0ac88d9dc0e60 -
Trigger Event:
push
-
Statement type: