OmniScout CLI: local-first multi-browser automation, semantic search, and research for AI agents

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

sriramramnath

These details have not been verified by PyPI

Project links

Project description

OmniScout

Local-first browser automation, semantic search, and research for AI agents.

Website: omniscout.xyz · Docs: docs.omniscout.xyz

No cloud APIs. No hosted browser sessions. No MCP yet. No SDK.

The CLI is the interface. Install the omniscout command and drive everything from the terminal or via JSON (--json / OMNISCOUT_JSON=1).

scout is a short alias. harness is a legacy dev alias kept for compatibility.

Install

Requires Python 3.11+ and a Chromium-based browser (Chrome by default; Edge, Brave, Vivaldi, and others supported — see Settings below).

# One-liner (pip + browser + models + agent skill)
curl -fsSL https://omniscout.xyz/install.sh | bash

# Or step by step
pip install omniscout
omniscout install --skill            # browser + models + agent skill files
omniscout install --browser brave    # non-interactive browser choice
omniscout settings browsers          # list supported / installed browsers

If no Chromium browser is installed, add --bundled to download Playwright Chromium (~190MB).

Search commands auto-start the local daemon and keep the embedding model loaded in RAM across invocations — no manual warm-up step required.

Features

Browser automation (daemon-backed)

Long-lived daemon at 127.0.0.1:7720 for sub-second per-action latency.

Playwright backend (default) — local Chrome with persistent profiles
Chrome extension backend (opt-in) — drives your real running Chrome via chrome.debugger; same JSON vocabulary, real cookies and logins
Atomic actions: navigate, snapshot, click, fill, type, paste, select, scroll, key, hover, back, forward, reload, get, is, wait, mouse, screenshot, pdf, eval, tabs, network, console, upload, login, captcha, close
Stable @eN refs from the accessibility tree (preferred over CSS selectors); use snapshot --refs-only — no separate refs alias
Persistent profiles — log in once, stay logged in
CAPTCHA: local-first manual handoff; optional 2captcha / capsolver solvers
Network + console capture with list and incremental tail for agent debugging
Session restore across daemon restarts

Semantic search

DuckDuckGo HTML search with optional local embedding rerank
Sources: ddg, index (local crawl corpus), memory (remembered visits), hybrid (memory + DDG)
omniscout answer — grounded one-sentence answers: direct DDG answers first (snippets, Search Assist), then extractive parsing, local LLM, and limited crawl (auto, fast, balanced, deep; extractive fallback)

Warm embedding model

Search, research, and memory commands route embeddings through the daemon. The sentence-transformers model (all-MiniLM-L6-v2) loads once (~2s) and stays hot. omniscout daemon status reports embed_model_loaded.

Content extraction

Fetch URLs to clean Markdown, plain text, structured JSON fields, or full JSON metadata via trafilatura + markdownify.

--format structured — auto-extract everything found (company, pricing, socials, docs/blog/careers URLs, contact info, labeled fields). NLP only, no LLM. Empty fields omitted. Quiet stdout (fields JSON only).
--query / -q — search DuckDuckGo, crawl top hits, follow same-host links (--depth, default 3), merge pages, extract structured fields (no URL required).
--fields company,pricing,... — limit structured output to specific keys
--data — include full ExtractResult plus stderr diagnostics

omniscout extract https://example.com --format structured
omniscout extract https://example.com --format structured --fields twitter,pricing
omniscout extract -q "SpaceX founder" --format structured --fields founder

Research pipeline

Multi-step: search → crawl → extract → embed → rerank → summarize.

Knowledge graphs

omniscout graph maps an entity (product, company, person) into a structured Unicode tree — Company, Founders, Competitors, Pricing, Features, Reviews, and more. Default: 3 web sources, local LLM synthesis (same model as answer). Pass a URL or --website / -w to crawl only that site and same-host links (no DuckDuckGo). Use --no-llm for heuristic-only graphs, --data for sources and timing, --json for agents.

omniscout graph "Cursor"
omniscout graph "cursor.com"
omniscout graph "Cursor" -w cursor.com --data

Browser memory

Remember visits and notes; semantic search over your browsing history.

omniscout remember <url> — visit, extract, index
omniscout memory list|show|note|delete|stats|clear

Workflow shortcuts

Top-level commands for agent ergonomics:

omniscout open <url|index> — open URL or latest search result
omniscout snapshot, omniscout context, omniscout reset
omniscout workflow export — JSON steps from workflow state + action history

Replay & observability

Every daemon action is logged to $OMNISCOUT_DATA_DIR/daemon/actions.jsonl:

omniscout daemon trace — recent activity table or JSON
omniscout daemon replay <action_id> — re-run a single action
omniscout daemon watch — live SSE event stream
Top-level omniscout replay action-<id> and omniscout replay session-<name>

Benchmarks

omniscout benchmark answers — latency + correctness matrix over answer modes
omniscout benchmark startup — CLI process launch overhead

Quickstart

# Search
omniscout search "local-first browser agents"
omniscout answer "who is the president" --depth balanced

# Extract
omniscout extract https://example.com
omniscout extract https://example.com --format structured

# Browser (daemon auto-starts)
omniscout browser navigate https://example.com
omniscout browser snapshot --refs-only
omniscout browser click '@e1'
omniscout browser screenshot --out /tmp/state.png
omniscout browser screenshot --full-length --out /tmp/full.png   # full page
omniscout browser close --all

# Research
omniscout research "state of local AI agents in 2026"

# Knowledge graph
omniscout graph "Cursor"
omniscout graph "https://cursor.com"

# Profiles & sessions
omniscout profile create work
omniscout browser open https://news.ycombinator.com --profile work --headful
omniscout session start --headful

Optional warm-up before a batch of searches:

omniscout warmup

JSON output (for agents)

Every command supports --json. Set OMNISCOUT_JSON=1 to make JSON the default for an entire shell session. Logs go to stderr; stdout is the structured result.

export OMNISCOUT_JSON=1
omniscout search "robotics simulators" --limit 5
omniscout browser navigate https://example.com --session demo

Direct HTTP (no CLI wrapper):

curl -s -X POST http://127.0.0.1:7720/command \
  -H 'Content-Type: application/json' \
  -d '{"action":"navigate","args":{"url":"https://example.com"},"session":"demo"}'

Architecture

omniscout CLI ──HTTP POST /command──▶ omniscout daemon (127.0.0.1:7720)
     │                                      ├─ Playwright backend
     │                                      ├─ Extension backend (opt-in)
     │                                      └─ Embed service (warm model)
     └── Search / Extract / Research engines (local Qdrant + DDG)

Python package layout (for contributors):

cli/omniscout/
  app.py              # Typer root (binary: omniscout)
  commands/           # CLI sub-commands
  daemon/             # HTTP server, backends, replay, events
  engines/            # browser, search, research, extractor, crawler
  store/              # SQLite cache, sessions, workflow, memory
  models.py           # pydantic JSON contract

On-disk state

Path	Purpose
`profiles/`	Persistent Chrome user-data-dirs
`qdrant/`	Embedded vector index
`models/sentence-transformers/`	Prefetched embedding model
`memory.sqlite`	Browser memory (visits + notes)
`sessions.sqlite`	Long-lived browser session registry
`cache/pages/`	Content-hashed HTML cache
`daemon/`	PID, port, logs, action history, session restore

Default locations:

macOS — ~/Library/Application Support/omniscout/
Linux — ~/.local/share/omniscout/

Override with OMNISCOUT_DATA_DIR, OMNISCOUT_CONFIG_DIR, OMNISCOUT_CACHE_DIR. Legacy HARNESS_* names are still accepted.

Configuration

config.toml (in config dir):

default_source = "ddg"
search_limit = 10
research_results = 8
request_throttle_seconds = 1.0
embedding_model = "sentence-transformers/all-MiniLM-L6-v2"
embedding_local_only = true
browser = "chrome"                    # chrome | edge | brave | vivaldi | opera | arc | dia | thorium | chromium | custom
# browser_executable = "/path/to/binary"  # optional override or required for custom
summary_sentences = 6

Or use the settings command:

omniscout settings browsers
omniscout settings set browser brave
omniscout settings set browser custom --executable /path/to/chromium
omniscout settings show

Supported browser ids: chrome, edge, brave, vivaldi, opera, arc, dia, thorium, chromium, custom. Legacy browser_channel in config.toml is still honored.

Environment variables

Variable	Purpose
`OMNISCOUT_JSON=1`	Force JSON output on every command
`OMNISCOUT_EMBED_DAEMON=1`	Route embeds through daemon (default on)
`OMNISCOUT_DAEMON_AUTO_START=0`	Don't auto-start daemon
`OMNISCOUT_DAEMON_PORT`	Daemon port (default 7720)
`OMNISCOUT_DATA_DIR`	Override data directory
`OMNISCOUT_BROWSER`	Browser id (same as `browser` in config.toml)
`OMNISCOUT_EMBED_LOCAL_ONLY=0`	Allow runtime Hugging Face fetches
`TWOCAPTCHA_API_KEY`	CAPTCHA solver API key

Legacy HARNESS_* equivalents work for all of the above.

Why your own browser?

Using your installed Chromium browser (Chrome, Edge, Brave, etc.) gives you real cookies, login state, extensions, and the same fingerprint as daily browsing — without a separate ~190MB Chromium download. OmniScout falls back to other installed Chromium builds automatically, then to Playwright's bundled Chromium when nothing else is available.

License

Modified MIT — see LICENSE. Products built on OmniScout must prominently display Powered by OmniScout on the user interface.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

sriramramnath

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.10

Jun 13, 2026

0.2.9.1

Jun 11, 2026

This version

0.2.9

Jun 10, 2026

0.2.8

Jun 9, 2026

0.2.7.1

Jun 7, 2026

0.2.7

Jun 7, 2026

0.2.6

Jun 6, 2026

0.2.5

Jun 4, 2026

0.2.4

Jun 4, 2026

0.2.3

May 31, 2026

0.2.2

May 31, 2026

0.2.1

May 30, 2026

0.2.0

May 30, 2026

0.1.0

May 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omniscout-0.2.9.tar.gz (167.0 kB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

omniscout-0.2.9-py3-none-any.whl (203.9 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file omniscout-0.2.9.tar.gz.

File metadata

Download URL: omniscout-0.2.9.tar.gz
Upload date: Jun 10, 2026
Size: 167.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omniscout-0.2.9.tar.gz
Algorithm	Hash digest
SHA256	`06e06e4b325ef374f9542e6bc43db5194f6e338f84b4ef808996f359c405f85f`
MD5	`2aeee42e8bda4dba74c1648864213ba8`
BLAKE2b-256	`a91ad7b815ab0d7d68ac00f51157f257ee20c817fd51f1c13e001935a093fb30`

See more details on using hashes here.

Provenance

The following attestation bundles were made for omniscout-0.2.9.tar.gz:

Publisher: pypi-publish.yml on sriramramnath/omniscout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: omniscout-0.2.9.tar.gz
- Subject digest: 06e06e4b325ef374f9542e6bc43db5194f6e338f84b4ef808996f359c405f85f
- Sigstore transparency entry: 1779486306
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: sriramramnath/omniscout@83ff9bbefaca44b18990f14b4f73afa9a82c76f2
- Branch / Tag: refs/heads/main
- Owner: https://github.com/sriramramnath
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@83ff9bbefaca44b18990f14b4f73afa9a82c76f2
- Trigger Event: workflow_dispatch

File details

Details for the file omniscout-0.2.9-py3-none-any.whl.

File metadata

Download URL: omniscout-0.2.9-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 203.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omniscout-0.2.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`93efdd12b27cdba3f7d90f88544faedb39366b053a0505d8b1b0f053b5e3874d`
MD5	`bee6f7c6e63b0e42ea17b5643fe70deb`
BLAKE2b-256	`8761ceaf8b99528e6d64424b1030ad4cad20a86d1b32dfceb14bb22f35e0bf7e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for omniscout-0.2.9-py3-none-any.whl:

Publisher: pypi-publish.yml on sriramramnath/omniscout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: omniscout-0.2.9-py3-none-any.whl
- Subject digest: 93efdd12b27cdba3f7d90f88544faedb39366b053a0505d8b1b0f053b5e3874d
- Sigstore transparency entry: 1779486413
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: sriramramnath/omniscout@83ff9bbefaca44b18990f14b4f73afa9a82c76f2
- Branch / Tag: refs/heads/main
- Owner: https://github.com/sriramramnath
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@83ff9bbefaca44b18990f14b4f73afa9a82c76f2
- Trigger Event: workflow_dispatch

omniscout 0.2.9

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

OmniScout

Install

Features

Browser automation (daemon-backed)

Semantic search

Warm embedding model

Content extraction

Research pipeline

Knowledge graphs

Browser memory

Workflow shortcuts

Replay & observability

Benchmarks

Quickstart

JSON output (for agents)

Architecture

On-disk state

Configuration

Environment variables

Why your own browser?

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance