Skip to main content

Undetectable browser automation for LLM agents, spoken over MCP.

Project description

zendriver-mcp

Undetectable browser automation for LLM agents, spoken over MCP.

zendriver-mcp is an MCP server that gives your coding agent (Claude, Cursor, Gemini, Copilot) a real Chrome browser it can actually use on the real web - behind Cloudflare, behind login walls, on pages that detect and block WebDriver.

It's built on Zendriver, which speaks the Chrome DevTools Protocol directly instead of going through WebDriver. That means no navigator.webdriver flag, no headless telltales, and a fingerprint that looks like an ordinary Chrome install.

On top of that foundation, zendriver-mcp layers everything an agent needs to get work done: a token-efficient DOM walker, an accessibility tree with stable uids, performance traces, Lighthouse audits, heap snapshots, human-like input, device emulation, cookie round-tripping, and more - 96 tools across 22 modules.

Highlights

  • Undetectable by design. Zendriver keeps a clean fingerprint; we ship a Cloudflare Turnstile solver, identity overrides (UA, locale, timezone, geolocation), and bezier-curve mouse movement plus gaussian typing timing.
  • Token-efficient DOM. Two ways to see the page - the upstream DOM walker that reports 96% fewer tokens than raw HTML, and a CDP accessibility tree keyed by stable uids that survive re-renders.
  • Full DevTools parity. Performance traces that load in Chrome DevTools, heap snapshots in the standard .heapsnapshot format, Lighthouse audits via the Lighthouse CLI reusing the same browser.
  • Session round-tripping. Export all cookies (including HTTP-only ones) to JSON, re-import on the next session. Log in once, reuse everywhere.
  • Emulation. iPhone 15 Pro, Pixel 8, iPad Pro, and desktop presets; CPU throttling; Slow 3G / Fast 3G / 4G / offline network profiles; force prefers-color-scheme.
  • Screencasts. Write frames to disk as JPEG or PNG at configurable FPS.

Tool catalogue

Module Tools
browser start, stop, status
navigation navigate, back, forward, reload, page info
tabs new / list / switch / close
elements click, type, clear, focus, select, upload
query find element(s), text, attributes, buttons, inputs
content HTML, text, interaction tree, scroll
storage cookies (document.cookie), localStorage
logging network + console logs, wait-for-request
forms fill form, submit, key press, mouse click
utils screenshot, execute JS, wait, security audit
stealth Cloudflare bypass, UA / locale / timezone / geolocation overrides
humanlike human_click, human_type, estimated_typing_duration
emulation viewport, device presets, CPU + network throttle, media
devtools start/stop trace, take heap snapshot
lighthouse run audit, check CLI availability
screencast start / stop (writes frame directory)
accessibility AX snapshot with stable uids, click_by_uid, describe_uid
cookies export / import / list / clear (CDP-level, all origins)
network_control block URLs, extra headers, bypass service worker
permissions grant / reset, list names
proxy configure / clear (restarts browser with proxy args)
interception mock_response, fail_requests, list, stop
screencast + export_screencast_mp4, check_ffmpeg_available

Full signatures live in the docstrings of src/tools/*.py and are auto-listed by the MCP handshake.

Install

Published on PyPI: https://pypi.org/project/zendriver-mcp/.

Requires Python 3.10+ and a Chrome / Chromium install.

# Zero-setup, re-resolves on every run - great for Claude Desktop configs
uvx zendriver-mcp

# Or install once, invoke many
uv tool install zendriver-mcp
pipx install zendriver-mcp
pip install zendriver-mcp

Use with Claude Desktop / Claude Code

{
  "mcpServers": {
    "zendriver": {
      "command": "uvx",
      "args": ["zendriver-mcp"]
    }
  }
}

No clone, no --directory, no absolute path. If you prefer a permanent install instead, swap "command": "uvx" for "command": "zendriver-mcp" after running uv tool install zendriver-mcp.

Development checkout

Working on zendriver-mcp itself? Clone + uv sync, then point your MCP client at the checkout:

{
  "mcpServers": {
    "zendriver": {
      "command": "uv",
      "args": ["--directory", "/absolute/path/to/zendriver-mcp", "run", "zendriver-mcp"]
    }
  }
}

Optional flags on the CLI:

  • --browser-path /path/to/chrome - point at a specific Chrome binary
  • --transport stdio - only stdio for now; SSE/HTTP arrive when upstream mcp ships stable support

The 30-second tour

# Ask the browser to start, log in once, save the session.
await start_browser()
await navigate("https://example.com/login")
await fill_form({"#email": "me@me.com", "#pw": "..."})
await export_cookies("~/sessions/example.json")

# Next run: skip the login entirely.
await start_browser()
await import_cookies("~/sessions/example.json")
await navigate("https://example.com/dashboard")

# Get an accessibility snapshot, click by stable uid.
snap = await get_accessibility_snapshot()
await click_by_uid("ax_1b2c3d4e")

# Record a performance trace while you click around.
await start_trace()
await human_click(selector="#buy-now")
await stop_trace("/tmp/buy-flow.json")   # loads in Chrome DevTools

# Run Lighthouse against the current browser.
await run_lighthouse("https://example.com", form_factor="mobile")

Token-optimised DOM walker

The interaction tree emits compact rows like {"id": 1, "t": "btn", "l": "Search", "r": "hdr"}:

  • Compact keys: t (type), l (label), r (region)
  • Smart labels: inferred from aria-label, aria-labelledby, associated <label>, placeholder, text, title, alt
  • Noise filtering: SVG internals, nested interactive children skipped
  • Region tagging: hdr, nav, main, side, ftr, dlg
  • Type compression: button -> btn, checkbox -> chk, etc.

Reported reduction on perplexity.ai: ~96% fewer tokens than raw HTML (~11k -> ~400).

For flows that span multiple actions, prefer get_accessibility_snapshot + click_by_uid - the uids stay valid as long as the underlying backend node survives, even across re-renders.

Development

uv sync
uv run ruff check .
uv run ruff format --check .
uv run mypy src
uv run pytest

CI runs the same four on every push and PR.

Roadmap

Everything on the original roadmap shipped in the 0.2 / 0.3 releases:

  • Stealth: Cloudflare solver, UA / locale / timezone / geolocation
  • Human-like input: bezier mouse paths, gaussian keystroke timing
  • DevTools parity: traces, heap snapshots, Lighthouse
  • Screencast + mp4 export via ffmpeg
  • Accessibility tree with stable uids
  • Cookie import/export, blocking, extra headers, permissions
  • Request interception + response mocking (Fetch.enable)
  • Proxy configuration (restart with --proxy-server)
  • ToolResponse envelope adopted in rich-output tools
  • PyPI publish workflow (Trusted Publishing)

Documentation site: https://bituq.github.io/zendriver-mcp/

What's next is driven by actual usage. Ideas on deck:

  • Per-request proxy routing via Fetch interception
  • Binary body support in mock_response
  • Fetch.enable with patterns (faster than our "match all then filter")

License

MIT. See LICENSE.

Acknowledgements

  • Zendriver does the heavy lifting underneath.
  • The token-optimised DOM walker and the original 49-tool foundation come from ShubhamChoulkar/Zendriver-MCP. This project started as a fork and has since grown its own identity and feature set.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zendriver_mcp-0.3.3.tar.gz (67.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

zendriver_mcp-0.3.3-py3-none-any.whl (77.3 kB view details)

Uploaded Python 3

File details

Details for the file zendriver_mcp-0.3.3.tar.gz.

File metadata

  • Download URL: zendriver_mcp-0.3.3.tar.gz
  • Upload date:
  • Size: 67.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for zendriver_mcp-0.3.3.tar.gz
Algorithm Hash digest
SHA256 9fedcb9581a4dd3c1adbed5657a6f4965df0e83d932d8948edecea8d7aab24e1
MD5 87b72e27ff21c6caea251124b919fa11
BLAKE2b-256 066cdbacc96a9ccd08f777a39e21151c74abf7f609ad7eedbd3d128eda8fa8f8

See more details on using hashes here.

Provenance

The following attestation bundles were made for zendriver_mcp-0.3.3.tar.gz:

Publisher: publish.yml on bituq/zendriver-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zendriver_mcp-0.3.3-py3-none-any.whl.

File metadata

  • Download URL: zendriver_mcp-0.3.3-py3-none-any.whl
  • Upload date:
  • Size: 77.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for zendriver_mcp-0.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 6bbe95ec173045f3db4d085cfc1a47d9bd226f38c299c773d145815f260daf05
MD5 b20d2854bd22ea923f04ec545c516efd
BLAKE2b-256 b2eabcaa3fd28322ff8a5e600a036637cb89fc2b65da1c629820be492c14c414

See more details on using hashes here.

Provenance

The following attestation bundles were made for zendriver_mcp-0.3.3-py3-none-any.whl:

Publisher: publish.yml on bituq/zendriver-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page