Skip to main content

Undetectable browser automation for LLM agents, spoken over MCP.

Project description

zendriver-mcp

Undetectable browser automation for LLM agents, spoken over MCP.

zendriver-mcp is an MCP server that gives your coding agent (Claude, Cursor, Gemini, Copilot) a real Chrome browser it can actually use on the real web - behind Cloudflare, behind login walls, on pages that detect and block WebDriver.

It's built on Zendriver, which speaks the Chrome DevTools Protocol directly instead of going through WebDriver. That means no navigator.webdriver flag, no headless telltales, and a fingerprint that looks like an ordinary Chrome install.

On top of that foundation, zendriver-mcp layers everything an agent needs to get work done: a token-efficient DOM walker, an accessibility tree with stable uids, performance traces, Lighthouse audits, heap snapshots, human-like input, device emulation, cookie round-tripping, and more - 96 tools across 22 modules.

Highlights

  • Undetectable by design. Zendriver keeps a clean fingerprint; we ship a Cloudflare Turnstile solver, identity overrides (UA, locale, timezone, geolocation), and bezier-curve mouse movement plus gaussian typing timing.
  • Token-efficient DOM. Two ways to see the page - the upstream DOM walker that reports 96% fewer tokens than raw HTML, and a CDP accessibility tree keyed by stable uids that survive re-renders.
  • Full DevTools parity. Performance traces that load in Chrome DevTools, heap snapshots in the standard .heapsnapshot format, Lighthouse audits via the Lighthouse CLI reusing the same browser.
  • Session round-tripping. Export all cookies (including HTTP-only ones) to JSON, re-import on the next session. Log in once, reuse everywhere.
  • Emulation. iPhone 15 Pro, Pixel 8, iPad Pro, and desktop presets; CPU throttling; Slow 3G / Fast 3G / 4G / offline network profiles; force prefers-color-scheme.
  • Screencasts. Write frames to disk as JPEG or PNG at configurable FPS.

Tool catalogue

Module Tools
browser start, stop, status
navigation navigate, back, forward, reload, page info
tabs new / list / switch / close
elements click, type, clear, focus, select, upload
query find element(s), text, attributes, buttons, inputs
content HTML, text, interaction tree, scroll
storage cookies (document.cookie), localStorage
logging network + console logs, wait-for-request
forms fill form, submit, key press, mouse click
utils screenshot, execute JS, wait, security audit
stealth Cloudflare bypass, UA / locale / timezone / geolocation overrides
humanlike human_click, human_type, estimated_typing_duration
emulation viewport, device presets, CPU + network throttle, media
devtools start/stop trace, take heap snapshot
lighthouse run audit, check CLI availability
screencast start / stop (writes frame directory)
accessibility AX snapshot with stable uids, click_by_uid, describe_uid
cookies export / import / list / clear (CDP-level, all origins)
network_control block URLs, extra headers, bypass service worker
permissions grant / reset, list names
proxy configure / clear (restarts browser with proxy args)
interception mock_response, fail_requests, list, stop
screencast + export_screencast_mp4, check_ffmpeg_available

Full signatures live in the docstrings of src/tools/*.py and are auto-listed by the MCP handshake.

Install

Published on PyPI: https://pypi.org/project/zendriver-mcp/.

Requires Python 3.10+ and a Chrome / Chromium install.

# Zero-setup, re-resolves on every run - great for Claude Desktop configs
uvx zendriver-mcp

# Or install once, invoke many
uv tool install zendriver-mcp
pipx install zendriver-mcp
pip install zendriver-mcp

Use with Claude Desktop / Claude Code

{
  "mcpServers": {
    "zendriver": {
      "command": "uvx",
      "args": ["zendriver-mcp"]
    }
  }
}

No clone, no --directory, no absolute path. If you prefer a permanent install instead, swap "command": "uvx" for "command": "zendriver-mcp" after running uv tool install zendriver-mcp.

Development checkout

Working on zendriver-mcp itself? Clone + uv sync, then point your MCP client at the checkout:

{
  "mcpServers": {
    "zendriver": {
      "command": "uv",
      "args": ["--directory", "/absolute/path/to/zendriver-mcp", "run", "zendriver-mcp"]
    }
  }
}

Optional flags on the CLI:

  • --browser-path /path/to/chrome - point at a specific Chrome binary
  • --transport stdio - only stdio for now; SSE/HTTP arrive when upstream mcp ships stable support

The 30-second tour

# Ask the browser to start, log in once, save the session.
await start_browser()
await navigate("https://example.com/login")
await fill_form({"#email": "me@me.com", "#pw": "..."})
await export_cookies("~/sessions/example.json")

# Next run: skip the login entirely.
await start_browser()
await import_cookies("~/sessions/example.json")
await navigate("https://example.com/dashboard")

# Get an accessibility snapshot, click by stable uid.
snap = await get_accessibility_snapshot()
await click_by_uid("ax_1b2c3d4e")

# Record a performance trace while you click around.
await start_trace()
await human_click(selector="#buy-now")
await stop_trace("/tmp/buy-flow.json")   # loads in Chrome DevTools

# Run Lighthouse against the current browser.
await run_lighthouse("https://example.com", form_factor="mobile")

Token-optimised DOM walker

The interaction tree emits compact rows like {"id": 1, "t": "btn", "l": "Search", "r": "hdr"}:

  • Compact keys: t (type), l (label), r (region)
  • Smart labels: inferred from aria-label, aria-labelledby, associated <label>, placeholder, text, title, alt
  • Noise filtering: SVG internals, nested interactive children skipped
  • Region tagging: hdr, nav, main, side, ftr, dlg
  • Type compression: button -> btn, checkbox -> chk, etc.

Reported reduction on perplexity.ai: ~96% fewer tokens than raw HTML (~11k -> ~400).

For flows that span multiple actions, prefer get_accessibility_snapshot + click_by_uid - the uids stay valid as long as the underlying backend node survives, even across re-renders.

Development

uv sync
uv run ruff check .
uv run ruff format --check .
uv run mypy src
uv run pytest

CI runs the same four on every push and PR.

Roadmap

Everything on the original roadmap shipped in the 0.2 / 0.3 releases:

  • Stealth: Cloudflare solver, UA / locale / timezone / geolocation
  • Human-like input: bezier mouse paths, gaussian keystroke timing
  • DevTools parity: traces, heap snapshots, Lighthouse
  • Screencast + mp4 export via ffmpeg
  • Accessibility tree with stable uids
  • Cookie import/export, blocking, extra headers, permissions
  • Request interception + response mocking (Fetch.enable)
  • Proxy configuration (restart with --proxy-server)
  • ToolResponse envelope adopted in rich-output tools
  • PyPI publish workflow (Trusted Publishing)

Documentation site: https://bituq.github.io/zendriver-mcp/

What's next is driven by actual usage. Ideas on deck:

  • Per-request proxy routing via Fetch interception
  • Binary body support in mock_response
  • Fetch.enable with patterns (faster than our "match all then filter")

License

MIT. See LICENSE.

Acknowledgements

  • Zendriver does the heavy lifting underneath.
  • The token-optimised DOM walker and the original 49-tool foundation come from ShubhamChoulkar/Zendriver-MCP. This project started as a fork and has since grown its own identity and feature set.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zendriver_mcp-0.3.1.tar.gz (62.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

zendriver_mcp-0.3.1-py3-none-any.whl (71.9 kB view details)

Uploaded Python 3

File details

Details for the file zendriver_mcp-0.3.1.tar.gz.

File metadata

  • Download URL: zendriver_mcp-0.3.1.tar.gz
  • Upload date:
  • Size: 62.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for zendriver_mcp-0.3.1.tar.gz
Algorithm Hash digest
SHA256 c0c4a37df93c4f793acf697a9e52adf1701b84cc2c60274a1e104f4c200c1ef4
MD5 fe7458a6ec603d5ea2cc52d59eeb09d1
BLAKE2b-256 318920382d872c84053735aa386f5e3608e9720198a09bee7e2e1a706036c249

See more details on using hashes here.

Provenance

The following attestation bundles were made for zendriver_mcp-0.3.1.tar.gz:

Publisher: publish.yml on bituq/zendriver-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zendriver_mcp-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: zendriver_mcp-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 71.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for zendriver_mcp-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2a4d74a75113d3a05732f868e5060ea649972e95fe9f55ed58ddcf1c32e176bd
MD5 30e208de617e25c1978f8bf3b38aad9c
BLAKE2b-256 aa9ced5ef674425d36b095dc81e3efc3452eda4e38eefb43eb31ff7455b77f91

See more details on using hashes here.

Provenance

The following attestation bundles were made for zendriver_mcp-0.3.1-py3-none-any.whl:

Publisher: publish.yml on bituq/zendriver-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page