Production-ready MCP server integrating Selenium 4 (BiDi) for browser automation

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

selenium-mcp

A production-ready MCP (Model Context Protocol) server that exposes Selenium 4 browser automation as MCP tools. Supports Chrome and Firefox with full BiDi (Bidirectional API) event streaming.

Architecture

selenium-mcp/
├── server.py               # MCP entrypoint (JSON-RPC 2.0 over stdio)
├── config/
│   ├── settings.py         # ENV + YAML config loader (singleton)
│   ├── default.yaml        # Default configuration values
│   └── logging_config.py   # Structured logging setup
├── driver/
│   ├── factory.py          # WebDriver factory (Chrome / Firefox + BiDi)
│   ├── session.py          # BrowserSession – wraps a single WebDriver
│   └── session_manager.py  # Registry of all active sessions
├── events/
│   ├── dispatcher.py       # Async pub/sub event dispatcher (asyncio.Queue)
│   ├── bidi_listeners.py   # BiDi WebSocket event listeners
│   └── network_interceptor.py  # CDP/BiDi network interception
├── tools/
│   ├── base.py             # BaseTool + error-screenshot decorator
│   ├── navigation_tools.py # open_page, navigate_back/forward, get_dom
│   ├── interaction_tools.py# click, type_text, get_text, wait_for
│   ├── script_tools.py     # execute_js, screenshot
│   ├── log_tools.py        # get_console_logs, get_network_logs, intercept_requests
│   ├── session_tools.py    # create_session, close_session, list_sessions
│   └── registry.py         # Tool name → callable map + MCP descriptors
└── models/
    ├── session.py          # SessionInfo, BrowserType, SessionStatus
    ├── events.py           # BrowserEvent, ConsoleLogEvent, NetworkRequestEvent, …
    ├── network.py          # NetworkLog, ConsoleLog, InterceptRule, PerformanceMetrics
    └── exceptions.py       # Custom exception hierarchy

Layered design

MCP Client (Claude / any MCP host)
        ↕  JSON-RPC 2.0 / stdio
    server.py  (MCPServer)
        ↕
    tools/registry.py  →  tools/*.py  (business logic)
        ↕
    driver/session_manager.py  →  driver/session.py
        ↕                               ↕
    driver/factory.py           events/bidi_listeners.py
    (WebDriver creation)        (BiDi / CDP event capture)
        ↕                               ↕
    Selenium 4 WebDriver       events/dispatcher.py
    (Chrome / Firefox)         (async pub/sub)

Quick start

Prerequisites

Requirement	Version
Python	3.11+
Chrome / ChromeDriver	latest stable
Firefox / GeckoDriver	latest stable (optional)

Installation

# 1. Clone / enter the project
git clone <repo> selenium-mcp
cd selenium-mcp

# 2. Create a virtual environment
python -m venv .venv
# Windows
.venv\Scripts\activate
# macOS / Linux
source .venv/bin/activate

# 3. Install dependencies
pip install -r requirements.txt

# 4. (Optional) install as editable package
pip install -e .

Run the server

# Stdio mode (standard MCP transport)
python server.py

# Or via the installed entry-point
selenium-mcp

The server reads JSON-RPC 2.0 messages from stdin and writes responses to stdout.

Configuration

Configuration is loaded in priority order:

Environment variables (SMCP_*)
YAML file (SMCP_CONFIG_FILE env var or config/default.yaml)
Built-in defaults

Key settings

ENV variable	YAML key	Default	Description
`SMCP_BROWSER`	`browser.default`	`chrome`	Default browser (`chrome`/`firefox`)
`SMCP_HEADLESS`	`browser.headless`	`true`	Run headless
`SMCP_MAX_SESSIONS`	`browser.max_sessions`	`5`	Max concurrent sessions
`SMCP_BIDI_ENABLED`	`bidi.enabled`	`true`	Enable BiDi WebSocket
`SMCP_LOG_LEVEL`	`server.log_level`	`INFO`	Log level
`SMCP_DEBUG`	`server.debug`	`false`	Verbose debug logging
`SMCP_SCREENSHOT_ON_ERROR`	`screenshot.on_error`	`true`	Auto-screenshot on errors
`SMCP_SCREENSHOT_DIR`	`screenshot.directory`	`screenshots/`	Screenshot output dir

Custom YAML config

SMCP_CONFIG_FILE=/path/to/my-config.yaml python server.py

Example my-config.yaml:

browser:
  default: firefox
  headless: false
  max_sessions: 3
bidi:
  enabled: true
screenshot:
  on_error: true
  directory: /tmp/mcp-screenshots

MCP Tools reference

Session management

Tool	Description	Key params
`create_session`	Open a new browser	`browser`, `headless`
`close_session`	Close a session	`session_id`
`list_sessions`	List active sessions	—
`get_session_info`	Get session metadata	`session_id`

Navigation

Tool	Description	Key params
`open_page`	Navigate to URL	`url`
`navigate_back`	History back	—
`navigate_forward`	History forward	—
`get_dom`	Full page HTML	—

Element interaction

Tool	Description	Key params
`click`	Click a CSS selector	`selector`
`type_text`	Type into an input	`selector`, `text`
`get_text`	Get element text	`selector`
`wait_for`	Wait until visible	`selector`, `timeout`
`wait_for_dom_stable`	Smart DOM-stability wait	`timeout`

Script & media

Tool	Description	Key params
`execute_js`	Run JavaScript	`script`
`screenshot`	Capture viewport (base64 PNG)	—

Logs & network

Tool	Description	Key params
`get_console_logs`	Browser console entries	—
`get_network_logs`	Network request/response log	—
`get_performance_metrics`	Page timing data	—
`intercept_requests`	Register URL intercept rule	`pattern`, `action`

Connecting to MCP clients

Via uvx (recommended — zero install)

uvx mcp_selenium

Via pip

pip install mcp_selenium
selenium-mcp

Local development (uv run)

git clone https://github.com/SCV-Consultants/selenium-mcp.git
cd selenium-mcp
uv run selenium-mcp

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "selenium": {
      "command": "uvx",
      "args": ["mcp_selenium"],
      "env": {
        "SMCP_HEADLESS": "false",
        "SMCP_BROWSER": "chrome"
      }
    }
  }
}

Antigravity / Gemini

Add to your MCP config (mcp_config.json):

{
  "mcpServers": {
    "selenium": {
      "command": "uvx",
      "args": ["mcp_selenium"],
      "env": {
        "SMCP_HEADLESS": "false",
        "SMCP_BROWSER": "chrome"
      }
    }
  }
}

For local development, use uv run instead:

{
  "mcpServers": {
    "selenium": {
      "command": "uv",
      "args": ["run", "--directory", "/path/to/selenium-mcp", "selenium-mcp"],
      "env": {
        "SMCP_HEADLESS": "false",
        "SMCP_BROWSER": "chrome"
      }
    }
  }
}

Claude CLI

claude mcp add selenium -- uvx mcp_selenium

Install via Smithery

npx -y @smithery/cli install mcp_selenium --client claude

BiDi / Event system

When bidi.enabled: true the server attaches BiDi WebSocket listeners to each session:

Console events – console.log, console.error, etc. are captured and stored per-session. Retrieved via get_console_logs.
JS errors – JavaScript runtime errors are captured as error-level console entries.
Network events – CDP Network.enable (Chrome) captures request/response data. Retrieved via get_network_logs.
Event dispatcher – All events flow through an asyncio.Queue-backed pub/sub hub (events/dispatcher.py). Custom async handlers can be registered per event type for real-time streaming use cases.

Error handling

All tools wrap failures in a typed exception hierarchy:

Exception	Trigger
`SessionNotFoundError`	Invalid `session_id`
`SessionLimitError`	Too many concurrent sessions
`ElementNotFoundError`	CSS selector matched nothing
`ElementInteractionError`	Element not clickable/typeable
`NavigationError`	`get()` / history navigation failed
`ScriptExecutionError`	JavaScript threw or timed out
`TimeoutError`	`wait_for` condition not met
`NetworkInterceptionError`	CDP interception setup failed
`BiDiNotSupportedError`	BiDi requested but unavailable

When screenshot.on_error: true, a PNG is saved to screenshot.directory automatically on any SeleniumMCPError.

Development

# Lint
ruff check .

# Type check
mypy .

# Tests
pytest tests/ -v

Retry mechanism

All element interactions use an internal _retry() helper that retries on StaleElementReferenceException and transient WebDriverException. Configurable via:

retry:
  max_attempts: 3
  backoff_seconds: 1.0

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

dobrzycki

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.2

Apr 6, 2026

This version

1.0.1

Apr 6, 2026

1.0.0

Apr 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_selenium-1.0.1.tar.gz (40.1 kB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mcp_selenium-1.0.1-py3-none-any.whl (42.2 kB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file mcp_selenium-1.0.1.tar.gz.

File metadata

Download URL: mcp_selenium-1.0.1.tar.gz
Upload date: Apr 6, 2026
Size: 40.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mcp_selenium-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`976f2e14ea66cfdea9d147b3b0ecb87b826014d632190593b2a9fc64f7ceea00`
MD5	`2e8288f255f5fde17382b6ecf49634a4`
BLAKE2b-256	`7f92ba799c8471986fd63e0bba6d6691acd673a6c16236a62c042802c57f8bc2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcp_selenium-1.0.1.tar.gz:

Publisher: publish.yml on SCV-Consultants/selenium-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mcp_selenium-1.0.1.tar.gz
- Subject digest: 976f2e14ea66cfdea9d147b3b0ecb87b826014d632190593b2a9fc64f7ceea00
- Sigstore transparency entry: 1244389673
- Sigstore integration time: Apr 6, 2026
Source repository:
- Permalink: SCV-Consultants/selenium-mcp@46b0f5c8967bc4cffefeb66adfc6cecf12fcc69c
- Branch / Tag: refs/tags/v1.0.1
- Owner: https://github.com/SCV-Consultants
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@46b0f5c8967bc4cffefeb66adfc6cecf12fcc69c
- Trigger Event: release

File details

Details for the file mcp_selenium-1.0.1-py3-none-any.whl.

File metadata

Download URL: mcp_selenium-1.0.1-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 42.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mcp_selenium-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8dc670f4cdc5c62890cc8e471bdbfa03497e9075d39103e6a3ee551cc0b00f25`
MD5	`70c23168d21de143f874a55071c139f1`
BLAKE2b-256	`ec5252eb4b7c8fb82fbb9350bce0c133aeff0c77c52a7378d5a2a4e09e13858d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcp_selenium-1.0.1-py3-none-any.whl:

Publisher: publish.yml on SCV-Consultants/selenium-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mcp_selenium-1.0.1-py3-none-any.whl
- Subject digest: 8dc670f4cdc5c62890cc8e471bdbfa03497e9075d39103e6a3ee551cc0b00f25
- Sigstore transparency entry: 1244389705
- Sigstore integration time: Apr 6, 2026
Source repository:
- Permalink: SCV-Consultants/selenium-mcp@46b0f5c8967bc4cffefeb66adfc6cecf12fcc69c
- Branch / Tag: refs/tags/v1.0.1
- Owner: https://github.com/SCV-Consultants
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@46b0f5c8967bc4cffefeb66adfc6cecf12fcc69c
- Trigger Event: release

mcp-selenium 1.0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

selenium-mcp

Architecture

Layered design

Quick start

Prerequisites

Installation

Run the server

Configuration

Key settings

Custom YAML config

MCP Tools reference

Session management

Navigation

Element interaction

Script & media

Logs & network

Connecting to MCP clients

Via uvx (recommended — zero install)

Via pip

Local development (uv run)

Claude Desktop

Antigravity / Gemini

Claude CLI

Install via Smithery

BiDi / Event system

Error handling

Development

Retry mechanism

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance