MCP Server for browser automation via Selenium WebDriver + geckodriver (Firefox)
Project description
mcp-server-webdriver
MCP Server that lets AI agents control a real web browser via Selenium WebDriver (Firefox + geckodriver).
Built with FastMCP.
What it does
The server eliminates the copy-paste loop between the browser and the AI assistant. Instead of opening DevTools, copying errors, pasting them into a chat, and repeating, the assistant opens the browser itself, navigates, captures errors and screenshots, and diagnoses the problem directly.
You: "Why is the checkout button broken on /cart?"
AI: browser_open → browser_navigate("/cart")
→ devtools_report # JS errors? network failures?
→ browser_screenshot # what does it look like?
→ devtools_computed_css("button#checkout") # hidden? wrong z-index?
→ "The button has pointer-events: none — overridden by .disabled class
applied when cart.js fails to load (404 on /static/cart.js)."
Requirements
| Dependency | Version |
|---|---|
| Python | ≥ 3.11 |
| FastMCP | ≥ 2.10 |
| selenium | ≥ 4.0 |
| Firefox | any recent |
| geckodriver | ≥ 0.34 |
Installation
Recommended — Debian package from VitexSoftware repository
sudo curl -fsSL http://repo.vitexsoftware.com/KEY.gpg -o /usr/share/keyrings/vitexsoftware-archive-keyring.gpg
echo "deb [signed-by=/usr/share/keyrings/vitexsoftware-archive-keyring.gpg] http://repo.vitexsoftware.com trixie main" \
| sudo tee /etc/apt/sources.list.d/vitexsoftware.list
sudo apt update
sudo apt install python3-mcp-server-webdriver
This installs gecko-driver, python3-selenium, python3-fastmcp, and
mcp-server-webdriver in a single step.
Alternative — system package manager
# Debian/Ubuntu (official repos — may be older geckodriver):
sudo apt install firefox-geckodriver
# macOS:
brew install geckodriver
# Rust / cargo (build from source):
cargo install geckodriver
Then install Python dependencies:
pip install fastmcp selenium
Fallback — webdriver-manager (auto-download)
pip install fastmcp selenium webdriver-manager
# geckodriver is downloaded automatically on first browser_open call
Usage
mcp-server-webdriver [OPTIONS]
OPTIONS
-P <profile> Start Firefox with a named profile
--profile <path> Start Firefox with a profile directory at <path>
-h, --help Show help and exit
The server speaks MCP over stdin/stdout and is launched automatically by the MCP client — not by hand.
Use cases
Debug a broken page
Ask: "Why does /dashboard show a blank screen?"
The assistant will:
browser_open— open Firefox headlesslybrowser_navigate— go to/dashboarddevtools_report— get JS errors, console output, and failed network resources in one callbrowser_screenshot— see what the page actually looks like- Explain the root cause from the combined evidence
devtools_report is the primary diagnostic tool — equivalent to opening the Console
and Network tabs in DevTools and reading them simultaneously.
Diagnose a CSS / layout problem
Ask: "The sidebar overlaps the content area on mobile. Why?"
browser_open— open the pagebrowser_screenshot— capture the broken layoutdevtools_computed_css(".sidebar")— checkposition,width,z-index,overflowdevtools_css_variables("--")— verify design tokens loaded correctlydevtools_network_failed— check whether any stylesheet failed to load
Automate a login flow
Ask: "Log into the app at /login with user=admin, password=secret and screenshot the dashboard."
browser_open— start the browserbrowser_navigate— go to/loginbrowser_fill("#username", "admin")— type usernamebrowser_fill("#password", "secret")— type passwordbrowser_press_key("enter")— submit the formbrowser_wait("#dashboard", condition="visible")— wait for redirectbrowser_screenshot— capture the result
Inject a session cookie to skip login
Ask: "Check the admin panel using my existing session token."
browser_open— start the browserbrowser_navigate— go to the app's root so the cookie domain matchesbrowser_set_cookie("session", "<token>")— inject the auth cookiebrowser_navigate— now navigate to the protected pagebrowser_screenshot— confirm access
Enumerate page content
Ask: "List all the navigation links on the homepage."
browser_open+browser_navigate— open the pagebrowser_find_elements("nav a")— get all links with their text, href, and visibility- Return the structured list
Interact with hover menus
Ask: "Click the third item in the Products dropdown."
browser_hover(".nav-products")— trigger the:hoverstate that reveals the dropdownbrowser_wait(".dropdown-menu", condition="visible")— wait for animationbrowser_find_elements(".dropdown-menu a")— list the itemsbrowser_click(".dropdown-menu a:nth-child(3)")— click the right one
Test a multi-step form
Ask: "Fill out the registration form and submit it."
browser_fill("#first-name", "Alice")browser_fill("#last-name", "Smith")browser_fill("#email", "alice@example.com")browser_select("#country", "Czech Republic")browser_press_key("tab")— move focus to next fieldbrowser_click("button[type=submit"]")browser_wait(".success-message", condition="visible")devtools_report— check for any JS errors or failed API calls during submission
Handle JS dialogs
Ask: "Click the Delete button and confirm the dialog."
browser_click("#delete-btn")browser_accept_dialog— click OK on theconfirm("Are you sure?")browser_wait(".deleted-notice", condition="present")
Scroll and capture a long page
Ask: "Screenshot the footer of the page."
browser_open+browser_navigatebrowser_scroll("footer")— scroll the footer element into viewbrowser_screenshot("footer")— capture just the footer element
Or scroll by offset to trigger lazy-loaded content:
browser_scroll(by=True, y=1000)— scroll down 1000 pxbrowser_wait(".lazy-section", condition="visible")— wait for lazy contentbrowser_screenshot— capture the now-loaded content
Use a real Firefox profile (stay logged in)
Configure the server with a named profile that already has your session:
{
"mcpServers": {
"webdriver": {
"command": "mcp-server-webdriver",
"args": ["-P", "work"]
}
}
}
The browser starts with your existing cookies, saved passwords, and extensions. Ask: "Check my GitHub notifications." — no login step needed.
Audit network performance
Ask: "Which resources on /shop are slowest to load?"
browser_open+browser_navigate("/shop")devtools_network_all(slow_ms=500, limit=20)— requests over 500 ms, capped at 20 entries- Report the slowest assets with their URLs, types, and durations
Available Tools
Session management
| Tool | Description |
|---|---|
browser_open |
Open Firefox (URL optional, default about:blank); accepts width, height, user_agent for mobile emulation |
browser_close |
Quit the browser session |
browser_status |
Session state, geckodriver version, BiDi status, current viewport size, buffer counts |
browser_set_viewport |
Resize the viewport mid-session (e.g. 390×844 for iPhone 14) |
Navigation
| Tool | Description |
|---|---|
browser_navigate |
Navigate to a URL (bare hostnames get https://) |
browser_back |
Go back in history |
browser_forward |
Go forward in history |
browser_refresh |
Reload the current page |
Page inspection
| Tool | Description |
|---|---|
browser_screenshot |
Full-page or element PNG screenshot |
browser_get_title |
Current page <title> |
browser_get_url |
Current URL |
browser_get_source |
Raw HTML source |
browser_get_text |
Visible text (whole page or CSS selector) |
browser_get_attribute |
Value of an HTML attribute on an element |
browser_find_elements |
List all elements matching a CSS selector |
Interaction
| Tool | Description |
|---|---|
browser_click |
Click element (CSS selector) |
browser_fill |
Type text into an input field (clears first by default) |
browser_select |
Select <option> in a <select> dropdown |
browser_execute_js |
Run JavaScript — returns JSON |
browser_wait |
Wait: visible / clickable / present / text:<str> |
browser_scroll |
Scroll to coords, by offset, or element into view |
browser_press_key |
Send enter / tab / escape / arrow / F-keys |
browser_hover |
Hover mouse over element (:hover states, tooltips, dropdowns) |
browser_switch_frame |
Switch into <iframe> or back to main document |
Dialogs & cookies
| Tool | Description |
|---|---|
browser_accept_dialog |
Accept a JS alert() / confirm() / prompt() |
browser_dismiss_dialog |
Dismiss a JS confirm() / prompt() |
browser_get_cookies |
Read all cookies for the current page |
browser_set_cookie |
Inject a cookie (auth tokens, session IDs) |
DevTools (require BiDi — Firefox + geckodriver ≥ 0.34)
| Tool | Description |
|---|---|
devtools_report |
Main diagnostic tool — JS errors + console + failed/slow network |
devtools_js_errors |
JavaScript exceptions only |
devtools_console |
Console output (log / warn / error / info / debug) |
devtools_network_failed |
Failed resources (4xx, 5xx, DNS errors) |
devtools_network_all |
All network requests (supports limit= and filters) |
devtools_clear |
Clear buffered DevTools data (use before navigating) |
devtools_enable_bidi |
Attach BiDi listeners to a running session |
devtools_computed_css |
Computed CSS properties of an element |
devtools_element_info |
Bounding box, visibility, attributes, aria, outerHTML |
devtools_css_variables |
CSS custom properties (--var) in scope |
Environment variables
| Variable | Default | Description |
|---|---|---|
GECKODRIVER_PATH |
(unset) | Absolute path to geckodriver binary (highest priority) |
GECKODRIVER_AUTO_INSTALL |
true |
Set to false to disable webdriver-manager fallback |
FIREFOX_BINARY |
(unset) | Path to a custom Firefox executable |
FIREFOX_PROFILE |
(unset) | Named Firefox profile — same as -P |
FIREFOX_PROFILE_DIR |
(unset) | Profile directory path — same as --profile |
geckodriver resolution order
| # | Source | Configure via |
|---|---|---|
| 1 | GECKODRIVER_PATH env variable |
Absolute path to the binary |
| 2 | System PATH (default) | apt install gecko-driver from repo.vitexsoftware.com |
| 3 | webdriver-manager auto-download | Fallback; disable with GECKODRIVER_AUTO_INSTALL=false |
MCP client configuration
Minimal config:
{
"mcpServers": {
"webdriver": {
"command": "mcp-server-webdriver"
}
}
}
With a named Firefox profile (stays logged in, uses saved passwords):
{
"mcpServers": {
"webdriver": {
"command": "mcp-server-webdriver",
"args": ["-P", "work"]
}
}
}
With a profile directory and explicit geckodriver path:
{
"mcpServers": {
"webdriver": {
"command": "mcp-server-webdriver",
"args": ["--profile", "/home/user/.mozilla/firefox/abc123.dev"],
"env": {
"GECKODRIVER_PATH": "/usr/bin/geckodriver"
}
}
}
}
Running tests
# Unit tests only (no browser required):
pytest tests/ -m "not integration"
# All tests including browser integration:
pytest tests/
Related MCP Servers by VitexSoftware
| Server | Description |
|---|---|
| abraflexi-mcp-server | AbraFlexi accounting/ERP integration — invoices, contacts, products, bank transactions |
| mastodon-mcp-server | Mastodon integration — timelines, posting, account management, search |
| semaphore-mcp-server | Semaphore UI integration — manage Ansible, Terraform and other automation workflows |
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mcp_server_webdriver-0.6.0.tar.gz.
File metadata
- Download URL: mcp_server_webdriver-0.6.0.tar.gz
- Upload date:
- Size: 45.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4d60e569232351054c420f67b382fa57daeafa773af9e00c8791eb4db5e19260
|
|
| MD5 |
4db9dbf7eb7fba8937f86a9f0e24f900
|
|
| BLAKE2b-256 |
8ae93da86f7008df37305682986ec6f61419b19d2b7a3c6b4161a18154fc6a13
|
File details
Details for the file mcp_server_webdriver-0.6.0-py3-none-any.whl.
File metadata
- Download URL: mcp_server_webdriver-0.6.0-py3-none-any.whl
- Upload date:
- Size: 24.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ce62abb0af683b3bae6eb60ec6846d4d9960fb683d7da506c433d5bbea9a48bd
|
|
| MD5 |
da90cf0e95208d1b1ff897355f5d6c45
|
|
| BLAKE2b-256 |
ca1a06fd4f3df2114df7e182712005e5404c7d3c58321e78462f562e970525ef
|