CLI tool for AI agents to observe and interact with Chrome via CDP

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

captivus

These details have not been verified by PyPI

Project description

chrome-agent

A CLI tool that gives AI coding agents the ability to observe and interact with Chrome browsers.

Built as a replacement for browser MCP tools. Faster, lower token overhead, and supports something MCP tools can't do: multiple agents sharing the same browser instance.

Why this exists

AI coding agents need to see and interact with browsers -- to test their code, debug automation, inspect page state. The standard approach (browser MCP tools) uses a persistent server with protocol negotiation and verbose response formatting. chrome-agent takes a different approach: each command is a standalone CLI call that connects to Chrome via the DevTools Protocol, does one thing, and disconnects. No server, no session state, no bloat.

This also enables a workflow that MCP tools can't support: one process drives the browser (your automation code) while a separate agent observes the same browser to diagnose issues and improve the code.

Installation

uv tool install chrome-agent
playwright install chromium

Or add to a project:

uv add chrome-agent
uv run playwright install chromium

Two ways to use it

Drive mode -- you control the browser

Launch a browser and interact with it directly. This is the MCP replacement use case.

chrome-agent launch &
chrome-agent navigate "https://example.com"
chrome-agent text                        # Read page content
chrome-agent element "h1"                # Inspect an element
chrome-agent fill "#search" "query"      # Fill a form field
chrome-agent click "#submit"             # Click a button
chrome-agent screenshot /tmp/page.png    # Capture the screen

Attach mode -- observe a running browser

Your automation code launches a browser with --remote-debugging-port=9222. You connect to observe what the code is doing, diagnose failures, and figure out what to change.

chrome-agent status                      # Is the browser running?
chrome-agent url                         # Where is it?
chrome-agent element "#submit-btn"       # Why can't the code click this?
chrome-agent eval "document.querySelectorAll('.error').length"
chrome-agent screenshot                  # What does it look like?

The feedback loop: write code -> run it -> observe the browser -> diagnose -> modify code -> repeat.

Commands

chrome-agent [--port PORT] <command> [args...]

Check browser status

status                Check if a browser is running on the CDP port
launch                Launch a browser with CDP enabled
                      [--fingerprint PATH] [--headless] [--no-pin-desktop]
help                  Print command reference

Observe (read-only, always safe)

url                   Print current URL and page title
screenshot [path]     Save a screenshot (default: /tmp/cdp-screenshot.png)
snapshot              Print the ARIA accessibility tree
text                  Print visible text content
html [selector]       Print page HTML or a specific element's HTML
element <selector>    Detailed element inspection (visibility, dimensions,
                      attributes, position, disabled state)
find <selector>       Count and list all matching elements
value <selector>      Get an input element's current value
eval <code>           Execute JavaScript and print the result
cookies               List all cookies
tabs                  List all open tabs/pages
wait <target>         Wait for a selector, milliseconds, or load state

Navigate

navigate <url>        Go to a URL
back                  Browser back
forward               Browser forward
reload                Reload the page

Interact

click <selector>      Click an element (JS fallback for hidden elements)
fill <selector> <val> Fill a form field (clears first)
type <selector> <txt> Type text character by character
press <key>           Press a keyboard key (Enter, Escape, Tab, etc.)
select <sel> <value>  Select a dropdown option
check <selector>      Check a checkbox
uncheck <selector>    Uncheck a checkbox
hover <selector>      Hover over an element
scroll <target>       Scroll to element, or scroll up/down
clickxy <x> <y>       Click at page coordinates
close                 Close the current page
viewport <w> <h>      Resize the viewport

For AI agents

The primary user of this tool is an AI coding agent, not a human. See INSTRUCTIONS.md for comprehensive agent instructions covering:

Drive mode vs attach mode mental model
Safety rules for shared browser access
The development feedback loop
When to observe vs intervene
Command recipes for common tasks
Failure modes and recovery

Include the contents of INSTRUCTIONS.md in your project's CLAUDE.md or agent instructions file.

Browser fingerprinting (optional)

For sites that detect automated browsers, launch with a fingerprint profile:

chrome-agent launch --fingerprint path/to/fingerprint.json

The fingerprint JSON overrides the browser's user agent, viewport, locale, timezone, and platform to match a real desktop browser:

{
    "userAgent": "Mozilla/5.0 (X11; Linux x86_64) ...",
    "platform": "Linux x86_64",
    "vendor": "Google Inc.",
    "language": "en-US",
    "timezone": "America/Chicago",
    "viewport": {"width": 1920, "height": 1080}
}

Without --fingerprint, the browser launches with default Chromium settings.

Requirements

Python >= 3.11
Playwright >= 1.50.0
Chromium (installed via playwright install chromium)
Linux with xdotool (optional, for virtual desktop pinning)

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

captivus

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.1

Apr 22, 2026

0.4.0

Apr 15, 2026

0.3.0

Apr 14, 2026

0.2.1

Apr 11, 2026

This version

0.2.0

Apr 11, 2026

0.1.0

Apr 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chrome_agent-0.2.0.tar.gz (12.4 kB view details)

Uploaded Apr 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chrome_agent-0.2.0-py3-none-any.whl (15.4 kB view details)

Uploaded Apr 11, 2026 Python 3

File details

Details for the file chrome_agent-0.2.0.tar.gz.

File metadata

Download URL: chrome_agent-0.2.0.tar.gz
Upload date: Apr 11, 2026
Size: 12.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for chrome_agent-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`09d6e94c678f4c66eaa5c60fac21c8066ce500f652519587745ef25402777d8a`
MD5	`00a93e52d63f6a6eed6998519b2b8e42`
BLAKE2b-256	`3d266dc0b2fec8d6838009133d7de6cf4718e1c724b7a46fdc4dcb770c56c8d1`

See more details on using hashes here.

File details

Details for the file chrome_agent-0.2.0-py3-none-any.whl.

File metadata

Download URL: chrome_agent-0.2.0-py3-none-any.whl
Upload date: Apr 11, 2026
Size: 15.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for chrome_agent-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7b6a83d0bca54f2c0640ad204d8cc47628c729b24146d8a755ff7549d9eb8a18`
MD5	`b0a6430afd83307ae3823412ccf13ac8`
BLAKE2b-256	`909b99225dc149396240bb6e7278ff6004ee07e3cfe5935ce9aa22bbddaff071`

See more details on using hashes here.

chrome-agent 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

chrome-agent

Why this exists

Installation

Two ways to use it

Drive mode -- you control the browser

Attach mode -- observe a running browser

Commands

Check browser status

Observe (read-only, always safe)

Navigate

Interact

For AI agents

Browser fingerprinting (optional)

Requirements

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes