Browser screenshot tool using Playwright async API

These details have not been verified by PyPI

Project links

Project description

brosh - Browser Screenshot Tool

A powerful browser screenshot tool that captures scrolling screenshots of webpages using Playwright's async API. Supports intelligent section identification, multiple output formats including animated PNG, and MCP (Model Context Protocol) integration.

1. Table of Contents

Features
How It Works
Installation
Quick Start
Usage
Command Reference
- Global Options
- Commands
Output
Advanced Usage
MCP Integration
Architecture
Development
Troubleshooting
Contributing
License

2. Features

🚀 Async Playwright Integration: Fast and reliable browser automation
🔍 Smart Section Detection: Automatically identifies visible sections for descriptive filenames
🖼️ Multiple Formats: PNG, JPG, and animated PNG (APNG) output
🌐 Browser Support: Chrome, Edge, and Safari (macOS)
🔌 Remote Debugging: Connects to existing browser sessions preserving cookies/auth
🤖 MCP Server: Integrate with AI tools via Model Context Protocol with configurable output
📄 HTML Extraction: Optionally capture HTML content of visible elements
📝 Text Extraction: Automatically converts visible content to Markdown text with optional trimming
📐 Flexible Scrolling: Configurable scroll steps and starting positions
🎯 Precise Control: Set viewport size, zoom level, and output scaling
🔄 Automatic Retries: Robust error handling with configurable retry logic

3. How It Works

brosh works by:

Browser Connection: Connects to an existing browser in debug mode or launches a new instance
Page Navigation: Navigates to the specified URL and waits for content to load
Smart Scrolling: Scrolls through the page in configurable steps, capturing screenshots
Section Detection: Identifies visible headers and elements to create meaningful filenames
Image Processing: Applies scaling, format conversion, and creates animations if requested
Output Organization: Saves screenshots with descriptive names including domain, timestamp, and section

The tool is especially useful for:

Documentation: Capturing long technical documentation or API references
QA Testing: Visual regression testing and bug reporting
Content Archival: Preserving web content with full page captures
Design Reviews: Sharing complete page designs with stakeholders
AI Integration: Providing visual context to language models via MCP

4. Installation

4.1. Using uv/uvx (Recommended)

uv is a fast Python package manager that replaces pip, pip-tools, pipx, poetry, pyenv, and virtualenv.

# Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Run brosh directly with uvx (no installation needed)
uvx brosh shot "https://example.com"

# Or install globally
uv tool install brosh

# Install with all extras
uv tool install "brosh[all]"

4.2. Using pip

# Basic installation
pip install brosh

# With all optional dependencies
pip install "brosh[all]"

4.3. Using pipx

pipx installs Python applications in isolated environments.

# Install pipx (if not already installed)
python -m pip install --user pipx
python -m pipx ensurepath

# Install brosh
pipx install brosh

4.4. From Source

git clone https://github.com/twardoch/brosh.git
cd brosh
pip install -e ".[all]"

4.5. Install Playwright Browsers

After installation, you need to install the browser drivers:

playwright install

5. Quick Start

# Capture a single webpage
brosh shot "https://example.com"

# Capture a local HTML file
brosh shot "file:///path/to/local/file.html"

# Start browser in debug mode for better performance
brosh run
brosh shot "https://example.com"

# Create an animated PNG showing the scroll
brosh shot "https://example.com" --format apng

# Capture with custom viewport
brosh --width 1920 --height 1080 shot "https://example.com"

# Extract HTML content
brosh shot "https://example.com" --fetch_html --json > content.json

6. Usage

6.1. Command Line Interface

brosh provides a Fire-based CLI with intuitive commands and options.

6.1.1. Basic Screenshot Capture

# Simple capture
brosh shot "https://example.com"

# Capture with custom settings
brosh --width 1920 --height 1080 --zoom 125 shot "https://example.com"

# Capture entire page height (no viewport limit)
brosh --height -1 shot "https://example.com"

# Save to specific directory
brosh --output_dir ~/Screenshots shot "https://example.com"

# Organize by domain
brosh --subdirs shot "https://example.com"

6.1.2. Advanced Capture Options

# Start from specific element
brosh shot "https://docs.python.org" --from_selector "#functions"

# Limit number of screenshots
brosh shot "https://example.com" --max_frames 5

# Adjust scroll step (percentage of viewport)
brosh shot "https://example.com" --scroll_step 50

# Scale output images
brosh shot "https://example.com" --scale 75

# Create animated PNG
brosh shot "https://example.com" --format apng --anim_spf 1.0

# Extract visible HTML
brosh shot "https://example.com" --html --json > page_content.json

6.2. MCP Server Mode

Run as an MCP server for AI tool integration:

# Using the dedicated command
brosh-mcp

# Or via the main command
brosh mcp

6.3. Python API

import asyncio
from brosh import capture_webpage, capture_webpage_async

# Synchronous usage (automatically handles async for you)
def capture_sync():
    # Basic capture
    result = capture_webpage(
        url="https://example.com",
        width=1920,
        height=1080,
        scroll_step=100,
        format="png"
    )

    print(f"Captured {len(result)} screenshots")
    for path, data in result.items():
        print(f"  - {path}")
        print(f"    Text: {data['text'][:100]}...")

# Asynchronous usage (for integration with async applications)
async def capture_async():
    # Capture with HTML extraction
    result = await capture_webpage_async(
        url="https://example.com",
        html=True,
        max_frames=3,
        from_selector="#main-content"
    )

    # Result is a dict with paths as keys and metadata as values
    for path, data in result.items():
        print(f"\nScreenshot: {path}")
        print(f"Selector: {data['selector']}")
        print(f"Text preview: {data['text'][:200]}...")
        if 'html' in data:
            print(f"HTML preview: {data['html'][:200]}...")

# Run the examples
capture_sync()
asyncio.run(capture_async())

# Convenience functions
from brosh import capture_full_page, capture_visible_area, capture_animation

# Capture entire page in one screenshot
full_page = capture_full_page("https://example.com")

# Capture only visible viewport
visible = capture_visible_area("https://example.com")

# Create animated PNG
animation = capture_animation("https://example.com")

7. Command Reference

7.1. Global Options

These options can be used with any command:

Option	Type	Default	Description
`--app`	str	auto-detect	Browser to use: `chrome`, `edge`, `safari`
`--width`	int	screen width	Viewport width in pixels
`--height`	int	screen height	Viewport height in pixels (-1 for full page)
`--zoom`	int	100	Zoom level percentage (10-500)
`--output_dir`	str	~/Pictures	Output directory for screenshots
`--subdirs`	bool	False	Create subdirectories for each domain
`--verbose`	bool	False	Enable debug logging

7.2. Commands

7.2.1. `run` - Start Browser in Debug Mode

brosh [--app BROWSER] run [--force_run]

Starts the browser in remote debugging mode for better performance with multiple captures.

Options:

--force_run: Force restart even if already running

7.2.2. `quit` - Quit Browser

brosh [--app BROWSER] quit

Closes the browser started in debug mode.

7.2.3. `shot` - Capture Screenshots

brosh [OPTIONS] shot URL [SHOT_OPTIONS]

Required:

URL: The webpage URL to capture

Shot Options:

Option	Type	Default	Description
`--scroll_step`	int	100	Scroll step as % of viewport height (10-200)
`--scale`	int	100	Scale output images by % (10-200)
`--format`	str	png	Output format: `png`, `jpg`, `apng`
`--anim_spf`	float	0.5	Seconds per frame for APNG
`--fetch_html`	bool	False	Extract HTML content of visible elements
`--fetch_image`	bool	False	Include image data in MCP output
`--fetch_image_path`	bool	True	Include image paths in MCP output
`--fetch_text`	bool	True	Include extracted text in MCP output
`--trim_text`	bool	True	Trim text to 200 characters
`--json`	bool	False	Output results as JSON
`--max_frames`	int	0	Maximum screenshots (0 = all)
`--from_selector`	str	""	CSS selector to start capture from

7.2.4. `mcp` - Run MCP Server

brosh mcp

Starts the MCP server for AI tool integration.

8. Output

8.1. File Naming Convention

Screenshots are saved with descriptive filenames:

{domain}-{timestamp}-{scroll_position}-{section}.{format}

Example:

github_com-250612-185234-00500-readme.png
│         │              │     │
│         │              │     └── Section identifier
│         │              └──────── Scroll position (0-9999)
│         └─────────────────────── Timestamp (YYMMDD-HHMMSS)
└───────────────────────────────── Domain name

8.2. Output Formats

PNG: Lossless compression, best quality (default)
JPG: Smaller file size, good for photos
APNG: Animated PNG showing the scroll sequence

8.3. JSON Output

The tool now always extracts text content from visible elements. When using --json:

Default output (without --fetch_html):

{
  "/path/to/screenshot1.png": {
    "selector": "main.content",
    "text": "# Main Content\n\nThis is the extracted text in Markdown format..."
  }
}

With --fetch_html flag:

{
  "/path/to/screenshot1.png": {
    "selector": "main.content",
    "html": "<main class='content'>...</main>",
    "text": "# Main Content\n\nThis is the extracted text in Markdown format..."
  }
}

The text field contains the visible content converted to Markdown format using html2text, making it easy to process the content programmatically.

9. Advanced Usage

9.1. Browser Management

brosh can connect to your existing browser session, preserving cookies, authentication, and extensions:

# Start Chrome in debug mode
brosh --app chrome run

# Your regular browsing session remains active
# brosh connects to it for screenshots

# Take screenshots with your logged-in session
brosh shot "https://github.com/notifications"

# Quit when done
brosh --app chrome quit

9.2. Custom Viewports

Simulate different devices by setting viewport dimensions:

# Desktop - 4K
brosh --width 3840 --height 2160 shot "https://example.com"

# Desktop - 1080p
brosh --width 1920 --height 1080 shot "https://example.com"

# Tablet
brosh --width 1024 --height 768 shot "https://example.com"

# Mobile
brosh --width 375 --height 812 shot "https://example.com"

9.3. HTML Extraction

Extract the HTML content of visible elements for each screenshot:

# Get HTML with screenshots
brosh shot "https://example.com" --fetch_html --json > content.json

# Process the extracted content
cat content.json | jq 'to_entries | .[] | {
  screenshot: .key,
  wordCount: (.value.html | split(" ") | length)
}'

9.4. Animation Creation

Create smooth animations showing page scroll:

# Standard animation (0.5 seconds per frame)
brosh shot "https://example.com" --format apng

# Faster animation
brosh shot "https://example.com" --format apng --anim_spf 0.2

# Slower, more detailed
brosh shot "https://example.com" --format apng --anim_spf 1.0 --scroll_step 50

10. MCP Integration

10.1. What is MCP?

The Model Context Protocol (MCP) is an open standard that enables seamless integration between AI applications and external data sources or tools. brosh implements an MCP server that allows AI assistants like Claude to capture and analyze web content.

10.2. Setting Up MCP Server

10.2.1. Using uvx (Recommended)

# Run directly without installation
uvx brosh-mcp

# Or install as a tool
uv tool install brosh
uvx brosh-mcp

10.2.2. Configure Claude Desktop

Add brosh to your Claude Desktop configuration:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "brosh": {
      "command": "uvx",
      "args": ["brosh-mcp"],
      "env": {
        "FASTMCP_LOG_LEVEL": "INFO"
      }
    }
  }
}

Note: If you encounter issues with uvx, you can use the full path to brosh-mcp:

{
  "mcpServers": {
    "brosh": {
      "command": "/path/to/python/bin/brosh-mcp",
      "args": [],
      "type": "stdio"
    }
  }
}

To find the full path:

# On Unix-like systems
which brosh-mcp

# Or with Python
python -c "import shutil; print(shutil.which('brosh-mcp'))"

10.2.3. Alternative Configurations

Using Python directly:

{
  "mcpServers": {
    "brosh": {
      "command": "python",
      "args": ["-m", "brosh", "mcp"]
    }
  }
}

Using specific Python path:

{
  "mcpServers": {
    "brosh": {
      "command": "/usr/local/bin/python3",
      "args": ["-m", "brosh", "mcp"]
    }
  }
}

10.3. Using with Claude

Once configured, you can ask Claude to:

"Take a screenshot of python.org documentation"
"Capture the entire React homepage with animations"
"Get screenshots of the GitHub trending page and extract the visible HTML"
"Show me what the Hacker News homepage looks like"

Claude will use brosh to capture the screenshots and can analyze the visual content or extracted HTML.

10.4. MCP Output Format

The MCP server returns results in a flat JSON structure for better compatibility:

{
  "image_path": "/path/to/screenshot.png",
  "selector": "main.content",
  "text": "Extracted text content (trimmed to 200 chars by default)...",
  "html": "<main>...</main>"  // Only if fetch_html=True
}

Control output with flags:

fetch_image: Include actual image data (default: False)
fetch_image_path: Include image file path (default: True)
fetch_text: Include extracted text (default: True)
trim_text: Trim text to 200 characters (default: True)
fetch_html: Include HTML content (default: False)

11. Architecture

The project is organized into modular components:

src/brosh/
├── __init__.py      # Package exports
├── __main__.py      # CLI entry point
├── api.py           # Public API functions
├── cli.py           # Command-line interface
├── tool.py          # Main screenshot tool
├── browser.py       # Browser management
├── capture.py       # Screenshot capture logic
├── image.py         # Image processing
├── models.py        # Data models
├── mcp.py           # MCP server implementation
└── texthtml.py      # HTML/text processing

11.1. Key Components

API Layer: Provides both sync (capture_webpage) and async (capture_webpage_async) interfaces
BrowserManager: Handles browser detection, launching, and connection
CaptureManager: Manages scrolling and screenshot capture
ImageProcessor: Handles image scaling, conversion, and animation
BrowserScreenshotTool: Orchestrates the capture process
BrowserScreenshotCLI: Provides the command-line interface
MCP Server: FastMCP-based server for AI tool integration

12. Development

12.1. Setup Development Environment

# Clone the repository
git clone https://github.com/twardoch/brosh.git
cd brosh

# Install with development dependencies
pip install -e ".[dev,test,all]"

# Install pre-commit hooks
pre-commit install

12.2. Running Tests

# Run all tests
pytest

# Run with coverage
pytest --cov=src/brosh --cov-report=term-missing

# Run specific test
pytest tests/test_capture.py -v

12.3. Code Quality

# Format code
ruff format src/brosh tests

# Lint code
ruff check src/brosh tests

# Type checking
mypy src/brosh

12.4. Building Documentation

# Install docs dependencies
pip install -e ".[docs]"

# Build documentation
sphinx-build -b html docs/source docs/build

13. Troubleshooting

13.1. Common Issues

13.1.1. Browser Not Found

Error: "Could not find chrome installation"

Solution: Ensure Chrome/Edge/Safari is installed in the default location, or specify the browser explicitly:

brosh --app edge shot "https://example.com"

13.1.2. Connection Timeout

Error: "Failed to connect to browser"

Solution: Start the browser in debug mode first:

brosh run
# Then in another terminal:
brosh shot "https://example.com"

13.1.3. Screenshot Timeout

Error: "Screenshot timeout for position X"

Solution: Increase the timeout or reduce the page complexity:

# Simpler format
brosh shot "https://example.com" --format jpg

# Fewer screenshots
brosh shot "https://example.com" --scroll_step 200

13.1.4. Permission Denied

Error: "Permission denied" when saving screenshots

Solution: Check the output directory permissions or use a different directory:

brosh --output_dir /tmp/screenshots shot "https://example.com"

13.2. Debug Mode

Enable verbose logging to troubleshoot issues:

brosh --verbose shot "https://example.com"

13.3. Platform-Specific Notes

13.3.1. macOS

Safari requires enabling "Allow Remote Automation" in Develop menu
Retina displays are automatically detected and handled

13.3.2. Windows

Run as administrator if you encounter permission issues
Chrome/Edge must be installed in default Program Files location

13.3.3. Linux

Install additional dependencies: sudo apt-get install libnss3 libxss1
Headless mode may be required on servers without display

14. Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

14.1. Development Process

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

14.2. Code Style

Follow PEP 8 guidelines
Use type hints for all functions
Add docstrings to all public functions
Keep functions focused and modular
Write tests for new features

15. License

This project is licensed under the MIT License - see the LICENSE file for details.

Created by Adam Twardoch
Created with Anthropic software
Uses Playwright for reliable browser automation
Uses Fire for the CLI interface
Implements FastMCP for Model Context Protocol support

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.4.7

Jun 13, 2025

1.4.0

Jun 13, 2025

1.3.3

Jun 13, 2025

1.3.1

Jun 13, 2025

1.3.0

Jun 13, 2025

1.2.8

Jun 13, 2025

1.2.2

Jun 13, 2025

1.2.1

Jun 12, 2025

1.0.5

Jun 12, 2025

1.0.3

Jun 12, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

brosh-1.4.7.tar.gz (20.1 kB view details)

Uploaded Jun 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

brosh-1.4.7-py3-none-any.whl (10.3 kB view details)

Uploaded Jun 13, 2025 Python 3

File details

Details for the file brosh-1.4.7.tar.gz.

File metadata

Download URL: brosh-1.4.7.tar.gz
Upload date: Jun 13, 2025
Size: 20.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.13

File hashes

Hashes for brosh-1.4.7.tar.gz
Algorithm	Hash digest
SHA256	`a3bd0c8ca63fa9a7338a26b53873ad199532a4663a5d826eb9b20183394998ea`
MD5	`37b2c1079bd123fdf8c5333e05527b0f`
BLAKE2b-256	`b9f0bb7b21a4171a767ffab0a42b30f9a3f690d24fe61ef0ab6663a153700ec6`

See more details on using hashes here.

File details

Details for the file brosh-1.4.7-py3-none-any.whl.

File metadata

Download URL: brosh-1.4.7-py3-none-any.whl
Upload date: Jun 13, 2025
Size: 10.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.13

File hashes

Hashes for brosh-1.4.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`90f3cdb40ce42165803bf7bdb6cfb4e43e53b54340c501bea7abcbc342f9a821`
MD5	`56fe5646210bfcc86489d88db0ca043a`
BLAKE2b-256	`feacfc1af04bf3d0079de19115403e1ad8a0b75edf3813a8ace4d2de874e6d97`

See more details on using hashes here.

brosh 1.4.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

brosh - Browser Screenshot Tool

1. Table of Contents

2. Features

3. How It Works

4. Installation

4.1. Using uv/uvx (Recommended)

4.2. Using pip

4.3. Using pipx

4.4. From Source

4.5. Install Playwright Browsers

5. Quick Start

6. Usage

6.1. Command Line Interface

6.1.1. Basic Screenshot Capture

6.1.2. Advanced Capture Options

6.2. MCP Server Mode

6.3. Python API

7. Command Reference

7.1. Global Options

7.2. Commands

7.2.1. run - Start Browser in Debug Mode

7.2.2. quit - Quit Browser

7.2.3. shot - Capture Screenshots

7.2.4. mcp - Run MCP Server

8. Output

8.1. File Naming Convention

8.2. Output Formats

8.3. JSON Output

9. Advanced Usage

9.1. Browser Management

9.2. Custom Viewports

9.3. HTML Extraction

9.4. Animation Creation

10. MCP Integration

10.1. What is MCP?

10.2. Setting Up MCP Server

10.2.1. Using uvx (Recommended)

10.2.2. Configure Claude Desktop

10.2.3. Alternative Configurations

10.3. Using with Claude

10.4. MCP Output Format

11. Architecture

11.1. Key Components

12. Development

12.1. Setup Development Environment

12.2. Running Tests

12.3. Code Quality

12.4. Building Documentation

13. Troubleshooting

13.1. Common Issues

13.1.1. Browser Not Found

13.1.2. Connection Timeout

13.1.3. Screenshot Timeout

13.1.4. Permission Denied

13.2. Debug Mode

13.3. Platform-Specific Notes

13.3.1. macOS

13.3.2. Windows

13.3.3. Linux

14. Contributing

14.1. Development Process

14.2. Code Style

15. License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

7.2.1. `run` - Start Browser in Debug Mode

7.2.2. `quit` - Quit Browser

7.2.3. `shot` - Capture Screenshots

7.2.4. `mcp` - Run MCP Server