AI's camera - Multi-desktop screenshot capture with MCP server for AI agents

These details have not been verified by PyPI

Project links

Project description

CAM - Python Screen Capture with MCP Server Support

- shared visual context between users and AI agents -

A lightweight, efficient screen capture library with automatic error detection. Features full MCP (Model Context Protocol) server integration for seamless AI assistant workflows. Optimized for WSL2 to capture Windows host screens.

🎬 Live Demos & Screenshots

📸 Single Screenshot Capture

Example of CAM's single screenshot capture functionality with custom message

Feature Demo	Description
🔄 Real Monitoring Session Real GIF from actual monitoring session Session Details: • 30s monitoring interval • 11 automatic captures • 2.8MB optimized GIF • JPEG compression • Timestamp-based naming	📋 Workflow Documentation Step-by-step process capture (230KB, 7 frames) Features: • Sequential capture • Auto file organization • Visual documentation • Efficient compression
🖥️ Continuous Monitoring Real-time progress tracking (429KB, 12 frames) Technical: • Real-time monitoring • Progress visualization • Frame compression • Session management	🚨 Error Detection Context-aware categorization (322KB, 5 frames) Smart Features: • Error detection • stdout/stderr tagging • Exception integration • Intelligent naming

Installation

From PyPI (Recommended)

pip install cammy
# pip install cammy[full] # For full features including mss and Pillow support:

From Source

git clone https://github.com/ywatanabe1989/cammy.git
cd cammy

# Install dependencies
pip install -r requirements.txt

Quick Start

🐍 Python API - Beautiful Simplicity

import cammy

# Core functions
cammy.snap()                          # Capture single screenshot
cammy.start()                        # Start monitoring
cammy.stop()                         # Stop monitoring
cammy.create_gif_from_latest_session()  # Create GIF summary

# Multi-desktop features
cammy.get_info()           # List all monitors and windows
cammy.snap(monitor_id=1)             # Capture specific monitor
cammy.snap(capture_all=True)         # Capture all monitors
cammy.start(monitor_id=0)           # Monitor specific screen

📋 Detailed Usages in Python

🐛 Debug Your Code Visually

import cammy

def process_data(df):
    cammy.snap("before transformation")
    df = df.transform(complex_operation)
    cammy.snap("after transformation")
    return df

🚨 Automatic Error Screenshots

import cammy

try:
    selenium_driver.click(button)
    api_response = fetch_data()
except Exception as e:
    cammy.snap()  # Auto-adds -stderr suffix
    raise

🔍 Monitor Long-Running Processes

import cammy

cammy.start()  # Start taking screenshots every second
train_model()  # Your long operation
cammy.stop()

# Multi-monitor monitoring
cammy.start(capture_all=True, interval=2.0)  # All monitors, 2s interval

🎬 Create GIF Summaries

import cammy

# Method 1: Use Context Manager
with cammy.session() as session:
    # ... your process ...

# Method 2: Start/Stop Manually
cammy.start()
# ... your process ...  
cammy.stop()
cammy.create_gif_from_latest_session()
# 📹 GIF created: ~/.cache/cammy/20250823_104523_summary.gif

Configuration

All configuration through function parameters - no config files needed!

cammy.start(
    output_dir="~/screenshots",  # Where to save
    interval=2.0,                # Seconds between captures
    quality=85,                  # JPEG quality (1-100)
    verbose=False,               # Silent mode
    monitor_id=0,                # Specific monitor (0-based)
    capture_all=False            # Capture all monitors
)

🖥️ Multi-Desktop & Monitor Features

import cammy

# Enumerate available monitors and windows
info = cammy.get_info()
print(f"Monitors: {info['Monitors']['Count']}")
print(f"Windows: {info['Windows']['VisibleCount']}")

# Capture specific monitor
cammy.snap(monitor_id=0)    # Primary monitor
cammy.snap(monitor_id=1)    # Secondary monitor

# Capture all monitors combined
cammy.snap(capture_all=True)

# Capture specific window by handle
windows = info['Windows']['Details']
if windows:
    handle = windows[0]['Handle']
    path = cammy.capture_window(handle)
    print(f"Captured window: {path}")

# Monitor specific screen continuously
cammy.start(monitor_id=1, interval=3.0)

File Structure

~/.cache/cammy/
├── 20250823_104523-message-stdout.jpg    # Normal capture
├── 20250823_104525-error-stderr.jpg      # Error capture  
└── 20250823_104530_0001_*.jpg            # Monitoring mode

Cache automatically managed (1GB default limit, oldest files removed)

Requirements

Python 3.7+
WSL environment (for Windows capture)
PowerShell access to Windows host

Optional:

Pillow - JPEG compression (recommended)
mss - Cross-platform fallback

🤖 MCP Server Integration

AI Assistant Ready - Built-in MCP server for Claude Code and other AI assistants
Direct Screenshot Control - AI can capture, monitor, and analyze screenshots programmatically
Automated Workflows - Perfect for debugging, documentation, and monitoring tasks
Real-time Interaction - AI assistants can respond to visual changes instantly

Setup - Just Add JSON!

// Add to your Claude Code settings
{
  "mcpServers": {
    "cammy": {
      "command": "python", 
      "args": ["-m", "cammy", "--mcp"]
    }
  }
}

Available MCP Tools

Core Capture:

capture_screenshot - Take single screenshots with custom messages
start_monitoring / stop_monitoring - Continuous monitoring at configurable intervals
get_monitoring_status - Check current monitoring status

Analysis & Management:

analyze_screenshot - AI-powered error detection and categorization
list_recent_screenshots - Browse capture history by category (stdout/stderr)
clear_cache - Manage screenshot cache size

Advanced Features:

create_gif - Generate animated summaries from monitoring sessions
list_sessions - List available sessions for GIF creation

Use Cases & Examples

📸 Essential Use Cases

Debug visually - Capture before/after states during development
Monitor processes - Continuous screenshots during long operations
Document workflows - Step-by-step visual documentation
Error analysis - Automatic error categorization and screenshots
Multi-monitor setup - Capture specific monitors or all screens simultaneously
Window enumeration - List and target specific application windows
AI-powered automation - Let AI assistants capture and analyze screens via MCP integration

Platform Support

Platform	Support	Notes
WSL → Windows	✅ Full	Primary focus - DPI-aware, multi-monitor
Linux (X11)	⚠️ Limited	Falls back to mss/scrot
macOS	⚠️ Limited	Requires mss library
Windows Native	❌	Not implemented
Linux (Wayland)	❌	Not supported

License

MIT License

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Contact

Yusuke.Watanabe@scitex.ai

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.1

Oct 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cammy-0.2.1.tar.gz (30.1 kB view details)

Uploaded Oct 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cammy-0.2.1-py3-none-any.whl (25.0 kB view details)

Uploaded Oct 17, 2025 Python 3

File details

Details for the file cammy-0.2.1.tar.gz.

File metadata

Download URL: cammy-0.2.1.tar.gz
Upload date: Oct 17, 2025
Size: 30.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.0rc1

File hashes

Hashes for cammy-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`73027f676a7e999481bf01366ed338ea1ccf8244690f3330f951b1b60d6bb16f`
MD5	`6f702359aae73178e31b7fffb4e8300c`
BLAKE2b-256	`85a91e8c94e1dadde26bc0b180bb373ece1db2825df5958b4d12a89703aaf9a3`

See more details on using hashes here.

File details

Details for the file cammy-0.2.1-py3-none-any.whl.

File metadata

Download URL: cammy-0.2.1-py3-none-any.whl
Upload date: Oct 17, 2025
Size: 25.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.0rc1

File hashes

Hashes for cammy-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b1b37025a45631dbe548bd900867e924bd52251f26da33a7222f391c9c284581`
MD5	`7a26f77fb47a28049a38873f90964c21`
BLAKE2b-256	`0abdeaea8ffc50658b5675071b78f12d76458cd1e04fb94a8873bcdc1266130c`

See more details on using hashes here.

cammy 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

CAM - Python Screen Capture with MCP Server Support

🎬 Live Demos & Screenshots

📸 Single Screenshot Capture

Installation

From PyPI (Recommended)

From Source

Quick Start

🐍 Python API - Beautiful Simplicity

🐛 Debug Your Code Visually

🚨 Automatic Error Screenshots

🔍 Monitor Long-Running Processes

🎬 Create GIF Summaries

Configuration

🖥️ Multi-Desktop & Monitor Features

File Structure

Requirements

Setup - Just Add JSON!

Available MCP Tools

Use Cases & Examples

📸 Essential Use Cases

Platform Support

License

Contributing

Contact

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes