Like Selenium, but for your entire screen. Find and interact with text anywhere using OCR.
Project description
🎯 Screenium
Screenium is like Selenium after discovering there's a whole world outside the browser. Find and click any text on your screen using OCR. That "Accept Cookies" popup for the millionth time? Banish it.
🔍 Installation
pip install screenium
or
uv add screenium
✨ Use Cases
- Auto-Accept Cookies: Run a background task that automatically clicks "Accept Cookies" popups
- Meeting Assistant: Auto-join video calls by detecting "Join Meeting" buttons and automatically enable/disable your camera/mic based on meeting names
- Game Automation: Automate repetitive game tasks by detecting and clicking on in-game text (e.g., "Collect Rewards", "Start Battle")
- System Dialog Monitor: Handling system or application dialogs, like "Are you sure?" or "Run command," in Cursor based on set rules.
- Multi-Browser Testing: Perform UI tests on browsers that lack automation APIs or web drivers with no separate web driver installations needed.
🔍 Finding Text
The most basic way to find text is:
from screenium import Text
# Simple text matching
text = Text("Login") # Finds "Login" anywhere on screen
text = Text("LOG") # Matches the first of "LOGIN", "LOGOUT", etc.
# Exact matching
text = Text("Login", exact_match=True) # Only matches "Login" exactly
text = Text("login", case_sensitive=True) # Case-sensitive matching
📍 Spatial Relationships
You can find text based on its position relative to other text:
# Basic relationships
Text("Password").below("Username") # Find "Password" below "Username"
Text("Cancel").left_of("Submit") # Find "Cancel" left of "Submit"
Text("Help").right_of("Back") # Find "Help" right of "Back"
Text("Title").above("Content") # Find "Title" above "Content"
# Chain relationships
Text("Save").below("Options").right_of("Cancel")
📐 Aligned Elements
For more precise positioning, use aligned to require elements to line up:
# Elements must be vertically aligned (same x-axis)
Text("Email").aligned.below("Username")
# Elements must be horizontally aligned (same y-axis)
Text("Back").aligned.left_of("Next")
🎨 Background Color Matching
Find text by its background color:
# Match text with specific background colors
Text("Error", background_color="red")
Text("Success", background_color="green")
Text("Info", background_color="#0088ff")
# Adjust color matching tolerance (0-100)
Text("Warning", background_color="yellow", background_tolerance=50)
🖱️ Mouse Actions
Interact with matched text:
# Basic mouse actions
Text("Submit").click()
Text("Options").right_click()
Text("Link").double_click()
Text("Button").mouse_move() # Just move mouse without clicking
# Type text
Text("Username").click().typewrite("myuser")
Text("Password").click().typewrite("mypass")
# Special keys
Text("Terminal").click().typewrite(["command", "k"]) # Clear terminal
⏰ Waiting and Monitoring
Handle timing and background monitoring:
# Wait for specific duration
Text("Loading").wait(2).click() # Wait 2 seconds then click
🔍 Inspecting Matches
Get detailed information about matches:
# Get all matches
matches = Text("Button").matches
for match in matches:
print(f"Found at: ({match['x']}, {match['y']})")
print(f"Size: {match['width']}x{match['height']}")
print(f"Confidence: {match['confidence']}")
🎨 Visual Debugging
Draw boxes around matches to debug:
# Draw green box for 3 seconds
Text("Username").draw(duration=3, color="green")
# Chain with other operations
Text("Password").below("Username").draw(duration=2, color="red").click()
🎯 Tips & Best Practices
- Start with basic text matching before adding spatial relationships
- Use
draw()to visually verify matches - Adjust background color tolerance if color matching is too strict/loose
- Use
alignedwhen elements should line up precisely - Chain operations for more precise matching
⚠️ Limitations
- macOS only (uses Apple Vision framework)
- Requires macOS Monterey (12.x) or higher
- Screen resolution and scaling can affect matching
🔐 Required macOS Permissions
Screenium needs two key macOS permissions to function. The app running your Python script (e.g., Terminal, VS Code, PyCharm) will need:
-
Screen Recording
- Open System Settings > Privacy & Security > Screen Recording
- Toggle on your terminal app/IDE
- Required for:
- Finding text on screen using OCR
- Background monitoring
- Visual debugging with
draw()
-
Accessibility
- Open System Settings > Privacy & Security > Accessibility
- Toggle on your terminal app/IDE
- Required for:
- Mouse actions (
click(),right_click(), etc.) - Keyboard input (
typewrite(),hotkey())
- Mouse actions (
💡 Tip: If running from different apps, each one needs separate permissions. For example, if you run scripts from both Terminal and VS Code, you'll need to grant permissions to both.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file screenium-0.1.1.tar.gz.
File metadata
- Download URL: screenium-0.1.1.tar.gz
- Upload date:
- Size: 55.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ae14dd61ecfa62d11281aef08c345a90521bfadedf2701fe213b73f5887b32c4
|
|
| MD5 |
f2d42afd8fa27265a004fb319308961a
|
|
| BLAKE2b-256 |
aafd8c6b9e127d4ff95556704915f6bcc1c8ce8810a4056c580394e21f2af8d7
|
File details
Details for the file screenium-0.1.1-py3-none-any.whl.
File metadata
- Download URL: screenium-0.1.1-py3-none-any.whl
- Upload date:
- Size: 17.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9ae673cbcf9254d985c429449a0764b6e5270fe65b604ac5d07db1a2750efd62
|
|
| MD5 |
2a0df750e1cb58638151c8937c6876a6
|
|
| BLAKE2b-256 |
154ddb643efcee6d31f5a611c03b8145686883d99a8deb05b636f07ef3dfbe1f
|