Advance GUI automation
Project description
PyAutoScene
Advanced GUI Automation with Scene-Based State Management
PyAutoScene is a Python library that provides a declarative approach to GUI automation by modeling application interfaces as scenes and transitions. It combines element detection with state machine patterns to create robust, maintainable automation scripts.
🌟 Features
- Scene-Based Architecture: Model your application as a collection of scenes with defined elements and transitions
- Visual Element Detection: Supports both image-based and text-based element recognition
- Region Specification: Flexible region syntax for targeting specific screen areas
- Automatic Navigation: Intelligent pathfinding between scenes using graph algorithms
- Action Decorators: Clean, declarative syntax for defining scene actions and transitions
🚀 Quick Start
Installation
pip install pyautoscene
Basic Example
Here's how to automate a simple login flow:
import pyautogui as gui
from pyautoscene import ImageElement, TextElement, Scene, Session
from pyautoscene.utils import locate_and_click
# Define scenes
login = Scene(
"Login",
elements=[
TextElement("Welcome to Login"),
ImageElement("references/login_button.png"),
],
initial=True,
)
dashboard = Scene(
"Dashboard",
elements=[
TextElement("Dashboard"),
ImageElement("references/user_menu.png"),
],
)
# Define actions with transitions
@login.action(transitions_to=dashboard)
def perform_login(username: str, password: str):
"""Performs login and transitions to dashboard."""
locate_and_click("references/username_field.png")
gui.write(username, interval=0.1)
gui.press("tab")
gui.write(password, interval=0.1)
gui.press("enter")
# Create session and navigate
session = Session(scenes=[login, dashboard])
session.expect(dashboard, username="user", password="pass")
📖 Core Concepts
Scenes
A Scene represents a distinct state in your application's UI. Each scene contains:
- Elements: Visual markers that identify when the scene is active
- Actions: Functions that can be performed in this scene
- Transitions: Connections to other scenes
scene = Scene(
"SceneName",
elements=[
ImageElement("path/to/image.png"),
TextElement("Expected Text"),
],
initial=False # Set to True for starting scene
)
Reference Elements
PyAutoScene supports two types of reference elements:
ImageElement
Detects scenes using image matching:
ImageElement("path/to/reference/image.png")
TextElement
Detects scenes using text recognition:
TextElement("Expected text on screen")
Both elements support region specification for limiting search areas. You can specify regions using a string format like "x:2/3 y:(1-2)/3" to easily define screen regions:
# Search for text in the top-left third of the screen
TextElement("Login", region="x:1/3 y:1/3")
# Search in a horizontal band across the middle third of the screen
ImageElement("button.png", region="y:2/3")
# Search in a specific area spanning columns 1-2 out of 3 columns
TextElement("Welcome", region="x:(1-2)/3 y:1/3")
Actions and Transitions
Actions are decorated functions that define what can be done in a scene:
@scene.action(transitions_to=target_scene) # Action that changes scenes
def action_with_transition():
# Perform GUI operations
pass
@scene.action() # Action that stays in current scene
def action_without_transition():
# Perform GUI operations
pass
Session Management
The Session class manages the state machine and provides navigation:
session = Session(scenes=[scene1, scene2, scene3])
# Navigate to a specific scene (finds optimal path)
session.expect(target_scene, **action_params)
# Invoke an action in the current scene
session.invoke("action_name", **action_params)
# Get current scene
current = session.current_scene
Automatic Scene Detection
PyAutoScene automatically detects which scene is currently active:
from pyautoscene.session import get_current_scene
current_scene = get_current_scene(scenes)
print(f"Currently on: {current_scene.name}")
Path Finding
The library uses NetworkX to find optimal paths between scenes:
# This will automatically navigate: Login → Dashboard → Cart
session.expect(cart_scene, username="user", password="pass")
Error Handling
from pyautoscene.session import SceneRecognitionError
try:
session.expect(target_scene)
except SceneRecognitionError as e:
print(f"Navigation failed: {e}")
🤝 Contributing
- Fork the repository
- Create a feature branch:
git checkout -b feature-name - Make your changes and add tests
- Run pre-commit hooks:
pre-commit run --all-files - Submit a pull request
📝 License
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
🔮 Roadmap
- Enhanced image matching algorithms
- TemplateScene
- Relation between images
- Multiple session support
🙏 Credits
PyAutoScene builds on the work of several open-source projects:
- PyAutoGUI by Al Sweigart
- python-statemachine by Filipe Macedo
- RapidOCR by RapidAI contributors
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pyautoscene-0.2.0.tar.gz.
File metadata
- Download URL: pyautoscene-0.2.0.tar.gz
- Upload date:
- Size: 53.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dd11adec7a20aa641cc083d58f310ce9181602cf0a814a472b0f4bbb268093fa
|
|
| MD5 |
f5c7460401b4d943dfd73c30587a6686
|
|
| BLAKE2b-256 |
ead420f4e0aa9308352fd760305f23985e1b8a889a1832ecc3c155e5b0fcde21
|
File details
Details for the file pyautoscene-0.2.0-py3-none-any.whl.
File metadata
- Download URL: pyautoscene-0.2.0-py3-none-any.whl
- Upload date:
- Size: 15.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a646cae3decec4ed09454ed418d4c1fe0581917d3a716dbb57a4dfdacfc6d359
|
|
| MD5 |
77bec652521306481c197de82ed23c9d
|
|
| BLAKE2b-256 |
f3ba6e6b810dcf73dea99eddf3959b1b0f950fa4081ca8735810a21e9d12af9f
|