Advance GUI automation

Project description

PyAutoScene

Advanced GUI Automation with Scene-Based State Management

PyAutoScene is a Python library that provides a declarative approach to GUI automation by modeling application interfaces as scenes and transitions. It combines element detection with state machine patterns to create robust, maintainable automation scripts.

🌟 Features

Scene-Based Architecture: Model your application as a collection of scenes with defined elements and transitions
Visual Element Detection: Supports both image-based and text-based element recognition
Region Specification: Flexible region syntax for targeting specific screen areas
Automatic Navigation: Intelligent pathfinding between scenes using graph algorithms
Action Decorators: Clean, declarative syntax for defining scene actions and transitions

🚀 Quick Start

Installation

pip install pyautoscene

Basic Example

Here's how to automate a simple login flow:

import pyautogui as gui
from pyautoscene import ImageElement, TextElement, Scene, Session
from pyautoscene.utils import locate_and_click

# Define scenes
login = Scene(
    "Login",
    elements=[
        TextElement("Welcome to Login"),
        ImageElement("references/login_button.png"),
    ],
    initial=True,
)

dashboard = Scene(
    "Dashboard",
    elements=[
        TextElement("Dashboard"),
        ImageElement("references/user_menu.png"),
    ],
)

# Define actions with transitions
@login.action(transitions_to=dashboard)
def perform_login(username: str, password: str):
    """Performs login and transitions to dashboard."""
    locate_and_click("references/username_field.png")
    gui.write(username, interval=0.1)
    gui.press("tab")
    gui.write(password, interval=0.1)
    gui.press("enter")

# Create session and navigate
session = Session(scenes=[login, dashboard])
session.expect(dashboard, username="user", password="pass")

📖 Core Concepts

Scenes

A Scene represents a distinct state in your application's UI. Each scene contains:

Elements: Visual markers that identify when the scene is active
Actions: Functions that can be performed in this scene
Transitions: Connections to other scenes

scene = Scene(
    "SceneName",
    elements=[
        ImageElement("path/to/image.png"),
        TextElement("Expected Text"),
    ],
    initial=False  # Set to True for starting scene
)

Reference Elements

PyAutoScene supports two types of reference elements:

ImageElement

Detects scenes using image matching:

ImageElement("path/to/reference/image.png")

TextElement

Detects scenes using text recognition:

TextElement("Expected text on screen")

Both elements support region specification for limiting search areas. You can specify regions using a string format like "x:2/3 y:(1-2)/3" to easily define screen regions:

# Search for text in the top-left third of the screen
TextElement("Login", region="x:1/3 y:1/3")

# Search in a horizontal band across the middle third of the screen
ImageElement("button.png", region="y:2/3")

# Search in a specific area spanning columns 1-2 out of 3 columns
TextElement("Welcome", region="x:(1-2)/3 y:1/3")

Actions and Transitions

Actions are decorated functions that define what can be done in a scene:

@scene.action(transitions_to=target_scene)  # Action that changes scenes
def action_with_transition():
    # Perform GUI operations
    pass

@scene.action()  # Action that stays in current scene
def action_without_transition():
    # Perform GUI operations
    pass

Session Management

The Session class manages the state machine and provides navigation:

session = Session(scenes=[scene1, scene2, scene3])

# Navigate to a specific scene (finds optimal path)
session.expect(target_scene, **action_params)

# Invoke an action in the current scene
session.invoke("action_name", **action_params)

# Get current scene
current = session.current_scene

Automatic Scene Detection

PyAutoScene automatically detects which scene is currently active:

from pyautoscene.session import get_current_scene

current_scene = get_current_scene(scenes)
print(f"Currently on: {current_scene.name}")

Path Finding

The library uses NetworkX to find optimal paths between scenes:

# This will automatically navigate: Login → Dashboard → Cart
session.expect(cart_scene, username="user", password="pass")

Error Handling

from pyautoscene.session import SceneRecognitionError

try:
    session.expect(target_scene)
except SceneRecognitionError as e:
    print(f"Navigation failed: {e}")

🤝 Contributing

Fork the repository
Create a feature branch: git checkout -b feature-name
Make your changes and add tests
Run pre-commit hooks: pre-commit run --all-files
Submit a pull request

📝 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

🔮 Roadmap

Enhanced image matching algorithms
TemplateScene
Relation between images
Multiple session support

🙏 Credits

PyAutoScene builds on the work of several open-source projects:

PyAutoGUI by Al Sweigart
python-statemachine by Filipe Macedo
RapidOCR by RapidAI contributors

Project details

Release history Release notifications | RSS feed

0.2.3

Jul 19, 2025

This version

0.2.2

Jul 19, 2025

0.2.1

Jul 15, 2025

0.2.0

Jul 14, 2025

0.1.0

Jul 12, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyautoscene-0.2.2.tar.gz (56.7 kB view details)

Uploaded Jul 19, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pyautoscene-0.2.2-py3-none-any.whl (17.3 kB view details)

Uploaded Jul 19, 2025 Python 3

File details

Details for the file pyautoscene-0.2.2.tar.gz.

File metadata

Download URL: pyautoscene-0.2.2.tar.gz
Upload date: Jul 19, 2025
Size: 56.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pyautoscene-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`8a90a8bccb62dfbe930ca6cc1d3ccd42418c5aa89a75cbb486538530ee52259a`
MD5	`f4c4201e7dff767f131ea32fba5fb718`
BLAKE2b-256	`dbf08be7fecdbb761af76d0fa1c3dac6a884ea7ccbe946de58c62b8cd11b0d71`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyautoscene-0.2.2.tar.gz:

Publisher: release.yml on pritam-dey3/PyAutoScene

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pyautoscene-0.2.2.tar.gz
- Subject digest: 8a90a8bccb62dfbe930ca6cc1d3ccd42418c5aa89a75cbb486538530ee52259a
- Sigstore transparency entry: 292437819
- Sigstore integration time: Jul 19, 2025
Source repository:
- Permalink: pritam-dey3/PyAutoScene@dded90f2d9ff9a69560e8675cbc59016b65edeb6
- Branch / Tag: refs/heads/main
- Owner: https://github.com/pritam-dey3
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@dded90f2d9ff9a69560e8675cbc59016b65edeb6
- Trigger Event: workflow_dispatch

File details

Details for the file pyautoscene-0.2.2-py3-none-any.whl.

File metadata

Download URL: pyautoscene-0.2.2-py3-none-any.whl
Upload date: Jul 19, 2025
Size: 17.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pyautoscene-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c546fdc6373702e2de7b0a9c44a2a3d12b0b98e90b77bcbba3e088b7530348cb`
MD5	`f05c0b752cfce5307960bf8d34297784`
BLAKE2b-256	`63640938129b5133bf2c1b5c07a82773a7ad207f6112444ff996a99325910017`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyautoscene-0.2.2-py3-none-any.whl:

Publisher: release.yml on pritam-dey3/PyAutoScene

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pyautoscene-0.2.2-py3-none-any.whl
- Subject digest: c546fdc6373702e2de7b0a9c44a2a3d12b0b98e90b77bcbba3e088b7530348cb
- Sigstore transparency entry: 292437834
- Sigstore integration time: Jul 19, 2025
Source repository:
- Permalink: pritam-dey3/PyAutoScene@dded90f2d9ff9a69560e8675cbc59016b65edeb6
- Branch / Tag: refs/heads/main
- Owner: https://github.com/pritam-dey3
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@dded90f2d9ff9a69560e8675cbc59016b65edeb6
- Trigger Event: workflow_dispatch

pyautoscene 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

PyAutoScene

🌟 Features

🚀 Quick Start

Installation

Basic Example

📖 Core Concepts

Scenes

Reference Elements

ImageElement

TextElement

Actions and Transitions

Session Management

Automatic Scene Detection

Path Finding

Error Handling

🤝 Contributing

📝 License

🔮 Roadmap

🙏 Credits

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance