Skip to main content

A desktop automation tool for image recognition.

Project description

pyauto-desktop

The resolution-agnostic, high-performance automation library for Python.

📘 Documentation: Read the full docs here

Built on mss and OpenCV. Designed for scalability. Includes a built-in GUI Inspector.

pyauto-desktop is a drop-in replacement for pyautogui designed for developers who need their automation scripts to work across different monitors, resolutions, and scaling settings (DPR).

It introduces the concept of a Session: a portable definition of your development environment that allows your code to auto-scale intelligently on any target machine.


Why Switch?

Most automation libraries fail when moved from a Dev machine (e.g., 4K monitor, 125% scale) to a Production machine (e.g., 1080p, 100% scale). pyauto-desktop solves this mathematically.

Feature pyauto-desktop pyautogui
Cross-Resolution&DPR Automatic. Uses Session logic to scale coordinates & images automatically. Manual. Scripts break if resolution changes.
Performance Up to 5x Faster. Uses mss & Pyramid Template Matching & Image caching. Standard speed.
Logic locateAny / locateAll built-in. Finds first or all matches from a list of images. Requires complex for loops / try-except blocks.
Tooling Built-in GUI Inspector to snip, edit, test, and generate code. None. Requires external tools.
Backend opencv-python, mss, pynput pyscreeze, pillow, mouse

💻 Operating System Support

  • Windows:Tested and Supported.
  • Mac / Linux: ⚠️ Experimental. While the underlying libraries (OpenCV, Qt6) are cross-platform, these environments have not been verified. Use at your own risk.

📦 Installation

pyauto-desktop relies on robust standard libraries like OpenCV and Qt6.

pip install pyauto-desktop

The Core Concept: The "Session"

In pyautogui, you write code for your screen. If you share that script, it usually fails on other screens.

In pyauto-desktop, you define a Session that records your source resolution and DPR. At runtime, the library compares your source environment with the current machine and automatically scales all clicks and image searches.

The result: the same script works across 1080p, 1440p, 4K, and retina displays without modification.


1. The GUI Inspector (No More Guessing Coordinates)

You don’t write the automation code by hand. Instead, open the Inspector to:

  • Snip UI elements directly from the screen
  • Test image matches in real time
  • Auto-generate production-ready Python code
import pyauto_desktop

# Opens the snipping and code generation tool
pyauto_desktop.inspector()

2. Generated Code Example

The code below works on a 1080p screen, a 4K screen, or a retina display without modification.

import pyauto_desktop

# Define the environment where you CREATED the script (e.g., your 1440p monitor)
# The library detects the CURRENT screen at runtime and scales accordingly
session = pyauto_desktop.Session(
    screen=1,
    source_resolution=(2560, 1440),
    source_dpr=1.25,
    scaling_type="dpr"
)

# Search for the image
# 'grayscale' and 'confidence' behave like standard automation tools
# 'use_pyramid=True' (default) handles slight size variations in the UI
image = session.locateOnScreen(
    'images/submit_btn.png',
    grayscale=True,
    confidence=0.9
)

if image:
    session.click(image)

⚠️ Important Note on Auto-Scaling & Confidence

The auto-scaling logic is robust, but not perfect. It is possible when an image is scaled down, a small amount of pixel detail can be lost, which affects template matching that could require confidence score tweaking.


Credits

Special thanks to the developers below for amkgin this module possible:

  • PyQt6 by Riverbank Computing - The GUI framework.
  • OpenCV - Computer vision and image processing.
  • NumPy - Fundamental package for scientific computing.
  • Pillow - Python Imaging Library.
  • RapidOCR - OCR capabilities.

Utilities

  • pynput by Moses Palmér - Monitor and control input devices.
  • mss by Mickaël Schoentgen - Fast cross-platform screenshots.
  • pydirectinput by Ben Johnson - Direct input for games.
  • pywinctl by Kalmat - Cross-platform window control.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyauto_desktop-0.4.0.tar.gz (41.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyauto_desktop-0.4.0-py3-none-any.whl (43.1 kB view details)

Uploaded Python 3

File details

Details for the file pyauto_desktop-0.4.0.tar.gz.

File metadata

  • Download URL: pyauto_desktop-0.4.0.tar.gz
  • Upload date:
  • Size: 41.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyauto_desktop-0.4.0.tar.gz
Algorithm Hash digest
SHA256 e1ade58e2c823d62654c9758bd98cfbb4449f92811b109377dad4fa541bd8305
MD5 fcbe3db66d1ee1d440f38fea7c08af93
BLAKE2b-256 67cedf670dde7b2215b208b7110b49189aa2bfc4b332c78d361d63c1520affde

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyauto_desktop-0.4.0.tar.gz:

Publisher: python-publish.yml on Omar-F-Rashed/pyauto-desktop

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pyauto_desktop-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: pyauto_desktop-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 43.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyauto_desktop-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 56820965f7944ce52fe70edcdc753d7cff1811bfa5193d82b99fbdf7a9eb65c0
MD5 a8ff272c11f47545b2eaf68e93670337
BLAKE2b-256 a1976b4870077345903f7a717a2d4040945f82632830f68cfcabdb550f53d4f3

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyauto_desktop-0.4.0-py3-none-any.whl:

Publisher: python-publish.yml on Omar-F-Rashed/pyauto-desktop

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page