Skip to main content

A desktop automation tool for image recognition.

Project description

pyauto-desktop

The resolution-agnostic, high-performance automation library for Python.

📘 Documentation: Read the full docs here

Built on mss and OpenCV. Designed for scalability. Includes a built-in GUI Inspector.

pyauto-desktop is a drop-in replacement for pyautogui designed for developers who need their automation scripts to work across different monitors, resolutions, and scaling settings (DPR).

It introduces the concept of a Session: a portable definition of your development environment that allows your code to auto-scale intelligently on any target machine.


Why Switch?

Most automation libraries fail when moved from a Dev machine (e.g., 4K monitor, 125% scale) to a Production machine (e.g., 1080p, 100% scale). pyauto-desktop solves this mathematically.

Feature pyauto-desktop pyautogui
Cross-Resolution&DPR Automatic. Uses Session logic to scale coordinates & images automatically. Manual. Scripts break if resolution changes.
Performance Up to 5x Faster. Uses mss & Pyramid Template Matching & Image caching. Standard speed.
Logic locateAny / locateAll built-in. Finds first or all matches from a list of images. Requires complex for loops / try-except blocks.
Tooling Built-in GUI Inspector to snip, edit, test, and generate code. None. Requires external tools.
Backend opencv-python, mss, pynput pyscreeze, pillow, mouse

💻 Operating System Support

  • Windows:Tested and Supported.
  • Mac / Linux: ⚠️ Experimental. While the underlying libraries (OpenCV, Qt6) are cross-platform, these environments have not been verified. Use at your own risk.

📦 Installation

pyauto-desktop relies on robust standard libraries like OpenCV and Qt6.

pip install pyauto-desktop

The Core Concept: The "Session"

In pyautogui, you write code for your screen. If you share that script, it usually fails on other screens.

In pyauto-desktop, you define a Session that records your source resolution and DPR. At runtime, the library compares your source environment with the current machine and automatically scales all clicks and image searches.

The result: the same script works across 1080p, 1440p, 4K, and retina displays without modification.


1. The GUI Inspector (No More Guessing Coordinates)

You don’t write the automation code by hand. Instead, open the Inspector to:

  • Snip UI elements directly from the screen
  • Test image matches in real time
  • Auto-generate production-ready Python code
import pyauto_desktop

# Opens the snipping and code generation tool
pyauto_desktop.inspector()

2. Generated Code Example

The code below works on a 1080p screen, a 4K screen, or a retina display without modification.

import pyauto_desktop

# Define the environment where you CREATED the script (e.g., your 1440p monitor)
# The library detects the CURRENT screen at runtime and scales accordingly
session = pyauto_desktop.Session(
    screen=1,
    source_resolution=(2560, 1440),
    source_dpr=1.25,
    scaling_type="dpr"
)

# Search for the image
# 'grayscale' and 'confidence' behave like standard automation tools
# 'use_pyramid=True' (default) handles slight size variations in the UI
image = session.locateOnScreen(
    'images/submit_btn.png',
    grayscale=True,
    confidence=0.9
)

if image:
    session.click(image)

⚠️ Important Note on Auto-Scaling & Confidence

The auto-scaling logic is robust, but not perfect. It is possible when an image is scaled down, a small amount of pixel detail can be lost, which affects template matching that could require confidence score tweaking.


Credits

Special thanks to the developers below for making this module possible:

  • PyQt6 by Riverbank Computing - The GUI framework.
  • OpenCV - Computer vision and image processing.
  • NumPy - Fundamental package for scientific computing.
  • Pillow - Python Imaging Library.
  • RapidOCR - OCR capabilities.

Utilities

  • pynput by Moses Palmér - Monitor and control input devices.
  • mss by Mickaël Schoentgen - Fast cross-platform screenshots.
  • pydirectinput by Ben Johnson - Direct input for games.
  • pywinctl by Kalmat - Cross-platform window control.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyauto_desktop-0.4.3.tar.gz (42.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyauto_desktop-0.4.3-py3-none-any.whl (44.1 kB view details)

Uploaded Python 3

File details

Details for the file pyauto_desktop-0.4.3.tar.gz.

File metadata

  • Download URL: pyauto_desktop-0.4.3.tar.gz
  • Upload date:
  • Size: 42.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyauto_desktop-0.4.3.tar.gz
Algorithm Hash digest
SHA256 d19e9abf37124ddab6f824f5805f2c20cc488c4e9fed0a2e280dd0ad56ac7bdf
MD5 69f474a99505877ac7bab8891bd69586
BLAKE2b-256 90d79712408252c461b50f436cb44414ce28dbfa0be6ffc3295636cd23aac789

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyauto_desktop-0.4.3.tar.gz:

Publisher: python-publish.yml on Omar-F-Rashed/pyauto-desktop

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pyauto_desktop-0.4.3-py3-none-any.whl.

File metadata

  • Download URL: pyauto_desktop-0.4.3-py3-none-any.whl
  • Upload date:
  • Size: 44.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyauto_desktop-0.4.3-py3-none-any.whl
Algorithm Hash digest
SHA256 ace5623d666e3499600981dfad0d9388a50915222b52acc1009af36b8569047e
MD5 cd830051277912372ffea503e3b85586
BLAKE2b-256 b065cd15826d5c616406cd432df65170d62e8b0fe1aa103261953d3830d5fdf0

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyauto_desktop-0.4.3-py3-none-any.whl:

Publisher: python-publish.yml on Omar-F-Rashed/pyauto-desktop

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page