Simple python library useful for automating tasks using images.

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Gui automation

Simple python library useful for automating tasks using images. It can run on Windows background applications.

It uses OpenCV and PyAutoGui. Made with Python 3.7.

Simple example:

import cv2
from gui_automation import GuiAuto

image_path = "win10key.png"
ga = GuiAuto()
if ga.detect(cv2.imread(image_path), 0.8):
    ga.click()

It searches windows 10 key image in the screen with 80% or more similarity. If it is found it gets clicked (it opens windows 10 start menu).

The core class is GuiAuto:

Wraps detection and controlling of the GUI. By default it will use normal image detection and foreground app automation.

Parameters

  detector: detector instance. Default is TMDetector()
  handler:  handler instance. Default is ForegroundHandler()

Methods

  detect(tpl, img=None): 
      returns Spot instance if it finds the tpl in the image. Internally, it keeps the last spot found.
  move(coords=None): 
      same as detect but it moves the cursor to the center of the found image withing the screen.
  click(coords=None): 
      Clicks the left buttons the quantity specified in @param click(default 1) in the center of the found image.
  hold(coords=None): 
      Same as click but instead of clicking X times, it holds the click @param time seconds.
  drag(start_coords, end_coords): 
      Drags from one point to another using start and end coordinates.
  drag_within(start_x_fraction, start_y_fraction, end_x_fraction, end_y_fraction): 
      Drags from one point to another inside the bounding box of the image found.

  For move, click, and hold methods if no coords are given it performs the action on the last spot found.

drag_within method

  Drags the mouse from one point to another using the tpl image width and height.
  All params are fractions in the following string format: 'number/number'.
  Eg: ga.drag_within('3/4', '0/1', '7/8', '4/5')

     3/4 of the width and 0/1 of the height for START
    __o___o_  7/8 of the width for END
   |  S     |   S = start
   |   \\   |   E = end
   |    \\  |   \\ = the mouse drag path
   |      E o 4/5 of the height for END
   |________|

Detector:

Searches an image inside another image using template match from OpenCV. Classes with default parameters:

    TMDetector(method=SQDIFF, thresh=False):
    Applies the normal detection.

    MultiscaledTMDetector(method=SQDIFF, thresh=False, reduce_sc=0.2, magnify_sc=2.0, cant_sc=40):
    Applies detection multiples times while resizing the image. Parameters specify how image is resized.

Parameters

  method: template method to use. Could be SQDIFF, CCOEFF or CCORR.
  thresh: boolean that specifies if binary threshold filter must be used for detection.
  reduce_sc: how much the image is reduced.
  magnify_sc how much the image is enlarged.
  cant_sc= how many resizing will be applied.

Handler:

Interacts with the app or environment to be automated. Performs clicks, drags among others; and also obtains the screen of the app/environment on an image format.

ForegroundHandler():
Normal handler that takes screenshot and simulates mouse action normally.

BackgroundHandlerWin32(app_name, *args):
Handler that works in not vieawable/background applications. It requires an application/window name, and it's possible to pass as arguments a names hierarchy of the UI elements of the application.
Works only for Windows.

Spot:

Wraps all position/coordinates calculations for the found image.

Methods:

  upper_left_position()
  upper_right_position()
  bottom_left_position()
  bottom_right_position()
  center_position()
  custom_position(x_multiplier, x_modifier, y_multiplier, y_modifier)

custom_position method:

  This method helps calculate any coordinate within the image detected.
  Here is some expanation of its parameters:
  x_multiplier: how many parts of the divided width to take.
  x_modifier: in how many parts the width is going to be divided.
  y_multiplier: same as x_multiplier but with height.
  y_modifier: same as x_modifier but with height.

  Eg: x, y = custom_position(3, 8, 1, 2)
     3/8 of the width
    __o_____
   |        |
   |        |
   |  x     o 1/2 of the height
   |        |
   |________|

Image loader:

Little module to help load images from a directory. Loads all images of a folder given in param path, and assign them to a dictionary in this way: name=>image name would be the filename without the extension. image would be the numpy array with the image data loaded with OpenCV. Path: relative or absolute path where the images are. Must finish with '/'. It returns a dictionary with the names of the images as keys, and the images themselves as values. Returns False if any error.

from gui_automation import GuiAuto, load_images
from time import sleep

buttons = load_images("images/buttons/")

ga = GuiAuto()
while not ga.detect(buttons['start'], 0.8).detect():
    ga.click()
sleep(2)
if ga.detect(buttons['accept'], 0.8):
    print("Found accept button")

In this case we load all images in "images/buttons/" folder and the wait until start button is found. After that, it waits 2 seconds and then it tries to find accept button.

Another example:

image_path = "win10keyresized.PNG"
ga = GuiAuto(detector=MultiscaledTMDetector())
spot = ga.detect(cv2.imread(image_path), 0.8)
if spot:
    ga.click(coords=spot.bottom_right(), clicks=3 )

In this case we have a similar win10key image to our original located on our rendered screen. So using MultiscaledTMDetector fixes our problem resizing the screen multiple times. If it is detected, it clicks the bottom right of the image found three times.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

3.2.1

May 11, 2021

3.2.0

May 11, 2021

3.1.1

May 10, 2021

3.1.0

May 6, 2021

3.0.3

Aug 24, 2019

3.0.2

Aug 23, 2019

3.0.1

Aug 23, 2019

2.1

Jul 31, 2019

Jul 12, 2019

Jun 23, 2019

0.21

Jun 23, 2019

0.2

Jun 23, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gui_automation-3.2.1.tar.gz (9.8 kB view details)

Uploaded May 11, 2021 Source

Built Distribution

gui_automation-3.2.1-py3-none-any.whl (13.9 kB view details)

Uploaded May 11, 2021 Python 3

File details

Details for the file gui_automation-3.2.1.tar.gz.

File metadata

Download URL: gui_automation-3.2.1.tar.gz
Upload date: May 11, 2021
Size: 9.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.6.13

File hashes

Hashes for gui_automation-3.2.1.tar.gz
Algorithm	Hash digest
SHA256	`99c5024cf85842803031b6bd5582bfd1287cb4816e64ac57d6e7794ae6a9f77c`
MD5	`e78d0c241ce5168438df69a278f50f53`
BLAKE2b-256	`bf1d689c92b4d713e362e57519fe363f29fbdee5c31713b0aadccdbac7a7e991`

See more details on using hashes here.

File details

Details for the file gui_automation-3.2.1-py3-none-any.whl.

File metadata

Download URL: gui_automation-3.2.1-py3-none-any.whl
Upload date: May 11, 2021
Size: 13.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.6.13

File hashes

Hashes for gui_automation-3.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bf10d0d2107aa2f2fbf81849ff9e1072f1ff9cccc028472ad1616bb6cf78c37b`
MD5	`d3277149d91eab5ccbe112043fffd4af`
BLAKE2b-256	`2f215415be48122e44baa5c80466cc15507bf3d0d677129032810e4ea2adce43`

See more details on using hashes here.

gui-automation 3.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Gui automation

Simple example:

The core class is GuiAuto:

Detector:

Handler:

Spot:

Image loader:

Another example:

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes