Skip to main content

Gesture actions library for Appium

Project description

Appium Gestures Library

This library is to provide a number gesture/interaction functions for Appium mobile automation.
The gestures are platform agnostic, which allows the user to provide a WebElement - or locators for both Android and iOS in the same function call.

Available Functions

Swipe Gestures

  • up()
  • down()
  • left()
  • right()
  • next()
  • previous()
  • on_element()
  • element_into_view()

Drag and Drop Gestures

  • drag_and_drop()

Pinch Gestures

  • open()
  • close()

Documentation


Compatibility

  • Android
    Requires: appium driver install uiautomator2
  • iOS (unverified)
    Requires: appium driver install xcuitest

Install

pip install appium-gesture-actions
# or
uv add appium-gesture-actions

Changelog

## 0.2.1 (2024-11-21) 🥳

- Renamed package
- Complete re-write of functionality
- Added platform-specific code
- Added a number of new gestures
  - Drag and Drop
  - Pinch/Zoom In
  - Pinch/Zoom Out
- Added safe inserts for Swipe On Element
- Added error handling
- Added API docs
- Changed to uv for packaging

See full list of changes: CHANGES.md

To-do

  • Expand functionality to include gestures
  • Add cross-platform functionality
  • Robust error handling
  • Add documentation
  • Add examples for other gestures
  • Return bool for most functions
  • Return WebElement on element_into_view()
  • Completely rewrite the tests
  • Allow for the specifying of values in sub-classes (such as _max_attempts or CROP_FACTOR_ in SwipeGestures)
  • Reduce minimum Python version
  • Remove direct accessing of ActionBuilder - or not..
  • Handling of different orientations
  • Allow for re-initialisation of viewport calculations

Demo and Example Usage

Library Demo

from appium.gesture.actions import GestureActions
from appium.gesture.enums import Direction, SeekDirection, UiSelector

class TestDemo(TestCore):
    def test_cool_stuff(self):
        # Initialise class object
        action = GestureActions(self.driver, self.options["platformName"])

        # Drag and Drop
        image_element = driver.find_element(
            by=AppiumBy.XPATH,
            value='//android.widget.ImageView[0]',
        )
        delete_element = driver.find_element(
            by=AppiumBy.XPATH,
            value='//android.widget.ImageView[1]',
        )
        action.drag_drop.drag_and_drop(image_element, delete_element)

        # Swipe
        action.swipe.up()
        action.swipe.next()

        # Pinch
        action.pinch.open(image_element)

        # Swipe on Element
        action.swipe.on_element(
            image_element,
            Direction.LEFT,
        )

        # Scroll to Element (Android - UiAutomator)
        action.swipe.element_into_view(
            value_a='new UiSelector().description("Save")',
            locator_method_a=AppiumBy.ANDROID_UIAUTOMATOR,
        )

        # Scroll to Element (Android - XPATH)
        action.swipe.element_into_view(
            value_a="//android.widget.ImageView",
            locator_method_a=AppiumBy.XPATH,
            direction=SeekDirection.UP,
        )

        # Scroll to Element (Multi-platform, single code base)
        action.swipe.element_into_view(
            value_a='//android.widget.Button[@content-desc="Save"]',
            value_i="label == 'Save'"
            locator_method_a=AppiumBy.XPATH,
            locator_method_i=AppiumBy.IOS_PREDICATE,
            direction=SeekDirection.DOWN,
        )

An Explainer on Swipe Element Into View

This function (part of the SwipeGestures class) has cross-platform support.
It is achieved by using parameters with different suffixes (_a and _i for Android and iOS respectively).

This will allow you to use a single function call for use on both platforms.

Additionally, it includes a fallback method (which is less efficient) if the element cannot be located initially.
This is achieved by scrolling the viewport until the element is located, or the maximum number of swipes is achieved (default: 5).
An example situation would be if an element is not yet present in the viewport and is loaded after scrolling.
If this situation applies to you, the direction parameter will need to be specified.

A Quick Word on Perform Navigation (Full and Partial)

The fallback method will calculate the viewport size, and then define a scrollable region based on crop factors to avoid triggering the notification shade/center or multi-tasking view.
If it cannot locate the element, it will call _perform_navigation_partial_ until it does - or it exceeds the max attempts.
In the event that the element is present but not in the viewport, it will calculate the distance to the element.
Then, it will determine the correct number of full (and if necessary) partial swipes to bring the element into the center of the viewport.

Additionally, the if actions_partial > SWIPE_ACTION_THRESHOLD check ensures the pixel distance is large enough to warrant an action.
When this value is less than 50px, the swipe action will be interpreted by the OS as a double-tap.

Please look at _fallback_scroll_to_element if you would like to learn more.
https://github.com/tanakrit-d/appium-gesture-actions/blob/ae1d229d14ced2a20169982d6284fcf7ed92c22b/src/appium/gesture/swipe.py#L176

Defining a Scrollable Region

This library divides the viewport into four bounds: upper, lower, left, and right. The default values cannot be overwritten (this will change in a future release).
Using these bounds, we then define a 'scrollable region'. We can then perform our scroll/swipe actions within this space.
The impetus for this is to recreate scrolling/swiping behaviour more similar to a user and avoid hardcoding coordinates.
Additionally, it avoids the automation attempting to perform actions on top of elements (such as headers or footers).
Viewport Diagram

Defining Element Points

The importance of dynamically generating 'points' of an element to interact with allows us to account for re-sizing under a number of conditions (such as different devices/resolutions).

For the purpose of this library, we are only concerned with two attributes of an element: position and size.
The element's coordinates within the viewport is considered the top-left-point.

We can then use the element size to determine where it occupies relative to the view-port position. Element Diagram

top_left_point      = element.location["x"], element.location["y"]
top_mid_point       = element.location["x"] + (element.size["width"] // 2), element.location["y"]
top_right_point     = element.location["x"] + element.size["width"], element.location["y"]

left_mid_point      = element.location["x"], element.location["y"] + (element.size["height"] // 2)
mid_point           = element.location["x"] + (element.size["width"] // 2), element.location["y"] + (element.size["height"] // 2)
right_mid_point     = element.location["x"] + element.size["width"], element.location["y"] + (element.size["height"] // 2)

bottom_left_point   = element.location["x"], element.location["y"] + element.size["height"]
bottom_mid_point    = element.location["x"] + (element.size["width"] // 2), element.location["y"] + element.size["height"]
bottom_right_point  = element.location["x"] + element.size["width"], element.location["y"] + element.size["height"]

Using the example element from the image, the above calculations would output as follows:

Top-Left-Point:  (20, 20)  
Top-Mid-Point:  (40, 20)  
Top-Right-Point:  (60, 20)
  
Left-Mid-Point:  (20, 30)  
Mid-Point:  (40, 30)  
Right-Mid-Point:  (60, 30)
  
Bottom-Left-Point:  (20, 40)  
Bottom-Mid-Point:  (40, 40)  
Bottom-Right-Point:  (60, 40)

An example of this is available here: demo/calc_coordinates.py

Notes

Gesture Execution, Design Decisions, and Documentation

I wrote this library based on issues I had with automating scroll-to-element functionality in my testing.
The majority of the tutorials or documentation I have found on the subject is either sparse, low quality and SEO orientated, or relies on hardcoding coordinates for the scroll actions.
Additionally, I did not find these to be robust enough or support different screen sizes when using the same code for different devices.


The Appium Python Client implementation relies on ActionChains, and the .swipe() and .scroll() functions require either both elements to be directly provided - or the x and y coordinates to be specified.

In this library, Drag and Drop and Pinch call the Selenium JavaScript .execute_script() function - which is more reliable and robust than using ActionChains.

Initially, a prototype implementation using ActionChains was attempted, however the performance was poor and buggy since Selenium implements a number of logical checks when executing it.
I found it threw numerous exceptions due to some form of built-in element detection.

Swipe contains a combination of .execute_script() and ActionChains.

For Android, the preferred method is AppiumBy.ANDROID_UIAUTOMATOR which uses new UiScrollable() as it is incredibly quick and reliable.
It will use ActionChains if any other locator method is called.

For iOS, it will initially attempt to use .execute_script(), and then fallback to ActionChains if the element cannot be located.

I would recommend reading the following documentation which helped inform the design and implementation.

Android

If you would like to see the pointer interactions and coordinates, this can be enabled on a device level in Settings > Developer Options > Pointer location

Contributing

Contributions or feedback is welcome! Please feel free to submit a Pull Request.
As this is my first Python package I am open to any and all suggestions :^)

Support

If you encounter any issues or have questions, please file an issue on the GitHub repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

appium_gesture_actions-0.2.1.tar.gz (14.0 kB view details)

Uploaded Source

Built Distribution

appium_gesture_actions-0.2.1-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file appium_gesture_actions-0.2.1.tar.gz.

File metadata

File hashes

Hashes for appium_gesture_actions-0.2.1.tar.gz
Algorithm Hash digest
SHA256 acef2542c89f558f5a022c8e0d60b0e9e0d88826dbc1a335e7fc59e191b7f083
MD5 1c2e14f7690c261fbe35edc20a5c6d8e
BLAKE2b-256 e94df3f6b30ab1a6c46f3fb78ad31536e1fb1bafb552d264ead60166dc7cec6b

See more details on using hashes here.

File details

Details for the file appium_gesture_actions-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for appium_gesture_actions-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 54d6c7bfbb872d7e45dde328df8fcaa257c361b4978ec22c9d94f3e538f87d6e
MD5 44a8cad608b05768e2bf6288ff98c8fd
BLAKE2b-256 2ef308adbedba2f212ecbf2d5dde78f53efcb0b8f68a03d99cb10b5ad96acdb3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page