Skip to main content

Python SDK for controlling iPhones via TapKit

Project description

TapKit Python SDK

Python SDK for controlling iPhones programmatically via the TapKit API.

Installation

pip install tapkit

Or install from source:

pip install git+https://github.com/Jootsing-Research/tapkit-python.git

Requirements

  • Python 3.11 or later
  • A TapKit account with API key
  • TapKit Mac companion app running
  • An iPhone connected through the Mac app

Quick Start

Authentication

Set your API key as an environment variable:

export TAPKIT_API_KEY="your-api-key"

Or pass it directly to the client:

from tapkit import TapKitClient

client = TapKitClient(api_key="your-api-key")

Basic Usage

from tapkit import TapKitClient

# Initialize client
client = TapKitClient()

# Get your phone (works when you have exactly one phone)
phone = client.get_phone()

print(f"Connected to {phone.name}")
print(f"Screen size: {phone.width}x{phone.height}")

# Tap center of screen
phone.tap(phone.screen.center)

# Take a screenshot
screenshot = phone.screenshot()
with open("screen.png", "wb") as f:
    f.write(screenshot)

Multiple Phones

# List all phones
phones = client.list_phones()

# Get a specific phone by name
phone = client.phone("iPhone 15 Pro")

# Or set a default phone for the session
client.use_phone("iPhone 15 Pro")
client.tap((100, 200))  # Uses the default phone

Features

Gestures

# Tap at coordinates
phone.tap((100, 200))

# Tap at screen center
phone.tap(phone.screen.center)

# Tap by description (uses vision AI)
phone.tap("the Settings icon")

# Double tap
phone.double_tap((100, 200))

# Tap and hold (long press)
phone.tap_and_hold((100, 200), duration_ms=1000)

# Flick (quick swipe)
phone.flick(phone.screen.center, "up")

# Pan (slower scroll)
phone.pan(phone.screen.center, "down", duration_ms=500)

# Drag from one point to another
phone.drag((100, 200), (300, 400))

# Hold and drag
phone.hold_and_drag((100, 200), (300, 400), hold_duration_ms=500)

# Pinch gestures
phone.pinch(phone.screen.center, "pinch_in")   # Zoom out
phone.pinch(phone.screen.center, "pinch_out")  # Zoom in
phone.pinch(phone.screen.center, "rotate_cw")  # Rotate clockwise

Device Control

# Navigation
phone.home()           # Go to home screen
phone.app_switcher()   # Open app switcher
phone.control_center() # Open control center
phone.spotlight()      # Open spotlight search
phone.siri()           # Activate Siri

# Lock/Unlock
phone.lock()
phone.unlock(passcode="123456")

# Volume
phone.volume_up()
phone.volume_down()

# Rotation
phone.rotate("portrait")
phone.rotate("left")
phone.rotate("right")

# Action button (iPhone 15 Pro+)
phone.action_button()

App Control

# Open an app
phone.open_app("Safari")

# Type text
phone.type_text("Hello, world!")
phone.type_text("pasted text", method="paste")  # Use clipboard

# Run iOS Shortcuts
phone.run_shortcut(name="My Shortcut")
phone.run_shortcut(index=0)  # Run first shortcut

Screenshots

# Capture screenshot as PNG bytes
screenshot = phone.screenshot()

# Save to file
with open("screenshot.png", "wb") as f:
    f.write(screenshot)

Geometry Utilities

TapKit includes geometry primitives for working with coordinates:

from tapkit.geometry import Point, BBox, NormalizedPoint, NormalizedBBox, Screen

# Points
point = Point(100, 200)
x, y = point  # Tuple unpacking works

# Bounding boxes
bbox = BBox(x1=100, y1=200, x2=300, y2=400)
phone.tap(bbox.center)  # Tap center of bounding box

# Normalized coordinates (0.0-1.0)
normalized = NormalizedPoint(0.5, 0.5)  # Center of screen
absolute = normalized.to_absolute(phone.width, phone.height)

# Screen utilities
screen = phone.screen
screen.center        # Center point
screen.contains(p)   # Check if point is in bounds
screen.clamp(p)      # Clamp point to screen bounds

OAGI Integration

TapKit includes handlers for OAGI workflows:

from tapkit import TapKitClient
from tapkit.oagi import TapKitAsyncActionHandler, TapKitAsyncImageProvider

client = TapKitClient()
phone = client.get_phone()

# Create handlers
action_handler = TapKitAsyncActionHandler(phone)
image_provider = TapKitAsyncImageProvider(phone)

# Use with OAGI agents
# action_handler can execute OAGI Action objects
# image_provider provides screenshots for vision models

Configuration

Environment Variable Description Default
TAPKIT_API_KEY API key for authentication Required
TAPKIT_BASE_URL Base URL for API https://api.tapkit.ai/v1

Documentation

For full documentation, visit docs.tapkit.ai.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tapkit-0.1.2.tar.gz (13.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tapkit-0.1.2-py3-none-any.whl (17.9 kB view details)

Uploaded Python 3

File details

Details for the file tapkit-0.1.2.tar.gz.

File metadata

  • Download URL: tapkit-0.1.2.tar.gz
  • Upload date:
  • Size: 13.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.13

File hashes

Hashes for tapkit-0.1.2.tar.gz
Algorithm Hash digest
SHA256 790e89a2dada1a4c51c4e7c0b884170330249946ebc3fd0e5112cfdb47d10b38
MD5 c7445e97a34f917b9bd295c736cad412
BLAKE2b-256 4a97c29c6a390e92c846e29927cf18cd7c5d17d7059427ccbe0fbd5277addbf0

See more details on using hashes here.

File details

Details for the file tapkit-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: tapkit-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 17.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.13

File hashes

Hashes for tapkit-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2af10c69ecd1f63ec0d0b08f12d04e9587d1f487255cd1cd9421c20cc3e24098
MD5 be1a916f591306005406eaf972b72673
BLAKE2b-256 5f5d87de3ba89985a9ede269c36427322cc6a6e6db45351439db2412fe51266c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page