Skip to main content

Python SDK for controlling iPhones via TapKit

Project description

TapKit Python SDK

Python SDK for controlling iPhones programmatically via the TapKit API.

Installation

pip install tapkit

Or install from source:

pip install git+https://github.com/Jootsing-Research/tapkit-python.git

Requirements

  • Python 3.11 or later
  • A TapKit account with API key
  • TapKit Mac companion app running
  • An iPhone connected through the Mac app

Quick Start

Authentication

Set your API key as an environment variable:

export TAPKIT_API_KEY="your-api-key"

Or pass it directly to the client:

from tapkit import TapKitClient

client = TapKitClient(api_key="your-api-key")

Basic Usage

from tapkit import TapKitClient

# Initialize client
client = TapKitClient()

# Get your phone (works when you have exactly one phone)
phone = client.get_phone()

print(f"Connected to {phone.name}")
print(f"Screen size: {phone.width}x{phone.height}")

# Tap center of screen
phone.tap(phone.screen.center)

# Take a screenshot
screenshot = phone.screenshot()
with open("screen.png", "wb") as f:
    f.write(screenshot)

Multiple Phones

# List all phones
phones = client.list_phones()

# Get a specific phone by name
phone = client.phone("iPhone 15 Pro")

# Or set a default phone for the session
client.use_phone("iPhone 15 Pro")
client.tap((100, 200))  # Uses the default phone

Features

Gestures

# Tap at coordinates
phone.tap((100, 200))

# Tap at screen center
phone.tap(phone.screen.center)

# Tap by description (uses vision AI)
phone.tap("the Settings icon")

# Double tap
phone.double_tap((100, 200))

# Tap and hold (long press)
phone.tap_and_hold((100, 200), duration_ms=1000)

# Flick (quick swipe)
phone.flick(phone.screen.center, "up")

# Pan (slower scroll)
phone.pan(phone.screen.center, "down", duration_ms=500)

# Drag from one point to another
phone.drag((100, 200), (300, 400))

# Hold and drag
phone.hold_and_drag((100, 200), (300, 400), hold_duration_ms=500)

# Pinch gestures
phone.pinch(phone.screen.center, "pinch_in")   # Zoom out
phone.pinch(phone.screen.center, "pinch_out")  # Zoom in
phone.pinch(phone.screen.center, "rotate_cw")  # Rotate clockwise

Device Control

# Navigation
phone.home()           # Go to home screen
phone.app_switcher()   # Open app switcher
phone.control_center() # Open control center
phone.spotlight()      # Open spotlight search
phone.siri()           # Activate Siri

# Lock/Unlock
phone.lock()
phone.unlock(passcode="123456")

# Volume
phone.volume_up()
phone.volume_down()

# Rotation
phone.rotate("portrait")
phone.rotate("left")
phone.rotate("right")

# Action button (iPhone 15 Pro+)
phone.action_button()

App Control

# Type text
phone.type_text("Hello, world!")
phone.type_text("pasted text", method="paste")  # Use clipboard

# Run iOS Shortcuts
phone.run_shortcut(name="My Shortcut")
phone.run_shortcut(index=0)  # Run first shortcut

Screenshots

# Capture screenshot as PNG bytes
screenshot = phone.screenshot()

# Save to file
with open("screenshot.png", "wb") as f:
    f.write(screenshot)

Geometry Utilities

TapKit includes geometry primitives for working with coordinates:

from tapkit.geometry import Point, BBox, NormalizedPoint, NormalizedBBox, Screen

# Points
point = Point(100, 200)
x, y = point  # Tuple unpacking works

# Bounding boxes
bbox = BBox(x1=100, y1=200, x2=300, y2=400)
phone.tap(bbox.center)  # Tap center of bounding box

# Normalized coordinates (0.0-1.0)
normalized = NormalizedPoint(0.5, 0.5)  # Center of screen
absolute = normalized.to_absolute(phone.width, phone.height)

# Screen utilities
screen = phone.screen
screen.center        # Center point
screen.contains(p)   # Check if point is in bounds
screen.clamp(p)      # Clamp point to screen bounds

OAGI Integration

TapKit includes handlers for OAGI workflows:

from tapkit import TapKitClient
from tapkit.oagi import TapKitAsyncActionHandler, TapKitAsyncImageProvider

client = TapKitClient()
phone = client.get_phone()

# Create handlers
action_handler = TapKitAsyncActionHandler(phone)
image_provider = TapKitAsyncImageProvider(phone)

# Use with OAGI agents
# action_handler can execute OAGI Action objects
# image_provider provides screenshots for vision models

Configuration

Environment Variable Description Default
TAPKIT_API_KEY API key for authentication Required
TAPKIT_BASE_URL Base URL for API https://api.tapkit.ai/v1
TAPKIT_PHONE_ID Default phone name or UUID Auto-select if one phone

Documentation

For full documentation, visit docs.tapkit.ai.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tapkit-0.1.4.tar.gz (15.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tapkit-0.1.4-py3-none-any.whl (20.7 kB view details)

Uploaded Python 3

File details

Details for the file tapkit-0.1.4.tar.gz.

File metadata

  • Download URL: tapkit-0.1.4.tar.gz
  • Upload date:
  • Size: 15.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for tapkit-0.1.4.tar.gz
Algorithm Hash digest
SHA256 f522c7165ddc5d8164f75f211de4985ebe87ffa949b3743d56aa21ea626ff313
MD5 3dc049cd6940ae2e023ae7d1dab3b72a
BLAKE2b-256 4b4bb24ce6ed85a295a0011faf41820f241f392090e82a07fd14be161c658ca4

See more details on using hashes here.

Provenance

The following attestation bundles were made for tapkit-0.1.4.tar.gz:

Publisher: publish.yml on Jootsing-Research/tapkit-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file tapkit-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: tapkit-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 20.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for tapkit-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 27b73cfc5d87e77f75adcc921deda5409cd98b8a6f78e1ce76b93fb029e55488
MD5 410308204bc4fcb996202d48e8c357dc
BLAKE2b-256 1a53da9d6489c8f5beb2627dc357bb4bfa99ce6f49774135b23168fb0ee070d1

See more details on using hashes here.

Provenance

The following attestation bundles were made for tapkit-0.1.4-py3-none-any.whl:

Publisher: publish.yml on Jootsing-Research/tapkit-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page