Python SDK for controlling iPhones via TapKit
Project description
TapKit Python SDK
Python SDK for controlling iPhones programmatically via the TapKit API.
Installation
pip install tapkit
Or install from source:
pip install git+https://github.com/Jootsing-Research/tapkit-python.git
Requirements
- Python 3.11 or later
- A TapKit account with API key
- TapKit Mac companion app running
- An iPhone connected through the Mac app
Quick Start
Authentication
Set your API key as an environment variable:
export TAPKIT_API_KEY="your-api-key"
Or pass it directly to the client:
from tapkit import TapKitClient
client = TapKitClient(api_key="your-api-key")
Basic Usage
from tapkit import TapKitClient
# Initialize client
client = TapKitClient()
# Get your phone (works when you have exactly one phone)
phone = client.get_phone()
print(f"Connected to {phone.name}")
print(f"Screen size: {phone.width}x{phone.height}")
# Tap center of screen
phone.tap(phone.screen.center)
# Take a screenshot
screenshot = phone.screenshot()
with open("screen.png", "wb") as f:
f.write(screenshot)
Multiple Phones
# List all phones
phones = client.list_phones()
# Get a specific phone by name
phone = client.phone("iPhone 15 Pro")
# Or set a default phone for the session
client.use_phone("iPhone 15 Pro")
client.tap((100, 200)) # Uses the default phone
Features
Gestures
# Tap at coordinates
phone.tap((100, 200))
# Tap at screen center
phone.tap(phone.screen.center)
# Tap by description (uses vision AI)
phone.tap("the Settings icon")
# Double tap
phone.double_tap((100, 200))
# Tap and hold (long press)
phone.tap_and_hold((100, 200), duration_ms=1000)
# Flick (quick swipe)
phone.flick(phone.screen.center, "up")
# Pan (slower scroll)
phone.pan(phone.screen.center, "down", duration_ms=500)
# Drag from one point to another
phone.drag((100, 200), (300, 400))
# Hold and drag
phone.hold_and_drag((100, 200), (300, 400), hold_duration_ms=500)
# Pinch gestures
phone.pinch(phone.screen.center, "pinch_in") # Zoom out
phone.pinch(phone.screen.center, "pinch_out") # Zoom in
phone.pinch(phone.screen.center, "rotate_cw") # Rotate clockwise
Device Control
# Navigation
phone.home() # Go to home screen
phone.app_switcher() # Open app switcher
phone.control_center() # Open control center
phone.spotlight() # Open spotlight search
phone.siri() # Activate Siri
# Lock/Unlock
phone.lock()
phone.unlock(passcode="123456")
# Volume
phone.volume_up()
phone.volume_down()
# Rotation
phone.rotate("portrait")
phone.rotate("left")
phone.rotate("right")
# Action button (iPhone 15 Pro+)
phone.action_button()
App Control
# Type text
phone.type_text("Hello, world!")
phone.type_text("pasted text", method="paste") # Use clipboard
# Run iOS Shortcuts
phone.run_shortcut(name="My Shortcut")
phone.run_shortcut(index=0) # Run first shortcut
Screenshots
# Capture screenshot as PNG bytes
screenshot = phone.screenshot()
# Save to file
with open("screenshot.png", "wb") as f:
f.write(screenshot)
Geometry Utilities
TapKit includes geometry primitives for working with coordinates:
from tapkit.geometry import Point, BBox, NormalizedPoint, NormalizedBBox, Screen
# Points
point = Point(100, 200)
x, y = point # Tuple unpacking works
# Bounding boxes
bbox = BBox(x1=100, y1=200, x2=300, y2=400)
phone.tap(bbox.center) # Tap center of bounding box
# Normalized coordinates (0.0-1.0)
normalized = NormalizedPoint(0.5, 0.5) # Center of screen
absolute = normalized.to_absolute(phone.width, phone.height)
# Screen utilities
screen = phone.screen
screen.center # Center point
screen.contains(p) # Check if point is in bounds
screen.clamp(p) # Clamp point to screen bounds
OAGI Integration
TapKit includes handlers for OAGI workflows:
from tapkit import TapKitClient
from tapkit.oagi import TapKitAsyncActionHandler, TapKitAsyncImageProvider
client = TapKitClient()
phone = client.get_phone()
# Create handlers
action_handler = TapKitAsyncActionHandler(phone)
image_provider = TapKitAsyncImageProvider(phone)
# Use with OAGI agents
# action_handler can execute OAGI Action objects
# image_provider provides screenshots for vision models
Configuration
| Environment Variable | Description | Default |
|---|---|---|
TAPKIT_API_KEY |
API key for authentication | Required |
TAPKIT_BASE_URL |
Base URL for API | https://api.tapkit.ai/v1 |
TAPKIT_PHONE_ID |
Default phone name or UUID | Auto-select if one phone |
Documentation
For full documentation, visit docs.tapkit.ai.
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tapkit-0.1.3.tar.gz.
File metadata
- Download URL: tapkit-0.1.3.tar.gz
- Upload date:
- Size: 19.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5c6dcbb4825d2a9956036aef99d10a367ec547efa48ebb676e8d68c264f15a48
|
|
| MD5 |
e12cd72cd1ecdcdf21bf7d32216d2999
|
|
| BLAKE2b-256 |
ad0fd72906f8eac163beb0839a3deca99d3c7602c395711300eb7999b8f2a7f2
|
File details
Details for the file tapkit-0.1.3-py3-none-any.whl.
File metadata
- Download URL: tapkit-0.1.3-py3-none-any.whl
- Upload date:
- Size: 20.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9cc93c9e83b5b6f0e54a7c0d75cbb08915f119f6eba565e2ea301ce0f7a11be7
|
|
| MD5 |
55a64eccf75b743adc3c33a302103fd2
|
|
| BLAKE2b-256 |
641256ecfe27d807ff21d2990585ea2047cbcdb3d78933f85ab5f2e5b4e34381
|