Model Context Protocol server for controlling Android & iOS devices with natural language
Project description
Minitap MCP Server
A Model Context Protocol (MCP) server that enables AI assistants to control and interact with real mobile devices (Android & iOS) through natural language commands.
Quick Start
Installation
pip install minitap-mcp
Prerequisites
Before running the MCP server, ensure you have the required mobile automation tools installed:
-
For Android devices:
- ADB (Android Debug Bridge) - For device communication
- Maestro - For mobile automation
-
For iOS devices (macOS only):
- Xcode Command Line Tools with
xcrun - Maestro - For mobile automation
- Xcode Command Line Tools with
For detailed setup instructions, see the mobile-use repository.
Running the Server
The simplest way to start:
minitap-mcp --server --api-key your_minitap_api_key
This starts the server on localhost:8000 with your API key. Get your free API key at platform.minitap.ai/api-keys.
Available CLI options:
minitap-mcp --server --api-key YOUR_KEY --llm-profile PROFILE_NAME
--api-key: Your Minitap API key (overridesMINITAP_API_KEYenv var). Get yours at platform.minitap.ai/api-keys.--llm-profile: LLM profile name to use (overridesMINITAP_LLM_PROFILE_NAMEenv var). If unset, uses the default profile. Configure profiles at platform.minitap.ai/llm-profiles.
Configuration (Optional)
Alternatively, you can set environment variables instead of using CLI flags:
export MINITAP_API_KEY="your_minitap_api_key"
export MINITAP_API_BASE_URL="https://platform.minitap.ai/api/v1"
export MINITAP_LLM_PROFILE_NAME="default"
You can set these in your .bashrc or equivalent, then simply run:
minitap-mcp --server
CLI flags always override environment variables when both are present.
By default, the server will bind to 0.0.0.0:8000. You can customize the port:
# Using CLI argument
minitap-mcp --server --port 9000
# Or using environment variable
export MCP_SERVER_PORT="9000"
minitap-mcp --server
# You can also customize the host
export MCP_SERVER_HOST="0.0.0.0"
IDE Integration
- Start the server:
minitap-mcp --server --api-key your_minitap_api_key - Add to your IDE MCP settings file:
# For Windsurf
{
"mcpServers": {
"minitap-mcp": {
"serverUrl": "http://localhost:8000/mcp"
}
}
}
# For Cursor
{
"mcpServers": {
"minitap-mcp": {
"transport": "http",
"url": "http://localhost:8000/mcp"
}
}
}
Available Resources & Tools
Once connected, your AI assistant can use these resources and tools:
Resource: data://devices
Lists all connected mobile devices (Android and iOS).
Returns: Array of device information objects with:
device_id: Device serial (Android) or UDID (iOS)platform:"android"or"ios"name: Device namestate: Device state ("connected"or"Booted")
Tool: analyze_screen
Captures a screenshot from a mobile device and analyzes it using a vision language model.
Parameters:
prompt(required): Analysis prompt describing what information to extractdevice_id(optional): Specific device ID to target. If not provided, uses the first available device.
Returns: AI-generated analysis of the screenshot based on the prompt.
Example:
Prompt: "What app is currently open? List all visible UI elements."
The tool will:
- Find the specified device (or first available)
- Capture a screenshot
- Analyze it with the vision model
- Return the analysis
Tool: execute_mobile_command
Execute natural language commands on your mobile device using the mobile-use SDK. This tool allows you to control your Android or iOS device with simple instructions.
Parameters:
goal(required): Natural language command to execute on the deviceoutput_description(optional): Description of the expected output format (e.g., "A JSON list of objects with sender and subject keys")profile(optional): Name of the profile to use for this task. Defaults to 'default'
Returns: Execution result with status, output, and any extracted data.
Examples:
# Simple command
goal: "Go to settings and tell me my current battery level"
# Data extraction with structured output
goal: "Open Gmail, find first 3 unread emails, and list their sender and subject line"
output_description: "A JSON list of objects, each with 'sender' and 'subject' keys"
# App navigation
goal: "Open Twitter and scroll to the latest tweet"
The tool will:
- Find the specified device (or first available)
- Execute the command using the mobile-use AI agent
- Return the result or extracted data
Tool: save_figma_assets
Fetch Figma design assets and React implementation code, then save them locally in the workspace.
Parameters:
node_id(required): The node ID of the Figma design in format "1:2" (colon-separated). Extract from URLs likehttps://figma.com/design/:fileKey/:fileName?node-id=1-2file_key(required): The file key from the Figma URL (e.g., "abc123" fromhttps://figma.com/design/abc123/MyFile)workspace_path(optional): The workspace path where assets should be saved. Defaults to current directory.
Returns: Download summary with list of successfully downloaded assets and any failures.
Example:
node_id: "1:2"
file_key: "abc123xyz"
workspace_path: "."
The tool will:
- Call
get_design_contextfrom Figma MCP to get React/TypeScript code - Extract all asset URLs from the code implementation
- Download each asset to
.mobile-use/figma_assets/<node-id>/folder - Save the code implementation to
.mobile-use/figma_assets/<node-id>/code_implementation.ts - Return a list of downloaded files with success/failure status
Tool: compare_screenshot_with_figma
Compare a screenshot of the current mobile device state with a Figma design to identify visual differences.
Parameters:
node_id(required): The node ID of the Figma design in format "1:2" (colon-separated). Extract from URLs likehttps://figma.com/design/:fileKey/:fileName?node-id=1-2
Returns: Detailed comparison report with both the Figma design and current device screenshots for visual context.
The tool will:
- Capture a screenshot of the current device state
- Fetch the Figma design screenshot
- Compare both screenshots using vision AI
- Return a detailed analysis highlighting differences
Advanced Configuration
Custom ADB Server
If using a remote or custom ADB server (like on WSL):
export ADB_SERVER_SOCKET="tcp:192.168.1.100:5037"
Vision Model
Customize the vision model used for screen analysis:
export VISION_MODEL="qwen/qwen-2.5-vl-7b-instruct"
Device Setup
Android
- Enable USB debugging on your device
- Connect via USB or network ADB
- Verify connection:
adb devices
iOS (macOS only)
- Install Xcode Command Line Tools
- Start a simulator or connect a physical device
- Verify:
xcrun simctl list devices booted
Troubleshooting
No devices found:
- Verify ADB/xcrun connection
- Check USB debugging is enabled (Android)
- Ensure device is unlocked
Connection refused errors:
- Check ADB/xcrun connection
API authentication errors:
- Verify
MINITAP_API_KEYis set correctly
Links
- Mobile-Use SDK: github.com/minitap-ai/mobile-use
- Mobile-Use Documentation: docs.minitap.ai
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file minitap_mcp-0.4.3.tar.gz.
File metadata
- Download URL: minitap_mcp-0.4.3.tar.gz
- Upload date:
- Size: 21.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.9.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0aeab4b15e1b08468073d59f20930539d7f0c6106117397e7d36506545ec1374
|
|
| MD5 |
9fd5a2af23be586a7ae9d15995cce850
|
|
| BLAKE2b-256 |
71cb33418aa5b2b871dbe16cd1d12cfaa522cceca4de2fff7b11c763bfca7bcf
|
File details
Details for the file minitap_mcp-0.4.3-py3-none-any.whl.
File metadata
- Download URL: minitap_mcp-0.4.3-py3-none-any.whl
- Upload date:
- Size: 30.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.9.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
400e87a6bee4379290e02a26706cdd97f3f142066f069aaa4b996fbbdfcc65e5
|
|
| MD5 |
f5f53e20d3c1b81d4ec9dd72b4730ab0
|
|
| BLAKE2b-256 |
88df70a4a33e60838ad27c2c8b331451e18170b374cad70103525c663df795e9
|