Web GUI for AutoGLM Phone Agent - AI-powered Android automation
Project description
AutoGLM-GUI
AutoGLM 手机助手的现代化 Web 图形界面 - 让 AI 自动化操作 Android 设备变得简单
✨ 特性
- 对话式任务管理 - 通过聊天界面控制 Android 设备
- 实时屏幕预览 - 基于 scrcpy 的低延迟视频流,随时查看设备正在执行的操作
- 直接操控手机 - 在实时画面上直接点击操作,支持精准坐标转换和视觉反馈
- 零配置部署 - 支持任何 OpenAI 兼容的 LLM API
- ADB 深度集成 - 通过 Android Debug Bridge 直接控制设备
📸 界面预览
任务开始
任务执行完成
🚀 快速开始
🎯 模型服务配置
AutoGLM-GUI 只需要一个 OpenAI 兼容的模型服务。你可以:
- 使用官方已托管的第三方服务
- 智谱 BigModel:
--base-url https://open.bigmodel.cn/api/paas/v4,--model autoglm-phone,--apikey <你的 API Key> - ModelScope:
--base-url https://api-inference.modelscope.cn/v1,--model ZhipuAI/AutoGLM-Phone-9B,--apikey <你的 API Key>
- 智谱 BigModel:
- 或自建服务:参考上游项目的部署文档用 vLLM/SGLang 部署
zai-org/AutoGLM-Phone-9B,启动 OpenAI 兼容端口后将--base-url指向你的服务。
示例:
# 使用智谱 BigModel
pip install autoglm-gui
autoglm-gui \
--base-url https://open.bigmodel.cn/api/paas/v4 \
--model autoglm-phone \
--apikey sk-xxxxx
# 使用 ModelScope
pip install autoglm-gui
autoglm-gui \
--base-url https://api-inference.modelscope.cn/v1 \
--model ZhipuAI/AutoGLM-Phone-9B \
--apikey sk-xxxxx
# 指向你自建的 vLLM/SGLang 服务
pip install autoglm-gui
autoglm-gui --base-url http://localhost:8000/v1 --model autoglm-phone-9b
前置要求
- Python 3.10+
- 已开启 USB 调试的 Android 设备
- 已安装 ADB 并添加到系统 PATH
- 一个 OpenAI 兼容的 API 端点
快捷运行(推荐)
无需手动准备环境,直接安装运行:
# 通过 pip 安装并启动
pip install autoglm-gui
autoglm-gui --base-url http://localhost:8080/v1
也可以使用 uvx 免安装启动(需已安装 uv,安装教程):
uvx autoglm-gui --base-url http://localhost:8080/v1
传统安装
# 从源码安装
git clone https://github.com/your-repo/AutoGLM-GUI.git
cd AutoGLM-GUI
uv sync
# 构建前端(必须)
uv run python scripts/build.py
# 启动服务
uv run autoglm-gui --base-url http://localhost:8080/v1
启动后,在浏览器中打开 http://localhost:8000 即可开始使用!
📖 使用说明
AI 自动化模式
- 连接设备 - 启用 USB 调试并通过 ADB 连接设备
- 对话 - 描述你想要做什么(例如:"去美团点一杯霸王茶姬的伯牙绝弦")
- 观察 - Agent 会逐步执行操作
手动控制模式
除了 AI 自动化,你也可以直接在实时画面上操控手机:
- 实时画面 - 界面右侧显示手机屏幕的实时视频流(基于 scrcpy)
- 点击操作 - 直接点击画面中的任意位置,操作会立即发送到手机
- 视觉反馈 - 每次点击都会显示涟漪动画和成功提示
- 精准转换 - 自动处理屏幕缩放和坐标转换,确保点击位置准确
技术细节:
- 使用 scrcpy 提供低延迟(~30-50ms)的 H.264 视频流
- 前端自动获取设备实际分辨率(如 1080x2400)
- 智能处理视频流缩放(如 576x1280)与设备分辨率的映射
- 支持 letterbox 黑边的精确坐标计算
🛠️ 开发指南
# 后端开发(自动重载)
uv run autoglm-gui --base-url http://localhost:8080/v1 --reload
# 前端开发服务器
cd frontend && pnpm dev
# 构建完整包
uv run python scripts/build.py --pack
📝 开源协议
Apache License 2.0
🙏 致谢
本项目基于 Open-AutoGLM 构建,感谢 zai-org 团队在 AutoGLM 上的卓越工作。
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file autoglm_gui-0.3.1.tar.gz.
File metadata
- Download URL: autoglm_gui-0.3.1.tar.gz
- Upload date:
- Size: 240.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5671c78f43c56b379f6bc65b9a0c445395ace6d7ffcde630dc096e8ede82866c
|
|
| MD5 |
b5040f68ee4c6968b905aef934e82378
|
|
| BLAKE2b-256 |
251c2e7f504b37896587d0f27c98242ab21d8059160d8261171e525fe082ffa6
|
Provenance
The following attestation bundles were made for autoglm_gui-0.3.1.tar.gz:
Publisher:
release.yml on suyiiyii/AutoGLM-GUI
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
autoglm_gui-0.3.1.tar.gz -
Subject digest:
5671c78f43c56b379f6bc65b9a0c445395ace6d7ffcde630dc096e8ede82866c - Sigstore transparency entry: 760878095
- Sigstore integration time:
-
Permalink:
suyiiyii/AutoGLM-GUI@ae502966a5bd6c04ad7bbeefe938ef32fd8ea79f -
Branch / Tag:
refs/tags/v0.3.1 - Owner: https://github.com/suyiiyii
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@ae502966a5bd6c04ad7bbeefe938ef32fd8ea79f -
Trigger Event:
push
-
Statement type:
File details
Details for the file autoglm_gui-0.3.1-py3-none-any.whl.
File metadata
- Download URL: autoglm_gui-0.3.1-py3-none-any.whl
- Upload date:
- Size: 252.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0fae5ce0b4b0e998517205c847f603ea8869afbb76e383c89b97ea3cf13c4d4a
|
|
| MD5 |
e7a79ea32c8d53770c86298cbe45edd3
|
|
| BLAKE2b-256 |
aef675780ea36b2c3160336086a684983c6ec7ca29e650e199684d0aae490b23
|
Provenance
The following attestation bundles were made for autoglm_gui-0.3.1-py3-none-any.whl:
Publisher:
release.yml on suyiiyii/AutoGLM-GUI
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
autoglm_gui-0.3.1-py3-none-any.whl -
Subject digest:
0fae5ce0b4b0e998517205c847f603ea8869afbb76e383c89b97ea3cf13c4d4a - Sigstore transparency entry: 760878096
- Sigstore integration time:
-
Permalink:
suyiiyii/AutoGLM-GUI@ae502966a5bd6c04ad7bbeefe938ef32fd8ea79f -
Branch / Tag:
refs/tags/v0.3.1 - Owner: https://github.com/suyiiyii
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@ae502966a5bd6c04ad7bbeefe938ef32fd8ea79f -
Trigger Event:
push
-
Statement type: