Skip to main content

PageEyes Agent 是一个轻量级 UI Agent,通过自然语言指令驱动,无需编写脚本既可实现Web、Android平台的UI自动化任务。

Project description

PageEyes Agent

PageEyes Agent 是基于 Pydantic AI 框架开发的一个轻量级 UI Agent, 其中元素信息感知能力依靠 OmniParserV2 模型,整个 Agent 的优势在于不依赖视觉语言大模型, 即使小参数的 LLM 也能胜任路径规划能力,同时支持多平台(Web、Android),目前主要包含以下功能:

  1. 完全由自然语言指令驱动,无需编写脚本,既可实现自动化测试,UI巡检等任务
  2. 跨平台、夸端支持,在 Python 环境中安装 page-eyes 库和配置 OmniParser 服务后即可开始 Web、Android 平台的自动化任务,未来还将继续支持iOS平台
  3. 支持多种大模型接入,包括DeepSeek、OpenAI、千问等,默认使用 DeepSeek V3 模型,后续会支持更多大模型接入
  4. 可通过自然语言进行断言,并生成详细的执行日志和报告,方便测试人员查看执行过程和结果

安装

pip install page-eyes

快速开始

配置环境变量

环境变量 默认值 说明
AGENT_MODEL openai:deepseek-v3 使用的AI模型,当前设置为deepseek-v3
AGENT_DEBUG False 是否启用调试模式
AGENT_HEADLESS False 是否使用无头模式
AGENT_LOG_GRAPH_NODE False 是否记录图节点日志
AGENT_OMNI_KEY test-UfcWMpXW Omni服务的认证密钥
OPENAI_BASE_URL http://deepseek.tmeoa.com/v1/ DeepSeek API的服务端点
OPENAI_API_KEY a22a37d7-xxx 调用DeepSeek API所需的认证密钥
COS_SECRET_ID xxx 腾讯云COS服务的Secret ID
COS_SECRET_KEY xxx 腾讯云COS服务的Secret Key

使用示例

import asyncio

from page_eyes.agent import WebAgent, MobileAgent


async def main():
    # Web 端
    ui_agent = await WebAgent.create(simulate_device='iPhone 15 Pro')

    # 移动端
    # ui_agent = await MobileAgent.create(serial='android-udid')

    report = await ui_agent.run(
        ('1.打开 url "https://yobang.tencentmusic.com/chart/uni-chart/rankList/"\n'
         '2.点击"查找icon"\n'
         '3.在搜索输入框中输入"小美满"\n'
         '4.点击"小美满> "\n'
         '5.点击"日榜"'
         ))


if __name__ == "__main__":
    asyncio.run(main())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

page_eyes-0.2.4.tar.gz (242.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

page_eyes-0.2.4-py3-none-any.whl (182.4 kB view details)

Uploaded Python 3

File details

Details for the file page_eyes-0.2.4.tar.gz.

File metadata

  • Download URL: page_eyes-0.2.4.tar.gz
  • Upload date:
  • Size: 242.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.9

File hashes

Hashes for page_eyes-0.2.4.tar.gz
Algorithm Hash digest
SHA256 58e5413b4e29f24a34a56123ef18d60171935dca046867d30ff2c678e7a4121b
MD5 da99af100f015713a89280296fc049c6
BLAKE2b-256 b79dd66cbcf532ee4d547c8c2cae8cb3e2f3c86ac54f2482ac6f4fd0f8b0730b

See more details on using hashes here.

File details

Details for the file page_eyes-0.2.4-py3-none-any.whl.

File metadata

  • Download URL: page_eyes-0.2.4-py3-none-any.whl
  • Upload date:
  • Size: 182.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.9

File hashes

Hashes for page_eyes-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 2b2315b8f8266483eeaacff5ab426fe4c24b7dac9470a0fb3145e9c6cf9d84c1
MD5 088d38f90532ad35772c39a398c0aa55
BLAKE2b-256 f8b6a600efba42942e9d51439aefd56e82140d0ab9eb07ef6a596a4d7f1a7e20

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page