Skip to main content

WebQA Agent is an autonomous web browser agent that audits performance, functionality & UX for engineers and vibe-coding creators.

Project description

WebQA Agent

License GitHub stars GitHub forks GitHub issues Ask DeepWiki

Try Demo 🤗HuggingFace | 🚀ModelScope
Join us on 🎮Discord | 💬WeChat

English · 简体中文

If you like WebQA Agent, please give us a ⭐ on GitHub!
Click Star

🤖 WebQA Agent is a fully automated web testing agent for multi-modal understanding, test generation, and end-to-end evaluation of functionality, performance, and UX. ✨

📑 Table of Contents

🚀 Core Features

📋 Feature Overview

WebQA-Agent provides two testing modes to support different scenarios 🤖 Generate Mode and 📋 Run Mode.

Capability 🤖 Generate Mode 📋 Run Mode
Core Features AI-driven discovery -> Dynamic generation -> Precise execution Execute based on instructions and expected verification
Use Cases New feature, comprehensive quality assurance Repeatable and regression testing scenarios
User Input Minimal: Only URL or a one-sentence business goal Structured: Simple natural language step descriptions
Advantages Reflection-based planning, adaptive to UI changes; Configurable functional / performance / security / UX evaluation for comprehensive QA Stable and predictable results; No selector maintenance; Real-time Console and Network monitoring

🧭 Architecture

WebQA Agent Architecture

📹 Examples

Baidu Image Generation Test Demo

Try Demo: 🤗Hugging Face · 🚀ModelScope

🚀 Quick Start

🏎️ Recommended uv (Python>=3.11):

# 1) Create project and install package
uv init my-webqa && cd my-webqa
uv add webqa-agent

# 2) Install browser (required)
uv run playwright install chromium

# 3) Generate Mode
# Initialize Gen mode configuration (config.yaml)
uv run webqa-agent init -m gen
# Edit config.yaml: target.url, llm_config.api_key
# Configure test_config
# For more details, see "Usage > Generate Mode - Configuration" below
# Run Generate Mode
uv run webqa-agent gen
# Generate Mode with specified config and 4 parallel workers
uv run webqa-agent gen -c /path/to/config.yaml -w 4

# 4) Run Mode
# Initialize Run mode configuration (config_run.yaml)
uv run webqa-agent init -m run
# Edit config.yaml: target.url, llm_config.api_key
# Write natural language test cases
# For more details, see "Usage > Run Mode - Configuration" below
# Run Run Mode
uv run webqa-agent run
# Run Mode with specified config and 4 parallel workers
uv run webqa-agent run -c /path/to/config_run.yaml -w 4

🔧 Generate Mode - Optional Dependencies

Performance testing (Lighthouse): npm install lighthouse chrome-launcher (requires Node.js ≥18)

Security testing (Nuclei):

brew install nuclei      # macOS
nuclei -ut               # Update templates
# Linux/Windows: https://github.com/projectdiscovery/nuclei/releases

🐳 Generate Mode - Docker One-liner Start

Please ensure Docker is installed (recommended Docker >= 24.0, Docker Compose >= 2.32). Official guide: Docker Installation

mkdir -p config \
  && curl -fsSL https://raw.githubusercontent.com/MigoXLab/webqa-agent/main/config/config.yaml.example -o config/config.yaml

# Edit config.yaml: set target.url, llm_config.api_key, etc.

curl -fsSL https://raw.githubusercontent.com/MigoXLab/webqa-agent/main/start.sh | bash

⚙️ Usage

Generate Mode - Configuration

The configuration file must include the test_config field to define test types.

  • Functional Testing (AI type): Validates correctness of page functionality. Optional configurations:
    1. business_objectives: Specifies business goals to steer test focus and coverage.
    2. dynamic_step_generation: Enables automatic generation of additional steps when new UI elements are detected during execution.
    3. filter_model: Configures a lightweight model for pre-filtering page elements to improve planning efficiency.
  • Functional Testing (default type): Does not rely on LLMs; focuses only on interaction success (clicks, navigation, etc.).
  • User Experience Testing: Evaluates visual quality, typography/grammar, layout rendering, and provides optimization suggestions based on best practices.
  • Performance Testing: Based on Lighthouse; evaluates performance, SEO, and related metrics.
  • Security Testing: Based on Nuclei, scans web security vulnerabilities and potential risks.

For more details, please refer to docs/MODES&CLI.md

target:
  url: https://example.com              # Website URL to test
  description: Website QA testing

test_config:
  function_test:                        # Functional testing
    enabled: True
    type: ai                            # 'default' or 'ai'
    business_objectives: Test search functionality, generate 3 test cases
  ux_test:                              # User experience testing
    enabled: True
  performance_test:                     # Performance analysis (requires Lighthouse)
    enabled: False
  security_test:                        # Security scanning (requires Nuclei)
    enabled: False

llm_config:                             # LLM configuration, supports OpenAI, Anthropic Claude, Google Gemini, and OpenAI-compatible models (e.g., Doubao, Qwen)
  model: gpt-4.1-2025-04-14             # Primary model
  filter_model: gpt-4o-mini             # Lightweight model for element filtering (optional)
  api_key: your_api_key                 # Or set via environment variable (OPENAI_API_KEY)
  base_url: https://api.openai.com/v1   # Optional, API endpoint. For OpenAI-compatible models (Doubao, Qwen, etc.), set to their API endpoint

browser_config:
  headless: False                       # Auto True in Docker
  language: en-US

report:
  language: en-US                       # zh-CN or en-US

Run Mode - Configuration

Run Mode configuration must include the cases field.

  • Multi-modal Interaction: Use action to describe visible text, images, or relative positions on the page. Supported browser actions include click, hover, input, clear, keyboard input, scrolling, mouse movement, file upload, drag-and-drop, and wait; page actions include navigation, back.
  • Multi-modal Verification: Use verify to ensure the agent stays on track, validating visual content, URLs, paths, and combined image–element conditions.
  • End-to-End Monitoring: Monitoring Console logs and Network request status, and supporting configuration of ignore_rules to ignore known errors.

For more details and test case writing specifications, please refer to docs/MODES&CLI.md

target:
  url: https://example.com              # Target website URL

llm_config:                             # LLM configuration
  api: openai
  model: gpt-4o-mini
  api_key: your_api_key_here
  base_url: https://api.openai.com/v1

browser_config:
  viewport: {"width": 1280, "height": 720}
  headless: False                       # Auto True in Docker
  language: en-US
  # cookies: /path/to/cookie.json

ignore_rules:                           # Ignore rules configuration (optional)
  network:                              # Network request ignore rules
    - pattern: ".*\\.google-analytics\\.com.*"
      type: "domain"
  console:                              # Console log ignore rules
    - pattern: "Failed to load resource.*favicon"
      match_type: "regex"
    - pattern: "Warning:"
      match_type: "contains"

cases:                                  # Test case list
  - name: Image Upload                  # Test case name
    steps:                              # Test steps
      - action: Upload icon is the image icon in the input box, located next to the Baidu search button, used for uploading files
        args:
          file_path: ./tests/data/test.jpeg
      - action: Wait for image upload
      - verify: Verify that the input field displays an open palm/hand icon image
      - action: Enter "How many fingers are in the image?" in the search input box, then press Enter, wait 2 seconds

📊 View Results

Test reports are generated in the reports/ directory. Open the HTML file to view detailed results.

🛠️ Extending WebQA Agent Tools

WebQA Agent supports custom tool development for domain-specific testing capabilities.

Document Description
Custom Tool Development Quick reference for creating custom tools
LLM Context Document Comprehensive guide for AI-assisted development, useful for vibe coding

We welcome contributions! Check out existing tools for examples.

🗺️ RoadMap

  1. Interaction & Visualization: Real-time display of reasoning processes
  2. Generate Mode Expansion: Integration of additional evaluation dimensions
  3. Tool Agent Context Integration: More comprehensive and precise execution

🙏 Acknowledgements

  • natbot: Drive a browser with GPT-3
  • Midscene.js: AI Operator for Web, Android, Automation & Testing
  • browser-use: AI Agent for Browser control

📄 License

This project is licensed under the Apache 2.0 License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webqa_agent-0.2.3.1.tar.gz (3.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

webqa_agent-0.2.3.1-py3-none-any.whl (3.6 MB view details)

Uploaded Python 3

File details

Details for the file webqa_agent-0.2.3.1.tar.gz.

File metadata

  • Download URL: webqa_agent-0.2.3.1.tar.gz
  • Upload date:
  • Size: 3.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for webqa_agent-0.2.3.1.tar.gz
Algorithm Hash digest
SHA256 705e6ac54f5c63b2fa9dda6356bf54f0f260c8ee7f7bdaefb8e66491cca41129
MD5 ce27605229a89d0692954624fdc28b52
BLAKE2b-256 2d6617dc890e6c9d1ac6c1aa4c3a6b5f66eac94b917db93a1e365495ffc805ae

See more details on using hashes here.

Provenance

The following attestation bundles were made for webqa_agent-0.2.3.1.tar.gz:

Publisher: workflow.yml on MigoXLab/webqa-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file webqa_agent-0.2.3.1-py3-none-any.whl.

File metadata

  • Download URL: webqa_agent-0.2.3.1-py3-none-any.whl
  • Upload date:
  • Size: 3.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for webqa_agent-0.2.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e7a6cbfb9a9a8d5a7a098b6e7b842ad0f3b408e4b3ee5f00de5dfebf3a35bb29
MD5 e07d6d3094fa5986379a67d197495ede
BLAKE2b-256 3c77dde4aa38be18b68b0b839b4b56bbbe60362d45742e4c0397d9eda025d8f5

See more details on using hashes here.

Provenance

The following attestation bundles were made for webqa_agent-0.2.3.1-py3-none-any.whl:

Publisher: workflow.yml on MigoXLab/webqa-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page