Agentic exploratory QA testing for web applications

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

QA Agent

Automated exploratory QA testing for web applications. Simulates real user interactions (mouse, keyboard, form input, accessibility checks) and optionally uses Claude to generate custom test steps from plain-English instructions.

console output showing a test run in progress

Features
Installation
Quick Start
Agentic Testing
Web Interface
CLI Reference
Programmatic Usage
Test Categories
Output Formats
CI/CD Integration
Architecture
Exit Codes

Features

Category	What it does
Agentic testing	Give Claude a bug report or feature spec; it generates custom Playwright test steps automatically
Two modes	`focused` tests only given URLs; `explore` crawls and discovers additional pages
Six test suites	Keyboard nav, mouse interaction, form handling, accessibility (WCAG), WCAG 2.1 AA compliance (opt-in), error detection
Auth support	Username/password, cookies, Bearer tokens, custom headers
Four output formats	Console, Markdown, JSON, PDF
Screenshots & video	On-error or every-interaction screenshots; full session video recording
Web UI	Browser-based dashboard for launching runs, watching live output, and browsing past sessions

Installation

# Core install
pip install qa-agent
playwright install chromium

# PDF report support
pip install "qa-agent[pdf]"

# Web UI support
pip install "qa-agent[web]"

# Everything
pip install "qa-agent[all]"
playwright install chromium

Requirements: Python 3.10+, Playwright ≥ 1.40

Note: Playwright requires browser binaries installed separately after the Python package. Run playwright install chromium (or playwright install for all browsers) once after install.

Agentic testing requires an Anthropic API key:

export ANTHROPIC_API_KEY=sk-ant-...

Quick Start

# Test a single URL
qa-agent https://example.com

# Test multiple URLs
qa-agent https://example.com https://example.com/about

# Crawl and test discovered pages (depth 2, up to 20 pages)
qa-agent --mode explore --max-depth 2 https://example.com

# Generate JSON + Markdown reports
qa-agent --output json,markdown https://example.com

Agentic Testing

Pass natural-language instructions and Claude generates custom test steps that run alongside the standard suite.

# From a bug report
qa-agent --instructions "The login button does nothing when email is blank — no validation error is shown" \
  https://example.com/login

# From a feature description
qa-agent --instructions "We added a 'Remember me' checkbox to the login form. \
  It should persist the session across browser restarts and be unchecked by default." \
  https://example.com/login

# From a file (for longer specs)
qa-agent --instructions-file feature-spec.txt https://example.com

What happens

Before any browser testing, Claude receives your instructions and the target URL.
Claude returns a structured plan: summary, focus areas, custom Playwright test steps, and suggested URLs.
The agent prints the plan, then runs those custom steps on every tested page alongside the five standard suites.
Assertion failures become findings in the report with the severity and category Claude assigned.

If the API call fails, a warning is printed and the run continues with standard tests only.

Model & caching

# Use a different model (default: claude-sonnet-4-6)
qa-agent --ai-model claude-opus-4-6 --instructions "Test checkout" https://shop.example.com

# Bypass the plan cache and always call the API
qa-agent --no-cache --instructions "..." https://example.com

Generated test plans are cached by default; rerunning with the same instructions and URL reuses the cached plan.

Web Interface

Web interface configuration form

A browser-based dashboard for configuring and monitoring runs.

# Start the server (opens at http://127.0.0.1:5000)
python -m qa_agent web
# or
qa-agent-web

# Custom host/port
qa-agent-web --host 0.0.0.0 --port 8080

Features:

Configuration form with all options (collapsible sections, preset save/load)
Real-time streaming output via Server-Sent Events
Stop a running test mid-run
Browse past sessions grouped by domain
Session detail: findings table, severity breakdown, screenshot gallery, report downloads

Session detail view showing findings table

The web interface has no authentication — intended for local or internal use only.

All output is written to output/ in the project directory. CLI sessions are also visible in the web UI as long as JSON output format was used.

CLI Reference

Modes

qa-agent --mode focused https://example.com   # default: test only given URLs
qa-agent --mode explore  https://example.com   # crawl and test discovered pages

Exploration options (explore mode)

Flag	Default	Description
`--max-depth N`	`3`	Max link depth to follow
`--max-pages N`	`20`	Max pages to test
`--allow-external`	off	Follow links to other domains
`--ignore PATTERN`	—	Regex pattern(s) for URLs to skip (repeatable)

Authentication

# Username/password with login URL
qa-agent --auth "username:password@https://example.com/login" https://example.com/dashboard

# JSON auth file
qa-agent --auth-file auth.json https://example.com

# Pre-set cookies
qa-agent --cookies cookies.json https://example.com

# Custom header (repeatable)
qa-agent --header "Authorization: Bearer token123" https://example.com

auth.json schema:

{
  "username": "testuser",
  "password": "testpass",
  "auth_url": "https://example.com/login",
  "username_selector": "input#email",
  "password_selector": "input#password",
  "submit_selector": "button[type=submit]"
}

Output

# Formats: console, markdown, json, pdf (comma-separated, default: console,markdown)
qa-agent --output console,markdown,json,pdf https://example.com

# Custom output directory (default: <project-root>/output)
qa-agent --output-dir ./reports https://example.com

Output is organized as output/{domain}/{session_id}/qa_reports|screenshots|recordings.

PDF requires weasyprint. Install with pip install "qa-agent[pdf]". Falls back to Markdown if not installed.

Screenshots & recording

qa-agent --screenshots       https://example.com  # capture on errors
qa-agent --screenshots-all   https://example.com  # capture after every interaction
qa-agent --full-page         https://example.com  # full-page screenshots
qa-agent --record            https://example.com  # record session video

Browser options

qa-agent --no-headless                  # visible browser window
qa-agent --viewport 1920x1080           # custom viewport (default: 1280x720)
qa-agent --timeout 60000                # timeout in ms (default: 30000)

Test category flags

# Skip standard suites
qa-agent --skip-keyboard      https://example.com
qa-agent --skip-mouse         https://example.com
qa-agent --skip-forms         https://example.com
qa-agent --skip-accessibility https://example.com
qa-agent --skip-errors        https://example.com

# Enable opt-in suites
qa-agent --wcag-compliance    https://example.com  # detailed WCAG 2.1 AA audit

Programmatic Usage

from qa_agent import QAAgent, TestConfig, TestMode, OutputFormat

config = TestConfig(
    urls=["https://example.com"],
    mode=TestMode.EXPLORE,
    output_formats=[OutputFormat.CONSOLE, OutputFormat.JSON, OutputFormat.PDF],
    max_depth=2,
    max_pages=10,
    # Optional: agentic testing
    instructions="Verify the password reset flow sends an email and the link expires after 24 hours.",
    ai_model="claude-opus-4-6",
)

agent = QAAgent(config)
session = agent.run()

print(f"Pages tested:   {len(session.pages)}")
print(f"Total findings: {session.total_findings}")

for finding in session.get_all_findings():
    print(f"  [{finding.severity.value.upper()}] {finding.title}")

Test Categories

Keyboard Navigation

TAB order and focusability · Arrow key navigation in widgets · Enter key activation · Escape key for closing modals · Keyboard trap detection · Focus visibility indicators

Mouse Interaction

Click target functionality · Hover states · Double-click behavior · Right-click/context menus · Click target sizes (WCAG 2.5.5 minimum 44×44 px) · Overlapping element detection

Form Handling

Required field indicators · Input validation feedback · Error message accessibility · Label associations · HTML5 input types · Autocomplete attributes

Accessibility (WCAG)

Image alt text · Heading structure (h1–h6) · Link text quality · Color contrast · ARIA usage · Landmark regions · Language attributes · Skip navigation links

Error Detection

Console errors and warnings · Network errors (4xx, 5xx) · JavaScript exceptions · Broken images · Broken anchor links · Mixed content (HTTP on HTTPS)

WCAG 2.1 AA Compliance (opt-in: `--wcag-compliance`)

Covers WCAG criteria not already in the standard accessibility suite: non-text contrast (1.4.11) · use of color (1.4.1) · content on hover/focus (1.4.13) · meaningful sequence (1.3.2) · input purpose (1.3.5) · focus visible (2.4.7) · label in name (2.5.3) · target size (2.5.5) · language of parts (3.1.2) · error identification (3.3.1) · detailed ARIA role/property validation

Output Formats

Console

./docs/colorized-console-output-with-summary-table.png

======================================================================
  QA AGENT TEST REPORT
======================================================================
  Session ID: a1b2c3d4
  Started:    2024-01-15 10:30:00
  Duration:   45.2 seconds
  Mode:       explore
======================================================================

SUMMARY
  Pages tested:   5
  Total findings: 12

  By Severity:
    HIGH:   2
    MEDIUM: 5
    LOW:    5

JSON

{
  "meta": {
    "session_id": "a1b2c3d4",
    "start_time": "2024-01-15T10:30:00",
    "duration_seconds": 45.2
  },
  "summary": {
    "pages_tested": 5,
    "total_findings": 12,
    "findings_by_severity": { "high": 2, "medium": 5, "low": 5 }
  },
  "findings": [...]
}

Severity levels

Level	Meaning
`CRITICAL`	Security issues, data loss
`HIGH`	Major usability blockers
`MEDIUM`	UX problems, accessibility issues
`LOW`	Minor improvements, best practices
`INFO`	Informational findings

CI/CD Integration

# GitHub Actions example
- name: Run QA Tests
  env:
    ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
  run: |
    pip install qa-agent
    playwright install chromium
    qa-agent --output json --output-dir ./qa-results https://staging.example.com

- name: Upload Results
  uses: actions/upload-artifact@v3
  with:
    name: qa-results
    path: ./qa-results/

The process exits with code 1 when critical or high severity issues are found, failing the CI step automatically.

Architecture

qa_agent/
├── cli.py               # Argument parsing, entry point
├── agent.py             # Core orchestrator
├── config.py            # TestConfig, AuthConfig, ScreenshotConfig, RecordingConfig
├── models.py            # Finding, PageAnalysis, TestSession
├── ai_planner.py        # Claude integration (plan generation & caching)
├── plan_cache.py        # Plan cache persistence
├── testers/
│   ├── keyboard.py         # Keyboard navigation tests
│   ├── mouse.py            # Mouse interaction tests
│   ├── forms.py            # Form handling tests
│   ├── accessibility.py    # WCAG / accessibility tests
│   ├── wcag_compliance.py  # Detailed WCAG 2.1 AA compliance (opt-in)
│   └── errors.py           # Console & network error detection
└── reporters/
    ├── console.py       # Real-time colored output
    ├── markdown.py      # Markdown report
    ├── json_reporter.py # JSON report
    └── pdf.py           # PDF report (requires weasyprint)

Adding a custom tester

Create testers/my_tester.py extending BaseTester, implement run() -> list[Finding]
Export it from testers/__init__.py
Add a test_my_feature: bool = True flag to TestConfig in config.py
Call it from agent.py in _test_page()

Exit Codes

Code	Meaning
`0`	All tests passed (no critical/high findings)
`1`	Critical or high severity issues found
`2`	Error running tests
`130`	Interrupted by user (Ctrl+C)

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

billrichards

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.1

Apr 14, 2026

0.2.0

Apr 14, 2026

0.1.1

Apr 10, 2026

This version

0.1.0

Apr 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qa_agent-0.1.0.tar.gz (75.4 kB view details)

Uploaded Apr 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

qa_agent-0.1.0-py3-none-any.whl (82.6 kB view details)

Uploaded Apr 7, 2026 Python 3

File details

Details for the file qa_agent-0.1.0.tar.gz.

File metadata

Download URL: qa_agent-0.1.0.tar.gz
Upload date: Apr 7, 2026
Size: 75.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for qa_agent-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`632015a49fae7c0fc4146a02daedddcd3a50e9e3b005a2efb6679fb24c0964c7`
MD5	`98982c8c02679795ded38bd07c74080c`
BLAKE2b-256	`efd805aa589e3e4c441ebc22532206863699f3bee1ff179a2e67c39dbf39a6d2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for qa_agent-0.1.0.tar.gz:

Publisher: release.yml on billrichards/qa-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: qa_agent-0.1.0.tar.gz
- Subject digest: 632015a49fae7c0fc4146a02daedddcd3a50e9e3b005a2efb6679fb24c0964c7
- Sigstore transparency entry: 1245394924
- Sigstore integration time: Apr 7, 2026
Source repository:
- Permalink: billrichards/qa-agent@b45c98323cf7c456e7fe1896096cb279b3917b81
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/billrichards
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b45c98323cf7c456e7fe1896096cb279b3917b81
- Trigger Event: push

File details

Details for the file qa_agent-0.1.0-py3-none-any.whl.

File metadata

Download URL: qa_agent-0.1.0-py3-none-any.whl
Upload date: Apr 7, 2026
Size: 82.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for qa_agent-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a13b78df9cc830f06d65f03643ea624ae5082b7b59d106e86ffcb68e9b17a5ac`
MD5	`4d717bac837187f1749ad2bee9407ff7`
BLAKE2b-256	`077f16a3d0912146eade11de8ef2b8f634ee3e2645b683e253f3811bb9c4344e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for qa_agent-0.1.0-py3-none-any.whl:

Publisher: release.yml on billrichards/qa-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: qa_agent-0.1.0-py3-none-any.whl
- Subject digest: a13b78df9cc830f06d65f03643ea624ae5082b7b59d106e86ffcb68e9b17a5ac
- Sigstore transparency entry: 1245394942
- Sigstore integration time: Apr 7, 2026
Source repository:
- Permalink: billrichards/qa-agent@b45c98323cf7c456e7fe1896096cb279b3917b81
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/billrichards
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b45c98323cf7c456e7fe1896096cb279b3917b81
- Trigger Event: push

qa-agent 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

QA Agent

Table of Contents

Features

Installation

Quick Start

Agentic Testing

What happens

Model & caching

Web Interface

CLI Reference

Modes

Exploration options (explore mode)

Authentication

Output

Screenshots & recording

Browser options

Test category flags

Programmatic Usage

Test Categories

Keyboard Navigation

Mouse Interaction

Form Handling

Accessibility (WCAG)

Error Detection

WCAG 2.1 AA Compliance (opt-in: --wcag-compliance)

Output Formats

Console

JSON

Severity levels

CI/CD Integration

Architecture

Adding a custom tester

Exit Codes

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

WCAG 2.1 AA Compliance (opt-in: `--wcag-compliance`)