Skip to main content

Open-source tool for UI-driven API regression testing.

Project description

scout

UI-driven API regression testing. Maintained by BoxProbe.

PyPI Python License: MIT

scout drives your web app's UI like a real user, records the resulting API traffic, and produces a diff report between two runs. When a deploy silently changes a response shape, scout tells you which endpoint changed, in which user flow, and exactly how.

Scenarios are recorded once and replay deterministically — no AI in the hot path, no per-request API fees, no SaaS subscription. Same scenario on the same app produces the same trace, at $0 per run.


Why it has to be open source

scout will run inside your CI, drive your app's UI, and capture every HTTP request and response that flows through your app during execution. That level of access has to be auditable, not taken on trust:

  • Source is open and short. Every Python file is reviewable before scout touches your repo.
  • Nothing leaves your machine. Recordings, diff reports, and run history all stay on the host running scout. No telemetry, no phone-home, no upload step.
  • No account required. scout is a local CLI. If you block outbound network at the firewall, it still works against a locally running app.
  • The recording proxy is a child process on localhost. You can attach a debugger to it like any other process.

If something looks off, you read the source. That's the deal.


Why scout

  • Deterministic, and free to run. Pixel-anchored locators make element resolution pure math, not a probabilistic selector match. No AI runs in the hot path. No tokens consumed, no per-request API fees. Cheap enough to run on every PR and nightly without anyone asking about cost.
  • Locators survive refactors. Elements are referenced by their on-page bounding box at recording time, not by CSS class names. Renaming .btn-primary-large to .PrimaryButton__lg doesn't break tests.
  • Real cross-version diff. Run against v1.0, run against v1.1, scout diff produces an HTML report grouping changes by endpoint and user flow. Surface real regressions, not selector breakage.
  • Narrow scope. scout catches one specific class of bug: API behavior drift that survives your existing tests because it only manifests through real UI interaction.

What scout isn't

scout has a deliberately narrow scope. It does not try to be:

  • A contract testing tool — you don't write an OpenAPI spec; scout observes what the UI actually triggers
  • A load test runner — one scenario, one user, deterministic replay
  • A unit test replacement — unit tests catch logic bugs in your code; scout catches behavioral drift in your API
  • A cloud service — local CLI, no account, no upload

Install

pip install boxprobe-scout
playwright install chromium

Requires Python ≥ 3.11.


Quickstart

A scenario is a Python file declaring locators and an async test function:

# scenarios/login/test.py
from scout.runner import Locator, Page, Scenario

scenario = Scenario(
    name="login",
    base_url="https://your-app.example.com",
    viewport_width=1280,
    viewport_height=800,
)

email = Locator(name="email", tag="input", bbox=(640, 320, 280, 32))
password = Locator(name="password", tag="input", bbox=(640, 372, 280, 32))
submit = Locator(name="submit", tag="button", bbox=(640, 428, 280, 40))


@scenario.test
async def test(page: Page) -> None:
    await page.goto("/login")
    await page.fill(email, "user@example.com")
    await page.fill(password, "password123")
    await page.click(submit)
    await page.wait(2000)


if __name__ == "__main__":
    scenario.run()

Run it once against your baseline, then again against the target version:

scout run scenarios/              # records API traffic, stores under .scout/runs/
scout run scenarios/              # second run after deploy

scout runs                        # list run IDs
scout diff <baseline-id> <target-id>

scout opens an HTML report grouped by endpoint, with structural and value diffs side-by-side.


How it works

scout run
  └── Playwright browser → recording proxy → your app
                                 │
                                 └── API traffic → .scout/runs/<id>/record.db

scout diff baseline target
  └── Pair endpoints by path + structural query → diff status + JSON shape + values
                                 │
                                 └── HTML report grouped by user flow + endpoint

The recording proxy is mitmproxy running in a child process. Tests run headless by default; pass scout verify for a debug mode with screenshots and no proxy.


Status

scout is alpha (0.1.x). The CLI surface and scenario DSL are stabilizing; expect breaking changes between minor versions until 1.0. Production-quality diff reports and stable file formats are the v1.0 bar.

What works today:

  • Pixel-anchored locators with absolute, relative, and delta-from-parent positioning modes
  • Recording proxy + per-scenario API capture
  • Structural and value diffs with known-change suppression via diff_ignore.json
  • HTML diff report with filterable endpoint table

What's coming:

  • npm-distributable wrapper for TS projects
  • GitHub Action template for CI
  • MCP server for AI agent integration

Using scout

The common pattern is accepting scout-runnable test PRs:

  1. A contributor opens a PR adding scenarios under tests-regression/ (or wherever you prefer).
  2. Your CI runs scout run against the baseline and target versions of the app, then scout diff against the two recordings.
  3. The diff report goes up as a PR comment or build artifact.
  4. You review like any other PR. The tests are plain Python, the diff report is a self-contained HTML file — no proprietary format to inspect.

Hand-writing scenarios from scratch is also supported but tedious at scale, since pixel-anchored Locator coordinates are awkward to type by hand. The scenario file format is open and stable enough to target with your own recording tooling — point a browser extension or annotation pipeline at it and emit test.py files in the format shown in the Quickstart.


Contributing

See CONTRIBUTING.md. PRs welcome; please open an issue first for non-trivial changes.


License

MIT — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

boxprobe_scout-0.1.3.tar.gz (314.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

boxprobe_scout-0.1.3-py3-none-any.whl (77.0 kB view details)

Uploaded Python 3

File details

Details for the file boxprobe_scout-0.1.3.tar.gz.

File metadata

  • Download URL: boxprobe_scout-0.1.3.tar.gz
  • Upload date:
  • Size: 314.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for boxprobe_scout-0.1.3.tar.gz
Algorithm Hash digest
SHA256 1199e73cd08441135d39d77d1da2aa8c2fc36aa8fd6a9d4208b8a9f31129930f
MD5 36292882f41c6bb85ecfcce44d86a6d7
BLAKE2b-256 63f7c8d909b45e6ef8f0422cc6fb0676caad4dd0c38e0206bb1fa7f1addff139

See more details on using hashes here.

Provenance

The following attestation bundles were made for boxprobe_scout-0.1.3.tar.gz:

Publisher: release.yml on boxprobe/scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file boxprobe_scout-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: boxprobe_scout-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 77.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for boxprobe_scout-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 257006dfdd471db501d9ffae6fe10ba20675480fd2b8b2e21831b27e2a2bc8c9
MD5 24937cbe99d37a52e880c510e5f1f7c3
BLAKE2b-256 a0d2b482ec2f7bf0604c00edf118a95e199b7fcc4952a4650b279e9dabebb186

See more details on using hashes here.

Provenance

The following attestation bundles were made for boxprobe_scout-0.1.3-py3-none-any.whl:

Publisher: release.yml on boxprobe/scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page