Browser-based human-in-the-loop UIs for AI coding agents

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

OpenWebGoggles

AI coding agents are good at writing code. They are not good at showing you things. An agent can generate a 200-line diff, but it has no way to pull up a side-by-side review UI, highlight the parts that matter, and wait for you to say "approved" or "try again with fewer abstractions."

OpenWebGoggles fixes that. It gives any agent — Claude Code, a shell script, anything that can write JSON — the ability to open a browser-based UI and get structured decisions back from a human.

Not a chat interface. Not a terminal dump. A real interactive panel: forms, approval flows, dashboards, multi-step wizards. The kind of thing you'd build if you had a few days and a frontend team. Except the agent builds it on the fly from a JSON schema, and the whole round-trip takes seconds.

Agent ←→ OpenWebGoggles Server ←→ Browser UI ←→ Human

"The goggles — they do everything."

What This Actually Looks Like

Here's a concrete example. Your agent finishes a security audit and has 12 findings to triage. Without OpenWebGoggles, it dumps them into the terminal and asks you to type approve or reject twelve times. With OpenWebGoggles, it opens a tabbed wizard in your browser — one finding per screen, editable severity dropdowns, analyst notes, a progress bar — and reads back your structured decisions when you're done.

The agent doesn't need to know HTML. It writes a JSON object describing what it wants to show, and the built-in dynamic renderer handles the rest:

{
  "title": "Security Finding 1 of 12",
  "status": "waiting_input",
  "data": {
    "sections": [
      { "type": "text", "content": "**SQL Injection** in `/api/users` endpoint" },
      { "type": "form", "fields": [
        { "key": "severity", "label": "Severity", "type": "select",
          "options": ["critical", "high", "medium", "low"], "value": "high" },
        { "key": "notes", "label": "Analyst Notes", "type": "textarea" }
      ]}
    ]
  },
  "actions_requested": [
    { "id": "confirm", "type": "approve", "label": "Confirmed" },
    { "id": "fp", "type": "reject", "label": "False Positive" }
  ]
}

The agent gets back:

{
  "actions": [{
    "action_id": "confirm",
    "type": "approve",
    "value": { "severity": "critical", "notes": "Escalated — no parameterized queries anywhere in this module." }
  }]
}

Structured data in, structured data out. The browser is just the rendering layer in between.

Quick Start

Install from PyPI:

pip install openwebgoggles

Then bootstrap for your editor:

Claude Code

openwebgoggles init claude

This creates .mcp.json and .claude/settings.json with the right permissions. Restart Claude Code and you're live.

OpenCode

openwebgoggles init opencode

This creates opencode.json with the MCP server configured. Restart OpenCode and you're live.

Try It

Tell your agent:

"Show me a review UI for these changes and wait for my approval."

"Create a dashboard showing the build progress."

"Walk me through these security findings one at a time with severity dropdowns."

The agent figures out the JSON schema, calls webview_ask, and a panel opens in your browser. You make your decisions, click approve, and the agent continues with your structured response.

What Gets Installed

Four MCP tools — that's the entire API surface:

Tool	What it does
`webview_ask(state)`	Show a UI and block until the human responds
`webview_show(state)`	Show a UI without blocking (dashboards, progress)
`webview_read()`	Poll for actions without blocking
`webview_close()`	Close the session

Manual Setup

If you'd rather configure things by hand, add to your project's .mcp.json:

{
  "mcpServers": {
    "openwebgoggles": {
      "command": "openwebgoggles"
    }
  }
}

Or for OpenCode, add to opencode.json:

{
  "mcp": {
    "openwebgoggles": {
      "type": "local",
      "command": ["openwebgoggles"],
      "enabled": true
    }
  }
}

Bash Scripts (for shell-based agents)

If your agent orchestrates via shell scripts — or if you just want to understand the mechanics — the bash interface exposes the same capabilities:

# Start a session
bash scripts/start_webview.sh --app dynamic

# Push state to the browser
bash scripts/write_state.sh '{"version":1, "status":"pending_review", "title":"Review Changes", ...}'

# Block until the human responds (up to 5 minutes)
ACTIONS=$(bash scripts/wait_for_action.sh --timeout 300)

# Clean up
bash scripts/stop_webview.sh

Script	Purpose
`start_webview.sh --app <name> [--port N]`	Launch server and open browser
`write_state.sh '<json>'`	Atomic state write
`wait_for_action.sh [--timeout N]`	Block until human acts
`read_actions.sh [--clear]`	Read actions, optionally clear
`stop_webview.sh`	Graceful shutdown
`init_webview_app.sh <name>`	Scaffold a custom app

How It Works Under the Hood

The architecture is deliberately simple. Three JSON files in a .openwebgoggles/ directory are the entire interface between the agent and the browser.

File	Direction	Purpose
`state.json`	Agent → Browser	What to show: data, UI schema, requested actions
`actions.json`	Browser → Agent	What the human decided
`manifest.json`	Shared	Session config: ports, app name, auth token

The Python server watches these files and pushes updates to the browser over WebSocket in real time. The browser renders the UI and writes responses back. The agent reads the response file and continues.

This means you can debug the entire system by looking at three JSON files. No hidden state, no message queues, no databases. If something looks wrong in the browser, cat .openwebgoggles/state.json and you'll see exactly what the agent sent.

The Dynamic Renderer

Most use cases don't require custom HTML. The built-in dynamic app takes a JSON schema and renders a complete, styled interface.

Section types: text, items, form, actions

Form field types: text, textarea, number, select, checkbox, email, url, static

Action styles: primary, success, danger, warning, ghost, approve, reject, submit, delete

You can combine these to build approval flows, configuration wizards, data entry forms, triage interfaces — really any structured interaction that runs on fields, selections, and decisions. For 80% of use cases, you never touch HTML.

Custom Apps

When the dynamic renderer isn't enough — complex visualizations, custom layouts, domain-specific interactions — you can build a custom app:

bash scripts/init_webview_app.sh my-dashboard

This scaffolds index.html, app.js, and style.css with the SDK already wired up. The client SDK is vanilla JavaScript with zero dependencies:

const wv = new OpenWebGoggles();
await wv.connect();

// Listen for state updates from the agent
wv.onStateUpdate((state) => {
  // Render however you want
});

// Send structured responses back
await wv.approve("action-id", { comment: "Looks good" });
await wv.reject("action-id");
await wv.submitInput("field-id", "user input");
await wv.sendAction("custom-id", "custom", { any: "data" });

Two working examples are included in examples/:

approval-review — Code review UI with unified diffs, per-file toggles, approve/reject with comments
security-qa — Step-by-step security findings triage with editable fields, severity dropdowns, and a progress bar

These aren't toy demos. They're functional interfaces that handle real workflows. Start by reading their source if you're building something custom.

Patterns That Work Well

Single approval. Agent shows a summary, human clicks approve or reject. The simplest case, and probably the most common.

Multi-step wizard. For N items that need review, show one at a time. The agent calls webview_ask in a loop, advancing to the next item after each response. This avoids overwhelming the user with a wall of decisions.

Live dashboard. Agent calls webview_show (non-blocking) to display progress, then updates state periodically. Useful for long-running operations where the human wants visibility but doesn't need to act.

Batch triage. Show all items at once with per-item actions — tabs, cards, or a list with inline controls. Works well when the total count is under 10 or so.

Security

The trust model is straightforward: the agent and the browser are on the same machine, and nobody else should be able to read or tamper with the communication between them.

Nine defense layers enforce this, all enabled by default:

Localhost-only binding — the server only listens on 127.0.0.1
Bearer token auth — 32-byte session token, constant-time comparison
WebSocket first-message auth — token verified before any data flows
Ed25519 signatures — server signs every state update (cryptographic proof of origin)
HMAC-SHA256 — browser signs every action (tamper detection)
Nonce replay prevention — each action can only be submitted once
Content Security Policy — per-request nonce blocks inline script injection
SecurityGate — 22 XSS patterns, zero-width character detection, schema validation
Rate limiting — 30 actions per minute per session

All cryptographic keys are ephemeral — generated in memory at session start, zeroed on shutdown, never written to disk in plaintext. The test suite covers OWASP Top 10, MITRE ATT&CK techniques, and LLM-specific attack vectors across 471 tests.

The tradeoff is real, though. This level of defense adds complexity to the codebase. If you're running in a fully trusted local environment and want to understand what each layer does, the security tests are the best documentation.

Development

# Run the full test suite
python -m pytest -v

# Lint
ruff check scripts/

Python 3.11+ required. Core dependencies: websockets, PyNaCl, mcp.

Reference Documentation

For the full details:

Data Contract — JSON file formats, state lifecycle, status values
SDK API — Complete client SDK reference
Integration Guide — Step-by-step patterns for connecting from other tools

License

Apache License 2.0 — see LICENSE.

Built by Techtoboggan.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

techtoboggan

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.17.19

Apr 25, 2026

0.17.18

Apr 25, 2026

0.17.17

Apr 9, 2026

0.17.16

Apr 9, 2026

0.17.15

Mar 29, 2026

0.17.11

Mar 29, 2026

0.17.10

Mar 28, 2026

0.17.9

Mar 28, 2026

0.17.8

Mar 27, 2026

0.17.7

Mar 27, 2026

0.17.6

Mar 27, 2026

0.17.5

Mar 27, 2026

0.17.4

Mar 27, 2026

0.17.3

Mar 27, 2026

0.17.2

Mar 27, 2026

0.17.1

Mar 26, 2026

0.17.0

Mar 25, 2026

0.16.1

Mar 10, 2026

0.16.0

Mar 10, 2026

0.15.0

Mar 8, 2026

0.14.2

Mar 4, 2026

0.13.0

Mar 2, 2026

0.12.5

Mar 2, 2026

0.12.4

Mar 2, 2026

0.12.3

Mar 2, 2026

0.12.2

Mar 2, 2026

0.12.1

Mar 2, 2026

0.11.0

Mar 1, 2026

0.8.2

Mar 1, 2026

0.8.1

Feb 28, 2026

0.8.0

Feb 27, 2026

0.7.1

Feb 27, 2026

0.7.0

Feb 27, 2026

0.6.0

Feb 26, 2026

0.5.0

Feb 26, 2026

0.4.3

Feb 26, 2026

0.4.2

Feb 26, 2026

0.4.1

Feb 26, 2026

0.4.0

Feb 26, 2026

0.3.1

Feb 26, 2026

This version

0.3.0

Feb 26, 2026

0.2.0

Feb 26, 2026

0.1.0

Feb 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openwebgoggles-0.3.0.tar.gz (468.3 kB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openwebgoggles-0.3.0-py3-none-any.whl (440.9 kB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file openwebgoggles-0.3.0.tar.gz.

File metadata

Download URL: openwebgoggles-0.3.0.tar.gz
Upload date: Feb 26, 2026
Size: 468.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for openwebgoggles-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`468a9d38da3dfd2a7898f3bc456f96964d7d3ee8539cc578a471b78f423173fd`
MD5	`f6937d81c81fc66f97bb1e07a8b4f814`
BLAKE2b-256	`7bf2003578fcc0350e705255ab70ec242229d8d7e709c64fcc24b1778fbe7b19`

See more details on using hashes here.

Provenance

The following attestation bundles were made for openwebgoggles-0.3.0.tar.gz:

Publisher: publish.yml on techtoboggan/openwebgoggles

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: openwebgoggles-0.3.0.tar.gz
- Subject digest: 468a9d38da3dfd2a7898f3bc456f96964d7d3ee8539cc578a471b78f423173fd
- Sigstore transparency entry: 995146517
- Sigstore integration time: Feb 26, 2026
Source repository:
- Permalink: techtoboggan/openwebgoggles@a650ebd77e46f1f9c20cda106b6129ee1fc802bb
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/techtoboggan
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a650ebd77e46f1f9c20cda106b6129ee1fc802bb
- Trigger Event: release

File details

Details for the file openwebgoggles-0.3.0-py3-none-any.whl.

File metadata

Download URL: openwebgoggles-0.3.0-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 440.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for openwebgoggles-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f6ca1033f1fe895caf30b72dff0ff85948a7c396cafe4a388cb0a32382895464`
MD5	`b470798942e9fdd06883acc2a2015d2e`
BLAKE2b-256	`c26c69118afc9c017e9117350f312a258d9add8b6a2f06ad766f172f83a5f3c7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for openwebgoggles-0.3.0-py3-none-any.whl:

Publisher: publish.yml on techtoboggan/openwebgoggles

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: openwebgoggles-0.3.0-py3-none-any.whl
- Subject digest: f6ca1033f1fe895caf30b72dff0ff85948a7c396cafe4a388cb0a32382895464
- Sigstore transparency entry: 995146519
- Sigstore integration time: Feb 26, 2026
Source repository:
- Permalink: techtoboggan/openwebgoggles@a650ebd77e46f1f9c20cda106b6129ee1fc802bb
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/techtoboggan
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a650ebd77e46f1f9c20cda106b6129ee1fc802bb
- Trigger Event: release

openwebgoggles 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

OpenWebGoggles

What This Actually Looks Like

Quick Start

Claude Code

OpenCode

Try It

What Gets Installed

Manual Setup

Bash Scripts (for shell-based agents)

How It Works Under the Hood

The Dynamic Renderer

Custom Apps

Patterns That Work Well

Security

Development

Reference Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance