GUI interaction capture - platform-agnostic event streams with time-aligned media

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

abrichr

These details have not been verified by PyPI

Project description

OpenAdapt Capture

OpenAdapt Capture is the data collection component of the OpenAdapt GUI automation ecosystem.

Capture platform-agnostic GUI interaction streams with time-aligned screenshots and audio for training ML models or replaying workflows.

Status: Pre-alpha. See docs/DESIGN.md for architecture discussion.

The OpenAdapt Ecosystem

                          OpenAdapt GUI Automation Pipeline
                          =================================

    +-----------------+          +------------------+          +------------------+
    |                 |          |                  |          |                  |
    | openadapt-      |  ------> | openadapt-ml     |  ------> |    Deploy        |
    | capture         |  Convert | (Train & Eval)   |  Export  |    (Inference)   |
    |                 |          |                  |          |                  |
    +-----------------+          +------------------+          +------------------+
          |                             |                             |
          v                             v                             v
    - Record GUI                  - Fine-tune VLMs              - Run trained
      interactions                - Evaluate on                   agent on new
    - Mouse, keyboard,              benchmarks (WAA)              tasks
      screen, audio               - Compare models              - Real-time
    - Privacy scrubbing           - Cloud GPU training            automation

Component	Purpose	Repository
openadapt-capture	Record human demonstrations	GitHub
openadapt-ml	Train and evaluate GUI automation models	GitHub
openadapt-privacy	PII scrubbing for recordings	Coming soon

Installation

uv add openadapt-capture

This includes everything needed to capture and replay GUI interactions (mouse, keyboard, screen recording).

For audio capture with Whisper transcription (large download):

uv add "openadapt-capture[audio]"

Quick Start

Capture

from openadapt_capture import Recorder

# Record GUI interactions
with Recorder("./my_capture", task_description="Demo task") as recorder:
    # Captures mouse, keyboard, and screen until context exits
    input("Press Enter to stop recording...")

print(f"Captured {recorder.event_count} events")

Replay / Analysis

from openadapt_capture import Capture

# Load and iterate over time-aligned events
capture = Capture.load("./my_capture")

for action in capture.actions():
    # Each action has an associated screenshot
    print(f"{action.timestamp}: {action.type} at ({action.x}, {action.y})")
    screenshot = action.screenshot  # PIL Image at time of action

Low-Level API

from openadapt_capture import (
    create_capture, process_events,
    MouseDownEvent, MouseButton,
)

# Create storage (platform and screen size auto-detected)
capture, storage = create_capture("./my_capture")

# Write raw events
storage.write_event(MouseDownEvent(timestamp=1.0, x=100, y=200, button=MouseButton.LEFT))

# Query and process
raw_events = storage.get_events()
actions = process_events(raw_events)  # Merges clicks, drags, typed text

Event Types

Raw events (captured):

mouse.move, mouse.down, mouse.up, mouse.scroll
key.down, key.up
screen.frame, audio.chunk

Actions (processed):

mouse.singleclick, mouse.doubleclick, mouse.drag
key.type (merged keystrokes → text)

Architecture

capture_directory/
├── capture.db      # SQLite: events, metadata
├── video.mp4       # Screen recording
└── audio.flac      # Audio (optional)

Performance Statistics

Track event write latency and analyze capture performance:

from openadapt_capture import Recorder

with Recorder("./my_capture") as recorder:
    input("Press Enter to stop...")

# Access performance statistics
summary = recorder.stats.summary()
print(f"Mean latency: {summary['mean_latency_ms']:.1f}ms")

# Generate performance plot
recorder.stats.plot(output_path="performance.png")

Performance Statistics

Frame Extraction Verification

Compare extracted video frames against original images to verify lossless capture:

from openadapt_capture import compare_video_to_images, plot_comparison

# Compare frames
report = compare_video_to_images(
    "capture/video.mp4",
    [(timestamp, image) for timestamp, image in captured_frames],
)

print(f"Mean diff: {report.mean_diff_overall:.2f}")
print(f"Lossless: {report.is_lossless}")

# Visualize comparison
plot_comparison(report, output_path="comparison.png")

Frame Comparison

Visualization

Generate animated demos and interactive viewers from recordings:

Animated GIF Demo

from openadapt_capture import Capture, create_demo

capture = Capture.load("./my_capture")
create_demo(capture, output="demo.gif", fps=10, max_duration=15)

Interactive HTML Viewer

from openadapt_capture import Capture, create_html

capture = Capture.load("./my_capture")
create_html(capture, output="viewer.html", include_audio=True)

The HTML viewer includes:

Timeline scrubber with event markers
Frame-by-frame navigation
Synchronized audio playback
Event list with details panel
Keyboard shortcuts (Space, arrows, Home/End)

Capture Viewer

Generate Demo from Command Line

uv run python scripts/generate_readme_demo.py --duration 10

Optional Extras

Extra	Features
`audio`	Audio capture + Whisper transcription
`privacy`	PII scrubbing (openadapt-privacy)
`all`	Everything

Training with OpenAdapt-ML

Captured recordings can be used to train vision-language models with openadapt-ml.

End-to-End Workflow

# 1. Capture a workflow demonstration
uv run python -c "
from openadapt_capture import Recorder

with Recorder('./my_capture', task_description='Turn off Night Shift') as recorder:
    input('Perform the task, then press Enter to stop...')
"

# 2. Train a model on the capture (requires openadapt-ml)
uv pip install openadapt-ml
uv run python -m openadapt_ml.cloud.local train \
  --capture ./my_capture \
  --open  # Opens training dashboard

# 3. Compare human vs model predictions
uv run python -m openadapt_ml.scripts.compare \
  --capture ./my_capture \
  --checkpoint checkpoints/model \
  --open

Cloud GPU Training

For faster training with cloud GPUs:

# Train on Lambda Labs A10 (~$0.75/hr)
uv run python -m openadapt_ml.cloud.lambda_labs train \
  --capture ./my_capture \
  --goal "Turn off Night Shift"

See the openadapt-ml documentation for cloud setup.

Data Format

OpenAdapt-ML converts captures to its Episode format automatically:

from openadapt_ml.ingest.capture import capture_to_episode

episode = capture_to_episode("./my_capture")
print(f"Loaded {len(episode.steps)} steps")
print(f"Instruction: {episode.instruction}")

The conversion maps capture event types to ML action types:

mouse.singleclick / mouse.click -> CLICK
mouse.doubleclick -> DOUBLE_CLICK
mouse.drag -> DRAG
mouse.scroll -> SCROLL
key.type -> TYPE

Development

uv sync --dev
uv run pytest

Related Projects

openadapt-ml - Train and evaluate GUI automation models
Windows Agent Arena - Benchmark for Windows GUI agents

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

abrichr

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.2

Mar 17, 2026

0.5.1

Mar 17, 2026

0.5.0

Mar 4, 2026

0.4.0

Mar 3, 2026

0.3.0

Feb 6, 2026

This version

0.2.0

Jan 29, 2026

0.1.0

Dec 13, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openadapt_capture-0.2.0.tar.gz (10.9 MB view details)

Uploaded Jan 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openadapt_capture-0.2.0-py3-none-any.whl (69.6 kB view details)

Uploaded Jan 29, 2026 Python 3

File details

Details for the file openadapt_capture-0.2.0.tar.gz.

File metadata

Download URL: openadapt_capture-0.2.0.tar.gz
Upload date: Jan 29, 2026
Size: 10.9 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for openadapt_capture-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`8df9779779d506bbf91610c2b30717cb68fcd59db5f280fad6cc00b502b30c15`
MD5	`de3a677cb400fde9e8f6bbdca89ea785`
BLAKE2b-256	`dfad7e1d47e4adce1928bc7d8d80e9261b6548d6817c64d4f59ef0234caf87f6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for openadapt_capture-0.2.0.tar.gz:

Publisher: release.yml on OpenAdaptAI/openadapt-capture

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: openadapt_capture-0.2.0.tar.gz
- Subject digest: 8df9779779d506bbf91610c2b30717cb68fcd59db5f280fad6cc00b502b30c15
- Sigstore transparency entry: 871196410
- Sigstore integration time: Jan 29, 2026
Source repository:
- Permalink: OpenAdaptAI/openadapt-capture@93cdbb8ff6f87a6ad96dc74d8092bbad58a34d51
- Branch / Tag: refs/heads/main
- Owner: https://github.com/OpenAdaptAI
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@93cdbb8ff6f87a6ad96dc74d8092bbad58a34d51
- Trigger Event: push

File details

Details for the file openadapt_capture-0.2.0-py3-none-any.whl.

File metadata

Download URL: openadapt_capture-0.2.0-py3-none-any.whl
Upload date: Jan 29, 2026
Size: 69.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for openadapt_capture-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`811fd74974ff26d64a3c6cb1f7b51e21ce61b41af1c4cb5abb0be3f7453e2e19`
MD5	`e2e10b62de25c70e030244c74b32e67d`
BLAKE2b-256	`489fc6841ea4bd0fb64621ed9779a3ba83b71427a06e98a131afc54927e84782`

See more details on using hashes here.

Provenance

The following attestation bundles were made for openadapt_capture-0.2.0-py3-none-any.whl:

Publisher: release.yml on OpenAdaptAI/openadapt-capture

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: openadapt_capture-0.2.0-py3-none-any.whl
- Subject digest: 811fd74974ff26d64a3c6cb1f7b51e21ce61b41af1c4cb5abb0be3f7453e2e19
- Sigstore transparency entry: 871196415
- Sigstore integration time: Jan 29, 2026
Source repository:
- Permalink: OpenAdaptAI/openadapt-capture@93cdbb8ff6f87a6ad96dc74d8092bbad58a34d51
- Branch / Tag: refs/heads/main
- Owner: https://github.com/OpenAdaptAI
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@93cdbb8ff6f87a6ad96dc74d8092bbad58a34d51
- Trigger Event: push

openadapt-capture 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

OpenAdapt Capture

The OpenAdapt Ecosystem

Installation

Quick Start

Capture

Replay / Analysis

Low-Level API

Event Types

Architecture

Performance Statistics

Frame Extraction Verification

Visualization

Animated GIF Demo

Interactive HTML Viewer

Generate Demo from Command Line

Optional Extras

Training with OpenAdapt-ML

End-to-End Workflow

Cloud GPU Training

Data Format

Development

Related Projects

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance