Record browser sessions and reverse-engineer them into automation scripts.
Project description
Your activity, into automation.
AutomatiQ
Alpha — Things will break and change. Read VISION.md to understand why Automatiq exists and where it's headed.
AutomatiQ watches you browse, then an AI agent reverse-engineers your session into a standalone Python automation/extraction script; no manual inspection needed.
What it does
1. RECORD 2. COMPILE 3. AGENT
Browse a website ==> AI analyses video ==> LLM investigates,
normally clips & network writes & tests
requests a Python script
- Record — Opens Chrome, captures your browsing (screen video, network
requests, user actions). Press
Ctrl+Cwhen you're done. - Compile — Vision AI analyses video clips around each action; network requests are decoded, deduplicated, and structured into a workspace dump.
- Agent — An LLM investigator reads the workspace, experiments in a sandboxed IPython environment, and iteratively produces a working script.
Quick start
pip install automatiq
Set your API key (any litellm-supported model):
GEMINI_API_KEY=your-key-here
Run:
automatiq run https://example.com
That's it. Browse the site, press Ctrl+C, and the agent takes over.
Keyboard shortcuts
| Phase | Key | Action |
|---|---|---|
| Recording | Ctrl+C |
Stop recording and save session |
| Compilation | Esc |
Skip AI analysis for remaining segments |
| Compilation | y / n |
Confirm or deny the skip prompt |
| Agent | q |
Quit the agent session |
| Agent | Esc |
Cancel current LLM call or code execution |
Ctrl+C force-quits at any phase.
CLI options
| Flag | Description |
|---|---|
--model MODEL |
LiteLLM model string for the agent |
--recorder-model MODEL |
Vision model for video-clip analysis |
--base-url URL |
Custom OpenAI-compatible API endpoint |
--max-steps N |
Maximum agent loop iterations (default: 60) |
--sandbox-timeout SEC |
Seconds per IPython cell (default: 60) |
--output-dir PATH |
Root directory for all output (default: ./output) |
--no-banner |
Skip the startup animation |
--verbose |
Show detailed diagnostic output |
-V, --version |
Show version |
-h, --help |
Show help message |
How it works
- Browser capture — Chrome is launched with CDP instrumentation. Every network request, response body, cookie, and user interaction (clicks, typing, navigation) is recorded with timestamps.
- Vision analysis — The recording is split into per-action video clips. A vision LLM watches each clip and produces structured annotations (what was clicked, what changed, whether the action succeeded).
- Sandboxed agent — The investigator runs Python code in an isolated IPython worker process. It can read the captured data, test hypotheses against the live site, and build the final script incrementally, with guardrails against loops and repetition.
Configuration
On first run, AutomatiQ creates ~/.automatiq/config.toml with commented
defaults. Edit it to override models, timeouts, recording settings, etc.
[models]
agent = "gemini/gemini-3-flash-preview"
recorder = "gemini/gemini-3.1-flash-lite-preview"
# base_url = "http://localhost:11434/v1" # Ollama / LM Studio / vLLM
[agent]
max_steps = 60
sandbox_timeout = 60
[recording]
fps = 3
segment_pad = 2
merge_gap_threshold = 1.5
max_frames_per_prompt = 8
Priority: CLI flag > ~/.automatiq/config.toml > built-in defaults.
Step-by-step usage
automatiq record https://example.com # just record
automatiq agent # build automation script from last recording
Install from source
git clone https://github.com/StoneSteel27/AutomatiQ.git
cd AutomatiQ
pip install -e .
Dev setup
pip install -e ".[dev]"
pre-commit install
This installs ruff, build, twine, and pre-commit hooks (lint + format
on every commit).
Requirements
- Python 3.11+
- A supported LLM API key (Gemini, OpenAI, OpenRouter, or any
OpenAI-compatible endpoint via
--base-url)
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file automatiq-0.1.1.tar.gz.
File metadata
- Download URL: automatiq-0.1.1.tar.gz
- Upload date:
- Size: 89.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8156399f744bc8c68ec732e28ff9ee1d3a3dc5f524edac41b7c932cdb05bb3c9
|
|
| MD5 |
297ef82dfdc3a433658f9fe889cd0aec
|
|
| BLAKE2b-256 |
9bcbdf226b12639030b962f6b31e0044834062a39e0bdd90269d0270b4b0ef4e
|
File details
Details for the file automatiq-0.1.1-py3-none-any.whl.
File metadata
- Download URL: automatiq-0.1.1-py3-none-any.whl
- Upload date:
- Size: 93.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6ed9ffed90e99fef94da394bd38046c265b5853c9d1affa5b9b70f02cf59363b
|
|
| MD5 |
3994e9c84846b28803dc417f075a017f
|
|
| BLAKE2b-256 |
571c0b568a0ba5c2442d37ed19b7a717a5b33a4c9412f5ca233cd729db546648
|