GUI Automation Framework

These details have not been verified by PyPI

Project links

Development Status
- 2 - Pre-Alpha
Environment
Operating System
- OS Independent
Programming Language
- Python :: 3.10

Project description

AutoControl

AutoControl is a cross-platform Python GUI automation framework providing mouse control, keyboard input, image recognition, screen capture, action scripting, and report generation — all through a unified API that works on Windows, macOS, and Linux (X11).

繁體中文 | 简体中文

Features
Architecture
Installation
Requirements
Quick Start
Command-Line Interface
Platform Support
Development
License

Features

Mouse Automation — move, click, press, release, drag, and scroll with precise coordinate control
Keyboard Automation — press/release individual keys, type strings, hotkey combinations, key state detection
Image Recognition — locate UI elements on screen using OpenCV template matching with configurable threshold
Accessibility Element Finder — query the OS accessibility tree (Windows UIA / macOS AX) to locate buttons, menus, and controls by name/role
AI Element Locator (VLM) — describe a UI element in plain language and let a vision-language model (Anthropic / OpenAI) find its screen coordinates
OCR — extract text from screen regions using Tesseract; wait for, click, or locate rendered text
Clipboard — read/write system clipboard text on Windows, macOS, and Linux
Screenshot & Screen Recording — capture full screen or regions as images, record screen to video (AVI/MP4)
Action Recording & Playback — record mouse/keyboard events and replay them
JSON-Based Action Scripting — define and execute automation flows using JSON action files (dry-run + step debug)
Scheduler — run scripts on an interval or cron expression; jobs persist across restarts
Global Hotkey Daemon — bind OS-level hotkeys to action scripts (Windows today; macOS/Linux stubs in place)
Event Triggers — fire scripts when an image appears, a window opens, a pixel changes, or a file is modified
Run History — SQLite-backed run log across scheduler / triggers / hotkeys / REST with auto error-screenshot artifacts
Report Generation — export test records as HTML, JSON, or XML reports with success/failure status
MCP Server — JSON-RPC 2.0 Model Context Protocol server (stdio + HTTP/SSE) so Claude Desktop / Claude Code / custom tool-use loops can drive AutoControl. ~90 tools, full protocol coverage (resources, prompts, sampling, roots, logging, progress, cancellation, elicitation), bearer-token auth + TLS, audit log, rate limit, plugin hot-reload, CI fake backend
Remote Automation — TCP socket server and REST API server to receive automation commands
Plugin Loader — drop .py files exposing AC_* callables into a directory and register them as executor commands at runtime
Shell Integration — execute shell commands within automation workflows with async output capture
Callback Executor — trigger automation functions with callback hooks for chaining operations
Dynamic Package Loading — extend the executor at runtime by importing external Python packages
Project & Template Management — scaffold automation projects with keyword/executor directory structure
Window Management — send keyboard/mouse events directly to specific windows (Windows/Linux)
GUI Application — built-in PySide6 graphical interface with live language switching (English / 繁體中文 / 简体中文 / 日本語)
CLI Runner — python -m je_auto_control.cli run|list-jobs|start-server|start-rest
Cross-Platform — unified API across Windows, macOS, and Linux (X11)

Architecture

The runtime is layered: client surfaces (CLI, GUI, MCP/REST/socket servers) sit on top of the headless API (wrapper/ + utils/), which resolves to a per-OS backend chosen at import time by wrapper/platform_wrapper.py. The package façade (je_auto_control/__init__.py) re-exports every public name so users need only import je_auto_control regardless of which surface or backend they hit.

flowchart LR
    subgraph Clients["Client Surfaces"]
        direction TB
        Claude[["Claude Desktop /<br/>Claude Code"]]
        APIUser[["Custom Anthropic /<br/>OpenAI tool loops"]]
        HTTPClient[["HTTP / SSE clients"]]
        TCPClient[["Socket / REST clients"]]
        GUIUser[["PySide6 GUI"]]
        CLIUser[["python -m<br/>je_auto_control[.cli]"]]
        Library[["Library users<br/>(import je_auto_control)"]]
    end

    subgraph Transports["Transports & Servers"]
        direction TB
        Stdio["MCP stdio<br/>JSON-RPC 2.0"]
        HTTPMCP["MCP HTTP /<br/>SSE + auth + TLS"]
        REST["REST server<br/>:9939"]
        Socket["Socket server<br/>:9938"]
    end

    subgraph MCP["mcp_server/"]
        direction TB
        Dispatcher["MCPServer<br/>(JSON-RPC dispatcher)"]
        Tools["tools/<br/>~90 ac_* + aliases"]
        Resources["resources/<br/>files · history ·<br/>commands · screen-live"]
        Prompts["prompts/<br/>built-in templates"]
        Context["context · audit ·<br/>rate-limit · log-bridge"]
        FakeBE["fake_backend<br/>(CI smoke)"]
    end

    subgraph Core["Headless Core (wrapper/ + utils/)"]
        direction TB
        Wrapper["wrapper/<br/>mouse · keyboard · screen ·<br/>image · record · window"]
        Executor["executor/<br/>AC_* JSON action engine"]
        Vision["vision/ · ocr/ ·<br/>accessibility/"]
        Recorder["scheduler/ · triggers/ ·<br/>hotkey/ · plugin_loader/<br/>run_history/"]
        IOUtils["clipboard/ · cv2_utils/ ·<br/>shell_process/ · json/"]
    end

    subgraph Backends["Per-OS Backends"]
        direction TB
        Win["windows/<br/>Win32 ctypes"]
        Mac["osx/<br/>pyobjc · Quartz"]
        X11["linux_with_x11/<br/>python-Xlib"]
    end

    Claude --> Stdio
    APIUser --> Stdio
    HTTPClient --> HTTPMCP
    TCPClient --> Socket
    TCPClient --> REST

    Stdio --> Dispatcher
    HTTPMCP --> Dispatcher
    Dispatcher --> Tools
    Dispatcher --> Resources
    Dispatcher --> Prompts
    Dispatcher -.- Context
    Tools -.optional.-> FakeBE

    Tools --> Wrapper
    Tools --> Executor
    Tools --> Vision
    Tools --> Recorder
    Tools --> IOUtils
    Resources --> Recorder
    Resources --> Wrapper

    REST --> Executor
    Socket --> Executor

    GUIUser --> Wrapper
    GUIUser --> Recorder
    CLIUser --> Executor
    Library --> Wrapper
    Library --> Executor

    Wrapper --> Backends
    Vision -.- Wrapper
    Recorder -.- Executor

je_auto_control/
├── wrapper/                    # Platform-agnostic API layer
│   ├── platform_wrapper.py     # Auto-detects OS and loads the correct backend
│   ├── auto_control_mouse.py   # Mouse operations
│   ├── auto_control_keyboard.py# Keyboard operations
│   ├── auto_control_image.py   # Image recognition (OpenCV template matching)
│   ├── auto_control_screen.py  # Screenshot, screen size, pixel color
│   ├── auto_control_window.py  # Cross-platform window manager facade
│   └── auto_control_record.py  # Action recording/playback
├── windows/                    # Windows-specific backend (Win32 API / ctypes)
├── osx/                        # macOS-specific backend (pyobjc / Quartz)
├── linux_with_x11/             # Linux-specific backend (python-Xlib)
├── gui/                        # PySide6 GUI application
└── utils/
    ├── mcp_server/             # MCP server (stdio + HTTP/SSE) — server, tools/, resources, prompts, audit, rate_limit, fake_backend, plugin_watcher
    ├── executor/               # JSON action executor engine
    ├── callback/               # Callback function executor
    ├── cv2_utils/              # OpenCV screenshot, template matching, video recording
    ├── accessibility/          # UIA (Windows) / AX (macOS) element finder
    ├── vision/                 # VLM-based locator (Anthropic / OpenAI backends)
    ├── ocr/                    # Tesseract-backed text locator
    ├── clipboard/              # Cross-platform clipboard (text + image)
    ├── scheduler/              # Interval + cron scheduler
    ├── hotkey/                 # Global hotkey daemon
    ├── triggers/               # Image/window/pixel/file triggers
    ├── run_history/            # SQLite run log + error-screenshot artifacts
    ├── rest_api/               # Stdlib HTTP/REST server
    ├── plugin_loader/          # Dynamic AC_* plugin discovery
    ├── socket_server/          # TCP socket server for remote automation
    ├── shell_process/          # Shell command manager
    ├── generate_report/        # HTML / JSON / XML report generators
    ├── test_record/            # Test action recording
    ├── script_vars/            # Script variable interpolation
    ├── watcher/                # Mouse / pixel / log watchers (Live HUD)
    ├── recording_edit/         # Trim, filter, re-scale recorded actions
    ├── json/                   # JSON action file read/write
    ├── project/                # Project scaffolding & templates
    ├── package_manager/        # Dynamic package loading
    ├── logging/                # Logging
    └── exception/              # Custom exception classes

The platform_wrapper.py module automatically detects the current operating system and imports the corresponding backend, so all wrapper functions work identically regardless of platform.

Installation

Basic Installation

pip install je_auto_control

With GUI Support (PySide6)

pip install je_auto_control[gui]

Linux Prerequisites

On Linux, install the following system packages before installing:

sudo apt-get install cmake libssl-dev

Requirements

Python >= 3.10
pip >= 19.3

Dependencies

Package	Purpose
`je_open_cv`	Image recognition (OpenCV template matching)
`pillow`	Screenshot capture
`mss`	Fast multi-monitor screenshot
`pyobjc`	macOS backend (auto-installed on macOS)
`python-Xlib`	Linux X11 backend (auto-installed on Linux)
`PySide6`	GUI application (optional, install with `[gui]`)
`qt-material`	GUI theme (optional, install with `[gui]`)
`uiautomation`	Windows accessibility backend (optional, loaded on demand)
`pytesseract` + Tesseract	OCR engine (optional, loaded on demand)
`anthropic`	VLM locator — Anthropic backend (optional, loaded on demand)
`openai`	VLM locator — OpenAI backend (optional, loaded on demand)

See Third_Party_License.md for a full list of third-party components and their licenses.

Quick Start

Mouse Control

import je_auto_control

# Get current mouse position
x, y = je_auto_control.get_mouse_position()
print(f"Mouse at: ({x}, {y})")

# Move mouse to coordinates
je_auto_control.set_mouse_position(500, 300)

# Left click at current position (use key name)
je_auto_control.click_mouse("mouse_left")

# Right click at specific coordinates
je_auto_control.click_mouse("mouse_right", x=800, y=400)

# Scroll down
je_auto_control.mouse_scroll(scroll_value=5)

Keyboard Control

import je_auto_control

# Press and release a single key
je_auto_control.type_keyboard("a")

# Type a whole string character by character
je_auto_control.write("Hello World")

# Hotkey combination (e.g., Ctrl+C)
je_auto_control.hotkey(["ctrl_l", "c"])

# Check if a key is currently pressed
is_pressed = je_auto_control.check_key_is_press("shift_l")

Image Recognition

import je_auto_control

# Find all occurrences of an image on screen
positions = je_auto_control.locate_all_image("button.png", detect_threshold=0.9)
# Returns: [[x1, y1, x2, y2], ...]

# Find a single image and get its center coordinates
cx, cy = je_auto_control.locate_image_center("icon.png", detect_threshold=0.85)
print(f"Found at: ({cx}, {cy})")

# Find an image and automatically click it
je_auto_control.locate_and_click("submit_button.png", mouse_keycode="mouse_left")

Accessibility Element Finder

Query the OS accessibility tree to locate controls by name, role, or app. Works on Windows (UIA, via uiautomation) and macOS (AX).

import je_auto_control

# List all visible buttons in the Calculator app
elements = je_auto_control.list_accessibility_elements(app_name="Calculator")

# Find a specific element
ok = je_auto_control.find_accessibility_element(name="OK", role="Button")
if ok is not None:
    print(ok.bounds, ok.center)

# Click it directly
je_auto_control.click_accessibility_element(name="OK", app_name="Calculator")

Raises AccessibilityNotAvailableError if no accessibility backend is installed for the current platform.

AI Element Locator (VLM)

When template matching and accessibility both fail, describe the element in plain language and let a vision-language model find its coordinates.

import je_auto_control

# Uses Anthropic by default if ANTHROPIC_API_KEY is set, else OpenAI.
x, y = je_auto_control.locate_by_description("the green Submit button")

# Or click it in one shot
je_auto_control.click_by_description(
    "the cookie-banner 'Accept all' button",
    screen_region=[0, 800, 1920, 1080],   # optional crop
)

Configuration (environment variables only — keys are never persisted or logged):

Variable	Effect
`ANTHROPIC_API_KEY`	Enables the Anthropic backend
`OPENAI_API_KEY`	Enables the OpenAI backend
`AUTOCONTROL_VLM_BACKEND`	`anthropic` or `openai` to force a backend
`AUTOCONTROL_VLM_MODEL`	Override the default model (e.g. `claude-opus-4-7`, `gpt-4o-mini`)

Raises VLMNotAvailableError if neither SDK is installed or no API key is set.

OCR (Text on Screen)

import je_auto_control as ac

# Locate all matches of a piece of text
matches = ac.find_text_matches("Submit")

# Center of the first match, or None
cx, cy = ac.locate_text_center("Submit")

# Click text in one call
ac.click_text("Submit")

# Block until text appears (or timeout)
ac.wait_for_text("Loading complete", timeout=15.0)

If Tesseract is not on PATH, point at it explicitly:

ac.set_tesseract_cmd(r"C:\Program Files\Tesseract-OCR\tesseract.exe")

Clipboard

import je_auto_control as ac
ac.set_clipboard("hello")
text = ac.get_clipboard()

Backends: Windows (Win32 via ctypes), macOS (pbcopy/pbpaste), Linux (xclip or xsel).

Screenshot

import je_auto_control

# Take a full-screen screenshot and save to file
je_auto_control.pil_screenshot("screenshot.png")

# Take a screenshot of a specific region [x1, y1, x2, y2]
je_auto_control.pil_screenshot("region.png", screen_region=[100, 100, 500, 400])

# Get screen resolution
width, height = je_auto_control.screen_size()

# Get pixel color at coordinates
color = je_auto_control.get_pixel(500, 300)

Action Recording & Playback

import je_auto_control
import time

# Start recording mouse and keyboard events
je_auto_control.record()

time.sleep(10)  # Record for 10 seconds

# Stop recording and get the action list
actions = je_auto_control.stop_record()

# Replay the recorded actions
je_auto_control.execute_action(actions)

JSON Action Scripting

Create a JSON action file (actions.json):

[
    ["AC_set_mouse_position", {"x": 500, "y": 300}],
    ["AC_click_mouse", {"mouse_keycode": "mouse_left"}],
    ["AC_write", {"write_string": "Hello from AutoControl"}],
    ["AC_screenshot", {"file_path": "result.png"}],
    ["AC_hotkey", {"key_code_list": ["ctrl_l", "s"]}]
]

Execute it:

import je_auto_control

# Execute from file
je_auto_control.execute_action(je_auto_control.read_action_json("actions.json"))

# Or execute from a list directly
je_auto_control.execute_action([
    ["AC_set_mouse_position", {"x": 100, "y": 200}],
    ["AC_click_mouse", {"mouse_keycode": "mouse_left"}]
])

Available action commands:

Category	Commands
Mouse	`AC_click_mouse`, `AC_set_mouse_position`, `AC_get_mouse_position`, `AC_get_mouse_table`, `AC_press_mouse`, `AC_release_mouse`, `AC_mouse_scroll`, `AC_mouse_left`, `AC_mouse_right`, `AC_mouse_middle`
Keyboard	`AC_type_keyboard`, `AC_press_keyboard_key`, `AC_release_keyboard_key`, `AC_write`, `AC_hotkey`, `AC_check_key_is_press`, `AC_get_keyboard_keys_table`
Image	`AC_locate_all_image`, `AC_locate_image_center`, `AC_locate_and_click`
Screen	`AC_screen_size`, `AC_screenshot`
Accessibility	`AC_a11y_list`, `AC_a11y_find`, `AC_a11y_click`
VLM (AI Locator)	`AC_vlm_locate`, `AC_vlm_click`
OCR	`AC_locate_text`, `AC_click_text`, `AC_wait_text`
Clipboard	`AC_clipboard_get`, `AC_clipboard_set`
Window	`AC_list_windows`, `AC_focus_window`, `AC_wait_window`, `AC_close_window`
Flow control	`AC_loop`, `AC_break`, `AC_continue`, `AC_if_image_found`, `AC_if_pixel`, `AC_while_image`, `AC_wait_image`, `AC_wait_pixel`, `AC_sleep`, `AC_retry`
Record	`AC_record`, `AC_stop_record`, `AC_set_record_enable`
Report	`AC_generate_html`, `AC_generate_json`, `AC_generate_xml`, `AC_generate_html_report`, `AC_generate_json_report`, `AC_generate_xml_report`
Run history	`AC_history_list`, `AC_history_clear`
Project	`AC_create_project`
Shell	`AC_shell_command`
Process	`AC_execute_process`
Executor	`AC_execute_action`, `AC_execute_files`, `AC_add_package_to_executor`, `AC_add_package_to_callback_executor`
MCP server	`AC_start_mcp_server`, `AC_start_mcp_http_server`

MCP Server (Use AutoControl from Claude)

Expose AutoControl as a Model Context Protocol server so any MCP-compatible client (Claude Desktop, Claude Code, custom Anthropic / OpenAI tool-use loops) can drive the host machine. Stdlib-only — JSON-RPC 2.0 over stdio or HTTP+SSE.

Register with Claude Code:

claude mcp add autocontrol -- python -m je_auto_control.utils.mcp_server

Register with Claude Desktop (claude_desktop_config.json):

{
  "mcpServers": {
    "autocontrol": {
      "command": "python",
      "args": ["-m", "je_auto_control.utils.mcp_server"]
    }
  }
}

Start programmatically:

import je_auto_control as ac

# Stdio (blocks until stdin closes)
ac.start_mcp_stdio_server()

# Or HTTP / SSE with bearer-token auth + optional TLS
ac.start_mcp_http_server(host="127.0.0.1", port=9940,
                         auth_token="hunter2")

Inspect the catalogue without starting the server:

je_auto_control_mcp --list-tools
je_auto_control_mcp --list-tools --read-only
je_auto_control_mcp --list-resources
je_auto_control_mcp --list-prompts

What ships:

Surface	Coverage
Tools (~90)	mouse · keyboard · drag · screen / multi-monitor · screenshot-as-image · diff · OCR · image · windows (move/min/max/restore/...) · clipboard text+image · process / shell · recording · screen recording · scheduler / triggers / hotkeys · accessibility tree · VLM locator · executor · history
Aliases	`click`, `type`, `screenshot`, `find_image`, `drag`, `shell`, `wait_image`, ... — toggle with `JE_AUTOCONTROL_MCP_ALIASES=0`
Resources	`autocontrol://files/<name>`, `autocontrol://history`, `autocontrol://commands`, `autocontrol://screen/live` (with `resources/subscribe`)
Prompts	`automate_ui_task`, `record_and_generalize`, `compare_screenshots`, `find_widget`, `explain_action_file`
Protocol	tools / resources / prompts / sampling / roots / logging / progress / cancellation / list_changed / elicitation
Transports	stdio, HTTP `POST /mcp`, SSE streaming when `Accept: text/event-stream`
Safety	tool annotations · `JE_AUTOCONTROL_MCP_READONLY` · `JE_AUTOCONTROL_MCP_CONFIRM_DESTRUCTIVE` · audit log · token-bucket rate limiter · auto-screenshot on error
Ops	bearer-token auth · TLS via `ssl_context` · `PluginWatcher` hot-reload · `JE_AUTOCONTROL_FAKE_BACKEND=1` for CI

See docs/source/Eng/doc/mcp_server/mcp_server_doc.rst for the full reference (or the 繁體中文 version).

⚠️ The MCP server can move the mouse, send keystrokes, capture the screen, and execute arbitrary AC_* actions. Only register it with MCP clients you trust. HTTP defaults to 127.0.0.1; binding to 0.0.0.0 requires explicit reason and must be paired with auth_token plus ssl_context.

Scheduler (Interval & Cron)

import je_auto_control as ac

# Interval job — run every 30 seconds
job = ac.default_scheduler.add_job(
    script_path="scripts/poll.json", interval_seconds=30, repeat=True,
)

# Cron job — 09:00 on weekdays (minute hour dom month dow)
cron_job = ac.default_scheduler.add_cron_job(
    script_path="scripts/daily.json", cron_expression="0 9 * * 1-5",
)

ac.default_scheduler.start()

Both flavours coexist; job.is_cron tells them apart.

Global Hotkey Daemon

Bind OS-level hotkeys to action JSON scripts (Windows backend today; macOS / Linux raise NotImplementedError on start() with Strategy- pattern seams in place).

from je_auto_control import default_hotkey_daemon

default_hotkey_daemon.bind("ctrl+alt+1", "scripts/greet.json")
default_hotkey_daemon.start()

Event Triggers

Poll-based triggers that fire a script when a condition becomes true:

from je_auto_control import (
    default_trigger_engine, ImageAppearsTrigger,
    WindowAppearsTrigger, PixelColorTrigger, FilePathTrigger,
)

default_trigger_engine.add(ImageAppearsTrigger(
    trigger_id="", script_path="scripts/click_ok.json",
    image_path="templates/ok_button.png", threshold=0.85, repeat=True,
))
default_trigger_engine.start()

Run History

Every run from the scheduler, trigger engine, hotkey daemon, REST API, and manual GUI replay is recorded to ~/.je_auto_control/history.db. Errors automatically attach a screenshot under ~/.je_auto_control/artifacts/run_{id}_{ms}.png for post-mortem.

from je_auto_control import default_history_store

for run in default_history_store.list_runs(limit=20):
    print(run.id, run.source, run.status, run.artifact_path)

The GUI Run History tab exposes filter/refresh/clear and double-click-to-open on the artifact column.

Report Generation

import je_auto_control

# Enable test recording first
je_auto_control.test_record_instance.set_record_enable(True)

# ... perform automation actions ...
je_auto_control.set_mouse_position(100, 200)
je_auto_control.click_mouse("mouse_left")

# Generate reports
je_auto_control.generate_html_report("test_report")   # -> test_report.html
je_auto_control.generate_json_report("test_report")   # -> test_report.json
je_auto_control.generate_xml_report("test_report")    # -> test_report.xml

# Or get report content as string
html_string = je_auto_control.generate_html()
json_string = je_auto_control.generate_json()
xml_string = je_auto_control.generate_xml()

Reports include: function name, parameters, timestamp, and exception info (if any) for each recorded action. HTML reports display successful actions in cyan and failed actions in red.

Remote Automation (Socket / REST)

Two servers are available — a raw TCP socket and a stdlib HTTP/REST server. Both default to 127.0.0.1; binding to 0.0.0.0 is an explicit, documented opt-in.

import je_auto_control as ac

# TCP socket server (default: 127.0.0.1:9938)
ac.start_autocontrol_socket_server(host="127.0.0.1", port=9938)

# REST API server (default: 127.0.0.1:9939)
ac.start_rest_api_server(host="127.0.0.1", port=9939)
# Endpoints:
#   GET  /health           liveness probe
#   GET  /jobs             scheduler job list
#   POST /execute          body: {"actions": [...]}

Client example:

import socket
import json

sock = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
sock.connect(("localhost", 9938))

# Send an automation command
command = json.dumps([
    ["AC_set_mouse_position", {"x": 500, "y": 300}],
    ["AC_click_mouse", {"mouse_keycode": "mouse_left"}]
])
sock.sendall(command.encode("utf-8"))

# Receive response
response = sock.recv(8192).decode("utf-8")
print(response)
sock.close()

Plugin Loader

Drop .py files defining top-level AC_* callables into a directory, then register them as executor commands at runtime:

from je_auto_control import (
    load_plugin_directory, register_plugin_commands,
)

commands = load_plugin_directory("./my_plugins")
register_plugin_commands(commands)

# Now usable from any JSON action script:
# [["AC_greet", {"name": "world"}]]

Warning: Plugin files execute arbitrary Python on load. Only load from directories you control.

Shell Command Execution

import je_auto_control

# Using the default shell manager
je_auto_control.default_shell_manager.exec_shell("echo Hello")
je_auto_control.default_shell_manager.pull_text()  # Print captured output

# Or create a custom ShellManager
shell = je_auto_control.ShellManager(shell_encoding="utf-8")
shell.exec_shell("ls -la")
shell.pull_text()
shell.exit_program()

Screen Recording

import je_auto_control
import time

# Method 1: ScreenRecorder (manages multiple recordings)
recorder = je_auto_control.ScreenRecorder()
recorder.start_new_record(
    recorder_name="my_recording",
    path_and_filename="output.avi",
    codec="XVID",
    frame_per_sec=30,
    resolution=(1920, 1080)
)
time.sleep(10)
recorder.stop_record("my_recording")

# Method 2: RecordingThread (simple single recording, outputs MP4)
recording = je_auto_control.RecordingThread(video_name="my_video", fps=20)
recording.start()
time.sleep(10)
recording.stop()

Callback Executor

Execute an automation function and trigger a callback upon completion:

import je_auto_control

def my_callback():
    print("Action completed!")

# Execute set_mouse_position then call my_callback
je_auto_control.callback_executor.callback_function(
    trigger_function_name="AC_set_mouse_position",
    callback_function=my_callback,
    x=500, y=300
)

# With callback parameters
def on_done(message):
    print(f"Done: {message}")

je_auto_control.callback_executor.callback_function(
    trigger_function_name="AC_click_mouse",
    callback_function=on_done,
    callback_function_param={"message": "Click finished"},
    callback_param_method="kwargs",
    mouse_keycode="mouse_left"
)

Package Manager

Dynamically load external Python packages into the executor at runtime:

import je_auto_control

# Add all functions/classes from a package to the executor
je_auto_control.package_manager.add_package_to_executor("os")

# Now you can use os functions in JSON action scripts:
# ["os_getcwd", {}]
# ["os_listdir", {"path": "."}]

Project Management

Scaffold a project directory structure with template files:

import je_auto_control

# Create a project structure
je_auto_control.create_project_dir(project_path="./my_project", parent_name="AutoControl")

# This creates:
# my_project/
# └── AutoControl/
#     ├── keyword/
#     │   ├── keyword1.json        # Template action file
#     │   ├── keyword2.json        # Template action file
#     │   └── bad_keyword_1.json   # Error handling template
#     └── executor/
#         ├── executor_one_file.py  # Execute single file example
#         ├── executor_folder.py    # Execute folder example
#         └── executor_bad_file.py  # Error handling example

Window Management

Send events directly to specific windows (Windows and Linux only):

import je_auto_control

# Send keyboard event to a window by title
je_auto_control.send_key_event_to_window("Notepad", keycode="a")

# Send mouse event to a window handle
je_auto_control.send_mouse_event_to_window(window_handle, mouse_keycode="mouse_left", x=100, y=50)

GUI Application

Launch the built-in graphical interface (requires [gui] extra):

import je_auto_control
je_auto_control.start_autocontrol_gui()

Or from the command line:

python -m je_auto_control

Command-Line Interface

AutoControl can be used directly from the command line:

# Execute a single action file
python -m je_auto_control -e actions.json

# Execute all action files in a directory
python -m je_auto_control -d ./action_files/

# Execute a JSON string directly
python -m je_auto_control --execute_str '[["AC_screenshot", {"file_path": "test.png"}]]'

# Create a project template
python -m je_auto_control -c ./my_project

A richer subcommand CLI built on the headless APIs:

# Run a script, optionally with variables, and/or a dry-run
python -m je_auto_control.cli run script.json
python -m je_auto_control.cli run script.json --var name=alice --dry-run

# List scheduler jobs
python -m je_auto_control.cli list-jobs

# Start the socket or REST server
python -m je_auto_control.cli start-server --port 9938
python -m je_auto_control.cli start-rest   --port 9939

--var name=value is parsed as JSON when possible (so count=10 becomes an int), otherwise treated as a string.

Platform Support

Platform	Status	Backend	Notes
Windows 10 / 11	Supported	Win32 API (ctypes)	Full feature support
macOS 10.15+	Supported	pyobjc / Quartz	Action recording not available; `send_key_event_to_window` / `send_mouse_event_to_window` not supported
Linux (X11)	Supported	python-Xlib	Full feature support
Linux (Wayland)	Not supported	—	May be added in a future release
Raspberry Pi 3B / 4B	Supported	python-Xlib	Runs on X11

Development

Setting Up

git clone https://github.com/Intergration-Automation-Testing/AutoControl.git
cd AutoControl
pip install -r dev_requirements.txt

Running Tests

# Unit tests
python -m pytest test/unit_test/

# Integration tests
python -m pytest test/integrated_test/

Project Links

Homepage: https://github.com/Intergration-Automation-Testing/AutoControl
Documentation: https://autocontrol.readthedocs.io/en/latest/
PyPI: https://pypi.org/project/je_auto_control/

License

MIT License © JE-Chen. See Third_Party_License.md for the licenses of bundled and optional third-party dependencies.

Project details

These details have not been verified by PyPI

Project links

Development Status
- 2 - Pre-Alpha
Environment
Operating System
- OS Independent
Programming Language
- Python :: 3.10

Release history Release notifications | RSS feed

0.0.184

Apr 28, 2026

0.0.183

Apr 28, 2026

0.0.182

Apr 27, 2026

0.0.181

Apr 26, 2026

This version

0.0.180

Apr 25, 2026

0.0.179

Apr 4, 2026

0.0.178

Dec 14, 2025

0.0.177

Nov 9, 2025

0.0.176

Oct 12, 2025

0.0.175

Sep 30, 2025

0.0.174

Sep 27, 2025

0.0.173

Sep 23, 2025

0.0.172

Sep 23, 2025

0.0.171

Jul 19, 2025

0.0.170

Jul 11, 2025

0.0.169

Jul 11, 2025

0.0.168

Apr 3, 2025

0.0.167

Mar 28, 2025

0.0.166

Mar 25, 2025

0.0.165

Mar 25, 2025

0.0.164

Mar 23, 2025

0.0.163

Jan 5, 2025

0.0.162

Dec 7, 2024

0.0.161

Dec 4, 2024

0.0.160

Dec 4, 2024

0.0.159

Jul 22, 2024

0.0.158

Jul 16, 2024

0.0.157

Jul 11, 2024

0.0.156

Nov 13, 2023

0.0.155

Aug 15, 2023

0.0.154

Aug 14, 2023

0.0.153

Aug 3, 2023

0.0.152

Jul 17, 2023

0.0.151

Jul 17, 2023

0.0.150

Jul 17, 2023

0.0.149

Jun 19, 2023

0.0.148

Jun 19, 2023

0.0.147

Jun 19, 2023

0.0.146

Jun 19, 2023

0.0.145

Jun 19, 2023

0.0.144

Jun 13, 2023

0.0.142

Jun 13, 2023

0.0.141

Jun 1, 2023

0.0.140

May 28, 2023

0.0.139

May 21, 2023

0.0.138

May 18, 2023

0.0.137

May 2, 2023

0.0.136

Apr 29, 2023

0.0.135

Apr 25, 2023

0.0.134

Apr 11, 2023

0.0.133

Apr 10, 2023

0.0.132

Apr 10, 2023

0.0.131

Apr 10, 2023

0.0.130

Apr 10, 2023

0.0.129

Apr 8, 2023

0.0.128

Apr 7, 2023

0.0.127

Apr 7, 2023

0.0.126

Apr 6, 2023

0.0.125

Apr 5, 2023

0.0.124

Mar 31, 2023

0.0.123

Mar 31, 2023

0.0.122

Mar 20, 2023

0.0.121

Mar 20, 2023

0.0.119

Mar 16, 2023

0.0.118

Mar 6, 2023

0.0.117

Dec 21, 2022

0.0.116

Dec 21, 2022

0.0.115

Dec 20, 2022

0.0.114

Nov 16, 2022

0.0.113

Nov 16, 2022

0.0.112

Nov 16, 2022

0.0.111

Nov 16, 2022

0.0.110

Nov 9, 2022

0.0.109

Nov 9, 2022

0.0.108

Nov 2, 2022

0.0.107

Oct 27, 2022

0.0.106

Oct 24, 2022

0.0.105

Oct 17, 2022

0.0.104

Sep 26, 2022

0.0.103

Sep 24, 2022

0.0.102

Sep 23, 2022

0.0.101

Sep 10, 2022

0.0.99

Sep 6, 2022

0.0.97

Jul 18, 2022

0.0.96

Jul 17, 2022

0.0.95

Jul 12, 2022

0.0.94

Jul 6, 2022

0.0.93

Jun 28, 2022

0.0.92

Jun 28, 2022

0.0.91

Jun 1, 2022

0.0.90

May 24, 2022

0.0.89

May 6, 2022

0.0.88

Apr 20, 2022

0.0.87

Apr 12, 2022

0.0.86

Apr 7, 2022

0.0.85

Apr 6, 2022

0.0.84

Mar 29, 2022

0.0.83

Mar 28, 2022

0.0.82

Mar 28, 2022

0.0.81

Mar 20, 2022

0.0.80

Mar 19, 2022

0.0.79

Mar 15, 2022

0.0.78

Dec 20, 2021

0.0.77

Dec 6, 2021

0.0.76

Dec 3, 2021

0.0.75

Dec 1, 2021

0.0.74

Dec 1, 2021

0.0.73

Nov 24, 2021

0.0.72

Nov 23, 2021

0.0.71

Nov 19, 2021

0.0.70

Nov 18, 2021

0.0.69

Nov 18, 2021

0.0.68

Nov 18, 2021

0.0.67

Nov 18, 2021

0.0.66

Nov 17, 2021

0.0.65

Nov 16, 2021

0.0.64

Nov 14, 2021

0.0.63

Nov 12, 2021

0.0.62

Nov 12, 2021

0.0.61

Nov 12, 2021

0.0.52

Nov 9, 2021

0.0.51

Nov 8, 2021

0.0.50

Oct 22, 2021

0.0.49

Oct 16, 2021

0.0.48

Oct 16, 2021

0.0.47

Oct 16, 2021

0.0.46

Oct 15, 2021

0.0.45

Oct 15, 2021

0.0.44

Oct 14, 2021

0.0.43

Oct 14, 2021

0.0.42

Oct 13, 2021

0.0.41

Oct 13, 2021

0.0.40

Oct 13, 2021

0.0.39

Sep 15, 2021

0.0.38

Sep 15, 2021

0.0.37

Sep 15, 2021

0.0.36

Sep 9, 2021

0.0.35

Aug 25, 2021

0.0.34

Aug 24, 2021

0.0.33

Aug 20, 2021

0.0.32

Aug 19, 2021

0.0.31

Aug 18, 2021

0.0.30

Aug 18, 2021

0.0.29

Aug 16, 2021

0.0.28

Aug 16, 2021

0.0.27

Aug 15, 2021

0.0.26

Aug 14, 2021

0.0.25

Jul 26, 2021

0.0.24

Jul 24, 2021

0.0.23

Jul 23, 2021

0.0.22

Jul 23, 2021

0.0.21

Jul 17, 2021

0.0.20

Jul 17, 2021

0.0.19

Jul 17, 2021

0.0.18

Jul 17, 2021

0.0.17

Jul 14, 2021

0.0.16

Jul 14, 2021

0.0.15

Jul 14, 2021

0.0.14

Jul 14, 2021

0.0.13

Jul 14, 2021

0.0.12

Jul 14, 2021

0.0.11

Jul 14, 2021

0.0.10

Jul 14, 2021

0.0.9

Jul 14, 2021

0.0.8

Jul 13, 2021

0.0.7

Jul 13, 2021

0.0.6

Jul 13, 2021

0.0.5

Jul 13, 2021

0.0.3

Jul 8, 2021

0.0.2

Jul 2, 2021

0.0.1

Jul 1, 2021

0.0.0

Jul 26, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

je_auto_control-0.0.180.tar.gz (219.3 kB view details)

Uploaded Apr 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

je_auto_control-0.0.180-py3-none-any.whl (275.8 kB view details)

Uploaded Apr 25, 2026 Python 3

File details

Details for the file je_auto_control-0.0.180.tar.gz.

File metadata

Download URL: je_auto_control-0.0.180.tar.gz
Upload date: Apr 25, 2026
Size: 219.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for je_auto_control-0.0.180.tar.gz
Algorithm	Hash digest
SHA256	`5e5bc942f55ee3b7cfa1ea335724aca6673a512efcb11fe3875180ac3a46e78a`
MD5	`390ba40980fb066ccae8fa4787d0a239`
BLAKE2b-256	`accf9e7a6348427c14dbcee44c4c64868eaf9fff1ffe856f5b7dc9ed996b1936`

See more details on using hashes here.

File details

Details for the file je_auto_control-0.0.180-py3-none-any.whl.

File metadata

Download URL: je_auto_control-0.0.180-py3-none-any.whl
Upload date: Apr 25, 2026
Size: 275.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for je_auto_control-0.0.180-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5380170a0ab84ca7c7ed95c07c655e6e1a2668b6ea311da00e2c267485b1ad9a`
MD5	`5baa24132a802e2e44a39879d16328ae`
BLAKE2b-256	`73f077b255ce2e54a003308e1388bacde143e94391beb35b4297b789c73b9824`

See more details on using hashes here.

je-auto-control 0.0.180

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

AutoControl

Table of Contents

Features

Architecture

Installation

Basic Installation

With GUI Support (PySide6)

Linux Prerequisites

Requirements

Dependencies

Quick Start

Mouse Control

Keyboard Control

Image Recognition

Accessibility Element Finder

AI Element Locator (VLM)

OCR (Text on Screen)

Clipboard

Screenshot

Action Recording & Playback

JSON Action Scripting

MCP Server (Use AutoControl from Claude)

Scheduler (Interval & Cron)

Global Hotkey Daemon

Event Triggers

Run History

Report Generation

Remote Automation (Socket / REST)

Plugin Loader

Shell Command Execution

Screen Recording

Callback Executor

Package Manager

Project Management

Window Management

GUI Application

Command-Line Interface

Platform Support

Development

Setting Up

Running Tests

Project Links

License

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes