Autonomous AI Testing Agent with multi-agent architecture
Project description
HAINDY
Give coding agents computer use.
HAINDY lets coding tools like Claude Code, Codex CLI, and OpenCode interact with real desktop and mobile apps by seeing the screen, clicking, typing, scrolling, and validating flows. Use it when your agent needs to work with a real UI instead of a DOM or selector tree.
pip install haindy
haindy setup
After setup, open your coding agent and use the haindy skill against a live app.
Agent integration
HAINDY ships with bundled skills that haindy setup installs automatically for detected AI CLIs.
Supported CLIs:
- Claude Code
- Codex CLI
- OpenCode
If one of those CLIs is installed, haindy setup copies the HAINDY skill into the agent's skill directory and points the agent at the setup flow. For other coding agents, you can still use HAINDY by prompting them directly with the examples below.
Try it now
Open your coding agent and use the haindy skill against a running desktop app, Android emulator or device, or iOS simulator or device.
If you are running an app locally, try prompts like:
- "Use HAINDY to do exploratory testing on my app."
- "Use HAINDY to test creating a new account."
- "Use HAINDY to check whether the login flow works end to end."
- "Use HAINDY to explore the settings screen and report whether notifications can be toggled."
Your agent will start the session, interact with the UI, and return screenshots and structured results.
CLI usage
HAINDY can also be driven directly from the command line when you want explicit command-by-command control.
Start a session, read the returned session_id, and pass it explicitly to later commands:
haindy session new --desktop
haindy screenshot --session <SESSION_ID>
haindy act "click the Login button" --session <SESSION_ID>
haindy session close --session <SESSION_ID>
For mobile:
haindy session new --android --android-serial emulator-5554
haindy session new --ios --ios-udid <UDID>
For session hygiene:
haindy session prune --older-than 7
Every command returns structured JSON. A typical response looks like this:
{
"session_id": "8f4d2c1e-7c2d-4d92-a0bc-3d0a9c6c1b5e",
"run_id": null,
"command": "screenshot",
"status": "success",
"response": "Screenshot captured.",
"screenshot_path": "/absolute/path/to/screenshot.png",
"meta": {
"exit_reason": "completed",
"duration_ms": 0,
"actions_taken": 0
}
}
Under the hood, each action goes through a computer-use AI provider (OpenAI, Google Gemini, or Anthropic Claude) that takes a screenshot, reasons about the UI, and performs real OS-level input -- mouse, keyboard, scroll -- against the actual application. No DOM hooks, no selectors, no browser automation.
Supported platforms
| Platform | Automation method |
|---|---|
| Linux/X11 | uinput + xdotool + ffmpeg |
| macOS | pynput + mss |
| Android | ADB |
| iOS | idb |
act vs test vs explore
act-- execute a single action ("click the submit button", "type hello into the search field")test-- dispatch a multi-step scenario with outcome validation, then polltest-statusexplore-- dispatch an open-ended goal, then pollexplore-status
Use test when the scenario is backed by written requirements, a test plan, wireframes, or other explicit documentation and you want structured execution plus validation. Use explore when the goal is clear but the path is not, or when you are working from product knowledge rather than supporting docs. Use act when you want tight step-by-step control and to inspect the screen after each command.
Session variables
Store values your agent can reference across commands:
haindy session set USERNAME alice@example.com --session <ID>
haindy session set PASSWORD --value-file credentials.txt --secret --session <ID>
haindy session vars --session <ID>
Run tests from requirements
HAINDY also includes a pipeline of specialized AI agents that can plan and execute tests autonomously from a requirements file.
Requirements -> Scope Triage -> Test Planner -> Situational Agent -> Test Runner -> Report
haindy run --plan requirements.txt --context context.txt
haindy run --mobile --plan requirements.txt --context context.txt # Android
haindy run --ios --plan requirements.txt --context context.txt # iOS
This produces an HTML report with screenshots, pass/fail results, and a JSONL execution log.
Configuration
Credentials
haindy auth login openai # stored in system keychain
haindy auth login openai-codex # OAuth-based login
haindy auth login google
haindy auth login anthropic
haindy auth status # verify
Providers
HAINDY uses two providers independently: one for planning/analysis, one for computer-use actions.
haindy provider set openai # planning/analysis
haindy provider set-computer-use google # computer-use
Settings file
Create ~/.haindy/settings.json for persistent non-secret configuration:
{
"agent": { "provider": "openai" },
"computer_use": { "provider": "google" },
"openai": { "model": "gpt-5.4", "computer_use_model": "gpt-5.4" },
"google": { "model": "gemini-3-flash-preview", "computer_use_model": "gemini-3-flash-preview" },
"anthropic": { "model": "claude-sonnet-4-6", "computer_use_model": "claude-sonnet-4-6" },
"execution": {
"automation_backend": "desktop",
"actions_action_timeout_seconds": 600
},
"logging": { "level": "INFO" }
}
Environment variables override all other sources. Timeout settings use seconds. In settings.json, use execution.actions_action_timeout_seconds; the older execution.actions_action_timeout_ms key is only accepted as a legacy read-time alias. See .env.example for the full list.
Platform prerequisites
| Platform | Requirements |
|---|---|
| Linux/X11 | ffmpeg, xdotool, xclip, /dev/uinput access |
| macOS | Grant Accessibility + Screen Recording to your terminal (System Settings > Privacy & Security) |
| Android | adb installed, device/emulator reachable |
| iOS (macOS) | brew install idb-companion, device paired |
haindy doctor checks all of these for you. See docs/RUNBOOK.md for detailed setup.
Development
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.lock
pip install -e ".[dev]"
ruff check . # lint
ruff format --check . # format check
mypy haindy # type check
pytest # tests
Architecture
| Directory | Purpose |
|---|---|
haindy/tool_call_mode/ |
Tool-call CLI, daemon, IPC, session state |
haindy/agents/computer_use/ |
Multi-provider computer-use session orchestrator |
haindy/agents/ |
Scope triage, test planner, situational, action, and test runner agents |
haindy/desktop/ |
Linux/X11 automation (uinput, xdotool, ffmpeg) |
haindy/macos/ |
macOS automation (pynput, mss) |
haindy/mobile/ |
Android (ADB) and iOS (idb) automation |
haindy/config/ |
Settings, env vars, settings file loader |
haindy/orchestration/ |
Multi-agent workflow coordination |
haindy/monitoring/ |
JSONL logging, HTML report generation |
Reporting issues
Both batch and tool-call modes emit a pre-filled GitHub issue URL with run
context (HAINDY version, platform, command, exit reason, truncated error).
Batch mode prints it at the end of a run; tool-call mode adds a feedback_url
field to the JSON envelope on failure. Set HAINDY_NO_FEEDBACK_URL=1 to
suppress it.
Contributing
See CONTRIBUTING.md for development guidelines and how to submit changes.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file haindy-0.3.0.tar.gz.
File metadata
- Download URL: haindy-0.3.0.tar.gz
- Upload date:
- Size: 487.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8601bbffa0265c52992251b9ada070f9582ce0fb29a7d0cf80b65a24fce42a8b
|
|
| MD5 |
6b58166cd301797613e4e9591e51645e
|
|
| BLAKE2b-256 |
00b7890a61178fd520f60b07f31589838b4727368515f7b5d75513bee2ae1675
|
Provenance
The following attestation bundles were made for haindy-0.3.0.tar.gz:
Publisher:
release.yml on Haindy/haindy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
haindy-0.3.0.tar.gz -
Subject digest:
8601bbffa0265c52992251b9ada070f9582ce0fb29a7d0cf80b65a24fce42a8b - Sigstore transparency entry: 1317068826
- Sigstore integration time:
-
Permalink:
Haindy/haindy@29d1a129af6734995222999068774b9960dcd435 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/Haindy
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@29d1a129af6734995222999068774b9960dcd435 -
Trigger Event:
push
-
Statement type:
File details
Details for the file haindy-0.3.0-py3-none-any.whl.
File metadata
- Download URL: haindy-0.3.0-py3-none-any.whl
- Upload date:
- Size: 433.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
89e75ad44528d7f9cbdd691b0cfbe30a84ecafc23988eb85666889888f3be3ba
|
|
| MD5 |
c78fe7e6e807aafdde4465f9e2561d5f
|
|
| BLAKE2b-256 |
dbe8d3b25b901971c39766318c0a0fb498a5c69583cd0585f13fcb7cb57b70a2
|
Provenance
The following attestation bundles were made for haindy-0.3.0-py3-none-any.whl:
Publisher:
release.yml on Haindy/haindy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
haindy-0.3.0-py3-none-any.whl -
Subject digest:
89e75ad44528d7f9cbdd691b0cfbe30a84ecafc23988eb85666889888f3be3ba - Sigstore transparency entry: 1317068829
- Sigstore integration time:
-
Permalink:
Haindy/haindy@29d1a129af6734995222999068774b9960dcd435 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/Haindy
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@29d1a129af6734995222999068774b9960dcd435 -
Trigger Event:
push
-
Statement type: