One CLI for SUSATest — autonomous QA, LLM observability, and production SRE for any product
Project description
susatest-agent
One CLI for SUSATest — autonomous QA, LLM observability, and production SRE for any product, in your terminal.
Install
pip install susatest-agent
Quick start
# 1. Sign in once (opens your browser)
susatest-agent login
# 2. Instrument your existing app with one command — zero SDK install
susatest-agent run -- python app.py
# (was: python app.py)
# 3. Open the dashboard to see traces, evals, and findings
susatest-agent dashboard
After susatest-agent login, every other command picks up your saved JWT from ~/.susatest/credentials. No more --api-key on every invocation.
Commands
Auth
| Command | What it does |
|---|---|
susatest-agent login |
OAuth device-code flow. Opens browser, you sign in once, JWT saved locally with 30-day expiry. |
susatest-agent logout |
Revokes the JWT server-side and clears local credentials. |
susatest-agent dashboard |
Opens the SUSATest dashboard in your default browser, pre-authenticated. |
Instrument your own app (zero code touch)
| Command | What it does |
|---|---|
susatest-agent run -- <your-command> |
Wraps your existing app launch with OpenTelemetry auto-instrumentation. Every OpenAI / Anthropic / LangChain / etc. LLM call your app makes auto-streams to your susaeval deploy as traces — no pip install susaeval, no susaeval.init(), no with session() wrappers. Python supported in 0.5.0; Node and Java coming. Example: susatest-agent run -- python app.py |
The wrapper detects the language from the wrapped command and routes through opentelemetry-instrument. First run prompts you to install the OTel libs (single pip install line printed) — we never install behind your back. After install, instrumentation is seamless on every subsequent run.
Testing your app
| Command | What it does |
|---|---|
susatest-agent test <apk-or-url> |
Upload an APK or point at a web URL; SUSATest runs an autonomous exploration test with the persona you pick and returns an HTML / JSON / JUnit report. |
susatest-agent connect |
Bridge a locally-attached Android device to the cloud platform — tests then run against your physical phone. |
susatest-agent prepare-device |
One-time setup for a device that will run agent-side network capture (CA cert, frida, marker). |
susatest-agent fleet-prep |
Bulk-bootstrap a phone for the fleet — frida + CA + marker + SUSA SMS forwarder + phone-vault registration in one shot. |
susatest-agent connect-windows |
Connect this Windows machine to SUSATest as a desktop-test agent. |
Strategist (autonomous QA brain)
| Command | What it does |
|---|---|
susatest-agent strategist list |
List your strategists (each one is bound to a product surface). |
susatest-agent strategist run <id> [--action gap_explore] [--wait] |
Fire an on-demand cognition cycle. With --wait the CLI blocks until the cycle row lands. |
susatest-agent strategist status <id> |
Show the strategist's state plus the last 5 cycle summaries. |
Opsy (production SRE signals)
| Command | What it does |
|---|---|
susatest-agent opsy list-incidents [--status open --severity high] |
List incidents opsy has observed in production. |
susatest-agent opsy list-alerts [--status fired] |
List firing alerts. |
susatest-agent opsy status |
Health / status summary. |
susatest-agent opsy fire-incident --title T --severity S [--services svc1 svc2] |
Create a synthetic incident — verifies the reactive-to-proactive loop. The next strategist cycle ingests it via opsy_signals. |
Susaeval (LLM observability)
| Command | What it does |
|---|---|
susatest-agent susaeval list-sessions [--gate-decision rollback] |
List recent LLM sessions. gate_decision=rollback surfaces sessions where eval judges flagged a regression. |
susatest-agent susaeval list-incidents |
List susaeval-side incidents. |
susatest-agent susaeval drift [--feature model --hours 24] |
Show distribution drift (PSI) on a feature over the last N hours. |
susatest-agent susaeval list-safety |
List safety / guardrail findings. |
Authentication
One susatest-agent login — one JWT — every subcommand.
After login, the saved JWT (~/.susatest/credentials, 30-day expiry) is honored by test, strategist, opsy, susaeval, dashboard, and the OTel exporter that run -- sets up. Server-side fan-out hands the right token to each backend; you never see OPSY_API_KEY or SUSAEVAL_API_KEY again.
Legacy paths still work for CI scripts and pinned older deploys:
--api-key <key>flag — overrides the saved JWT for one call.SUSA_API_KEYenv var — picked up by every subcommand if set.OPSY_API_KEY/SUSAEVAL_API_KEYenv vars — only needed if your CI pinned them before 0.4.2; new setups should leave them unset and uselogin.
Requirements
- Python 3.9+
- ADB installed (
adbin PATH) if you'll connect Android devices — comes with Android Studio
Known limitations
These exist today and are on the roadmap:
run --is Python-only in 0.5.x. Node and Java auto-instrumentation are on the roadmap. The wrapper recognises both today and prints clear next-step guidance instead of failing silently.- stdout/stderr → opsy log forwarding not yet auto-piped by the run wrapper. LLM traces flow to susaeval today; application log forwarding to opsy is on the roadmap.
- Three dashboards. Strategist / opsy / susaeval each have their own UI today. Unified nav is roadmap.
Resources
- Homepage: https://susatest.com
- Docs: https://susatest.com/docs
- Source / issues: https://github.com/monk0062006/SUSA
- Status: https://susatest.com/status
Changelog
0.5.1
- Docs: clarified that a single
susatest-agent loginJWT covers every subcommand (server-side fan-out is live as of 0.4.2). No code changes.
0.5.0
susatest-agent run -- <cmd>— auto-instrumentation wrapper for customer apps. Wraps your existing process launch with OpenTelemetry so every LLM call streams to susaeval as a trace, with zero source changes. Python supported today. Node + Java on the roadmap.- First-run install nudge — when OTel libs are missing in the customer's
env we print the exact
pip install ...line and exit. Never install behind their back.
0.4.2
- Server-side auth fan-out: a single
susatest-agent loginJWT now works acrosstest,strategist,opsy, andsusaevalsubcommands. No moreOPSY_API_KEY/SUSAEVAL_API_KEYenv juggling. Legacy env vars and--api-keyflags still honored for CI scripts.
0.4.1
- Updated README with full command surface (no functional changes vs 0.4.0).
0.4.0
- New
login,logout,dashboardcommands with OAuth device-code browser flow - New
strategist/opsy/susaevalsubcommands - Saved credentials at
~/.susatest/credentials—testandstrategistno longer require--api-key - Backward-compat:
--api-keyflag andSUSA_API_KEYenv var still honored
0.3.x
connect/prepare-device/fleet-prep/connect-windowsdevice bridging.
License
MIT.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file susatest_agent-0.5.3.tar.gz.
File metadata
- Download URL: susatest_agent-0.5.3.tar.gz
- Upload date:
- Size: 86.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
75c99fb69906c0c9871f148bddf18ca95a2751342f55096de7e5a14037984408
|
|
| MD5 |
89387e977f963a084238e044392e3c52
|
|
| BLAKE2b-256 |
6c0bb046a5c2900b024aa3049f27a5c82509b3a82549fc5ea8f92ad02e1b26c8
|
File details
Details for the file susatest_agent-0.5.3-py3-none-any.whl.
File metadata
- Download URL: susatest_agent-0.5.3-py3-none-any.whl
- Upload date:
- Size: 81.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
66da9122aa6e419df6333b1a99580d760d8b454367fbb644a582bbfe70b10fdb
|
|
| MD5 |
7781b17b0badf8d619580ebcd8757909
|
|
| BLAKE2b-256 |
b914f513b386964f9cacafc6fea943d1747d2c74a21a7918a19e91a4eac6ae3e
|