Sumo QA — a senior-QA-shaped MCP server for pre-coding QA planning, TDD scaffolding, mutation-testing follow-up, code review, and test-data discovery.
Project description
sumo-qa MCP
An MCP server that brings senior-QA discipline to AI coding assistants — test planning, TDD, mutation testing, code review.
[!IMPORTANT] sumo-qa is an advisor, not an oracle. Like any AI tool it can be wrong. Your judgment and your team's standards are the final word.
🚀 New here? 5-minute demo →
One install line, one prompt on your repo, the workflow runs on real code.
Why it exists
Ask a stock AI assistant to QA a change and you get the junior answer: "add unit tests, consider edge cases, maybe test performance." That's a checklist.
sumo-qa makes the agent work the way a senior QA does:
- Names 3–7 risks tied to specific files and lines, not categories
- Picks one design technique per risk from an ISTQB-grounded catalogue (boundary-value, decision-table, property-based, mutation)
- Runs your test suite fresh in the current turn before any "safe to merge" claim
- Holds TDD's red phase before any production code is written
- Keeps production code locked while strengthening tests against mutation survivors
- Won't ship a plan without measurable entry and exit criteria
The discipline lives in 14 skill files (1 router + 13 sub-skills). The host LLM follows them literally, and each one has an Iron Law and a HARD-GATE callout the LLM can't talk past. Skills route automatically from natural-language prompts; you don't need to remember to invoke them.
Install
python -m pip install sumo-qa && python -m sumo_qa.installer --claude-code
Other hosts: swap --claude-code for --vscode --workspace <path-to-repo>, --jetbrains, or drop the flag to configure every host on the machine. The python -m sumo_qa.installer form works even when the pip script directory is not on PATH yet.
On Windows PowerShell, use:
py -m pip install sumo-qa; if ($?) { py -m sumo_qa.installer --claude-code }
Restart your host or open a fresh chat afterwards.
Per-host flags, schema differences, and troubleshooting: docs/INSTALL.md. Want to run from a local clone with your team's own standards / knowledge packs editable in place? docs/INSTALL.md#install-from-a-local-clone.
Verify it's wired
In any host, ask:
load the QA classifications
You should get 10 names back: api_contract_change, business_logic_change, security_change, performance_change, frontend_change, infrastructure_change, test_change, docs_change, config_change, data_migration. If you do, you're wired.
Update
python -m pip install --upgrade sumo-qa && python -m sumo_qa.installer
Restart the host. The SessionStart hook re-injects the latest content; skills and knowledge refresh from the upgraded package.
What's included
%%{init: {'theme':'base', 'themeVariables': {
'fontFamily':'Charter, "Iowan Old Style", Georgia, serif',
'fontSize':'15px',
'primaryTextColor':'#1B1B1B',
'lineColor':'#1B1B1B'
}}}%%
flowchart LR
LLM{{"Host LLM"}}
subgraph Inputs ["sumo-qa content"]
direction TB
Knowledge[("Knowledge")]
Standards[("Standards")]
end
Skills["<b>Skills</b>"]
Output(["Output"])
Knowledge -- cited by --> Skills
Standards -- cited by --> Skills
LLM == follows ==> Skills
Skills == produces ==> Output
classDef host fill:#7A1F1F,stroke:#1B1B1B,stroke-width:2px,color:#FAF7F2
classDef skills fill:#FAF7F2,stroke:#1B1B1B,stroke-width:2.5px,color:#1B1B1B
classDef data fill:#F0EAE0,stroke:#8A7B5C,stroke-width:1.5px,color:#1B1B1B
classDef out fill:#E8EDDF,stroke:#3F4A2E,stroke-width:2px,color:#1B1B1B
classDef group fill:none,stroke:#8A7B5C,stroke-width:1px,color:#5C4D00,stroke-dasharray: 4 4
class LLM host
class Skills skills
class Knowledge,Standards data
class Output out
class Inputs group
linkStyle 0,1 stroke:#8A7B5C,stroke-width:1.2px,stroke-dasharray:5 4
linkStyle 2,3 stroke:#1B1B1B,stroke-width:2.5px
| Layer | What |
|---|---|
14 skills (skills/) |
Iron-Law procedures: deciding approach, preparing for work, TDD scaffolding, diff review, strengthening tests, finding test data, answering testing questions, repo strategy — plus the planning → parallel subagent execution → finishing chain. |
| 28 MCP entry points | 14 skill tools, 6 knowledge loaders, 4 test-data tools, 4 external-skill lifecycle tools. Thin operations, no inference. |
4 knowledge catalogues (knowledge/) |
Classifications, approaches, principles, techniques. The agent picks from these instead of recalling from training data. Editable as plain markdown. Specialty-tool picks are deliberately not catalogued — the discipline is observe the risk surface, web-search current options for the user's stack, cite when naming a tool. |
Host support
Every host calls the same MCP server and reads the same SKILL.md files. What differs is how each host exposes them — that's a host-API difference, not a sumo-qa choice.
These hosts are verified end-to-end with python -m sumo_qa.installer:
| Host | Slash | Setup |
|---|---|---|
| Claude Code | /sumo-qa-deciding-approach (hyphens) |
python -m sumo_qa.installer --claude-code |
| VS Code + Copilot (Agent mode, Claude Sonnet 4.5 or equivalent) | Natural language | python -m sumo_qa.installer --vscode --workspace <repo> writes .vscode/mcp.json |
| JetBrains AI Assistant | /sumo_qa_deciding_approach (underscores) |
One-time UI setup; python -m sumo_qa.installer --jetbrains prints the fields to paste |
| JetBrains Junie | Natural language | Drop the JSON python -m sumo_qa.installer --jetbrains prints into ~/.junie/mcp/sumo-qa.json (global) or <repo>/.junie/mcp/ (per project) |
In Claude Code, type / then sumo-qa- to see all 14 skills as hyphenated entries (symlinked into ~/.claude/skills/). The same skills are also registered through MCP with underscores (/sumo_qa_load_classifications, /sumo_qa_find_test_data); both routes call the same SKILL.md.
Natural language works everywhere. "Review my changes", "plan QA for this story", "load the QA classifications" — the agent routes by tool description. Slash and natural-language paths produce the same result.
Other MCP hosts (Cursor, Codex, OpenCode, etc.): pip install sumo-qa ships a standard stdio MCP server, so it should work with anything that speaks MCP. Follow your host's MCP-server setup docs and point it at the absolute path of the sumo-qa script. Not verified end-to-end by us, so we don't ship instructions.
See it in action
Ten transcripts showing the workflow on real code — diff reviews refusing to call safe-to-merge from stale CI, TDD cycles with the red output surfaced verbatim, mutation survivors walked one at a time, formal test plans gated on entry/exit criteria, and the case where the right answer is "no tests needed, stop here":
- tests/scenarios/worked-examples/ — start with 02 — review my changes for a representative end-to-end
- tests/scenarios/SCENARIOS.md — the scenario specs (prompt → expected shape → anti-patterns the skill prevents)
When sumo-qa doesn't fit
If your QA intent has no native fit (Playwright E2E, accessibility audits, k6 load testing, type checking), sumo-qa searches for an external skill through its MCP server, offers a [y/N] install gate, installs through the Skills CLI, then loads the installed SKILL.md back into the conversation.
%%{init: {'theme':'base', 'themeVariables': {
'fontFamily':'Charter, "Iowan Old Style", Georgia, serif',
'fontSize':'13px',
'primaryTextColor':'#1B1B1B',
'lineColor':'#1B1B1B'
}}}%%
flowchart LR
Intent(["QA intent<br/><i>no native fit</i>"])
Search["<b>search</b><br/><i>sumo_qa_search_external_skills</i>"]
Gate{"<b>[y/N]</b>"}
Install["<b>install</b><br/><i>sumo_qa_install_external_skill</i>"]
Locate["<b>locate & load</b><br/><i>check_installed · execute</i>"]
Out(["external SKILL.md<br/>in the conversation"])
Stop(["stop"])
Intent ==> Search ==> Gate
Gate -->|y| Install ==> Locate ==> Out
Gate -->|N| Stop
classDef io fill:#FAF7F2,stroke:#1B1B1B,stroke-width:2px,color:#1B1B1B
classDef step fill:#FAF7F2,stroke:#1B1B1B,stroke-width:2.5px,color:#1B1B1B
classDef gate fill:#7A1F1F,stroke:#1B1B1B,stroke-width:2px,color:#FAF7F2
classDef stop fill:#F0EAE0,stroke:#8A7B5C,stroke-width:1.5px,color:#1B1B1B
classDef done fill:#E8EDDF,stroke:#3F4A2E,stroke-width:2px,color:#1B1B1B
class Intent io
class Search,Install,Locate step
class Gate gate
class Stop stop
class Out done
- The host does not run
npxdirectly;sumo_qa_search_external_skills,sumo_qa_check_external_skill_installed,sumo_qa_install_external_skill, andsumo_qa_execute_external_skillown the lifecycle. - Search returns the Skills CLI's text output verbatim (ANSI stripped); the host LLM reads it as the user would. No structured parser to drift out of date.
- Node.js is required for the Skills CLI. If
npxis missing, the MCP tool returns an actionable error and stops. It doesn't elevate via sudo. - The external skill handles tool-specific setup, while sumo-qa keeps the confirmation gates, test evidence, and risk-to-test mapping.
License
Apache 2.0. See NOTICE for attribution requirements that apply to forks and redistributors.
More docs
- AGENTS.md — AI-agent bootstrap and per-host setup
- docs/ARCHITECTURE.md — three layers, host delivery, knowledge authority
- docs/SKILLS.md — every skill with its Iron Law
- docs/TOOLS.md — every MCP entry point
- docs/INSTALL.md — per-host install detail and troubleshooting
- docs/CONTENT-FORMATS.md — schemas + worked examples for adding team standards, knowledge, change rules, and test data (incl. swapping ISTQB out)
- docs/CONFIGURATION.md — env vars
- docs/DEVELOPMENT.md — local dev
- docs/TEST-DATA.md — known-good test-data catalogue
- docs/PERSONA.md — optional Sumo-sensei voice (off by default)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sumo_qa-0.6.2.tar.gz.
File metadata
- Download URL: sumo_qa-0.6.2.tar.gz
- Upload date:
- Size: 1.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
da9c7293ad5c0414c81d58676bac995b158affdaa113568128718fae8e4fdc41
|
|
| MD5 |
00fa38522fad2271dd756fa9ad685e98
|
|
| BLAKE2b-256 |
ddfe6d5d323ba3b74ac6e6bf7e5c222e667238dd3b763707019325997f8f3521
|
Provenance
The following attestation bundles were made for sumo_qa-0.6.2.tar.gz:
Publisher:
release.yml on sumithr/sumo-qa
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
sumo_qa-0.6.2.tar.gz -
Subject digest:
da9c7293ad5c0414c81d58676bac995b158affdaa113568128718fae8e4fdc41 - Sigstore transparency entry: 1582640481
- Sigstore integration time:
-
Permalink:
sumithr/sumo-qa@9b59cc650e2362cf5262596b285b4a653221c927 -
Branch / Tag:
refs/tags/v0.6.2 - Owner: https://github.com/sumithr
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9b59cc650e2362cf5262596b285b4a653221c927 -
Trigger Event:
push
-
Statement type:
File details
Details for the file sumo_qa-0.6.2-py3-none-any.whl.
File metadata
- Download URL: sumo_qa-0.6.2-py3-none-any.whl
- Upload date:
- Size: 142.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fb9ff85ebfd0bb3194c07218294bcf5a1655fe44f189cfa835b4e38bd969218b
|
|
| MD5 |
7d50de1c8b87d56db3958cccb482b4af
|
|
| BLAKE2b-256 |
bfc299c633367b855a04ea8326ff2bf87a117dec3c7be30f061c04644dbe17ff
|
Provenance
The following attestation bundles were made for sumo_qa-0.6.2-py3-none-any.whl:
Publisher:
release.yml on sumithr/sumo-qa
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
sumo_qa-0.6.2-py3-none-any.whl -
Subject digest:
fb9ff85ebfd0bb3194c07218294bcf5a1655fe44f189cfa835b4e38bd969218b - Sigstore transparency entry: 1582640658
- Sigstore integration time:
-
Permalink:
sumithr/sumo-qa@9b59cc650e2362cf5262596b285b4a653221c927 -
Branch / Tag:
refs/tags/v0.6.2 - Owner: https://github.com/sumithr
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9b59cc650e2362cf5262596b285b4a653221c927 -
Trigger Event:
push
-
Statement type: