Local-first security recon that briefs your AI coding agent: facts + tailored probe scripts, code-in / artifacts-out. No LLM, no server, no running app.

These details have not been verified by PyPI

Project description

websec-validator

Local-first security recon that briefs your AI coding agent. It does the deterministic half — read the repo, map the full attack surface, run + de-duplicate the static scanners, and stage a probe library tailored to what it found — then hands your agent (Claude Code, Codex, Gemini, Cursor) a marching-orders briefing. Code in, artifacts out. No LLM in the tool, no server, no running app required.

It is not an autonomous scanner and not a SaaS. It's the missing front-half: the thing that turns a repo into a precise, fact-grounded security brief an AI agent (with a human in the loop) can act on — an auto-filled, repo-aware version of a senior pentester's "here's what to test and how" handoff. How it works + the reasoning behind every check: docs/METHODOLOGY.md.

Quickstart — just point it at your repo

Simplest: tell your AI agent. In Claude Code (or any coding agent), open your project and say:

"Install and run the security tool at github.com/raccioly/websec-validator on this repo, then follow its briefing."

It installs, runs, and walks the findings with you. There's nothing to host and no website — it's local. The four ways to get there, all ending in the same AGENT-BRIEFING.md your agent acts on:

Path	One-time setup	Then
Tell your agent (simplest)	—	say the line above
CLI (a terminal)	`pipx install websec-validator`	`websec run /path/to/your/app`
Claude Code plugin (slash)	`/plugin marketplace add raccioly/websec-validator` → `/plugin install websec-validator@websec-plugins`	invoke the security-pass skill, or just ask
Docker (no install)	`docker build -t websec-validator .`	`docker run --rm --user "$(id -u):$(id -g)" -v "$PWD:/scan" websec-validator run /scan --out /scan/websec-out`

➡️ Want the reasoning behind every check? Read docs/METHODOLOGY.md — what each test does and why.

Install

pipx install websec-validator   # from PyPI
brew install noir               # OWASP Noir — the route engine (50+ frameworks); regex fallback if absent
websec --version

Until the first PyPI release publishes (or for bleeding-edge), install straight from source instead: pipx install git+https://github.com/raccioly/websec-validator (or from a clone: pipx install .).

Requires Python 3.11+ (on stock macOS, python3 is often 3.9 — use pipx, which picks a newer interpreter, or install via Homebrew/pyenv). Zero Python runtime dependencies: it shells out to scanners (Trivy, Gitleaks, Semgrep/OpenGrep, Checkov, Prowler) and Noir when present, reports what's missing, and never hard-fails if a tool is absent.

Or run via Docker (everything bundled, zero install)

No need to install Noir or any scanner — the image bundles them all (arch-aware, amd64 + arm64):

docker build -t websec-validator .
docker run --rm --user "$(id -u):$(id -g)" -v "$PWD:/scan" websec-validator run /scan --out /scan/websec-out

The image carries Noir + Trivy + Gitleaks + Semgrep + Checkov; mount your repo at /scan and the artifacts land in /scan/websec-out.

Usage

websec run ./my-app           # ← the one command: recon + stage tailored probes + emit the briefing
websec ./my-app               # same thing — a bare path defaults to `run`
websec run ./my-app --scan    # …and also execute the available static scanners
websec doctor ./my-app        # (optional) which scanners are installed?

Then point your agent at the output: "Read websec-out/AGENT-BRIEFING.md and follow it."

That's the whole user surface: run (plus the optional, advanced dynamic live-probing step below). recon/proof/calibrate exist for developing the tool itself and are hidden from --help — you never need them.

What it extracts (16 deterministic extractors, no LLM)

	Dimension	Notable output
stack	languages, frameworks, datastores	monorepo-aware (aggregates every manifest)
routes	every endpoint via OWASP Noir	method · path · typed params · code path
auth	scheme + login surface + insecure-default signing secrets	multi-scheme; flags a hard-coded `JWT_SECRET \|\| 'dev-secret'` fallback (forgeable JWT)
authz	access-control map	guard coverage + write endpoints with no visible guard + roles
tenant	multi-tenancy key candidates	the BOLA boundary, by frequency
password_policy	cross-route consistency + reuse/history	complexity drift across routes + a set-password path that hashes without a reuse check
surface	15 sink classes + redirect-SSRF	user-input-gated sinks (incl. mass-assignment via object spread) + var-arg SSRF + error-disclosure + follows-redirects-without-per-hop-guard
upload_security	unrestricted upload + unsafe serve	deny-list-only, stored-name-from-filename, trust-client-MIME, accept-SVG, serve without `nosniff`
schemas	data models + privileged fields	Pydantic/SQLAlchemy/Django/Prisma/Mongoose/TypeORM/Zod → `role`/`isAdmin`/`groupId` for mass-assignment targeting
iac_ci	IaC + CI/CD	GHA injection, unpinned actions, tfstate, CDK AppSync `API_KEY` anonymous-default-auth + WAF-as-control smell
client_exposure	browser leakage	public-var secrets by name + value-shape (`da2-…`) + CDK build-injection, server-secret-in-client, source maps
client_integrity	tamperable display (client trust boundary) + WS auth model	any security-critical sink value (address/IBAN/2FA-seed/API-key/webhook) the user reads or copies, without strict CSP / out-of-band anchor + client-tamper-vector, grindable-fingerprint, over-claimed-control, the CSWSH determinant
transport_security	CSP + HSTS header baseline	missing/weak CSP, inline event handlers, partial HSTS (set on /api but not the HTML page)
pii_exposure	unmasked PII at the output boundary	`res.json(rawEntity)` with PII + a masking control defined but with zero live call sites (value-shape, not field-name)
graphql	GraphQL surface	introspection (AppSync `introspectionConfig: DISABLED`-aware) / playground / depth-limit + AppSync subscription-authz (cross-group BOLA)
integrations	third-party + webhooks + outbound-action endpoints	unsigned webhooks + email/SMS/push handlers with no auth or IP-only rate-limit + redundant secret-fetch

Plus derived targeting — IDOR / SSRF / open-redirect / upload / write / auth-endpoint candidates — so probes get pointed at the exact endpoints, not fired blindly.

What you get (`websec-out/`)

Artifact	What it is
`AGENT-BRIEFING.md`	The product. Marching orders: detected surface, the access-control map, targeting, findings, the method, and the staged probe list.
`FACTS.json`	The full structured recon.
`findings.json`	Static scanner results, de-duplicated across tools and severity-ranked (with `--scan`).
`findings-ledger.json` / `REPORT.md`	The traceable ledger: each finding with an evidence chain, CWE/ASVS/OWASP-API citation, remediation, and a calibrated `P(real)` (measured real-vuln rate + 95% CI + sample size).
`probes/`	The probe scripts selected + staged for this app (BOLA, JWT, SSRF, mass-assignment…).

The flow

🔧 websec (deterministic)              🤖 your agent + 🧑 you
─────────────────────────────────      ─────────────────────────────────
1. recon → full attack surface     →   confirm the tenant boundary + auth model
2. run + de-dup static scanners    →   triage real-vs-noise
3. stage tailored probes           →   fill placeholders, run vs a TEST instance
4. emit AGENT-BRIEFING.md           →   propose fixes, re-run to confirm, report back

Static recon + briefing need only the code. Running the probes needs a live test instance + test credentials (the human supplies them) — the tool itself never touches a running app.

Proof harness

websec proof clones a vuln-app corpus (VAmPI, NodeGoat, DVGA) and scores whether recon surfaces each app's documented attack surface — a deterministic, CI-trackable proxy (currently 10/10). The real kill-criterion (does the briefing lift an agent's bug-finding vs a generic prompt?) is the manual A/B in corpus/PROOF-PROTOCOL.md.

Calibrated confidence

websec calibrate runs the ledger against the labeled corpus, measures how often each (attack-class, confidence) bucket is a real documented vuln, and writes calibration.json (shipped + applied at runtime). Each finding then carries P(real) with a 95% Wilson confidence interval and the sample size n — so "MEDIUM" stops being a vibe and becomes "real ~57% of the time on the corpus (CI 43–70%, n=51)". A finding that matches no documented vuln counts as a false positive (the corpus is well-documented). Honest caveats: the corpus is deliberately vulnerable, so the rates skew optimistic for clean production code, and small samples mean wide intervals — the CI is the headline, not the point estimate, and both tighten as the corpus grows. With thin data a bucket falls back to the per-label aggregate, then to a clearly-flagged uncalibrated prior. No ML, no deps — binomial proportion + Wilson interval; the structure upgrades to isotonic regression if a large labeled set ever exists.

It self-improves. websec dynamic is an oracle: a write that executes unauthenticated is a confirmed real vuln, and a recon-flagged endpoint that turns out auth-enforced is a confirmed false positive. Every dynamic run folds those confirmed labels into a local overlay (~/.cache/websec-validator/, gitignored, never shipped) that's merged on top of the public table — so the numbers personalize to your apps the more you run it, with no extra step and nothing leaving your machine. To label by hand instead, feed a {attack_class, confidence, is_real} file to websec calibrate --ingest.

Dynamic phase (v2 — read-only so far)

When you have a running TEST instance, websec dynamic mints role tokens and runs the probes the static recon pointed at. v1 is read-only: authenticated cross-tenant BOLA on the group-scoped GET endpoints recon discovered.

cp dynamic-config.example.json dynamic-config.json    # TEST target + role creds (gitignored)
websec run ./my-app                                    # static recon → websec-out/FACTS.json
websec dynamic --config dynamic-config.json --facts websec-out/FACTS.json
# → "14/14 cross-tenant GET reads blocked — all isolated"   (or 🚨 LEAK with the exact endpoint)

Never point it at production. Write-verb BOLA, JWT/auth attacks, and a ZAP/Nuclei two-role diff are the next dynamic probes (explicitly gated — they mutate).

Validated on

A production Next.js app, a large Express/AWS monorepo, and the VAmPI / NodeGoat / DVGA vuln-app corpus — independently reproducing a hand-done pentest's findings (tenant boundary, SSRF, file upload, cross-tenant BOLA, role/authz gaps).

Tests

python3 -m unittest discover -s tests    # stdlib only, no Noir/network — 135 tests

Releasing (maintainer)

Published to PyPI via Trusted Publishing (OIDC — no API token in the repo). To cut a release:

# 1. bump the version in pyproject.toml (e.g. 0.2.1 → 0.2.2)
# 2. tag it and push — the tag must match pyproject's version (CI verifies):
git tag v0.2.2 && git push origin v0.2.2
# → publish.yml builds, INSTALLS + smoke-tests the wheel (version match,
#   calibration ships, a real `websec run`), then publishes. A bad build fails
#   CI instead of reaching PyPI — so you never have to yank after the fact.

One-time PyPI setup (before the first release): on pypi.org → Account → Publishing → Add a pending publisher with project websec-validator, owner raccioly, repo websec-validator, workflow publish.yml, environment pypi. The project is created on the first successful publish.

Two independent channels, two update mechanisms: the CLI ships to PyPI (semver releases, pip install --upgrade); the Claude Code plugin ships from git (tracks latest commit, refreshed via /plugin marketplace update).

Status / roadmap

Done: 15-extractor recon (incl. schema/entity → mass-assignment targeting, the AWS-CDK / managed-AppSync / VTL boundary, upload-security + PII-output-boundary + redirect-SSRF

password-reuse classes, and a man-in-the-browser / tamperable-display class), cross-tool de-dup + bundled Semgrep rules, tailored probe staging, agent briefing, traceable findings ledger with calibrated confidence (CJE — Wilson CIs), proof harness, test suite, Docker bundle (all scanners + Noir, arch-aware), dynamic phase v1 (authenticated read-only cross-tenant BOLA — validated live, reproduced a hand-pentest's 14/14). Validated against the REF-PENTEST pen test + retest (incl. correcting two findings the retest disproved: AppSync introspection is disablable engine-level, and API_KEY-default is anonymous-auth, not CSWSH). Next: dynamic write-verb BOLA + JWT/auth probes + ZAP/Nuclei two-role diff (gated, they mutate), calibration on hand-labeled real repos (more representative base rate), ASVS index lookup, optional model-SDK adapters for no-agent fallback.

Using it as a Claude Code skill / plugin

This repo is a Claude Code plugin. Install it once —

/plugin marketplace add raccioly/websec-validator
/plugin install websec-validator@websec-plugins

— and the bundled security-pass skill (skills/security-pass/SKILL.md) lets you just ask, in plain English, for a security pass: it runs websec, reads the briefing, and works the findings with you. For other agents the universal interface is unchanged: run the CLI, read AGENT-BRIEFING.md.

Install gotchas (field-tested):

The install id is plugin@marketplace — websec-validator@websec-plugins (the marketplace name from .claude-plugin/marketplace.json), not @websec-validator (the repo).
The plugin only delivers the instructions; the actual scanning is a separate Python CLI (websec). The skill's Step 0 installs it (pipx install websec-validator) if it's missing.
/plugin … only works in the terminal CLI. In the Claude app / Agent SDK (no /plugin), configure it in .claude/settings.json instead:
```
{
  "extraKnownMarketplaces": {
    "websec-plugins": { "source": { "source": "github", "repo": "raccioly/websec-validator" } }
  },
  "enabledPlugins": { "websec-validator@websec-plugins": true }
}
```
This registers + enables the plugin but does not auto-fetch it — the first download still needs the CLI (/plugin install websec-validator@websec-plugins) once. (Project .claude/settings.json for a team; ~/.claude/settings.json for just you.)

Credits

Methodology + probe library are distilled from a real authenticated penetration-testing pass. This tool productizes that hand-written methodology into something an AI agent can run on any repo.

License

MIT © Ricardo Accioly

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.8.1

Jun 23, 2026

0.8.0

Jun 22, 2026

0.7.0

Jun 22, 2026

0.6.3

Jun 19, 2026

0.6.2

Jun 19, 2026

0.6.1

Jun 17, 2026

This version

0.6.0

Jun 17, 2026

0.5.0

Jun 12, 2026

0.4.2

Jun 10, 2026

0.4.1

Jun 10, 2026

0.4.0

Jun 10, 2026

0.3.0

Jun 7, 2026

0.2.9

Jun 1, 2026

0.2.8

Jun 1, 2026

0.2.7

Jun 1, 2026

0.2.6

Jun 1, 2026

0.2.5

Jun 1, 2026

0.2.4

May 31, 2026

0.2.3

May 31, 2026

0.2.2

May 31, 2026

0.2.1

May 31, 2026

0.2.0 yanked

May 31, 2026

Reason this release was yanked:

new udpate

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

websec_validator-0.6.0.tar.gz (176.8 kB view details)

Uploaded Jun 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

websec_validator-0.6.0-py3-none-any.whl (182.2 kB view details)

Uploaded Jun 17, 2026 Python 3

File details

Details for the file websec_validator-0.6.0.tar.gz.

File metadata

Download URL: websec_validator-0.6.0.tar.gz
Upload date: Jun 17, 2026
Size: 176.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for websec_validator-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`f5671b79ff1d4d623a86a7fa612fea5a3c71cfaaa67e0c6ea8f48d8f365dc2b5`
MD5	`e1315ad74101402e29ace5888b56399e`
BLAKE2b-256	`2f8f28ac2dfc1311bb8fda4ff66b0906e467dc0331e5b7358604b5836c8f3bc2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for websec_validator-0.6.0.tar.gz:

Publisher: publish.yml on raccioly/websec-validator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: websec_validator-0.6.0.tar.gz
- Subject digest: f5671b79ff1d4d623a86a7fa612fea5a3c71cfaaa67e0c6ea8f48d8f365dc2b5
- Sigstore transparency entry: 1854163804
- Sigstore integration time: Jun 17, 2026
Source repository:
- Permalink: raccioly/websec-validator@def44748fd41c013dfd30684afcc96f86e3baa2d
- Branch / Tag: refs/tags/v0.6.0
- Owner: https://github.com/raccioly
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@def44748fd41c013dfd30684afcc96f86e3baa2d
- Trigger Event: push

File details

Details for the file websec_validator-0.6.0-py3-none-any.whl.

File metadata

Download URL: websec_validator-0.6.0-py3-none-any.whl
Upload date: Jun 17, 2026
Size: 182.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for websec_validator-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6392689f32c5b9cc217d5a870ed2afc0cdec1e84b027c4b35c6a7521cf9c8d9d`
MD5	`93c99745ac5222a76205dd35d278ee5b`
BLAKE2b-256	`f940a28a8967aa542a5cba2d6a59c2282ee7f5a7d19e1edec0baa8d08dc6aac2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for websec_validator-0.6.0-py3-none-any.whl:

Publisher: publish.yml on raccioly/websec-validator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: websec_validator-0.6.0-py3-none-any.whl
- Subject digest: 6392689f32c5b9cc217d5a870ed2afc0cdec1e84b027c4b35c6a7521cf9c8d9d
- Sigstore transparency entry: 1854164095
- Sigstore integration time: Jun 17, 2026
Source repository:
- Permalink: raccioly/websec-validator@def44748fd41c013dfd30684afcc96f86e3baa2d
- Branch / Tag: refs/tags/v0.6.0
- Owner: https://github.com/raccioly
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@def44748fd41c013dfd30684afcc96f86e3baa2d
- Trigger Event: push

websec-validator 0.6.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

websec-validator

Quickstart — just point it at your repo

Install

Or run via Docker (everything bundled, zero install)

Usage

What it extracts (16 deterministic extractors, no LLM)

What you get (websec-out/)

The flow

Proof harness

Calibrated confidence

Dynamic phase (v2 — read-only so far)

Validated on

Tests

Releasing (maintainer)

Status / roadmap

Using it as a Claude Code skill / plugin

Credits

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

What you get (`websec-out/`)