Cross-project vulnerability scanning CLI: dependency CVE, SAST, secrets, DAST, IAST, container, SBOM, API fuzz, and supply chain integrity — one entry point.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

suetsugushunsuke

These details have not been verified by PyPI

Project description

secscan

Cross-project vulnerability scanning CLI for Web (Node/TypeScript) and Python projects.

One secscan command runs dependency CVE, SAST, and secret-detection scanners over a project root, normalizes their output into a single report, and decides pass/fail against a configurable severity threshold. Findings can be acknowledged via a versioned baseline file with audit metadata.

secscan all --path .

What's in the box

Subcommand	Tool wrapped	What it checks
`secrets`	gitleaks v8+	hard-coded API keys, tokens, credentials
`deps`	npm / pnpm / yarn / pip-audit	declared dependencies with known CVEs/GHSAs
`sast`	semgrep	source-level vulnerability patterns
`dast`	OWASP ZAP (Docker)	live HTTP target probing — baseline + active (Phase 2-J)
`config`	Trivy (Docker)	IaC: k8s manifests, Terraform, Dockerfile, Helm (Phase 2-L)
`image`	Trivy (Docker, image mode)	container image CVEs — OS pkg + language pkg vulns (Phase 2-M)
`sbom`	Syft + Grype (Docker)	SBOM-based CVE matching: scan a directory / OCI image / existing SBOM file (Phase 2-N)
`apifuzz`	Schemathesis (Docker)	OpenAPI fuzzing — sends auto-generated edge-case requests to a live API to find input-validation / spec-conformance / auth bugs (Phase 2-O)
`iast`	pyrasp (operator-supplied)	runtime IAST harness — spawns the operator's app subprocess, sends canary probes, parses pyrasp event log. CLI-only, NEVER in `secscan all` (Phase 2-P)
`supply`	Sigstore cosign + lockfile parsers	container image signature verification (keyless) + lockfile self-consistency (npm / pip / uv) — supply chain integrity gate (Phase 2-Q)
`all`	every registered scanner	secrets + deps + sast + config (and dast/image/sbom/apifuzz/supply when their targets are configured). IAST is excluded — see Phase 2-P notes.
`baseline`	(self)	manage known-issue suppression file

Install

Requires Python 3.11 or newer.

Homebrew (macOS / Linux)

brew install shunsukesuetsugu-prog/secscan/secscan
secscan --version          # → secscan 0.19.0

Tap repo: shunsukesuetsugu-prog/homebrew-secscan.

Scoop (Windows)

scoop bucket add secscan https://github.com/shunsukesuetsugu-prog/scoop-secscan
scoop install secscan
secscan --version          # → secscan 0.19.0

Bucket repo: shunsukesuetsugu-prog/scoop-secscan.

From PyPI (any platform with pip)

pip install shun-secscan          # installs the ``secscan`` command
pip install "shun-secscan[sast,deps]"  # + bundled semgrep / pip-audit

The PyPI distribution name is shun-secscan (the bare secscan and the py- prefix variant were both blocked by PyPI's similarity gate against an unrelated secscan-cli package — see the note in pyproject.toml). The CLI command and Python import name are still secscan. Homebrew formula and Scoop manifest both alias this back to secscan so end-users see one name regardless of the install path.

From source

# 1. Clone the repo
git clone <repo>
cd secscan

# 2. Create an isolated environment
pyenv local 3.11.9            # or any 3.11+
python -m venv .venv

# 3. Install secscan plus the scanner extras you need:
.venv/bin/pip install -e ".[dev,sast,deps]"
#   - "sast" pulls in semgrep
#   - "deps" pulls in pip-audit
#   - "dev"  pulls in pytest, ruff, mypy
#
# Or, for a minimal install with no Python-side scanners:
# .venv/bin/pip install -e .

External binaries — not installed by pip:

gitleaks (required for secscan secrets): brew install gitleaks, or download from https://github.com/gitleaks/gitleaks/releases.
npm (required for secscan deps on npm projects): comes with Node.js (≥ v7; v6 is rejected explicitly with a clear error).
pnpm (required for secscan deps on pnpm projects): see https://pnpm.io/installation.

The scanner reports a precise, install-aware error if a required tool is missing — it never silently exits 0.

Quick start

# Scan everything secscan knows about (parallel by default — Phase 2-X).
secscan all --path .

# Force serial execution when SIGINT responsiveness matters or for
# easier debugging. Output is byte-identical between modes.
secscan all --path . --no-parallel

# Tune the parallel thread pool (default: min(cpu_count, scanners, 8)).
secscan all --path . --max-workers 4

# One scanner at a time.
secscan secrets --path .
secscan deps    --path .
secscan sast    --path .

# Custom failure threshold (default: high).
secscan deps --fail-on critical

# Disable the baseline for this run.
secscan deps --no-baseline

# Single-line, CI-friendly output.
secscan all --quiet

# Machine-readable output formats.
secscan all --format json                   # secscan-json v1 to stdout
secscan all --format sarif > scan.sarif     # SARIF 2.1.0 to file via shell
secscan all --format sarif --output scan.sarif

# Include baseline-suppressed findings in SARIF (default: excluded).
secscan all --format sarif --sarif-include-suppressed

Output formats

Format	Use case
`text`	Default. Human-readable terminal report. Color when stdout is a TTY.
`json`	secscan-json v1. Stable structured schema with counts, errors, suppressed.
`sarif`	SARIF 2.1.0 with per-scanner runs. Ready for GitHub Code Scanning upload.

--quiet is text-only; combining it with --format=json or --format=sarif is a CLI error (the structured formats already produce single-document output that's safe to parse).

SARIF caveats worth knowing:

One run per scanner that actually executed. Skipped scanners are not emitted as empty success runs — uploading those would mark previously- reported alerts on the missing scanner as fixed.
Baseline-suppressed findings are excluded by default (GitHub Code Scanning doesn't reliably honor SARIF suppressions). Opt in with --sarif-include-suppressed for archive / non-GitHub viewers.
Finding.raw and Finding.raw_fingerprint are never in the output (the latter can embed upstream-tool paths that bypass secscan's path-stripping).
No source snippet or artifact contents is emitted.


### Exit codes

| Code | Meaning                                                          |
| ---- | ---------------------------------------------------------------- |
| 0    | scan completed; findings at-or-above `--fail-on` were suppressed by baseline or none existed |
| 1    | scan completed; one or more findings crossed the `--fail-on` threshold |
| 2    | scan did **not** complete (tool failure, missing config, missing required lockfile, malformed baseline, etc.) |
| 130  | interrupted by user (SIGINT)                                     |

Codes 1 and 2 never overlap. CI should treat 2 as "inconclusive" — never
as "clean" — because something prevented the scan from running fully.
`secscan all` upgrades exit code from 0 → 2 when a scanner errored or when
a registered scanner wasn't actually run (e.g. missing tool that wasn't
explicitly skipped).

### Severity normalization

All scanners emit findings on the same scale:

`CRITICAL > HIGH > MEDIUM > LOW > INFO > UNKNOWN`

`UNKNOWN` is what we use when an upstream tool gives us no severity at
all — most pip-audit findings, for instance, since pip-audit doesn't
include severity in its JSON output. Whether `UNKNOWN` participates in
the `--fail-on` threshold is configurable per scanner; see
[Configuration](#configuration).

`--fail-on=none` (alias for the sentinel `NEVER`) means "never fail the
build" — useful for purely-informational runs.

## Configuration: `.secscan.toml`

Drop a `.secscan.toml` at the project root. secscan also walks parents.

```toml
[scan]
fail_on = "high"               # critical | high | medium | low | none
skip = []                      # e.g. ["sast"] to skip a scanner in `all`
# Per-scanner timeouts live under [deps] / [sast] / [secrets] — there is
# no whole-run timeout in MVP.

[scan.severity_unknown_policy]
# How findings with severity=UNKNOWN are treated, per scanner:
#   "warn"   – counted in display but never cross the threshold
#   "fail"   – treated as the fail-on threshold (i.e. they DO cross)
#   "ignore" – still displayed, but excluded from threshold check
deps    = "warn"
sast    = "warn"
secrets = "fail"

[deps]
allow_missing_lockfile  = false  # error when npm/pnpm has no lockfile
ignore_dev_dependencies = false  # adds --omit=dev (npm) / --prod (pnpm)
timeout_seconds         = 300

[sast]
# Each entry becomes a separate `--config` flag to semgrep.
# Default-safe set: registry shorthand (p/..., r/...) AND paths under
# the scan root. Any other value (arbitrary URL, out-of-tree absolute
# path) is REJECTED unless `allow_unverified_configs = true` below.
semgrep_config = ["p/python", "p/javascript", "p/typescript", "p/owasp-top-ten"]
allow_unverified_configs = false  # opt-in for arbitrary URLs / outside paths
timeout_seconds = 900

[secrets]
timeout_seconds = 300
# Note: secret redaction is mandatory; there is no opt-out switch.

[baseline]
path = ".secscan/baseline.json"  # resolved relative to THIS file's dir
default_expiry_days = 90

# Per-scanner severity overrides for noisy/critical rules.
# Keys are rule_ids; values follow the severity scale (no `none`).
[severity_overrides.secrets]
"aws-access-token" = "CRITICAL"

All keys are typed and unknown keys raise an error. A typo'd severity_overide (missing r) is rejected loudly rather than silently doing nothing.

Baseline workflow

Findings you decided to accept are written to a JSON baseline:

# Acknowledge every current finding (use sparingly).
secscan baseline accept --all --reason "initial baseline, tracked in TICKET-123"

# Acknowledge specific findings by fingerprint.
secscan baseline accept --fingerprint abc123 --fingerprint def456 \
    --reason "false positive: confirmed test fixture"

# List current entries.
secscan baseline list

# Remove expired entries (the file is never auto-pruned).
secscan baseline prune

Each baseline entry records accepted_by, reason (non-empty), added_at, expires_at, secscan_version, and a raw_fingerprint when the upstream tool provided one. The CLI refuses to:

write the baseline in CI (SECSCAN_CI=1).
accept any finding when any scanner errored during the scan — a partial scan must not produce a baseline that hides real findings.
accept fingerprints that aren't in the current scan output.

Expired entries do not auto-suppress — they re-surface as warnings on the next run so the lapse is loud, not silent.

CI integration

GitHub Actions example:

- name: Set up Python
  uses: actions/setup-python@v5
  with:
    python-version: "3.11"

- name: Install secscan + scanners
  run: |
    pip install -e ".[sast,deps]"
    brew install gitleaks  # macOS runners

- name: Run secscan
  env:
    SECSCAN_CI: "1"  # blocks `baseline accept` from running
  run: secscan all --quiet --fail-on high

Exit 1 → step fails. Exit 2 → step fails (and the log shows the partial scan warning so the operator knows why).

DAST (OWASP ZAP)

The dast subcommand runs an OWASP ZAP baseline scan against a live HTTP target via Docker. It is opt-in: secscan all only includes DAST when dast.target is set in .secscan.toml (or --target was passed on the CLI).

# Smoke a staging deployment.
secscan dast \
  --target https://staging.example.com/ \
  --zap-image zaproxy/zap-stable@sha256:<verified-digest> \
  --format sarif --output zap.sarif

Required CLI flags / config:

Flag	Equivalent config key	Purpose
`--target <URL>`	`dast.target`	HTTP/HTTPS URL to probe (mandatory).
`--zap-image <ref>`	`dast.image`	OCI image ref with `@sha256:` digest pinning.
`--ajax-spider`	`dast.ajax_spider`	Enable ZAP's AJAX spider (slower; JS-heavy targets).
`--zap-config-file`	`dast.config_file`	ZAP context file path inside the container.
`--zap-network`	`dast.network_mode`	`bridge` (default) or `host`.
`--auth-header`	`dast.auth_headers`	HTTP header injected into every ZAP request (Phase 2-K, repeatable).

Hard requirements baked into the implementation:

Image digest pinning is mandatory. An image without @sha256:<64 hex> is rejected before docker is invoked, and a leading - is also rejected to make argv injection structurally impossible.
Argv shape is fixed. docker run … -- <image> <cmd> — the -- separator is always present so the image value cannot be flag-interpreted under any future refactor.
No host leakage in external output. SARIF / JSON / text reports emit a normalized relative URI dast/<urlencoded-path>; the target host is never echoed into location.uri.
--cap-drop=ALL + --network=bridge by default. Operators can opt into --network=host explicitly when targeting a service that is only bound to the host namespace.
Findings are deduped on (pluginid, path, query-keys, param) and carry a coarse (pluginid, path) alias, so a single baseline accept suppresses both the param-bearing and param-less variants of the same advisory.

Authenticated DAST (Phase 2-K)

Many real-world bugs only surface behind a login. --auth-header forwards a static HTTP header — typically a JWT bearer token your test harness obtained out of band — into every request ZAP sends:

# 1. Obtain a token from your auth flow (curl / your test rig).
TOKEN=$(curl -s -X POST https://staging.example.com/login \
  -H 'content-type: application/json' \
  -d '{"email":"qa@example.com","password":"…"}' | jq -r .token)

# 2. Hand it to secscan dast.
secscan dast \
  --target https://staging.example.com/ \
  --zap-image zaproxy/zap-stable@sha256:<verified-digest> \
  --auth-header "Authorization: Bearer $TOKEN"

--auth-header is repeatable — pass it multiple times for multi-header auth schemes (e.g. Authorization: Bearer … + X-Tenant: acme). Internally each header becomes one entry in ZAP's replacer.full_list config; secscan single-quotes every key=value pair so values containing spaces (every Bearer token) survive ZAP's whitespace tokenisation.

Header value rules (enforced by the validator, all rejections produce a clear DastInputError at argv-build time):

Must contain : between name and value.
Name must be a RFC 7230 token — =, ,, ( and other ZAP -z syntax sigils are rejected so the value cannot break out of the replacer key/value position.
Value must be non-empty, printable, and free of CR / LF (classic header smuggling defence) and single quotes (we use single quotes to wrap the key=value pair, so an embedded ' would close the wrap early).
--auth-header values flow through secscan's redactor in logs; the raw token is never written to stdout or the report.

ZAP image digest rotation

The pinned default in src/secscan/scanners/dast/_pinned.py ships with an all-zero digest sentinel — the scanner will run, but Docker will refuse to pull the image. Operators are expected to pin a verified digest the first time they enable DAST:

docker pull zaproxy/zap-stable:2.15.0
docker inspect --format='{{index .RepoDigests 0}}' zaproxy/zap-stable:2.15.0
# zaproxy/zap-stable@sha256:<digest>

Set the verified digest under [dast].image in .secscan.toml (or pass it via --zap-image).

Container image scan (Trivy, Phase 2-M)

secscan image scans one or more built OCI images for known CVEs in OS packages (alpine/debian apt/apk/yum DBs) AND in language packages embedded in the image (npm/pip/gem/etc.). It catches the class of vulnerability that the source-tree scanners (deps/sast/secrets) cannot see — for example, an old openssl shipped in your base image even though your requirements.txt itself is clean.

# Scan a single image. Digest pin (@sha256:...) is mandatory.
secscan image \
  --image alpine@sha256:451eee8bedcb2f029756dc3e9d73bab0e7943c1ac55cff3a4861c52a0fdd3e98 \
  --format sarif --output image.sarif

# Or via config + secscan all
# .secscan.toml:
# [image]
# refs = [
#   "alpine@sha256:451eee...",
#   "my-corp/api@sha256:abcdef...",
# ]
# platform = "linux/amd64"   # default

Required CLI flags / config:

Flag	Equivalent config key	Purpose
`--image <ref>`	`[image].refs`	target OCI image (repeatable). Digest pinning is mandatory.
`--trivy-image <ref>`	`[image].image`	OCI image ref of the Trivy scanner container.
`--platform <os/arch>`	`[image].platform`	docker `--platform` (default `linux/amd64`).

Hard requirements baked into the implementation:

Both image refs are digest-pinned. The Trivy scanner image AND every target image must be <repo>[:tag]@sha256:<64 hex>. A bare alpine:3.10 is rejected with a clear error before docker is invoked. This is the same posture as DAST (Phase 2-D) and config (Phase 2-L) — secscan never invokes docker against a mutable tag.
--platform is forced on both layers (docker + Trivy CLI). Without an explicit platform, a multi-arch OCI index digest resolves to different per-arch manifests on different hosts, silently changing what got scanned. We default to linux/amd64; override per-deployment via config.
--cap-drop=ALL --security-opt=no-new-privileges always.
--network=bridge is mandatory here. Trivy must reach the registry to pull the target image; --network=none is impossible in image mode. The DAST scanner made the same trade-off. host networking is NOT allowed.
Opt-in like DAST. secscan all only runs the image scanner when [image].refs is non-empty (or --image is on the CLI). secscan image with zero refs fails loud (exit 2) rather than exiting 0 with no findings — a false-green that would let CI report "image scan clean" when nothing was actually scanned.
Findings are deduped on (CVE-ID, package, version, location). Scanning two images that ship the same vulnerable package produces ONE finding, not two — so a baseline accept on the CVE silences it across all targets at once.

Deterministic vs online Trivy DB

By default, every secscan image invocation pulls the latest Trivy vulnerability DB from ghcr.io. For reproducible bench runs (and any offline CI runner), bench/run.py --image-bench pre-seeds a named docker volume with the DB once and then mounts it read-only + --skip-db-update for each scan. The end-user CLI doesn't expose a cache-volume flag — it's a bench/CI plumbing concern, not an everyday operator setting.

SBOM scan (Syft + Grype, Phase 2-N)

secscan sbom covers the gap between deps (lockfile-only) and image (built-image-only): it can scan a local directory (e.g. a pip install-ed venv), an OCI image in a registry, or an already-existing SBOM file (CycloneDX / SPDX JSON). Two containers run in sequence — Syft generates the SBOM, Grype matches it against its vulnerability DB — connected by a short- lived named docker volume.

# Scan a venv on disk (Syft + Grype pipeline)
secscan sbom \
  --target /opt/myapp/.venv \
  --syft-image anchore/syft@sha256:... \
  --grype-image anchore/grype@sha256:...

# Scan an OCI image (--platform forwarded to Syft for multi-arch index)
secscan sbom \
  --target alpine@sha256:451eee...

# Scan an existing CycloneDX SBOM file (skips Syft, runs only Grype)
secscan sbom --target ./inventory/sbom.cdx.json

# Multiple targets — finding-level dedup keeps the report clean
secscan sbom \
  --target /opt/app1 \
  --target alpine@sha256:... \
  --target ./suppliers/vendor-x.cdx.json

CLI flags / config:

Flag	Equivalent config key	Purpose
`--target <T>`	`[sbom].targets`	directory / image ref / SBOM file (repeatable).
`--syft-image <ref>`	`[sbom].syft_image`	digest-pinned Anchore Syft image.
`--grype-image <ref>`	`[sbom].grype_image`	digest-pinned Anchore Grype image.
`--platform <os/arch>`	`[sbom].platform`	platform passed to Syft for image targets (default `linux/amd64`).
`--unsafe-allow-targets-outside-scan-root`	(CLI only)	bypass scan-root confinement for CLI targets. Does NOT affect config targets (security).

Hard requirements / security pins:

Path targets are confined to the scan root by default. A config target outside the scan root is rejected; the --unsafe-allow-targets-outside-scan-root flag is CLI-only and unconfines ONLY CLI targets. There is no config escape hatch — this is a deliberate split so an attacker-controlled .secscan.toml cannot couple with a CLI flag to bind-mount /etc (or any other host path) into the Syft container.
OCI image refs are digest-pinned. alpine:3.10 is rejected; alpine@sha256:<64 hex> is required. Same posture as DAST and image scanners.
2-step pipeline uses a labeled named volume. Each run creates secscan-sbom-<32 hex> via secrets.token_hex(16), cleans it up in try/finally. A SIGKILL or host crash mid-scan can leave the volume orphaned; sweep periodically with docker volume prune -f --filter label=secscan-tmp=1.
SBOM size cap (32 MiB) applies to both operator-supplied SBOM files AND Syft's output, so a hostile SBOM cannot OOM Grype.
Top-level symlinks refused. A --target that's a symlink is rejected outright (defence in depth against scan-root escape via symlink swap).

Cross-engine comparison with `image`

secscan image (Trivy) and secscan sbom (Grype) can scan the same OCI image and produce overlapping findings. This is by design — the two engines use different advisory DBs and CPE matching strategies, so a CVE present in one and absent in the other is a meaningful signal worth investigating. secscan keeps both findings (scanner name is part of the fingerprint), so a baseline accept on one does NOT silence the other.

OpenAPI fuzzing (Schemathesis, Phase 2-O)

secscan apifuzz runs Anchore Schemathesis against a live API. Schemathesis reads your OpenAPI spec and auto- generates property-based test cases that exercise:

Server crashes (5xx responses → not_a_server_error, HIGH)
Spec drift (response shape doesn't match the schema → response_schema_conformance / status_code_conformance, MEDIUM)
Auth bypasses (endpoints accept anonymous requests despite the spec marking them as protected → ignored_auth, HIGH)
Resource lifecycle bugs (use-after-free on deleted ids → use_after_free, HIGH)
Input validation gaps (malformed/extreme inputs the spec marks invalid get accepted → negative_data_rejection, MEDIUM)

This is the business-logic / input-validation complement to DAST (Phase 2-D, HTTP-layer ZAP probes) — the two are deliberately separate scanners because the bug classes barely overlap.

# Schema from a live URL (Schemathesis fetches it)
secscan apifuzz \
  --api-url https://staging.example.com/api/v3 \
  --schema https://staging.example.com/api/v3/openapi.json

# Schema from a local file (bind-mounted RO)
secscan apifuzz \
  --api-url https://staging.example.com/api/v3 \
  --schema ./openapi.yaml \
  --auth-header "Authorization: Bearer $JWT"

# Active mode (DESTRUCTIVE — see "active mode" below)
secscan apifuzz \
  --api-url https://staging.example.com/api/v3 \
  --schema ./openapi.yaml \
  --mode active --allow-active \
  --max-examples 50 --seed 42

Required flags / config:

Flag	Equivalent config key	Purpose
`--api-url <URL>`	`[apifuzz].api_url`	live API base URL. No query/fragment/userinfo.
`--schema <P>`	`[apifuzz].schema`	OpenAPI source: `http(s)://` URL or local file.
`--mode {baseline,active}`	`[apifuzz].mode`	`baseline` = GET/HEAD/OPTIONS; `active` = all methods.
`--allow-active`	(CLI-only — never config)	Second opt-in for `--mode=active`.
`--auth-header "Name: Value"`	`[apifuzz].headers`	repeatable. Same validator as Phase 2-K DAST.
`--max-examples N`	`[apifuzz].max_examples`	Hypothesis test cases per operation (default 25).
`--seed N`	`[apifuzz].seed`	Fixed seed for reproducible runs.

Active mode requires a second opt-in

--mode=active enables fuzzing of mutating methods (POST / PUT / PATCH / DELETE). Schemathesis will create users, orders, and other side-effect-bearing records, and may DELETE existing ones if the spec lists those endpoints.

To prevent an attacker-controlled .secscan.toml from silently turning on destructive fuzzing, allow_active cannot be set from config. The operator must additionally pass --allow-active on the CLI. mode = "active" in config + no --allow-active on the CLI → exit 2 with a clear error before any docker call.

Do NOT point active mode at production.

Security pins baked in

Schema FILE targets are confined to the scan root (config-origin always, CLI-origin unless --unsafe-allow-schema-outside-scan-root is set). Mirrors Phase 2-N's per-origin discipline.
--max-redirects 0 is mandatory — Schemathesis cannot follow 3xx redirects out of the operator-declared --api-url scope.
The parser checks each request URI in the NDJSON report against --api-url scheme+host. URIs outside the scope surface as warnings (visible even when the scenario "passed" the API's checks) so the operator notices if a schema's servers: list caused Schemathesis to touch a different host.
--generation-database :memory: keeps Hypothesis examples in memory only; bench runs are reproducible given a fixed seed.
--output-sanitize true masks token-shaped values in Schemathesis's own console output.
2-step pipeline uses a named volume secscan-apifuzz-<32 hex> with try/finally cleanup. Sweep orphaned volumes (SIGKILL recovery) with docker volume prune -f --filter label=secscan-tmp=1.

IAST harness (pyrasp, Phase 2-P)

secscan iast is a runtime IAST test harness. Unlike every other secscan scanner, it does NOT wrap a docker container — it spawns your application as a subprocess (under your shell credentials), sends a curated set of canary HTTP probes, and parses the pyrasp event log your app produces.

# Smoke a Flask app instrumented with pyrasp.
secscan iast \
  --command "python -m flask --app app run --host=127.0.0.1 --port=5050" \
  --probe-url http://127.0.0.1:5050 \
  --pyrasp-log /path/to/scan-root/.secscan-pyrasp.json

Operator setup (one-time)

secscan does NOT install or configure pyrasp. The harness expects your app to be already instrumented:

# requirements.txt — pin the version pin in src/secscan/scanners/iast/_pinned.py
# for cross-engine compatibility with the bench parser fixture.
pyrasp==0.8.0

# app.py — instrument before any route handler runs.
import os
import pyrasp
from flask import Flask

app = Flask(__name__)
pyrasp.init(
    app,
    conf={
        # secscan's harness sets these env vars at spawn time.
        "log_file": os.environ.get("SECSCAN_PYRASP_LOG", "/tmp/pyrasp.json"),
        # Record the run-id header so secscan can filter for the
        # current run's events.
        "log_headers": ["X-Secscan-Run-Id", "X-Secscan-Probe-Id"],
        # Don't actually block requests — we just want event
        # detection. Production deployments may want enforce=True
        # but that's outside secscan's concern.
        "enforce": False,
    },
)

The harness sets two environment variables when spawning the app:

SECSCAN_PYRASP_LOG — the path passed via --pyrasp-log. Your app should write pyrasp events here. The path MUST be inside the scan root and MUST NOT pre-exist (stale events would poison the parse).
SECSCAN_RUN_ID — a 32-hex-char identifier the harness also injects as X-Secscan-Run-Id on every probe request. The parser keeps ONLY events tagged with this run-id (Codex Phase 2-P design review MUST-FIX #5 — defends against stale event log pollution).

CLI flags

Flag	Purpose
`--command <argv>`	Shell-style command to spawn (shlex.split + Popen shell=False). CLI-only.
`--probe-url <URL>`	Loopback-only base URL to probe. Hostnames resolve to a literal IP at validation time (DNS-rebind defence). CLI-only.
`--pyrasp-log <PATH>`	Path the app writes events to. Confined to scan root; must not pre-exist. CLI-only.
`--allow-risky-probes`	Opt in to AWS IMDS / time-based blind SQLi / sleep-based RCE payloads. Default off. CLI-only.
`--app-ready-timeout N`	How long to wait for the app to accept TCP on the probe URL (default 60s).

Security architecture (why IAST is uniquely CLI-only)

The IAST harness is the only secscan scanner that:

Spawns operator code under operator credentials. Every other scanner runs in a hardened docker container with --cap-drop=ALL and a read-only mount. IAST runs your app.
Has NO config-file entry point. Writing [iast] in your .secscan.toml is a hard error (exit 2, "CLI-only"). A tampered config file cannot inject [iast].command = "rm -rf /" — the config parser refuses to populate ProjectConfig.iast from disk.
Is excluded from secscan all. Even a fully-populated [iast] block in config + correct CLI args would not pull IAST into secscan all. The only legal entry point is secscan iast directly. This makes IAST runs an interactive operator decision, never an automated CI step that could be triggered by a PR.
Pins the probe URL at validation time. Hostnames like localhost are resolved once to a literal IPv4/IPv6 loopback address; subsequent probes connect to the IP, not the hostname. This closes the DNS-rebind TOCTOU window (Codex Phase 2-P diff review MUST-FIX).
SIGKILLs the process group, not just the leader. Flask's reloader and gunicorn workers are descendants of the spawned process. The harness uses start_new_session=True + killpg so SIGTERM → grace → SIGKILL hits the whole tree. The SIGKILL step runs unconditionally even if the leader has already exited (Codex Phase 2-P diff review MUST-FIX).

Probe payloads

10 default "safe" canary payloads (SQLi, XSS, RCE, SSRF, traversal, NoSQLi). All are non-destructive — no DROP TABLE, no cat /etc/passwd, no IMDS endpoints. Three additional "risky" payloads (IMDS, blind sleep) are gated behind --allow-risky-probes.

These are canary probes for event triggers, not fuzzing. secscan's value here is the harness (subprocess lifecycle + probe HTTP + pyrasp event correlation), not the breadth of attack patterns — if you need real-world adversarial fuzzing, use secscan apifuzz (Phase 2-O) for HTTP-layer fuzzing.

Detection-rate benchmark

bench/run.py measures how much of a curated known-vulnerable corpus secscan detects and how it compares to single-tool baselines.

.venv/bin/python bench/run.py            # runs every available scanner

Latest results on this machine (re-run locally for an up-to-date snapshot; see bench/report.md for the full table):

Curated fixtures (in-sample — we author these, so 100% is the

expected ceiling)

Scanner	Fixture	Recall	vs single tool
deps	npm-vulnerable	100%	= npm audit ✅
deps	pip-vulnerable	100%	= pip-audit (deduped) ✅
sast	python	100%	≥ semgrep ✅
sast	javascript	100%	= semgrep ✅
secrets	synthetic	100%	= gitleaks ✅
dast	juice-shop	100%	= zap-baseline ✅

External benchmarks (third-party, out-of-sample — added in

Phase 2-I to surface over-fit)

Scanner	Fixture	Recall	Notes
dast	webgoat	100% (6/6)	second DAST target validates lifecycle generalises
external/secrets	gitleaks-corpus	100% (2/2)	parity with gitleaks on its own testdata
external/sast	pygoat	100% (9/9)	Python OWASP Top 10
external/sast	nodegoat	80% (8/10)	JS XSS (.ejs templates) + broken auth still escape

Overall: 45/47 = 95.7% across 11 benchmarks, false positives: 0, ≥ best single tool: 4/4. Run with --dast --external to include all third-party benchmarks (requires docker + git; adds 5-10 minutes for the clone-and-scan cycle).

The 80% on NodeGoat is the headline honest number — secscan catches 8 of 10 expected CWE categories on a benchmark we did NOT design or tune to. The two misses (CWE-79 in .ejs templates, CWE-287 broken authentication) reflect genuine limits of static analysis on those patterns, not gaps in the scanner integration layer.

Phase 2-F + 2-G tuning (post-initial benchmark) lifted recall from 54.5% → 86.7% → 100% by:

p/default ruleset added to the default semgrep family (JavaScript SAST 0% → 100%; Python SAST 25% → 50%).
gitleaks tempfile fix — switched from --report-path=/dev/stdout (which gitleaks refuses on macOS) to a 0700 tempfile the scanner manages with 0600 chmod + try/finally cleanup. A real cross-platform bug the bench surfaced.
pip-audit comparison dedup by (package, advisory_id) (pip-audit ships some advisories twice from different source DBs; secscan dedups, so a fair comparison must too).
Bundled secscan rules (secscan:extra sentinel) shipping under src/secscan/rules/ — Python yaml.load without SafeLoader (CWE-502), hard-coded credentials in named variables (CWE-798), MongoDB $where-template-string injection (CWE-943, NodeGoat A1), and template-engine auto-escape disabled (CWE-79, NodeGoat A3 — added post- v0.17.0 after the comprehensive bench surfaced the gap). The sentinel passes the _reject_unsafe_configs safety gate (it points at our own code, not user input) but is otherwise treated like any path-based ruleset. Lifted Python SAST recall 50% → 100% and NodeGoat external SAST 80% → 100%.

What's still scope-limited rather than a recall miss:

Real-codebase FP rate: the curated safe_* borderline fixtures stay clean, but they don't represent the full diversity of real source trees. Adding p/security-audit for even broader CWE coverage is opt-in via [sast].semgrep_config.
CWE-287 (Broken Authentication) is fundamentally out of scope for SAST — the lab patterns (session management, account lockout, password reset flow) are architectural and not detectable by pattern matching. NodeGoat's bench/fixtures/external/nodegoat/expected.json lists CWE-287 under out_of_scope_cwes with a pointer to the actual coverage paths:
- Dynamic auth-bypass sub-classes (CWE-306, 307, 384, 565) — covered by Phase 2-J DAST active mode (ZAP plugin 40038 "Bypassing 403", implemented in bench/fixtures/dast/juice- shop/expected.json expected_active_findings) and Phase 2-P IAST harness (pyrasp ignored_auth check).
- Pure-policy sub-classes (CWE-521 password complexity, CWE-613 session timeout) — remain undetectable by every scanner (no code pattern, no runtime observation surfaces them).

Phase 2-H added the DAST measurement (bench/run.py --dast). Lifecycle: bring up OWASP Juice Shop on 127.0.0.1:3000 → secscan dast internally orchestrates a docker volume + Alpine helper chown + zap-baseline.py scan + report extraction → compare detected ZAP pluginids against bench/fixtures/dast/juice-shop/ expected.json. The --dast flag is opt-in because the full lifecycle takes 2-4 minutes; the fast bench/run.py (no flag) skips it.

See bench/README.md for the full methodology, retraction policy, and how to add new fixtures.

Security posture

A few invariants worth knowing about if you're auditing the tool itself:

Subprocess execution. Every external tool is invoked via subprocess.run(shell=False) with an argv list — no shell interpolation.
Secrets never enter secscan's address space. gitleaks is started with --redact=100. The Finding.raw payload force-overrides Secret/Match fields with [REDACTED] even if the upstream tool somehow returned them unredacted. Author / Email / Commit / Message fields are dropped via a whitelist.
Scanner output paths are re-verified. A scanner that reports /etc/passwd or ../../etc/passwd has the path stripped before any Finding is rendered.
stderr / messages are redacted before truncation. AWS keys, GitHub tokens, JWTs, npm _authToken= lines, URL basic-auth credentials, PyPI tokens, and pip index-url= lines are scrubbed.
semgrep_config is gated. Only registry shorthand (p/, r/) and paths under the scan root are accepted by default. Arbitrary URLs and out-of-tree absolute paths require explicit opt-in ([sast].allow_unverified_configs = true). An untrusted PR that modifies .secscan.toml cannot point semgrep at a malicious ruleset.
Baseline tamper resistance. Malformed baseline files raise BaselineError, exit 2; a corrupt baseline cannot silently disable suppression. Entries match on the full (fingerprint, scanner, rule_id) tuple, so a hash collision across scanners cannot silence the wrong finding.

For a deeper look, the source modules carry inline rationale tied to the specific Codex review iteration that motivated each invariant.

Status

Phase	Scope	Status
1A	Common base + `secrets` (gitleaks)	done
1B	`deps` (npm / pnpm / pip-audit)	done
1C	`sast` (semgrep)	done
1D	docs + final review	done (v0.1.0)
2-A	JSON / SARIF output	done (v0.2.0)
2-B	monorepo / workspaces (pnpm + npm)	done (v0.3.0)
2-C	uv per-member audit + Yarn Berry workspaces	done (v0.4.0)
2-D	DAST (OWASP ZAP, Docker)	done (v0.5.0)
2-E	detection-rate benchmark (bench/)	done (v0.6.0)
2-F	tune defaults to improve bench recall (54.5% → 86.7%)	done (v0.7.0)
2-G	bundled secscan semgrep rules (recall 86.7% → 100%)	done (v0.8.0)
2-H	DAST 自動計測 (Juice Shop + docker volume lifecycle)	done (v0.9.0)
2-I	外部 benchmark (NodeGoat / PyGoat / WebGoat / gitleaks corpus)	done (v0.10.0)
2-J	ZAP active scan opt-in (`--mode=active` + `--dast-active`)	done (v0.11.0)
2-L	Trivy config scan (IaC/k8s/Docker/Helm)	done (v0.12.0)
2-K	ZAP auth-flow via HTTP header injection (`--auth-header`)	done (v0.13.0)
2-M	container image CVE scan (`secscan image`, Trivy image mode)	done (v0.14.0)
2-N	SBOM-based CVE scan (`secscan sbom`, Syft + Grype 2-step pipeline)	done (v0.15.0)
2-O	OpenAPI fuzzing (`secscan apifuzz`, Schemathesis)	done (v0.16.0)
2-P	IAST harness (`secscan iast`, pyrasp-aware)	done (v0.17.0)
2-W	Windows full support (cross-platform: Linux + macOS + Windows)	done (v0.18.0)
2-Q	Supply chain integrity (`secscan supply`, cosign + lockfile self-consistency)	done (v0.19.0)

Development

.venv/bin/pytest               # 648 unit tests + 2 integration (skipped without the binaries)
.venv/bin/ruff check src/ tests/ bench/
.venv/bin/mypy --strict src/secscan
.venv/bin/python bench/run.py  # detection-rate benchmark (see bench/README.md)

Integration tests against real gitleaks / semgrep are gated by pytest.mark.integration and skip cleanly when the binary is not on PATH. They produce a useful smoke check during development; CI may choose to install the tools and run them, or skip.

Monorepos / workspaces

secscan deps understands the four most-common JavaScript / Python workspace configurations and audits each member separately:

Workspace format	Detection	Per-member audit
pnpm	`pnpm-workspace.yaml` (`packages:` glob, `!` excludes supported)	`pnpm audit --filter <name>`
npm	`package.json#workspaces` (array or `{packages: [...]}`)	`npm audit --workspace <name>`
uv (Python)	`[tool.uv.workspace] members` + `exclude` in `pyproject.toml`	`uv export --locked --no-emit-local --package <name>` → pip-audit on the resulting requirements file (Phase 2-C-1)
yarn (Berry)	`package.json#packageManager: "yarn@2+"` / `__metadata:` in yarn.lock / `.yarnrc.yml`	`yarn workspace <name> npm audit --json --recursive` (Phase 2-C-2)
yarn (Classic)	`# yarn lockfile v1` header	unsupported, warning emitted (upgrade to Berry or switch to npm/pnpm)

Properties that make this safe for monorepos:

The audit always runs from the repo root so the authoritative root lockfile is used; --workspace / --filter scope the result to one member.
Each member gets a distinct fingerprint (deps-ws: prefix + member name). The same lodash advisory in packages/api and packages/web produces two separate baseline entries — accepting one does NOT silence the other.
File paths in findings are rewritten to repo-root-relative form (e.g. packages/api/src/leak.py), so reports stay consistent regardless of which member the scanner ran in.
Symlinked workspace members, glob patterns with .. / absolute paths / URI schemes, and member name strings containing pnpm selector grammar (!, *, ^, ~, ...) are all refused with a warning rather than included.
An empty or malformed workspace config does NOT silently disable the repo-root scan — single-project deps detection still runs.
Workspace member counts above 50 emit an informational warning; above 100 the list is truncated with a separate warning.

Secrets and SAST scanners are not workspace-aware; they always scan the whole repo root as a single unit (file content doesn't follow ecosystem boundaries).

Cross-tool baseline compatibility

If you migrate between package managers (e.g. yarn → pnpm, npm → yarn Berry), an existing baseline keeps working: secscan picks the canonical advisory identifier in the same order across npm / pnpm / yarn — GHSA → CVE → URL → numeric id — so the same advisory in any of those reports hashes to the same fingerprint.

Threat model note for Yarn Berry

.yarnrc.yml may set yarnPath to an arbitrary JavaScript file that yarn then executes. secscan treats the yarn CLI (and the binary it points at) as trusted, same way it trusts npm, pnpm, pip-audit, and semgrep. Do not run secscan deps against an untrusted project root.

Windows support (Phase 2-W)

secscan v0.18.0 runs natively on Linux, macOS, and Windows. The pip install shun-secscan flow is identical across OSes; CI tests all three via a GitHub Actions matrix.

What works the same on every OS

All 9 scanners (secrets, deps, sast, dast, config, image, sbom, apifuzz, iast).
Docker bind mounts: host paths are auto-converted to the /c/Users/... Unix-style form Docker Desktop expects on Windows (secscan.portability.to_docker_host_path). The operator passes a native Windows path; secscan transforms it.
Loopback URL validation accepts localhost, 127.0.0.1, [::1] on every OS.
gitleaks, semgrep, pip-audit, uv, npm, yarn — every external CLI secscan ships against has a Windows binary.

IAST on Windows: best-effort cleanup

The IAST harness (secscan iast) has a Windows-specific asymmetry, by design:

POSIX (Linux/macOS): spawn-time start_new_session=True
- cleanup-time os.killpg(pgid, SIGTERM) → grace → SIGKILL. The process group is killed atomically — Flask's reloader and gunicorn workers cannot orphan-survive.
Windows: spawn-time CREATE_NEW_PROCESS_GROUP + cleanup- time psutil.children(recursive=True) + terminate() (which on Windows is TerminateProcess, the OS-level force-kill — NOT SIGTERM-equivalent). The harness walks the parent- child tree and kills each process individually. A descendant that gets parent_pid=0 (detached, e.g. via the DETACHED_ PROCESS flag) can survive the cleanup.

In practice this means:

Regular Flask / FastAPI apps shut down cleanly on Windows (the reloader child is a normal descendant).
Apps that intentionally detach themselves (Windows services, uvicorn workers spawned via spawn start method) may leak on Windows. Use POSIX (Linux/macOS, or WSL2 on Windows) for those scenarios.

A Windows Job Object-based hard boundary is a possible future enhancement; v0.18.0 deliberately accepts the best-effort semantics to keep the scanner adapter simple.

Path conventions

secscan accepts Windows paths in their native form (C:\Users\foo) on the CLI and in .secscan.toml.
All Docker bind mount sites convert the path to /c/Users/foo before passing to docker -v.
as_posix() is used for display so reports remain consistent across OSes.

Cross-OS testing

# All three OSes are exercised in CI via:
# .github/workflows/ci.yml — matrix [ubuntu-latest, macos-latest, windows-latest]
# Unit tests cover the path-helper logic with simulated
# IS_WINDOWS=True / False so the Windows-branch code is
# exercised on POSIX hosts too.

Known limitations

No DAST yet — secscan dast is reserved for a future release.
npm v6 audit output is explicitly not parsed; the scanner rejects it with an instructive error telling the user to upgrade to npm v7+.
pip-audit cannot consume uv.lock / pdm.lock directly. The scanner errors out with a hint to uv export / pdm export to requirements.txt first. (uv workspaces are handled per member automatically via uv export --locked.)
Yarn Classic (v1) is unsupported; upgrade to Yarn Berry (v2+) or switch to pnpm / npm.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

suetsugushunsuke

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.20.0

May 25, 2026

0.19.0

May 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shun_secscan-0.20.0.tar.gz (240.8 kB view details)

Uploaded May 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shun_secscan-0.20.0-py3-none-any.whl (276.6 kB view details)

Uploaded May 25, 2026 Python 3

File details

Details for the file shun_secscan-0.20.0.tar.gz.

File metadata

Download URL: shun_secscan-0.20.0.tar.gz
Upload date: May 25, 2026
Size: 240.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for shun_secscan-0.20.0.tar.gz
Algorithm	Hash digest
SHA256	`c4cc50a4b11f792a3ef5e6c8d13303edae067f169dca3e923c9a2488085b14ef`
MD5	`a2b6b2917c2fe92082877d90812cea24`
BLAKE2b-256	`78a143ef3650e8e9c70915b8986ffec4d893b17007be0c669f0d0ce43dff228a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for shun_secscan-0.20.0.tar.gz:

Publisher: publish.yml on shunsukesuetsugu-prog/secscan

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: shun_secscan-0.20.0.tar.gz
- Subject digest: c4cc50a4b11f792a3ef5e6c8d13303edae067f169dca3e923c9a2488085b14ef
- Sigstore transparency entry: 1628857767
- Sigstore integration time: May 25, 2026
Source repository:
- Permalink: shunsukesuetsugu-prog/secscan@43c408113a00ef3538d04dc8fc483d5f24ef37f6
- Branch / Tag: refs/heads/main
- Owner: https://github.com/shunsukesuetsugu-prog
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@43c408113a00ef3538d04dc8fc483d5f24ef37f6
- Trigger Event: workflow_dispatch

File details

Details for the file shun_secscan-0.20.0-py3-none-any.whl.

File metadata

Download URL: shun_secscan-0.20.0-py3-none-any.whl
Upload date: May 25, 2026
Size: 276.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for shun_secscan-0.20.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e1173d5f1d945b57534aa4909ff2b5f3b6bd315fc5bd322ae40bc9e254980389`
MD5	`783d675c3aacbf4ab9cbede819cc5a80`
BLAKE2b-256	`f7f304f422c877f6f8335c9a6d2f34e53cd164a1e0e54726e709bf6d1490a300`

See more details on using hashes here.

Provenance

The following attestation bundles were made for shun_secscan-0.20.0-py3-none-any.whl:

Publisher: publish.yml on shunsukesuetsugu-prog/secscan

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: shun_secscan-0.20.0-py3-none-any.whl
- Subject digest: e1173d5f1d945b57534aa4909ff2b5f3b6bd315fc5bd322ae40bc9e254980389
- Sigstore transparency entry: 1628857821
- Sigstore integration time: May 25, 2026
Source repository:
- Permalink: shunsukesuetsugu-prog/secscan@43c408113a00ef3538d04dc8fc483d5f24ef37f6
- Branch / Tag: refs/heads/main
- Owner: https://github.com/shunsukesuetsugu-prog
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@43c408113a00ef3538d04dc8fc483d5f24ef37f6
- Trigger Event: workflow_dispatch

shun-secscan 0.20.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

secscan

What's in the box

Install

Homebrew (macOS / Linux)

Scoop (Windows)

From PyPI (any platform with pip)

From source

Quick start

Output formats

Baseline workflow

CI integration

DAST (OWASP ZAP)

Authenticated DAST (Phase 2-K)

ZAP image digest rotation

Container image scan (Trivy, Phase 2-M)

Deterministic vs online Trivy DB

SBOM scan (Syft + Grype, Phase 2-N)

Cross-engine comparison with image

OpenAPI fuzzing (Schemathesis, Phase 2-O)

Active mode requires a second opt-in

Security pins baked in

IAST harness (pyrasp, Phase 2-P)

Operator setup (one-time)

CLI flags

Security architecture (why IAST is uniquely CLI-only)

Probe payloads

Detection-rate benchmark

Curated fixtures (in-sample — we author these, so 100% is the

External benchmarks (third-party, out-of-sample — added in

Security posture

Status

Development

Monorepos / workspaces

Cross-tool baseline compatibility

Threat model note for Yarn Berry

Windows support (Phase 2-W)

What works the same on every OS

IAST on Windows: best-effort cleanup

Path conventions

Cross-OS testing

Known limitations

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Cross-engine comparison with `image`