DAST + SAST secret scanner with live verification, source-map parsing, and CI-native reporting

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

m14r41

These details have not been verified by PyPI

Project links

Project description

scan4secrets

DAST + SAST secret scanner with live verification, source-map parsing, and CI-native reporting.

Find leaked credentials in source trees, running web apps, and CI logs. Verify them live against vendor APIs. Output SARIF for code-scanning dashboards, JSONL for SOAR pipelines, or Excel/PDF/HTML for client reports.

Why scan4secrets

The crowded landscape (gitleaks, trufflehog, detect-secrets) is great at SAST on git trees but stops there. scan4secrets fills the gaps they don't cover:

Capability	gitleaks	trufflehog	detect-secrets	scan4secrets
SAST secret detection	Y	Y	Y	Y
DAST live web crawl	-	-	-	Y
JS source-map parsing	-	-	-	Y
JS endpoint extraction	-	-	-	Y
HTTP-header secret scan	-	-	-	Y
Live token verification	-	Y	-	Y
SARIF output	Y	-	-	Y
Excel / PDF / HTML reports	-	-	-	Y
Entropy gate + allowlist	Y	Y	Y	Y
YAML rules schema	- (TOML)	-	-	Y
Authenticated DAST (cookie/header/proxy)	n/a	n/a	n/a	Y

It is a complement to gitleaks, not a replacement. Use both: gitleaks in pre-commit + CI for git-history SAST, scan4secrets for live DAST against staging/production.

Install

# from source
git clone https://github.com/m14r41/scan4secrets
cd scan4secrets
pip install -e .

# OR via pipx
pipx install git+https://github.com/m14r41/scan4secrets

# OR Docker
docker run --rm -v $(pwd):/scan ghcr.io/m14r41/scan4secrets:latest --path /scan

After install, the scan4secrets command is on your PATH.

Quick start

# SAST: scan a local directory
scan4secrets --path /code

# DAST: crawl a live target
scan4secrets --url https://staging.example.com --threads 32

# DAST runs ALL bundled wordlists by default (1279 paths: /.env, /wp-config.php, /backup.zip, ...)
scan4secrets --url https://target.com

# Use YOUR OWN wordlist file (replaces the bundled set)
scan4secrets --url https://target.com --wordlist /path/to/my-paths.txt

# Combine multiple custom wordlist files
scan4secrets --url https://target.com --wordlist seclists/Common.txt internal-paths.txt

# Restrict to specific bundled wordlists by stem
scan4secrets --url https://wp.example.com --wordlist-only wordpress common env

# Turn wordlist seeding off entirely (only follow live links)
scan4secrets --url https://target.com --no-wordlist

# Full audit with verification + HTML report
scan4secrets --path . --url https://app.example.com \
    --verify --report html sarif json \
    --output reports/audit-$(date +%F)

# Authenticated DAST with proxy (works with Burp / ZAP)
scan4secrets --url https://app.example.com \
    --cookie "session=abc123" \
    --header "X-Tenant: acme" \
    --proxy http://127.0.0.1:8080

# CI gate (exit 1 if anything >= high)
scan4secrets --path . --report sarif --fail-on high \
    --output reports/scan

What it detects

170+ rules covering:

Cloud: AWS, GCP, Azure, DigitalOcean, Heroku, Linode, Vultr, Hetzner, Alibaba, IBM Cloud, Oracle Cloud, Render, Vercel, Netlify, Fly.io
CDN / edge: Cloudflare (API token + Origin CA), Fastly, Cloudinary, Akamai EdgeGrid, BunnyCDN
Source control: GitHub (classic / fine-grained / OAuth / App / refresh / deploy key), GitLab, Bitbucket
CI/CD: CircleCI, Travis, Buildkite, Jenkins, ArgoCD, Pulumi, Snyk, Doppler
Payments: Stripe, Square, PayPal/Braintree, Razorpay, Plaid, Adyen, Paddle, LemonSqueezy, Coinbase, Binance
E-commerce: Shopify (private app / shared secret / custom app / partner), WooCommerce REST
Messaging: Slack (5 token types + webhook), Discord (bot + webhook), Twilio, Telegram, Microsoft Teams webhook, Zoom JWT, Vonage/Nexmo
SMS / carriers: MessageBird, Plivo
AI/ML: OpenAI, Anthropic, Hugging Face, Replicate, Cohere, Pinecone, Mistral, Groq, Perplexity, DeepL, AssemblyAI, ElevenLabs, Stability AI
Email / marketing: SendGrid, Mailgun, Mailchimp, Postmark, Resend, Mailjet, Klaviyo, ConvertKit, Customer.io
Monitoring: Datadog, Sentry (DSN + org-auth-token), New Relic, Grafana (service-account + Cloud), LaunchDarkly (SDK + mobile), Honeycomb, Rollbar, Bugsnag, Splunk HEC, PagerDuty
DevOps / registries: Docker Hub, Docker registry auth, NPM, PyPI, RubyGems, crates.io, JFrog Artifactory, Terraform Cloud, HashiCorp Vault, HashiCorp Cloud
Auth / identity: Auth0, Okta, Clerk, WorkOS, Stytch, Atlassian / Jira, Frontegg, Keycloak
Productivity SaaS: Notion, Linear, Airtable, Asana, ClickUp, Typeform, Calendly, Zendesk, Intercom
Mobile / push: Firebase Cloud Messaging, Expo, OneSignal, Microsoft AppCenter
Data / ML platforms: Databricks, Snowflake, Algolia
Mapping: Mapbox (pk / sk), HERE Maps
Blockchain / Web3: Infura, Alchemy, Etherscan, WalletConnect, QuickNode
Storage: Backblaze B2 (KeyID + appKey)
Networking / VPN: Tailscale (auth + API)
QA / browser testing: BrowserStack, Sauce Labs, Percy
Connection strings: PostgreSQL, MySQL, MongoDB (incl. srv), Redis, AMQP
Webhooks: Zapier, IFTTT, Meta / Facebook Graph
Auth tokens: JWT, HTTP Basic in URLs
Crypto: RSA / EC / OPENSSH / PGP private keys, SSH public keys, Cloudflare Origin CA, GitHub deploy keys
Contextual fallbacks: quoted/unquoted high-entropy strings, hex tokens, UUIDs near credential names

See docs/RULES.md for the full reference and how to add custom rules.

Live verification

With --verify, scan4secrets makes one HTTP request per detected token to the vendor API to confirm whether the credential is still live:

Rule	Probe	Success
`github-pat-classic` / `github-pat-fine-grained`	`GET https://api.github.com/user`	HTTP 200
`stripe-secret-live`	`GET https://api.stripe.com/v1/charges?limit=1`	HTTP 200
`slack-bot-token`	`POST https://slack.com/api/auth.test`	HTTP 200
`openai-key`	`GET https://api.openai.com/v1/models`	HTTP 200

Each finding gets verified=true|false|null in every output format. A verified token is incident-grade evidence; an unverified one is a hypothesis.

See docs/VERIFICATION.md for the full vendor list and how to add probes.

Reports

scan4secrets --path . --report sarif json jsonl csv html excel pdf --output reports/run

Format	Best for
`sarif`	GitHub Code Scanning, GitLab Security Dashboard, Sonar, Defect Dojo
`json`	Tooling integrations, post-processing
`jsonl`	SIEM/SOAR pipelines (Splunk, Datadog, Sentinel)
`csv`	Spreadsheet triage
`html`	Sortable / filterable / colored UI for client review
`excel`	Pivot tables and exec summaries
`pdf`	Compliance evidence packets

Secrets are redacted by default (abcd****wxyz). Use --unsafe-show only when reports are stored securely.

DAST details

The crawler:

Honors scope (same eTLD+1 by default; --strict-host for exact host)
Runs concurrently (--threads N, default 16)
Sends a custom User-Agent, optional headers, cookies, and routes through your proxy (Burp / ZAP friendly)
Parses .js.map files and scans every embedded source (catches secrets hidden inside production source maps that no SAST sees)
Extracts string-literal endpoints from .js files and probes them
Scans response headers as well as body
Path-guess wordlists are ON by default — every DAST run seeds 1279 sensitive paths (.env, .git/config, wp-config.php, phpinfo.php, backup.zip, composer.json, source maps, admin panels, API docs, ...). Restrict with --wordlist-only NAME ... or disable with --no-wordlist.
Caps at --max-urls and --max-depth so you can't accidentally DoS a target

Wordlists are stack-specific: common, env, wordpress, php-laravel-symfony-drupal, Python-Django-Flask, Node.js-Express-JS, React-Next.js-Vite-Frontend, Docker-Compose-Kubernetes, CloudProvider-Service, Keys-SSH-Certificate, OtherConfig-CI-DevOps, backup-files, admin-panels, api-paths, database-dumps. Use --wordlist-only NAME ... to restrict to specific stems.

CI / pre-commit

.pre-commit-hooks.yaml is shipped:

repos:
  - repo: https://github.com/m14r41/scan4secrets
    rev: v2.1.0
    hooks:
      - id: scan4secrets

GitHub Actions:

- uses: actions/checkout@v4
- run: pip install scan4secrets
- run: scan4secrets --path . --report sarif --output results --fail-on high
- uses: github/codeql-action/upload-sarif@v3
  if: always()
  with: { sarif_file: results.sarif }

Documentation

docs/ARCHITECTURE.md — package layout, data flow, extension points
docs/RULES.md — rule schema, examples, writing custom rules
docs/VERIFICATION.md — how live verification works, adding new vendors
docs/CHANGELOG.md — what's new in v2 vs v1
docs/GAP_ANALYSIS.md — empirical comparison vs v1 and gitleaks

Benchmark

Tested on Plazmaz/leaky-repo (seeded with real-format secrets) and on expressjs/express (clean OSS code).

Tool	leaky-repo (TPs found)	benign express (FPs)
scan4secrets v1	35 (~22 TPs, ~13 FPs)	27
gitleaks	22	0
scan4secrets v2	23 (all TPs, incl. SSH/PEM/Docker keys v1 missed)	0

v2 has 0% FP rate on benign code (vs v1's ~13% per-file rate) and captures the high-value secret classes (private keys, Docker registry auth) that v1 was structurally incapable of detecting.

Contributing

Add a rule: edit scan4secrets/config/rules.yaml
Add a verifier: extend the verify: block in the rule
Add a reporter: drop a module under scan4secrets/reporters/ and register in __init__.py

Run tests: pytest -q (planted-secret fixtures under tests/fixtures/)

License

MIT — see LICENSE.

Built by @M14R41.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

m14r41

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.2.0

Jul 3, 2026

2.1.3

Jun 23, 2026

2.1.2

Jun 23, 2026

This version

2.1.1

Jun 23, 2026

2.1.0

Jun 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scan4secrets-2.1.1.tar.gz (48.4 kB view details)

Uploaded Jun 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

scan4secrets-2.1.1-py3-none-any.whl (51.0 kB view details)

Uploaded Jun 23, 2026 Python 3

File details

Details for the file scan4secrets-2.1.1.tar.gz.

File metadata

Download URL: scan4secrets-2.1.1.tar.gz
Upload date: Jun 23, 2026
Size: 48.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for scan4secrets-2.1.1.tar.gz
Algorithm	Hash digest
SHA256	`4926d501d427e1c1547428975cc413bd82bb0cbe8565b80e43893317639a8200`
MD5	`fcffe95841612ef0af3ee93172a8b940`
BLAKE2b-256	`26e31f797bf6c19514e64442d969b77d9be60d008a491d5cc582fba48b0c5f81`

See more details on using hashes here.

Provenance

The following attestation bundles were made for scan4secrets-2.1.1.tar.gz:

Publisher: release.yml on m14r41/scan4secrets

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: scan4secrets-2.1.1.tar.gz
- Subject digest: 4926d501d427e1c1547428975cc413bd82bb0cbe8565b80e43893317639a8200
- Sigstore transparency entry: 1928466143
- Sigstore integration time: Jun 23, 2026
Source repository:
- Permalink: m14r41/scan4secrets@aca3e73bb858a147ab78e896ac07fcfd8e4fa700
- Branch / Tag: refs/tags/v2.1.1
- Owner: https://github.com/m14r41
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@aca3e73bb858a147ab78e896ac07fcfd8e4fa700
- Trigger Event: push

File details

Details for the file scan4secrets-2.1.1-py3-none-any.whl.

File metadata

Download URL: scan4secrets-2.1.1-py3-none-any.whl
Upload date: Jun 23, 2026
Size: 51.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for scan4secrets-2.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e2d595bb0c4863799adf2c440f916c2cb3993a363f6fdbfa7bd97747e02473c9`
MD5	`ca984a28399fe70eb03d5b1cac87c6ee`
BLAKE2b-256	`de03bd27ee89a40632fe6eac7710954c944a996a802a7ecef5228e6a7492a029`

See more details on using hashes here.

Provenance

The following attestation bundles were made for scan4secrets-2.1.1-py3-none-any.whl:

Publisher: release.yml on m14r41/scan4secrets

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: scan4secrets-2.1.1-py3-none-any.whl
- Subject digest: e2d595bb0c4863799adf2c440f916c2cb3993a363f6fdbfa7bd97747e02473c9
- Sigstore transparency entry: 1928466275
- Sigstore integration time: Jun 23, 2026
Source repository:
- Permalink: m14r41/scan4secrets@aca3e73bb858a147ab78e896ac07fcfd8e4fa700
- Branch / Tag: refs/tags/v2.1.1
- Owner: https://github.com/m14r41
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@aca3e73bb858a147ab78e896ac07fcfd8e4fa700
- Trigger Event: push

scan4secrets 2.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

scan4secrets

Why scan4secrets

Install

Quick start

What it detects

Live verification

Reports

DAST details

CI / pre-commit

Documentation

Benchmark

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance