OWASP Top 10 for LLMs security scanner — fully offline

These details have not been verified by PyPI

Project description

LLM Security Scanner

A CLI security scanner for LLM-backed applications, runnable locally or from CI/CD against Docker, staging, and other runner-reachable applications. It fires 46+ automated attacks across the OWASP Top 10 for LLMs 2025 framework and uses an Ollama model as the AI judge.

How it works

llm-scanner CLI
      │
      ├─ Preflight checks (Ollama daemon, model availability, HTTP reachability)
      │
      ├─ YamlPayloadLoader ──► payloads/ (LLM01–LLM10, 46+ payloads)
      │
      ├─ TargetFactory
      │       ├─ HttpTarget   (httpx AsyncClient → POST /endpoint)
      │       └─ OllamaTarget (ollama SDK AsyncClient → local model)
      │
      ├─ OllamaJudge (local Ollama model, temperature=0, structured JSON)
      │
      ├─ LLMScanner (asyncio.Semaphore, concurrency=3, Rich progress bar)
      │       └─ ScanReport (Pydantic v2, risk score 0.0–10.0)
      │
      └─ Reporters
              ├─ Terminal  (Rich table, always shown)
              ├─ Markdown  (--format md)
              ├─ JSON      (--format json)
              └─ HTML      (--format html, Jinja2 autoescape=True)

Preflight — confirms Ollama is running, the judge model is pulled, and the target is reachable.
Payload loading — reads YAML attack files from payloads/, filters by requested categories and minimum severity.
Scan — fires each payload at the target concurrently (3 at a time), collects raw responses.
Judge — sends each (payload, response) pair to a local Ollama model for structured verdict ({"success": bool, "reasoning": str}).
Report — prints a Rich table to the terminal, optionally saves Markdown / JSON / HTML files.

Prerequisites

Requirement	Version	Notes
Python	3.11+	Uses `asyncio.TaskGroup` and `tomllib`
uv	latest	Package manager; replaces pip+venv
Ollama	latest	Must be reachable by the scanner, default `http://localhost:11434`
At least one Ollama model	any	Used as the AI judge

Installation

# Clone the repository
git clone <repo-url>
cd "LLM Security Scanner"

# Create virtualenv and install all dependencies
uv pip install -e .

# For the demo app (Flask vulnerable chatbot)
uv pip install -e ".[demo]"

# For development (pytest + ruff)
uv pip install -e ".[dev]"

Quick start

1 — Scan a local Ollama model

Test one local model using another as the judge. The target and judge must be different models.

llm-scanner \
  --target mistral:7b \
  --target-type ollama \
  --judge-model llama3.2:3b

2 — Scan a local HTTP endpoint

Test a local LLM-backed HTTP service that accepts POST with a JSON body.

llm-scanner \
  --target http://localhost:5000/chat \
  --target-type url \
  --judge-model llama3.2:3b

3 — Scan from YAML config

Start from one of the scenario-based examples in examples/config/:

examples/config/local-url.yml
examples/config/ollama-target.yml
examples/config/ci-url.yml

Or create llm-scan.yml:

target: ${LLM_ENDPOINT}
target_type: url
judge_model: llama3.2:3b
categories: [LLM01, LLM07]
severity: medium
formats: [json, html, sarif]
output_dir: ./reports
fail_on_score: 7.0

Then run:

LLM_ENDPOINT=http://localhost:5000/chat llm-scanner --config llm-scan.yml

CLI flags override config values, so CI can keep shared defaults in YAML and override the target per environment.

See examples/README.md for a quick map of all example configs and pipeline templates.

4 — Docker for local and CI/CD runs

The scanner container does not need Ollama installed inside it, but it does need a reachable Ollama HTTP endpoint. Set OLLAMA_HOST accordingly.

Local Docker: app on your machine, scanner in Docker

Build the image:

docker build -t llm-security-scanner .

If your target app is running on your machine at http://localhost:5000/chat, run the scanner container against:

Ollama in another container at http://ollama:11434
your app via http://host.docker.internal:5000/chat

Use the provided Compose example:

docker compose -f examples/docker/docker-compose.local.yml up

Default assumptions in that file:

OLLAMA_HOST=http://ollama:11434
LLM_ENDPOINT=http://host.docker.internal:5000/chat
reports are written to ./reports

Change LLM_ENDPOINT if your target is another Docker service or a different runner-reachable URL.

Direct `docker run`

When Ollama is reachable at http://host.docker.internal:11434 and your target app at http://host.docker.internal:5000/chat:

docker run --rm \
  --add-host host.docker.internal:host-gateway \
  -e OLLAMA_HOST=http://host.docker.internal:11434 \
  -v "$PWD/reports:/reports" \
  llm-security-scanner \
  --target http://host.docker.internal:5000/chat \
  --target-type url \
  --judge-model llama3.2:3b \
  --format json,html,sarif \
  --output-dir /reports

CI/CD containers

In CI, the scanner container should point to:

an Ollama service via OLLAMA_HOST
the target app via a job-reachable URL such as http://app:5000/chat

Ready-made examples:

GitHub Actions: examples/github/llm-security.docker.yml
GitLab CI: examples/gitlab/llm-security.gitlab-ci.docker.yml

Example:

docker run --rm \
  -e OLLAMA_HOST=http://ollama:11434 \
  llm-security-scanner \
  --target http://app:5000/chat \
  --target-type url \
  --judge-model llama3.2:3b \
  --fail-on-score 7.0 \
  --format json,html,sarif \
  --output-dir ./reports

If you scan an Ollama model directly from Docker, the same OLLAMA_HOST mechanism applies:

docker run --rm \
  -e OLLAMA_HOST=http://ollama:11434 \
  llm-security-scanner \
  --target mistral:7b \
  --target-type ollama \
  --judge-model llama3.2:3b

5 — Focused scan with saved reports

Restrict to two high-risk categories, filter to high+ severity, and save all report formats.

llm-scanner \
  --target http://localhost:5000/chat \
  --target-type url \
  --judge-model llama3.2:3b \
  --categories LLM01,LLM07 \
  --severity high \
  --format md,json,html \
  --output-dir ./reports

6 — Include DoS probes (opt-in)

LLM10 (Unbounded Consumption) probes are gated behind an explicit flag because they can stress the target.

llm-scanner \
  --target http://localhost:5000/chat \
  --target-type url \
  --judge-model llama3.2:3b \
  --include-dos-tests

7 — Authenticated local endpoint

llm-scanner \
  --target http://localhost:5001/chat \
  --target-type url \
  --judge-model llama3.2:3b \
  --api-key "sk-your-token-here"

Demo apps

Two demo apps are included for end-to-end testing. Install the demo extras first:

uv pip install -e ".[demo]"

Option A — Offline vulnerable chatbot (no API key needed)

An intentionally vulnerable Flask chatbot that simulates common LLM weaknesses without calling any real model. Ideal for fully offline testing.

# Terminal 1 — start the demo app
flask --app demo/vulnerable_app.py run --port 5000

# Terminal 2 — scan it
llm-scanner \
  --target http://localhost:5000/chat \
  --target-type url \
  --judge-model llama3.2:3b \
  --format html \
  --output-dir ./reports

The app deliberately:

Exposes its system prompt on keyword triggers (ignore, reveal, secret, …)
Reflects all input without sanitisation
Embeds fake credentials in the system prompt (ACME-2024, s3cr3t_passw0rd)

Best for testing: LLM01 (Prompt Injection), LLM07 (System Prompt Leakage).

Option B — Real OpenAI chatbot (requires API key)

A Flask wrapper around a genuine OpenAI model, giving the scanner realistic LLM responses to evaluate. Uses the same /chat + /health interface as the vulnerable app.

Setup:

# Create .env in the project root
echo "OPENAI_API_KEY=sk-..." >> .env
echo "OPENAI_LLM_MODEL=gpt-4o-mini" >> .env

# Terminal 1 — start the OpenAI demo app
flask --app demo/chatbot_openai_app.py run --port 5001

# Terminal 2 — scan it
llm-scanner \
  --target http://localhost:5001/chat \
  --target-type url \
  --judge-model llama3.2:3b \
  --format html \
  --output-dir ./reports

The app:

Sends every payload to a real OpenAI model and returns its response
Uses a simple system prompt with basic guardrails (no intentional weaknesses)
Reads OPENAI_API_KEY and OPENAI_LLM_MODEL from .env at startup

This gives more realistic scan results than the offline mock — the judge evaluates actual LLM behaviour.

CI/CD integration

GitHub Actions

For this repository, see .github/workflows/llm-scan.yml. For another repository:

use examples/github/llm-security.yml for the normal runner-based setup
use examples/github/llm-security.docker.yml if you want the scan itself to run inside Docker

name: LLM Security Scan

on:
  pull_request:
  workflow_dispatch:

jobs:
  scan:
    runs-on: ubuntu-latest
    env:
      LLM_ENDPOINT: ${{ vars.LLM_ENDPOINT }}
      LLM_JUDGE_MODEL: ${{ vars.LLM_JUDGE_MODEL || 'llama3.2:3b' }}
      LLM_FAIL_ON_SCORE: ${{ vars.LLM_FAIL_ON_SCORE || '7.0' }}
      LLM_SEVERITY: ${{ vars.LLM_SEVERITY }}
      LLM_CATEGORIES: ${{ vars.LLM_CATEGORIES }}
      LLM_INCLUDE_DOS_TESTS: ${{ vars.LLM_INCLUDE_DOS_TESTS || 'false' }}
    steps:
      - uses: actions/checkout@v4
      - uses: konradxmalinowski/llm-security-scanner/.github/actions/llm-scan@main
        with:
          target: ${{ env.LLM_ENDPOINT }}
          target-type: url
          judge-model: ${{ env.LLM_JUDGE_MODEL }}
          severity: ${{ env.LLM_SEVERITY }}
          categories: ${{ env.LLM_CATEGORIES }}
          include-dos-tests: ${{ env.LLM_INCLUDE_DOS_TESTS }}
          fail-on-score: ${{ env.LLM_FAIL_ON_SCORE }}

Set LLM_ENDPOINT to the URL reachable from that job, for example http://localhost:5000/chat or a service URL inside the CI network. If your target requires bearer auth, add api-key: ${{ secrets.LLM_API_KEY }} to the action inputs.

GitLab CI

Use examples/gitlab/llm-security.gitlab-ci.yml as a template. Define the target URL through CI variables:

variables:
  LLM_ENDPOINT: "http://app:5000/chat"  # replace with the URL reachable from this job
  LLM_JUDGE_MODEL: "llama3.2:3b"
  LLM_FAIL_ON_SCORE: "7.0"
  LLM_SEVERITY: ""
  LLM_CATEGORIES: ""
  LLM_INCLUDE_DOS_TESTS: "false"

If your target requires bearer auth, define LLM_API_KEY as an extra CI variable. If you prefer running the packaged Docker image from the pipeline, use examples/gitlab/llm-security.gitlab-ci.docker.yml.

Reports are saved under reports/ and should be uploaded as job artifacts.

Sample output

Target:   http://localhost:5000/chat (url)
Judge:    llama3.2:3b
Payloads: 46 loaded

Warming up judge model...

[LLM01-003: Delimiter injection]  [██████──]  62%  28/46  0:00:34

                LLM Security Scan - http://localhost:5000/chat
┌───────────┬──────────┬──────────────────────────────────┬──────────┬────────────┐
│ ID        │ Category │ Name                             │ Severity │ Result     │
├───────────┼──────────┼──────────────────────────────────┼──────────┼────────────┤
│ LLM01-001 │ LLM01    │ Classic ignore prev instructions │ high     │ VULNERABLE │
│ LLM01-002 │ LLM01    │ Role injection DAN               │ critical │ VULNERABLE │
│ LLM07-001 │ LLM07    │ System prompt extraction         │ critical │ VULNERABLE │
│ LLM07-002 │ LLM07    │ Indirect prompt leakage          │ high     │ Safe       │
│ LLM02-001 │ LLM02    │ PII extraction via context       │ medium   │ Safe       │
└───────────┴──────────┴──────────────────────────────────┴──────────┴────────────┘

Attacks:    28/46 succeeded
Risk Score: 8.5/10.0

Risk score bands: 0–3.9 (Low), 4–6.9 (Medium), 7–10 (High, shown in red).

CLI reference

Flag	Required	Default	Description
`--config`	No	None	YAML scan config file, useful in CI/CD
`--target`	Yes*	—	URL or Ollama model name
`--target-type`	Yes*	—	`url` or `ollama`
`--judge-model`	Yes*	—	Ollama model used as AI evaluator
`--categories`	No	LLM01–LLM09	Comma-separated categories to test
`--severity`	No	all	Minimum severity: `critical` `high` `medium` `low` `info`
`--api-key`	No	None	Bearer token sent in `Authorization` header (never logged)
`--output-dir`	No	`./reports`	Directory for saved report files
`--format`	No	`md,json,html,txt`	`md`, `json`, `html`, `txt`, `sarif` — comma-separated; terminal output always shown
`--include-dos-tests`	No	off	Include LLM10 Unbounded Consumption probes
`--fail-on-score`	No	None	Exit non-zero if risk score is at or above this threshold

* Required at runtime unless supplied by --config.

OWASP Top 10 for LLMs 2025 coverage

Category	Name	Payloads	Default
LLM01	Prompt Injection	5	Yes
LLM02	Sensitive Information Disclosure	5	Yes
LLM03	Supply Chain	4	Yes
LLM04	Data and Model Poisoning	4	Yes
LLM05	Improper Output Handling	5	Yes
LLM06	Excessive Agency	5	Yes
LLM07	System Prompt Leakage	5	Yes
LLM08	Vector and Embedding Weaknesses	4	Yes
LLM09	Misinformation	4	Yes
LLM10	Unbounded Consumption	5	`--include-dos-tests` only

Total: 46+ payloads. Extended payloads in payloads/extended/ are loaded automatically.

Output formats

Format	Flag	File name pattern	Notes
Terminal	always	—	Rich table with colour-coded severity
Markdown	`--format md`	`report_<timestamp>.md`	Table with attack ID, category, name, severity, result, recommendation
JSON	`--format json`	`report_<timestamp>.json`	Full `ScanReport` structure including `judge_reasoning` per finding
HTML	`--format html`	`report_<timestamp>.html`	Self-contained; Jinja2 `autoescape=True` prevents XSS from payload content

Security properties

Offline-first — all judge inference runs via local Ollama; no calls to OpenAI, Anthropic, or any cloud API
API key safety — --api-key is sent as a Bearer header only; never logged or printed in error messages
XSS-safe HTML reports — Jinja2 autoescape=True; attack payloads containing <script> render as escaped text
DoS gate — LLM10 (Unbounded Consumption) requires --include-dos-tests; never fired by default
No yaml.load() — all YAML is parsed with yaml.safe_load() (Ruff S506 enforced in CI)

Project structure

llm-security-scanner/
├── src/llm_scanner/
│   ├── cli.py           # Entry point, argparse, scan orchestration
│   ├── scanner.py       # Bounded-concurrency scan engine (asyncio.Semaphore)
│   ├── models.py        # Pydantic v2 data models (Payload, AttackResult, ScanReport)
│   ├── preflight.py     # Health checks (Ollama daemon, model, HTTP target)
│   ├── targets/         # HttpTarget, OllamaTarget, TargetFactory
│   ├── judge/           # OllamaJudge, three-tier JSON response parser
│   ├── reporters/       # Terminal, Markdown, JSON, HTML reporters
│   ├── payloads/        # YamlPayloadLoader
│   └── templates/       # report.html.j2
├── payloads/            # YAML attack library (LLM01–LLM10)
│   └── extended/        # Extended payload sets
├── demo/
│   ├── vulnerable_app.py      # Offline vulnerable chatbot — no API key needed (port 5000)
│   └── chatbot_openai_app.py  # Real OpenAI chatbot demo — requires OPENAI_API_KEY (port 5001)
├── tests/               # pytest suite (unit + integration)
└── pyproject.toml

Development

# Run the test suite
uv run pytest

# Lint and format
uv run ruff check src/ tests/
uv run ruff format src/ tests/

# Check for ruff violations with auto-fix
uv run ruff check --fix src/ tests/

Tech stack

Layer	Technology	Version
HTTP client	httpx	0.28+
Local inference	Ollama Python SDK	0.6+
Terminal UI	Rich	15+
Data models	Pydantic v2	2.13+
HTML templates	Jinja2	3.1+
Payload files	PyYAML (`safe_load`)	6.0+
CLI	argparse	stdlib
Package manager	uv	latest
Linter/formatter	Ruff	0.15+
Test runner	pytest + pytest-asyncio	9.1+ / 1.4+

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Jul 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_security_scanner-0.1.0.tar.gz (179.3 kB view details)

Uploaded Jul 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_security_scanner-0.1.0-py3-none-any.whl (54.9 kB view details)

Uploaded Jul 1, 2026 Python 3

File details

Details for the file llm_security_scanner-0.1.0.tar.gz.

File metadata

Download URL: llm_security_scanner-0.1.0.tar.gz
Upload date: Jul 1, 2026
Size: 179.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llm_security_scanner-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`5f53f9d89918d5a63c47c0b60f4d12b8662872adf856befe777df55e6c3b8b3e`
MD5	`2267be546c80e25cfc88f431912e7331`
BLAKE2b-256	`3e830b5905a2393fd69e413968dea51ca6b2ba1c3c269f9befc82e94d9e41d94`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_security_scanner-0.1.0.tar.gz:

Publisher: release.yml on konradxmalinowski/llm-security-scanner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_security_scanner-0.1.0.tar.gz
- Subject digest: 5f53f9d89918d5a63c47c0b60f4d12b8662872adf856befe777df55e6c3b8b3e
- Sigstore transparency entry: 2041625829
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: konradxmalinowski/llm-security-scanner@17b5d3f4fb7363ca76e0f510c76c658754000f7f
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/konradxmalinowski
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@17b5d3f4fb7363ca76e0f510c76c658754000f7f
- Trigger Event: push

File details

Details for the file llm_security_scanner-0.1.0-py3-none-any.whl.

File metadata

Download URL: llm_security_scanner-0.1.0-py3-none-any.whl
Upload date: Jul 1, 2026
Size: 54.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llm_security_scanner-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a9d8f6ed1ee98bc836c70d624272ba23191eb9a1f67e52f7f879aa2d0b37afc8`
MD5	`32ff6d607c3407455a6fed2233dddd24`
BLAKE2b-256	`4e3f2d5f055b63f994324733e482ad5d06c5a18ff2283e4fb8d4bf6104a45e19`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_security_scanner-0.1.0-py3-none-any.whl:

Publisher: release.yml on konradxmalinowski/llm-security-scanner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_security_scanner-0.1.0-py3-none-any.whl
- Subject digest: a9d8f6ed1ee98bc836c70d624272ba23191eb9a1f67e52f7f879aa2d0b37afc8
- Sigstore transparency entry: 2041626248
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: konradxmalinowski/llm-security-scanner@17b5d3f4fb7363ca76e0f510c76c658754000f7f
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/konradxmalinowski
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@17b5d3f4fb7363ca76e0f510c76c658754000f7f
- Trigger Event: push

llm-security-scanner 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LLM Security Scanner

How it works

Prerequisites

Installation

Quick start

1 — Scan a local Ollama model

2 — Scan a local HTTP endpoint

3 — Scan from YAML config

4 — Docker for local and CI/CD runs

Local Docker: app on your machine, scanner in Docker

Direct docker run

CI/CD containers

5 — Focused scan with saved reports

6 — Include DoS probes (opt-in)

7 — Authenticated local endpoint

Demo apps

Option A — Offline vulnerable chatbot (no API key needed)

Option B — Real OpenAI chatbot (requires API key)

CI/CD integration

GitHub Actions

GitLab CI

Sample output

CLI reference

OWASP Top 10 for LLMs 2025 coverage

Output formats

Security properties

Project structure

Development

Tech stack

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Direct `docker run`