Scan web content for prompt injection, hidden instructions, and adversarial content targeting AI agents

These details have not been verified by PyPI

Project links

Project description

Palisade Scanner 🔍

Try it live on HuggingFace Spaces — scan any URL without installing anything.

Scan web content for prompt injection, hidden instructions, and adversarial content targeting AI agents.

AI agents browse the web, read documents, and consume external content. Adversaries hide instructions in invisible text, HTML metadata, encoded payloads, and zero-width characters — Palisade finds them all.

What makes Palisade unique

Capability	Palisade Scanner	Manual review	Generic scrapers
Hidden text detection	✅ 20+ CSS/HTML techniques	❌	❌
Injection pattern matching	✅ 100+ regexes, 5 categories	❌	❌
LLM-as-judge classifier	✅ understands adversarial intent	N/A	❌
Metadata analysis	✅ comments, JSON-LD, meta, data attrs	❌	❌
Exfiltration detection	✅ URLs, eval(), fetch(), redirects	❌	❌
MCPGuard policy generation	✅ auto-generate rules	❌	❌
CI/CD mode	✅ `--ci --threshold high`	❌	❌
Zero-width character detection	✅	❌	❌

Why

AI agents browse the web, read documents, and consume external content. Adversaries can hide instructions in:

Invisible text (white-on-white, zero font size, off-screen positioning)
HTML comments and metadata
Base64 encoded payloads
Zero-width character injections
Instructions disguised as product descriptions or reviews

This scanner finds them all and tells you what to do about it.

Quick Start

# Install
pip install palisade-scanner

# CLI: scan a URL
pis scan https://example.com
# or
palisade scan https://example.com

# Web UI: open the dashboard
pis web

# Docker
docker compose up
# → http://localhost:8000

Usage

CLI

# Scan a URL
pis scan https://example.com

# Scan a local file
pis scan --file suspicious.html

# Scan pasted text
pis scan --paste "<!-- ignore instructions -->"

# JSON output
pis scan https://example.com --format json

# CI/CD mode (exit code reflects risk)
pis scan https://example.com --ci --threshold high

# Generate MCPGuard policy rules
pis policies https://evil-site.com

API

# Scan via REST API
curl "http://localhost:8000/api/scan?url=https://example.com"

# HTML report
curl "http://localhost:8000/api/scan/https://example.com"

How It Works

Detection Layers

Layer	What It Detects
Hidden Text Detector	20+ CSS/HTML hiding techniques (display:none, visibility, opacity, color matching, off-screen, zero-width chars, HTML comments)
Injection Pattern Matcher	100+ regex patterns across 5 categories (jailbreak, role override, exfiltration, tool manipulation, impersonation)
Instruction Classifier	LLM-as-judge that understands adversarial intent (requires API key)
Metadata Analyzer	HTML comments, JSON-LD, meta tags, data attributes, `<noscript>`, `<template>`
Exfiltration Detector	URLs, endpoints, eval() patterns, redirect attempts, `fetch()` calls

Scoring

Risk Score: 0-100

Weighted formula:
  base = 100
  - critical * 25
  - high * 10
  - medium * 3
  - low * 1

Categories: none (0-5) → low (6-20) → medium (21-50) → high (51-80) → critical (81-100)

Architecture

User (CLI / Web / API)
        │
        ▼
PipelineOrchestrator
        │
        ├── Loader (URL / File / Paste / PDF)
        │
        ├── Detector Pipeline (parallel)
        │   ├── HiddenTextDetector
        │   ├── InjectionPatternMatcher
        │   ├── MetadataAnalyzer
        │   ├── ExfiltrationDetector
        │   └── InstructionClassifier (LLM)
        │
        ├── ScoringEngine
        │
        └── Reporters
            ├── JSON / Markdown / Simple
            ├── Policy Generator (MCPGuard)
            └── Web UI (HTMX)

Project Structure

src/scanner/
├── cli.py              # Typer CLI
├── api.py              # FastAPI web app
├── config.py           # Settings (env vars)
├── domain/
│   ├── models.py       # Pydantic models
│   └── scoring.py      # Risk score engine
├── loaders/
│   ├── url.py          # HTTP URL fetcher
│   ├── pdf.py          # PDF extractor
│   └── paste.py        # Raw text
├── detectors/
│   ├── hidden_text.py       # CSS/HTML hiding
│   ├── injection_patterns.py # 100+ regex patterns
│   ├── instruction_classifier.py  # LLM-as-judge
│   ├── metadata_analyzer.py # Comments/meta/tags
│   └── exfiltration.py     # Data theft patterns
├── pipeline/
│   └── orchestrator.py # Scan pipeline
├── reporters/          # JSON/MD/Simple output
├── policies/           # MCPGuard rule generation
└── utils/              # DOM helpers

Integration

MCPGuard

Generate rules compatible with MCPGuard:

pis scan https://evil-site.com --format mcpguard > rules.yaml
mcpguard load-rules rules.yaml

CI/CD

# .github/workflows/check-urls.yml
- name: Scan for prompt injection
  run: |
    pis scan ${{ matrix.url }} --ci --threshold medium

Roadmap

v0.1 — Scanner core: CLI, 5 detectors, scoring, policy generation
v0.2 — Live Monitor: scheduled re-scans, webhook alerts, diff detection
v0.3 — Agent Validator: Browser Use agent tests pages in real time
v0.4 — Content Safety Proxy: reverse proxy that strips injections
v0.5 — Reputation Engine: web of trust for agent-safe URLs
v0.6 — Red Team Lab: adversarial page generator + benchmark suite
v0.7 — Certification Pipeline: verified AgentSafe badges

Related Projects

MCPGuard — Runtime security proxy for MCP
MCPwn — Offensive security testing for MCP
MCPscop — Unified security dashboard

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.2

May 26, 2026

This version

0.1.1

May 26, 2026

0.1.0

May 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

palisade_scanner-0.1.1.tar.gz (74.6 kB view details)

Uploaded May 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

palisade_scanner-0.1.1-py3-none-any.whl (74.4 kB view details)

Uploaded May 26, 2026 Python 3

File details

Details for the file palisade_scanner-0.1.1.tar.gz.

File metadata

Download URL: palisade_scanner-0.1.1.tar.gz
Upload date: May 26, 2026
Size: 74.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for palisade_scanner-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`499d2eb5a2d7a0e0dda3be99e1efa68fed62b67e950c88e74e41f85baae6cb23`
MD5	`3efca679db60b510296ff57edb2e9263`
BLAKE2b-256	`8ec09b29d8323d0536f5d09516cead6e1ee9b0c2dea9b46fe7d702d666770352`

See more details on using hashes here.

File details

Details for the file palisade_scanner-0.1.1-py3-none-any.whl.

File metadata

Download URL: palisade_scanner-0.1.1-py3-none-any.whl
Upload date: May 26, 2026
Size: 74.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for palisade_scanner-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f2e6922fcde5314f4903c2c0404551865031c1237b929866ce278e99bf7a8099`
MD5	`ec7e3abc4d7843b2a02b0fc0accb7996`
BLAKE2b-256	`43be03d374dd6b2b3525563088604b5b828fa7f258f48a99f56ef431430e5039`

See more details on using hashes here.

palisade-scanner 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Palisade Scanner 🔍

What makes Palisade unique

Why

Quick Start

Usage

CLI

API

How It Works

Detection Layers

Scoring

Architecture

Project Structure

Integration

MCPGuard

CI/CD

Roadmap

Related Projects

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes