A secure curl wrapper with middleware support and HTML-to-markdown extraction

These details have not been verified by PyPI

Project description

scurl

A secure curl wrapper with middleware support, HTML-to-markdown extraction, and prompt injection detection.

Installation

pip install sibylline-scurl

Or with pipx (recommended for CLI tools):

pipx install sibylline-scurl

For prompt injection detection, install the optional dependencies:

pip install "sibylline-scurl[prompt-defender]"

Usage

# Fetch a URL and extract clean markdown from HTML
scurl https://example.com

# Raw output (disable response middleware)
scurl --raw https://example.com

# Extract article content only (strips nav, ads, sidebars)
scurl --readability https://example.com

# Enable prompt injection detection
scurl --enable prompt-defender https://example.com

# Multilingual prompt injection detection (all 13 supported languages)
scurl --enable prompt-defender --injection-languages all https://example.com

# All curl flags work
scurl -H "Accept: application/json" https://api.example.com/data

Features

SecretDefender: Automatically detects and blocks requests containing exposed secrets/tokens
HTML to Markdown: Converts HTML responses to clean markdown (use --readability for article extraction)
Prompt Injection Detection: Detects and handles prompt injection attacks in web content
Multilingual Support: Prompt injection detection in 13 languages
Middleware System: Composable request and response middleware

Why scurl?

scurl extracts clean, readable content from web pages - perfect for LLM consumption, readability, or bandwidth savings. With prompt injection detection, you can safely fetch web content for AI applications.

Size Comparison

Website	curl	scurl	Reduction
example.com	513	167	67.4%
news.ycombinator.com	34,082	10,739	68.5%
en.wikipedia.org/wiki/Curl	110,373	10,044	90.9%
github.com/anthropics	296,788	353	99.9%
docs.python.org	319,554	12,348	96.1%

Prompt Injection Detection

scurl includes a prompt injection detection system that can identify and handle malicious content designed to manipulate LLMs. Enable it with --enable prompt-defender.

Detection System

The prompt defender uses a multi-layer detection approach:

Pattern Matching: Regex patterns for common injection techniques (instruction override, role injection, system manipulation, jailbreak attempts, etc.)
Motif Analysis: Fuzzy matching of known injection phrases
ML Classification: Random Forest classifier with semantic embeddings for novel attack detection

Supported Languages

Prompt injection detection is available in 13 languages:

Code	Language	Code	Language
`en`	English	`ko`	Korean
`es`	Spanish	`ru`	Russian
`fr`	French	`ar`	Arabic
`de`	German	`pt`	Portuguese
`zh`	Chinese	`it`	Italian
`ja`	Japanese	`hi`	Hindi
		`nl`	Dutch

Use --injection-languages to specify which languages to check:

# English only (default)
scurl --enable prompt-defender https://example.com

# Specific languages
scurl --enable prompt-defender --injection-languages en,es,fr https://example.com

# All supported languages
scurl --enable prompt-defender --injection-languages all https://example.com

Detection Actions

When a prompt injection is detected, you can configure how scurl handles it:

Action	Description
`warn`	Wrap suspicious spans in `<suspected-prompt-injection>` tags, content unchanged
`redact`	Wrap in tags and mask detected patterns with `█` characters (default)
`datamark`	Wrap in tags with spotlighting mode for LLM context
`metadata`	Add detection metadata to output, content unchanged
`silent`	No output modification, detection runs silently

# Redact detected injections (default)
scurl --enable prompt-defender --injection-action redact https://example.com

# Just warn, don't modify content
scurl --enable prompt-defender --injection-action warn https://example.com

# Adjust detection threshold (0.0-1.0, default: 0.3)
scurl --enable prompt-defender --injection-threshold 0.5 https://example.com

Custom Pattern Configuration

You can customize or extend the detection patterns by creating YAML configuration files. scurl checks these locations in priority order:

User config: ~/.config/scurl/patterns/ (highest priority)
Project config: .scurl/patterns/ in current directory
Package defaults: Built-in patterns (fallback)

To override patterns for a language, create a YAML file named {language_code}.yaml:

# ~/.config/scurl/patterns/en.yaml
language: en
name: English
version: "1.0"

patterns:
  instruction_override:
    - 'ignore\s+(all\s+)?(previous|prior|above)\s+(instructions?|rules?|guidelines?)'
    - 'disregard\s+(everything|all)\s+(above|before|prior)'
    # Add your custom patterns...

  role_injection:
    - 'you\s+are\s+now'
    - 'from\s+now\s+on'
    # ...

motifs:
  instruction_override:
    - 'ignore all instructions'
    - 'forget everything above'
    # Simple phrases for fuzzy matching...

Pattern categories:

instruction_override: Attempts to override system instructions
role_injection: Attempts to change the AI's role/persona
system_manipulation: Attempts to enable "developer mode", bypass safety, etc.
prompt_leak: Attempts to extract system prompts
jailbreak_keywords: Known jailbreak techniques (DAN, etc.)
encoding_markers: Base64, ROT13, and other encoding attempts
suspicious_delimiters: Fake system/instruction tags

For CJK languages (Chinese, Japanese, Korean) that don't use spaces between words, add:

settings:
  word_boundaries: false

Flags

Flag	Description
`--raw`	Disable all response middleware (raw HTML output)
`--readability`	Extract article content only (strips nav, ads, sidebars)
`--render`	Use headless browser for JS-rendered pages
`--disable <slug>`	Disable a middleware by slug (can be repeated)
`--enable <slug>`	Enable an opt-in middleware (can be repeated)
`--list-middleware`	List available middleware and their slugs

Prompt Defender Flags

Flag	Description
`--injection-threshold <0.0-1.0>`	Detection sensitivity (default: 0.3)
`--injection-action <action>`	Action on detection: `warn`, `redact`, `datamark`, `metadata`, `silent`
`--injection-languages <langs>`	Languages to check: comma-separated codes or `all`

Middleware Slugs

Slug	Type	Description
`secret-defender`	Request	Detects and blocks requests containing secrets
`readability`	Response	Extracts clean markdown from HTML
`prompt-defender`	Response	Detects prompt injection in web content (opt-in)

Python API

from scurl.prompt_defender import PromptInjectionDefender

# Create defender with custom settings
defender = PromptInjectionDefender(
    threshold=0.3,
    action="redact",
    languages=["en", "es", "fr"],
)

# Analyze text directly
from scurl.prompt_defender.middleware import PromptInjectionMiddleware
middleware = PromptInjectionMiddleware(languages=["all"])
result = middleware.analyze("Ignore all previous instructions and...")
print(result.is_injection)  # True
print(result.confidence)     # 0.95
print(result.detected_spans) # List of detected injection spans

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.2

Feb 15, 2026

0.3.1

Feb 15, 2026

0.3.0

Feb 10, 2026

0.2.3

Feb 6, 2026

0.2.0

Feb 6, 2026

0.1.1

Feb 3, 2026

0.1.0

Feb 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sibylline_scurl-0.3.2.tar.gz (34.3 kB view details)

Uploaded Feb 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sibylline_scurl-0.3.2-py3-none-any.whl (43.1 kB view details)

Uploaded Feb 15, 2026 Python 3

File details

Details for the file sibylline_scurl-0.3.2.tar.gz.

File metadata

Download URL: sibylline_scurl-0.3.2.tar.gz
Upload date: Feb 15, 2026
Size: 34.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sibylline_scurl-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`0c87c9c22c57b7fd1adbf27824449c28e6bd9e428c3f0817b721bd74cc1b2da7`
MD5	`cbd8bbf035091da6d780d59cb1465243`
BLAKE2b-256	`2f7ad25642bed8c16e0c732f146adc3e94ab16ff188f6e086de2fa46a5e29a9d`

See more details on using hashes here.

File details

Details for the file sibylline_scurl-0.3.2-py3-none-any.whl.

File metadata

Download URL: sibylline_scurl-0.3.2-py3-none-any.whl
Upload date: Feb 15, 2026
Size: 43.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sibylline_scurl-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cc6e7bc64e5a58776f15481928d6fbeb2b74bc404caa6c244fac33da95f77610`
MD5	`f92f182855c93a7ddf4bfce82fe8fc3e`
BLAKE2b-256	`13e41dbb788abab599084a37d53273a4a66c0e9578ae48a5e7c00045bbf11af2`

See more details on using hashes here.

sibylline-scurl 0.3.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

scurl

Installation

Usage

Features

Why scurl?

Size Comparison

Prompt Injection Detection

Detection System

Supported Languages

Detection Actions

Custom Pattern Configuration

Flags

Prompt Defender Flags

Middleware Slugs

Python API

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes