Scan prompts for LLMs to ensure content safety and apply guardrails

These details have not been verified by PyPI

Project links

Project description

Prompt Scanner

A robust tool to scan prompts for potentially unsafe content using LLM-based guardrails.

Current Version: 0.3.1 - Now with enhanced severity levels for better risk assessment!

Overview

Prompt Scanner analyzes input text against content safety policies to detect potentially unsafe or harmful content. It uses Large Language Models (LLMs) as content judges to provide more context-aware and nuanced content safety evaluations than simple pattern matching.

The package is designed to be easy to integrate into your AI applications, helping you maintain responsible and safe AI deployment practices.

Features

Multiple Provider Support: Uses OpenAI or Anthropic APIs for content safety evaluation
Comprehensive Safety Categories: Identifies content across various safety categories
Multi-category Detection: Supports detecting multiple policy violations in a single piece of content
Standardized Severity Levels: Categorizes risk with LOW, MEDIUM, HIGH, and CRITICAL severity levels
Prompt Injection Protection: Checks for prompt injection attacks and other security risks
Detailed Analysis: Returns structured responses with detailed reasoning
Performance Metrics: Includes token usage metrics
Customizable: Supports customizing the LLM model used for evaluation
Custom Guardrails: Add your own custom guardrails and content policy categories
Rich Command Line Interface: Scan prompts directly from the terminal with detailed output

What's New in 0.3.1

Enhanced Severity Levels: Added standardized severity assessment with LOW, MEDIUM, HIGH, and CRITICAL levels
Severity Feedback: Included detailed severity information in scan results, CLI output, and JSON responses
Improved LLM Prompts: Updated LLM evaluation prompts to include severity assessment
Category-Based Severity: Automatically assigns CRITICAL severity to particularly dangerous categories
See the CHANGELOG.md for full details

Quick Start

Installation

# Install the latest version
pip install prompt-scanner

# Or specify the version explicitly
pip install prompt-scanner==0.3.1

Basic Usage

from prompt_scanner import PromptScanner

# Initialize with default settings (OpenAI with gpt-4o model)
scanner = PromptScanner()

# Scan a text input for unsafe content
result = scanner.scan_text("What's the weather like today?")

# Check the safety status
if result.is_safe:
    print("Content is safe!")
else:
    print(f"Primary violation: {result.category.name}")
    print(f"Severity: {result.severity.level.value}")  # Now provides severity information
    print(f"Reasoning: {result.reasoning}")

Command Line Interface

After installation, you can use the prompt-scanner command:

# Basic usage
prompt-scanner --text "What's the weather like today?"

# With API key
prompt-scanner --openai-api-key "your-key" --text "Tell me about Mars"

# Read from a file
prompt-scanner --file input.txt

# Read from stdin
cat input.txt | prompt-scanner --stdin

# Use Anthropic instead of OpenAI
prompt-scanner --provider anthropic --text "Tell me about Mars"

# Get basic process information
prompt-scanner -v --text "What's the weather like today?"

# Get full detailed output including token usage
prompt-scanner -vv --text "What's the weather like today?"

# Output in JSON format
prompt-scanner --text "What's the weather like today?" --format json

# Use custom guardrails
prompt-scanner --text "Tell me a secret" --guardrail-file custom_guardrails.json

# Disable colored output
prompt-scanner --text "What's the weather like today?" --no-color

Adding Custom Guardrails

# Define a custom guardrail
custom_guardrail = {
    "type": "privacy",
    "description": "Prevents sharing of technical architecture details",
    "patterns": [
        {
            "type": "regex",
            "value": r"(AWS|Azure|GCP)\s+(access|secret)\s+key",
            "description": "Cloud provider access keys"
        }
    ]
}

# Add the custom guardrail to the scanner
scanner.add_custom_guardrail("technical_info_protection", custom_guardrail)

Using Decorators

from prompt_scanner import PromptScanner, PromptScanResult
from openai import OpenAI

scanner = PromptScanner()
client = OpenAI()

# Decorator that scans prompts before processing
@scanner.decorators.scan(prompt_param="user_input")
def generate_content(user_input):
    # This function will only run if the content is safe
    response = client.chat.completions.create(
        model="gpt-3.5-turbo",
        messages=[
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": user_input}
        ]
    )
    return response.choices[0].message.content

# Usage
result = generate_content(user_input="Tell me about space")

Documentation

For detailed documentation, please see the docs directory:

Examples

The package includes example scripts to demonstrate functionality:

# Using default (OpenAI with gpt-4o)
python examples/content_scan_example.py

# Basic usage example
python examples/basic_usage.py

# Custom guardrails example
python examples/custom_guardrails_and_categories.py

# CLI usage examples
bash examples/installed_cli_examples.sh

# Run CLI without installation
bash examples/run_without_installation.sh

Quality

This package is built with quality in mind:

100% test coverage with thorough unit and integration tests
Well-documented API with detailed examples
Comprehensive error handling and validation
Support for multiple LLM providers

Configuration

See the Getting Started documentation for various ways to configure API keys.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.1

Mar 30, 2025

0.3.0

Mar 30, 2025

0.2.0

Mar 30, 2025

0.1.0

Mar 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prompt_scanner-0.3.1.tar.gz (66.1 kB view details)

Uploaded Mar 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

prompt_scanner-0.3.1-py3-none-any.whl (31.6 kB view details)

Uploaded Mar 30, 2025 Python 3

File details

Details for the file prompt_scanner-0.3.1.tar.gz.

File metadata

Download URL: prompt_scanner-0.3.1.tar.gz
Upload date: Mar 30, 2025
Size: 66.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.5

File hashes

Hashes for prompt_scanner-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`2b58b26526d82fa801df89e7dfa9538ec7237fe331f2a7e32f848c04acd72e7e`
MD5	`6a14126d697f1e1daac3aad21a412ff4`
BLAKE2b-256	`a7b4763e316487bede35a2d19c969e313efacd9f656094c5801c8029d92d4f4c`

See more details on using hashes here.

File details

Details for the file prompt_scanner-0.3.1-py3-none-any.whl.

File metadata

Download URL: prompt_scanner-0.3.1-py3-none-any.whl
Upload date: Mar 30, 2025
Size: 31.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.5

File hashes

Hashes for prompt_scanner-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`be6cfa8d7a89ffdfaff0a1a716ac5bbe0f397655e68a554799f424aa8d250254`
MD5	`bae59848d0dba464a64e092ecd7f1489`
BLAKE2b-256	`fb02f2c54c900c5d593297b55f40d6a1c85404ae1161681792de053f09bd7f92`

See more details on using hashes here.

prompt-scanner 0.3.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Prompt Scanner

Overview

Features

What's New in 0.3.1

Quick Start

Installation

Basic Usage

Command Line Interface

Adding Custom Guardrails

Using Decorators

Documentation

Examples

Quality

Configuration

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes