Skip to main content

A package for sentinel detectors and utilities

Project description

Prompt Sentinel

Table of Contents

Introduction

Prompt Sentinel is a Python library designed to protect sensitive data during interactions with language models (LLMs). It automatically detects and sanitizes confidential information—such as passwords, tokens, and secrets—before sending input to the LLM, reducing the risk of accidental exposure. Once a response is received, the original values are seamlessly restored. This makes the masking process transparent to the client, enabling smooth and uninterrupted interactions, including function calling.

Prompt Sentinel’s focuses on simplicity—it works out of the box without requiring complex proxy-based LLM deployments or infrastructure changes. It offers flexible detection options, ranging from regular expressions to trusted local or external LLMs, enabling accurate identification of sensitive data such as personally identifiable information (PII) and secrets. Both detection and restoration processes are easy to set up and highly configurable to suit a wide range of applications. To avoid redundant detection calls—especially when relying on LLM-based detection—Prompt Sentinel includes a smart caching mechanism that boosts efficiency and significantly reduces overhead.

Usage

Below are examples of how to use Prompt Sentinel in different LLM pipelines. For detailed examples, please refer to the examples directory in the repository.

Decorating an LLM Function Call

@sentinel(detector=LLMSecretDetector(...))
def call_llm(messages):
    # Call the LLM with sanitized messages
    return response

Wrapping an entire class

InstrumentedClass = instrument_model_class(BaseChatModel, detector=LLMSecretDetector(...), methods_to_wrap=['invoke', 'ainvoke', 'stream', 'astream'])
llm = InstrumentedClass()
response = llm.invoke(messages)

Features

  • Sensitive Data Detection:
    Use detectors like LLMSecretDetector and PythonStringDataDetector to identify sensitive or private data in your text.

  • Automatic Sanitization through secret masking:
    Mechanisms for replacing detected secrets with unique mask tokens (e.g., __SECRET_1__) so that the LLM operates on sanitized input.

  • Following the LLM returned output, the response is decoded to reinstate the original secrets.

  • Decorator Integration:
    Easily integrate secret sanitization into your LLM calling pipeline using the @sentinel decorator. Preprocess your messages before they reach the LLM and post-process the responses to decode tokens.

  • Caching:
    Implement caching for repeated detections on the same text to reduce redundant API calls and improve performance.

  • Async LLM calls Support async function/method LLM call decoration.

Installation

Install the package using pip (or include it in your project as needed):

pip install prompt-sentinel

Note: This package requires Python 3.7 or higher.

How It Works

Step-by-step flow:

  1. User Input
    The user submits a prompt containing potential secrets.

  2. Sanitize Input via @sentinel
    The decorator intercepts the prompt before it reaches the LLM.

  3. Detect Secrets with SecretDetector
    A detector scans the prompt for sensitive information like passwords, keys, or tokens.

  4. Replace Secrets with Tokens (e.g., __SECRET_1__)
    Each secret is replaced by a unique placeholder token and stored in a mapping.

  5. Send Sanitized Input to LLM
    The modified, tokenized prompt is passed to the language model.

  6. LLM Generates Response with Tokens
    The response from the model may include those placeholder tokens.

  7. Decode LLM Output using Secret Mapping
    Tokens are replaced with their original secrets using the stored mapping.

Customization

  • Detectors:
    You can implement your own secret detectors by extending the SecretDetector abstract base class. Check out the provided implementations in the sentinel_detectors module for guidance.

  • Context Management:
    Internally, a singleton context is used to persist secret mappings during LLM interaction and tool invocation. This ensures secrets encoded in the LLM prompt are automatically decoded before tool execution.

  • Caching:
    The detectors can use caching to avoid redundant API calls. In the provided implementation of LLMSecretDetector, caching is handled via an instance variable (_detect_cache).

Contributing

Contributions are welcome! Feel free to submit issues or pull requests on GitHub. When contributing, please follow the guidelines in our CONTRIBUTING.md.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prompt_sentinel-0.1.4.tar.gz (12.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

prompt_sentinel-0.1.4-py3-none-any.whl (13.9 kB view details)

Uploaded Python 3

File details

Details for the file prompt_sentinel-0.1.4.tar.gz.

File metadata

  • Download URL: prompt_sentinel-0.1.4.tar.gz
  • Upload date:
  • Size: 12.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for prompt_sentinel-0.1.4.tar.gz
Algorithm Hash digest
SHA256 10e196a34425b487f21ad087cddb7f16a442c9fa84019a55c9e16227aa30621c
MD5 7f6321de579325babf4cdb43bd362fee
BLAKE2b-256 c680805d8829aad1d4d39c84ce5f8fb2e018d73eb468129ea2efcd55a423629b

See more details on using hashes here.

File details

Details for the file prompt_sentinel-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for prompt_sentinel-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 4e9f7f27031e7f64a91720874e1228ab91e002a491ecb4abb6e48f5300b2cf36
MD5 f3096099308b6454ca78859019f71d24
BLAKE2b-256 6e282e4fa064c32db6b04d3e761ba924dd4fce2ef17b179b91823de466b90582

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page