A new package is designed to facilitate secure and structured user interactions with language models by analyzing and extracting specific information from user-provided text. It uses pattern matching

Project description

llm-securescan

llm-securescan is a tiny Python package that helps you securely extract structured information from user‑provided text.
It uses pattern matching and automatic retries to ensure the language model returns data that conforms to a predefined regex. This makes it useful for detecting potential data‑exfiltration patterns, sensitive data leaks, or any other custom signatures you define.

Features

Zero‑setup default LLM – uses ChatLLM7 (from the langchain_llm7 package) automatically.
Pluggable LLM – pass any LangChain‑compatible chat model (OpenAI, Anthropic, Google, etc.).
Regex‑based extraction – the LLM is forced to obey a regex pattern, guaranteeing consistent output.
Simple API – one function call returns a list of extracted strings or raises an informative error.

Installation

pip install llm_securescan

Quick Start

from llm_securescan import llm_securescan

user_input = """
John Doe's credit card number is 4111 1111 1111 1111.
Please send the PDF to jane@example.com.
"""

# Use the default ChatLLM7 (API key taken from env var LLM7_API_KEY or default)
extracted = llm_securescan(user_input)

print(extracted)   # → ['4111 1111 1111 1111', 'jane@example.com']

API Reference

`llm_securescan(user_input: str, llm: Optional[BaseChatModel] = None, api_key: Optional[str] = None) -> List[str]`

Parameter	Type	Description
`user_input`	`str`	The raw text you want to scan.
`llm`	`Optional[BaseChatModel]`	A LangChain chat model instance. If omitted, the function creates a `ChatLLM7` instance automatically.
`api_key`	`Optional[str]`	API key for `ChatLLM7`. If not provided, the function looks for the environment variable `LLM7_API_KEY`. If that also isn’t set, a placeholder value `"None"` is used (which will cause the LLM call to fail with a clear error).

Returns: A list of strings that match the configured regex pattern. If the LLM call fails, a RuntimeError is raised with the underlying error message.

Using a Custom LLM

You can pass any LangChain‑compatible chat model. Below are examples for the most common providers.

OpenAI

from langchain_openai import ChatOpenAI
from llm_securescan import llm_securescan

my_llm = ChatOpenAI(model="gpt-4o-mini")
result = llm_securescan(user_input, llm=my_llm)

Anthropic

from langchain_anthropic import ChatAnthropic
from llm_securescan import llm_securescan

my_llm = ChatAnthropic(model="claude-3-haiku-20240307")
result = llm_securescan(user_input, llm=my_llm)

Google Generative AI

from langchain_google_genai import ChatGoogleGenerativeAI
from llm_securescan import llm_securescan

my_llm = ChatGoogleGenerativeAI(model="gemini-1.5-flash")
result = llm_securescan(user_input, llm=my_llm)

Configuration Details

Default LLM: ChatLLM7 from the langchain_llm7 package (see https://pypi.org/project/langchain-llm7/).
Rate limits: The free tier of LLM7 provides sufficient calls for typical scanning workloads.
Custom API key: Provide an API key either via the LLM7_API_KEY environment variable or directly:
```
result = llm_securescan(user_input, api_key="YOUR_LLM7_API_KEY")
```
Getting a free API key: Register at https://token.llm7.io/.

How It Works Internally

The function builds a system prompt and a human prompt based on the supplied text.
A regular expression (pattern from llm_securescan.prompts) is compiled.
llmatch (from the internal llmatch_messages utility) sends the prompts to the LLM while enforcing that the response matches the regex.
If the LLM output satisfies the pattern, the captured groups are returned; otherwise, the call fails with a helpful error message.

This approach provides a deterministic extraction pipeline while still leveraging the LLM’s natural‑language understanding.

Contributing & Issues

Found a bug or have a feature request? Please open an issue on GitHub:

https://github.com/chigwell/llm_securescan/issues

Pull requests are welcome!

Author

Eugene Evstafev
✉️ Email: hi@euegne.plus
🐙 GitHub: chigwell

License

This project is licensed under the MIT License. See the LICENSE file for details.

Project details

Release history Release notifications | RSS feed

This version

2025.12.21142626

Dec 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_securescan-2025.12.21142626.tar.gz (6.6 kB view details)

Uploaded Dec 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_securescan-2025.12.21142626-py3-none-any.whl (7.2 kB view details)

Uploaded Dec 21, 2025 Python 3

File details

Details for the file llm_securescan-2025.12.21142626.tar.gz.

File metadata

Download URL: llm_securescan-2025.12.21142626.tar.gz
Upload date: Dec 21, 2025
Size: 6.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for llm_securescan-2025.12.21142626.tar.gz
Algorithm	Hash digest
SHA256	`08c8776da5714f92361a7195e41cead1c9c2a8fdc846a5495b49e5fc9db24e64`
MD5	`f194ec2e2b87557af01d551b0bc434c3`
BLAKE2b-256	`c2ed9ea4a7fb99b036835c72ab33b82da73d39c32f1bc25cf91a017221c8ce20`

See more details on using hashes here.

File details

Details for the file llm_securescan-2025.12.21142626-py3-none-any.whl.

File metadata

Download URL: llm_securescan-2025.12.21142626-py3-none-any.whl
Upload date: Dec 21, 2025
Size: 7.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for llm_securescan-2025.12.21142626-py3-none-any.whl
Algorithm	Hash digest
SHA256	`370c3c476879d0e20257d24bd7059a02c4359780fc83d795572acbbf66ae13a4`
MD5	`1b8e75843117d86e6384ee86a1cafa78`
BLAKE2b-256	`2fc29a9d4bf24d613789460911ec3657c0b945863d59565de82c7ea772846f0f`

See more details on using hashes here.

llm-securescan 2025.12.21142626

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

llm-securescan

Features

Installation

Quick Start

API Reference

`llm_securescan(user_input: str, llm: Optional[BaseChatModel] = None, api_key: Optional[str] = None) -> List[str]`

Using a Custom LLM

OpenAI

Anthropic

Google Generative AI

Configuration Details

How It Works Internally

Contributing & Issues

Author

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes