Cognitive Security Middleware - The 'Electronic Stability Program' (ESP) for Large Language Models. Bidirectional containment system with defense-in-depth architecture, stateful tracking, and mathematical safety constraints.

These details have not been verified by PyPI

Project links

Project description

LLM Security Firewall

Bidirectional security framework for human/LLM interfaces implementing defense-in-depth architecture with multiple validation layers.

Version: 2.4.0 Python: >=3.12 License: MIT Status: Production

Overview

The system implements a stateful, bidirectional containment mechanism for large language models. It processes requests through sequential validation layers, applying mathematical constraints and stateful tracking to enforce safety boundaries.

The architecture follows a hexagonal pattern with Protocol-based Port/Adapter interfaces and dependency injection. Core business logic is separated from infrastructure concerns. Domain layer uses Protocol-based adapters (DecisionCachePort, DecoderPort, ValidatorPort) for framework independence.

Architecture

Bidirectional Processing Pipeline

The system operates in three directions:

Human → LLM (Input Protection)
- Normalization and sanitization
- Pattern matching and evasion detection
- Risk scoring and policy evaluation
- Session state tracking
LLM → Human (Output Protection)
- Evidence validation
- Tool call validation
- Output sanitization
- Truth preservation checks
Memory Integrity
- Session state management
- Drift detection
- Influence tracking

Core Components

Firewall Engine (src/llm_firewall/core/firewall_engine_v2.py)

Main decision engine
Risk score aggregation
Policy application
Unicode security analysis

Normalization Layer (src/hak_gal/layers/inbound/normalization_layer.py)

Recursive URL/percent decoding
Unicode normalization (NFKC)
Zero-width character removal
Directional override character removal

Pattern Matching (src/llm_firewall/rules/patterns.py)

Regex-based pattern detection
Concatenation-aware matching
Evasion pattern detection

Risk Scoring (src/llm_firewall/core/risk_scorer.py)

Multi-factor risk calculation
Cumulative risk tracking
Threshold-based decisions

Cache System (src/llm_firewall/cache/decision_cache.py)

Exact match caching (Redis)
Semantic caching (LangCache)
Hybrid mode support
Circuit breaker pattern
Fail-safe behavior (blocks on cache failure, prevents security bypass)

Adapter Health (src/llm_firewall/core/adapter_health.py)

Circuit breaker implementation
Health metrics tracking
Failure threshold management
Recovery timeout handling

Developer Adoption API (src/llm_firewall/guard.py)

Simple one-liner integration: guard.check_input(text), guard.check_output(text)
Backward compatible with existing API
See QUICKSTART.md for 5-minute integration guide

LangChain Integration (src/llm_firewall/integrations/langchain/callbacks.py)

FirewallCallbackHandler for LangChain chains
Automatic input/output validation
See examples/langchain_integration.py for usage

Dependencies

Core:

numpy>=1.24.0
scipy>=1.11.0
scikit-learn>=1.3.0
pyyaml>=6.0
blake3>=0.3.0
requests>=2.31.0
psycopg[binary]>=3.1.0
redis>=5.0.0
pydantic>=2.0.0
psutil>=5.9.0
cryptography>=41.0.0

Machine Learning:

sentence-transformers>=2.2.0
torch>=2.0.0
transformers>=4.30.0
onnx>=1.14.0
onnxruntime>=1.16.0

Installation

From PyPI (recommended):

pip install llm-security-firewall

For development (local installation):

pip install -e .

For development dependencies:

pip install -e .[dev]

Optional extras:

pip install llm-security-firewall[langchain]  # LangChain integration
pip install llm-security-firewall[dev]       # Development tools
pip install llm-security-firewall[monitoring] # Monitoring tools

Configuration

Cache Modes

Configure via CACHE_MODE environment variable:

exact (default): Redis exact match cache
semantic: LangCache semantic search
hybrid: Both caches in sequence

Redis Configuration

export REDIS_URL=redis://:password@host:6379/0
export REDIS_TTL=3600  # Optional: Cache TTL in seconds

For Redis Cloud:

export REDIS_CLOUD_HOST=host
export REDIS_CLOUD_PORT=port
export REDIS_CLOUD_USERNAME=username
export REDIS_CLOUD_PASSWORD=password

Examples

See the examples/ directory for complete integration examples:

quickstart.py - Simplest possible integration using guard.py API (< 10 lines)
langchain_integration.py - LangChain integration with FirewallCallbackHandler
minimal_fastapi.py - FastAPI middleware integration
quickstart_fastapi.py - Full FastAPI example with input/output validation

Run examples:

python examples/quickstart.py
python examples/langchain_integration.py
python examples/minimal_fastapi.py

Quick Start (Developer API):

from llm_firewall import guard

# One-liner input validation
decision = guard.check_input("user input text")
if decision.allowed:
    # Process request
    pass

Testing

Test suite includes unit tests, integration tests, and adversarial test cases.

pytest tests/ -v

With coverage:

pytest tests/ -v --cov=src/llm_firewall --cov-report=term

Known Limitations

False Positive Rate: Kids Policy false positive rate is approximately 20-25% (target: <5%)
Memory Usage: Current memory usage exceeds 300MB cap for adversarial inputs (measured: ~1.3GB)
Unicode Normalization: Some edge cases in mathematical alphanumeric symbol handling

Security Hardening

Implemented Measures

Multi-Tenant Isolation
- Session hashing via HMAC-SHA256(tenant_id + user_id + DAILY_SALT)
- Redis key isolation via ACLs and prefixes
Oscillation Defense
- CUSUM (Cumulative Sum Control Chart) algorithm
- Accumulative risk tracking across session turns
Parser Differential Protection
- StrictJSONDecoder with duplicate key detection
- Immediate exception on key duplication
Unicode Security
- Zero-width character detection and removal
- Directional override character detection
- Homoglyph normalization
Pattern Evasion Detection
- Concatenation-aware pattern matching
- Encoding anomaly detection
- Obfuscation pattern recognition

Performance Characteristics

P99 Latency: <200ms for standard inputs (measured)
Cache Hit Rate: 30-50% (exact), 70-90% (hybrid)
Cache Latency: <100ms (Redis Cloud), <1ms (local Redis)

Monitoring

MCP monitoring tools available for health checks and metrics:

firewall_health_check: Redis/Session health inspection
firewall_deployment_status: Traffic percentage and rollout phase
firewall_metrics: Real-time block rates and CUSUM scores
firewall_check_alerts: Critical P0 alerts
firewall_redis_status: ACL and connection pool health

Implementation Status

P0 Items (Critical):

Circuit breaker pattern: Implemented
False positive tracking: Implemented (rate: ~20-25%)
P99 latency metrics: Implemented (<200ms verified)
Cache mode switching: Implemented
Adversarial bypass detection: Implemented (0/50 bypasses in test suite)

P1 Items (High Priority):

Shadow-allow mechanism: Configuration-only
Cache invalidation strategy: TTL-based
Bloom filter parameters: Configurable

P2 Items (Medium Priority):

Concurrency model: Single-threaded
Progressive decoding: Not implemented
Forensic capabilities: Basic logging
STRIDE threat model: Partial

References

Architecture documentation: docs/SESSION_HANDOVER_2025_12_01.md (v2.4.0rc1)
Technical handover: docs/TECHNICAL_HANDOVER_2025_12_01.md (pre-v2.4.0rc1)
Test results: docs/TEST_RESULTS_SUMMARY.md
External review response: docs/EXTERNAL_REVIEW_RESPONSE.md
PyPI release report: docs/PYPI_RELEASE_REPORT_2025_12_02.md

License

MIT License

Author

Joerg Bollwahn Email: sookoothaii@proton.me

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.5.0

Dec 4, 2025

This version

2.4.0

Dec 2, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_security_firewall-2.4.0.tar.gz (859.2 kB view details)

Uploaded Dec 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_security_firewall-2.4.0-py3-none-any.whl (512.2 kB view details)

Uploaded Dec 2, 2025 Python 3

File details

Details for the file llm_security_firewall-2.4.0.tar.gz.

File metadata

Download URL: llm_security_firewall-2.4.0.tar.gz
Upload date: Dec 2, 2025
Size: 859.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for llm_security_firewall-2.4.0.tar.gz
Algorithm	Hash digest
SHA256	`af9cd7167c86f74578657f2eab047708a743784f75ab4d6cda520dc787c303b6`
MD5	`52b88b37fcb464c5008b43ba67953966`
BLAKE2b-256	`9c9071e07b0cdf360ad75ad2c923c17c451a45a93fe4f2eb4379530c4c1c0836`

See more details on using hashes here.

File details

Details for the file llm_security_firewall-2.4.0-py3-none-any.whl.

File metadata

Download URL: llm_security_firewall-2.4.0-py3-none-any.whl
Upload date: Dec 2, 2025
Size: 512.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for llm_security_firewall-2.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7bb94b919bdcf6511c6c34f813c3146aec46e0f841967536100602f4e515db4e`
MD5	`6c7dcc157ed8da395f628f9e8d2765d1`
BLAKE2b-256	`badb2b500925d1399a4786e41dabe682fc991509d2cd44c338edced62d755264`

See more details on using hashes here.

llm-security-firewall 2.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LLM Security Firewall

Overview

Architecture

Bidirectional Processing Pipeline

Core Components

Dependencies

Installation

Configuration

Cache Modes

Redis Configuration

Examples

Testing

Known Limitations

Security Hardening

Implemented Measures

Performance Characteristics

Monitoring

Implementation Status

References

License

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes