Comprehensive security orchestration for LLM/RAG stacks: poisoning detection, parameter validation, anomaly detection, memory integrity

These details have not been verified by PyPI

Project links

Project description

BeigeBox Security

Comprehensive security orchestration for LLM/RAG stacks.

Overview

BeigeBox Security is a standalone microservice that provides 4 core security tools for protecting language model and retrieval-augmented generation (RAG) systems:

🛡️ Core Security Tools

RAG Poisoning Detection (/v1/security/poisoning)
- Detects poisoned embeddings using anomaly detection
- Methods: magnitude, centroid, neighborhood, dimension, fingerprint, hybrid
- Per-vector confidence scoring
MCP Parameter Validation (/v1/security/parameters)
- Prevents tool parameter injection attacks
- Multi-tier validation: schema → constraint → semantic → isolation
- Supports: WorkspaceFile, NetworkAudit, CDP, PythonInterpreter, ApexAnalyzer, and more
API Anomaly Detection (/v1/security/anomaly)
- Detects token extraction and model switching attacks
- Z-score based anomaly detection
- Tracks: request rate, error rate, latency, payload sizes
- Configurable sensitivity (low/medium/high)
Memory Integrity Validation (/v1/security/memory)
- Detects conversation history tampering
- HMAC-SHA256 signature verification
- Audit logging with confidence scoring

Quick Start

Installation

pip install beigebox-security

Running the Service

# Start server
beigebox-security server

# Check health
beigebox-security health

# View docs
beigebox-security docs

Docker

# Build and run
docker-compose up -d

# Check status
curl http://localhost:8001/health

API Usage

Example: RAG Poisoning Detection

import httpx

client = httpx.Client(base_url="http://localhost:8001")

# Detect poisoning in embeddings
response = client.post(
    "/v1/security/poisoning/detect",
    json={
        "embeddings": [[0.1, 0.2, 0.3], [0.4, 0.5, 0.6]],
        "method": "hybrid",
        "sensitivity": 3.0,
        "collection_id": "my_collection"
    }
)

print(response.json())
# {
#   "poisoned": [false, true],
#   "scores": [0.2, 0.8],
#   "confidence": 0.95,
#   "method_used": "hybrid"
# }

Example: Parameter Validation

# Validate tool parameters
response = client.post(
    "/v1/security/parameters/validate",
    json={
        "tool_name": "workspace_file",
        "parameters": {
            "path": "/home/user/docs/file.txt",
            "operation": "read"
        },
        "allow_unsafe": False
    }
)

print(response.json())
# {
#   "valid": true,
#   "issues": [],
#   "sanitized_parameters": {...}
# }

Configuration

Configuration via environment variables or .env file:

# Server
BEIGEBOX_SECURITY_HOST=0.0.0.0
BEIGEBOX_SECURITY_PORT=8001
BEIGEBOX_SECURITY_DEBUG=false

# RAG Poisoning Detection
BEIGEBOX_SECURITY_POISONING_DETECTION_ENABLED=true
BEIGEBOX_SECURITY_POISONING_SENSITIVITY=medium
BEIGEBOX_SECURITY_POISONING_BASELINE_WINDOW=1000

# MCP Parameter Validation
BEIGEBOX_SECURITY_PARAMETER_VALIDATION_ENABLED=true
BEIGEBOX_SECURITY_PARAMETER_VALIDATION_ALLOW_UNSAFE=false

# API Anomaly Detection
BEIGEBOX_SECURITY_ANOMALY_DETECTION_ENABLED=true
BEIGEBOX_SECURITY_ANOMALY_DETECTION_SENSITIVITY=medium
BEIGEBOX_SECURITY_ANOMALY_DETECTION_DB_PATH=./data/anomaly_baselines.db

# Memory Integrity Validation
BEIGEBOX_SECURITY_MEMORY_INTEGRITY_ENABLED=true
BEIGEBOX_SECURITY_MEMORY_INTEGRITY_STRICT_MODE=false
BEIGEBOX_SECURITY_MEMORY_INTEGRITY_KEY=your-secret-key-here

Integration

With BeigeBox LLM Proxy

BeigeBox Security integrates seamlessly with the BeigeBox LLM proxy:

# In your BeigeBox config
security:
  poisoning_detection:
    enabled: true
  mcp_validator:
    enabled: true
  api_anomaly:
    enabled: true
  memory_integrity:
    enabled: true

Standalone

Use as a standalone security microservice for any LLM/RAG application:

import httpx

# Your application
async def protect_rag_request(embeddings):
    async with httpx.AsyncClient() as client:
        response = await client.post(
            "http://localhost:8001/v1/security/poisoning/detect",
            json={"embeddings": embeddings}
        )
        findings = response.json()
        
        if any(findings["poisoned"]):
            # Handle poisoning
            return "BLOCKED: Suspicious embeddings detected"
        
        return "ALLOWED"

API Documentation

Once running, access interactive API docs at:

Swagger UI: http://localhost:8001/docs
ReDoc: http://localhost:8001/redoc
OpenAPI Schema: http://localhost:8001/openapi.json

Development

# Install dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Run tests with coverage
pytest --cov=beigebox_security

# Format code
black beigebox_security tests

# Lint
ruff check beigebox_security tests

# Type checking
mypy beigebox_security

Contributing

Contributions welcome! Please:

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

See CONTRIBUTING.md for details.

Licensing

Apache License 2.0 — See LICENSE for details.

Support

Issues: https://github.com/beigebox-ai/beigebox-security/issues
Discussions: https://github.com/beigebox-ai/beigebox-security/discussions
Docs: https://beigebox-security.readthedocs.io

Research

BeigeBox Security is based on peer-reviewed research in LLM security:

RAG Poisoning: PoisonedRAG - 97-99% attack success with 5 poisoned documents
OWASP Top 10 for LLMs: https://owasp.org/www-project-top-10-for-large-language-model-applications/
Vector Database Security: LLM08:2025 Vector and Embedding Weaknesses

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Apr 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

beigebox_security-0.1.0.tar.gz (51.3 kB view details)

Uploaded Apr 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

beigebox_security-0.1.0-py3-none-any.whl (41.5 kB view details)

Uploaded Apr 12, 2026 Python 3

File details

Details for the file beigebox_security-0.1.0.tar.gz.

File metadata

Download URL: beigebox_security-0.1.0.tar.gz
Upload date: Apr 12, 2026
Size: 51.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for beigebox_security-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`9a1b8474e203502f9e86bcaee55306c6b53e21488fd21fb91c2da9b349b26643`
MD5	`bc8e8a0eaf236c0d09511a297b6dfc68`
BLAKE2b-256	`242c9014e3988b08af34f3b05a5421a0f4a9fd382b6808a56f65eb33fafa1240`

See more details on using hashes here.

File details

Details for the file beigebox_security-0.1.0-py3-none-any.whl.

File metadata

Download URL: beigebox_security-0.1.0-py3-none-any.whl
Upload date: Apr 12, 2026
Size: 41.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for beigebox_security-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`50d91de20801c4c616b6e89ec1a6aa17d088e9f18725cbab7065b1877f032a08`
MD5	`cec45a724136896ee6b0d8fdafa8d589`
BLAKE2b-256	`44a34eb7e8f5f6649a63fb986400fea04a46f700fd32a9643a9765f4a5356eb5`

See more details on using hashes here.

beigebox-security 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BeigeBox Security

Overview

🛡️ Core Security Tools

Quick Start

Installation

Running the Service

Docker

API Usage

Example: RAG Poisoning Detection

Example: Parameter Validation

Configuration

Integration

With BeigeBox LLM Proxy

Standalone

API Documentation

Development

Contributing

Licensing

Support

Research

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes