Security and moderation tools for the Jazzmine AI ecosystem

These details have not been verified by PyPI

Project links

Project description

Jazzmine Security

Production-ready security and moderation toolkit for AI applications

Jazzmine Security provides a comprehensive suite of tools for protecting AI applications from malicious inputs, toxic outputs, and unsafe content. Built with performance in mind, it combines Python flexibility with Rust speed through optimized bindings.

Features

Input Moderation

Jailbreak Detection: Identify and block prompt injection attacks
Toxic Content Detection: Multi-class toxicity classification with SHAP explainability
Batch Processing: High-throughput classification with GPU acceleration
HuggingFace Integration: Load pre-trained models directly from the Hub

Output Moderation

Response Validation: Ensure AI-generated content meets safety guidelines
Chunk-based Analysis: Handle long-form content with intelligent chunking
Confidence Scoring: Get detailed confidence metrics for each prediction

Content Sanitization

PDF Sanitization: Remove JavaScript, embedded files, and malicious content
CSV Sanitization: Prevent formula injection and XSS attacks
HTML Sanitization: Strip dangerous tags and attributes while preserving content

Performance

Rust-Powered: Critical text processing operations accelerated with Rust
GPU Support: Automatic CUDA acceleration when available
Async Support: Non-blocking operations for high-concurrency environments

Installation

From PyPI (Recommended)

pip install jazzmine-security

With GPU Support

pip install jazzmine-security torch --index-url https://download.pytorch.org/whl/cu121

From Source

git clone https://github.com/yourorg/jazzmine-security.git
cd jazzmine-security
pip install .

Quick Start

Input Moderation

from jazzmine.security import JazzmineInputModerator
from jazzmine.logging import ConsoleLogger

# Initialize with HuggingFace model
logger = ConsoleLogger()
moderator = JazzmineInputModerator(
    "nourmedini1/jazzmine-input-safeguard-v2",
    logger=logger
)

# Classify single input
text = "How can I hack into a system?"
label, confidence = moderator.classify(text)

if label == "LABEL_1":  # Toxic/Jailbreak detected
    print(f"Warning: Blocked - Confidence {confidence:.2%}")
else:
    print(f"Safe: Confidence {confidence:.2%}")

# Batch processing
requests = [
    {"text": "Tell me a joke"},
    {"text": "How to bypass security"},
    {"text": "What's the weather like?"}
]
results = moderator.classify_batch(requests, batch_size=32)

Output Moderation

from jazzmine.security import JazzmineOutputModerator

# Initialize output validator
output_mod = JazzmineOutputModerator(
    "nourmedini1/jazzmine-response-validator-v2"
)

# Validate AI response
ai_response = "Here's how to create a secure password..."
label, confidence = output_mod.classify(ai_response)

if label == "LABEL_1":  # Unsafe content
    print("Response blocked due to safety concerns")
else:
    print("Response approved")

Content Sanitization

from jazzmine.security import (
    JazzminePDFSanitizer,
    JazzmineCSVSanitizer,
    JazzmineHTMLSanitizer
)

# Sanitize PDF
pdf_sanitizer = JazzminePDFSanitizer()
safe_pdf = pdf_sanitizer.sanitize("document.pdf")

# Sanitize CSV (prevent formula injection)
csv_sanitizer = JazzmineCSVSanitizer()
safe_csv = csv_sanitizer.sanitize("data.csv")

# Sanitize HTML
html_sanitizer = JazzmineHTMLSanitizer()
safe_html = html_sanitizer.sanitize("<script>alert('xss')</script><p>Safe content</p>")
# Output: "<p>Safe content</p>"

Toxicity Detection with Explainability

from jazzmine.security.toxic_content_detector import JazzmineToxicityDetector

# Initialize detector
detector = JazzmineToxicityDetector()

# Train on your data
detector.train(
    csv_path="training_data.csv",
    text_column="text",
    label_column="is_toxic"
)

# Make predictions
text = "This is a test message"
prediction = detector.predict(text)
print(f"Toxic: {prediction['is_toxic']}")
print(f"Confidence: {prediction['confidence']:.2%}")

# Get SHAP explanations
explanation = detector.explain(text, num_samples=100)
print(f"Top contributing features: {explanation['top_features']}")

Architecture

Jazzmine Security is built with a hybrid Python-Rust architecture:

Python Layer: High-level APIs, model management, ML workflows
Rust Layer: Text normalization, TF-IDF extraction, semantic analysis
HuggingFace Integration: Seamless model loading and caching
PyO3 Bindings: Zero-copy data transfer between Python and Rust

Models

Pre-trained Models on HuggingFace

Input Safeguard: nourmedini1/jazzmine-input-safeguard-v2
- Detects jailbreaks, prompt injections, and malicious inputs
- Fine-tuned on diverse attack patterns
Response Validator: nourmedini1/jazzmine-response-validator-v2
- Validates AI-generated content for safety
- Identifies unsafe, biased, or harmful outputs

Custom Models

You can train and use your own models:

from jazzmine.security.toxic_content_detector import JazzmineToxicityDetector

detector = JazzmineToxicityDetector()
detector.train("your_data.csv", text_column="text", label_column="label")
detector.save("my_custom_model")

# Later use
detector = JazzmineToxicityDetector()
detector.load("my_custom_model")

Configuration

Logging Integration

from jazzmine.logging import BaseLogger, RequestContext

class MyLogger(BaseLogger):
    def info(self, message: str, **kwargs):
        print(f"[INFO] {message}: {kwargs}")

moderator = JazzmineInputModerator(
    "nourmedini1/jazzmine-input-safeguard-v2",
    logger=MyLogger()
)

GPU Configuration

import torch

# Check GPU availability
if torch.cuda.is_available():
    print(f"Using GPU: {torch.cuda.get_device_name(0)}")
else:
    print("Using CPU")

# Models automatically use GPU when available

Chunking Configuration

moderator = JazzmineInputModerator("model-name")

# Adjust chunk size for long texts
moderator.chunk_size = 512  # tokens
moderator.overlap = 50      # token overlap between chunks

Testing

# Run all tests
pytest tests/

# Run with coverage
pytest --cov=jazzmine.security tests/

# Run specific test file
pytest tests/test_input_moderator.py

Performance

Benchmark on NVIDIA RTX 3090:

Operation	Throughput	Latency (p50)	Latency (p99)
Input Moderation (batch=32)	450 texts/sec	71ms	120ms
Output Validation (batch=32)	420 texts/sec	76ms	130ms
Toxicity Detection	800 texts/sec	1.2ms	5ms
PDF Sanitization	15 docs/sec	65ms	150ms

Contributing

We welcome contributions! Please see our Contributing Guide for details.

# Setup development environment
git clone https://github.com/yourorg/jazzmine-security.git
cd jazzmine-security
pip install -e ".[dev]"

# Build Rust components
cd bindings
maturin develop --release

# Run tests
pytest tests/

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Built on Transformers by HuggingFace
Rust bindings powered by PyO3
Explainability via SHAP

Support

Documentation: https://jazzmine-security.readthedocs.io
Issues: GitHub Issues
Email: mohamednour.medini@etudiant-isi.utm.tn

Roadmap

Multi-language support (French, Arabic, Spanish)
Real-time monitoring dashboard
Additional sanitizers (JSON, XML, Markdown)
Model distillation for edge deployment
Integration with popular LLM frameworks (LangChain, LlamaIndex)

Made with care by the Jazzmine Team

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.12

Mar 25, 2026

0.1.11

Mar 25, 2026

0.1.10

Mar 25, 2026

0.1.9

Mar 25, 2026

This version

0.1.7

Mar 25, 2026

0.1.6

Feb 5, 2026

0.1.5

Feb 5, 2026

0.1.4

Feb 5, 2026

0.1.0

Feb 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jazzmine_security-0.1.7.tar.gz (2.4 MB view details)

Uploaded Mar 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

jazzmine_security-0.1.7-cp38-abi3-manylinux_2_34_x86_64.whl (3.9 MB view details)

Uploaded Mar 25, 2026 CPython 3.8+manylinux: glibc 2.34+ x86-64

File details

Details for the file jazzmine_security-0.1.7.tar.gz.

File metadata

Download URL: jazzmine_security-0.1.7.tar.gz
Upload date: Mar 25, 2026
Size: 2.4 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: maturin/1.11.5

File hashes

Hashes for jazzmine_security-0.1.7.tar.gz
Algorithm	Hash digest
SHA256	`1c72b49d7c92d4d2fbaa66da552334e7e6e44766a8cbfcfd2d95f7ba1dd6a8eb`
MD5	`fc3e90a3b63a21be450391eb341dbb56`
BLAKE2b-256	`81f15232753acc63b08b1df4a947a5cd5f352ba528d39aadc15755532a91b7b5`

See more details on using hashes here.

File details

Details for the file jazzmine_security-0.1.7-cp38-abi3-manylinux_2_34_x86_64.whl.

File metadata

Download URL: jazzmine_security-0.1.7-cp38-abi3-manylinux_2_34_x86_64.whl
Upload date: Mar 25, 2026
Size: 3.9 MB
Tags: CPython 3.8+, manylinux: glibc 2.34+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: maturin/1.11.5

File hashes

Hashes for jazzmine_security-0.1.7-cp38-abi3-manylinux_2_34_x86_64.whl
Algorithm	Hash digest
SHA256	`214b8524b936a74392ff49d900ca13ac3df4c837cd47ccfd1748e3151a40b974`
MD5	`9d9e81b2b75d8af298e3f782ba1ca3d3`
BLAKE2b-256	`3a00b6237ad6c26f1bfeba1ca734889710af943a07e89f4a2090859f467b50ad`

See more details on using hashes here.

jazzmine-security 0.1.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Jazzmine Security

Features

Input Moderation

Output Moderation

Content Sanitization

Performance

Installation

From PyPI (Recommended)

With GPU Support

From Source

Quick Start

Input Moderation

Output Moderation

Content Sanitization

Toxicity Detection with Explainability

Architecture

Models

Pre-trained Models on HuggingFace

Custom Models

Configuration

Logging Integration

GPU Configuration

Chunking Configuration

Testing

Performance

Contributing

License

Acknowledgments

Support

Roadmap

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes