Assert-style validation library for AI outputs - ensure your LLMs behave exactly as expected

These details have not been verified by PyPI

Project links

Project description

Aisert 🚀

Assert-style validation library for AI outputs - ensure your LLMs behave exactly as expected

✨ Validate AI responses with confidence!
🔗 Fluent, chainable API for comprehensive AI output validation
🎯 From token counts to semantic similarity - production-ready validation

⚠️ Alpha Release - Currently in alpha. Feedbacks welcome!

Who Is This For? 👥

AI/ML Engineers - Just like you use assert statements in unit tests, use Aisert to validate LLM outputs in production
QA Engineers - Automated testing for AI responses, similar to how Selenium tests web UIs
DevOps Teams - Monitor AI model performance and catch regressions, like APM tools for traditional apps
Product Teams - Ensure AI features meet quality standards before reaching users

Features

Fluent Interface: Chain multiple validations with a beautiful, readable API
Multiple Validators: Schema, content, token count, and semantic similarity validation
Flexible Modes: Strict mode (raises exceptions) or non-strict mode (collects results)
Thread-Safe: Production-ready with proper concurrency handling and model caching
Multi-Provider Support: OpenAI, Anthropic, HuggingFace, and Google token counting
Extensible: Custom token validators via TokenValidatorBase inheritance

Prerequisites & Setup

System Requirements

Python >= 3.9
1GB+ RAM (for semantic models)
500MB+ disk space (model downloads)

API Keys (for token counting)

# Set environment variables for your providers
export OPENAI_API_KEY="your-openai-key"
export ANTHROPIC_API_KEY="your-anthropic-key"

Installation

pip install aisert

Quick Start

from aisert import Aisert, AisertConfig

# Configure for your AI model
config = AisertConfig(
    token_model="gpt-3.5-turbo",
    model_provider="openai"
)

# Validate AI response with fluent interface
result = (
    Aisert(content="Paris is the capital of France.", config=config)
    .assert_contains(["Paris", "France"])
    .assert_tokens(max_tokens=50)
    .assert_semantic_matches("France's capital", threshold=0.8)
    .collect()
)

print(f"Validation passed: {result.status}")
print(f"Details: {result.rules}")

Validation Types

Content Validation

# Check if response contains specific items
Aisert(content).assert_contains(["keyword1", "keyword2"])

# Check if response doesn't contain items
Aisert(content).assert_not_contains(["spam", "inappropriate"])

Token Count Validation

# Ensure response is within token limits (requires API call)
Aisert(content, config).assert_tokens(max_tokens=100)

Schema Validation

# Validate with Pydantic model
from pydantic import BaseModel

class UserModel(BaseModel):
    name: str
    age: int

Aisert(json_content).assert_schema(UserModel)

Semantic Similarity

# Check semantic similarity (first run loads model, then fast)
# Loading time varies by model - use lightweight models for speed
Aisert(content).assert_semantic_matches(
    expected="Information about artificial intelligence",
    threshold=0.75
)

Validation Patterns

Content Moderation (Instant)

# Fast validation - no model loading required
result = Aisert(content).assert_not_contains(["spam", "inappropriate"]).collect()

API Response Validation (Comprehensive)

# Full validation with all validators
result = (
    Aisert(content, config)
    .assert_contains(["required_info"])
    .assert_tokens(max_tokens=100)
    .assert_semantic_matches("expected meaning", 0.8)
    .collect()
)

Performance-Optimized (Selective)

# Use only needed validators for optimal performance
result = (
    Aisert(content, config)
    .assert_not_contains(["inappropriate"])
    .assert_tokens(max_tokens=200)
    .collect()
)

Validation Modes

Strict Mode (Default)

Raises exceptions immediately when validation fails:

try:
    Aisert(content).assert_contains(["required_term"])
except AisertError as e:
    print(f"Validation failed: {e}")

Non-Strict Mode

Collects all validation results without raising exceptions:

result = (
    Aisert(content)
    .assert_contains(["term1"], strict=False)
    .assert_tokens(100, strict=False)
    .collect()
)

if not result.status:
    print("Some validations failed:", result.rules)

Configuration

from aisert import AisertConfig

config = AisertConfig(
    token_model="gpt-4",           # Model for token counting
    model_provider="openai",       # Provider: "openai", "anthropic", "huggingface", "google"
    token_encoding=None,           # Custom encoding (OpenAI only)
    sentence_transformer_model="all-MiniLM-L6-v2"  # Semantic similarity model
)

Real-World Examples

API Response Validation

def validate_chatbot_response(response_text):
    return (
        Aisert(response_text, config)
        .assert_not_contains(["inappropriate", "harmful"])
        .assert_tokens(max_tokens=500)
        .assert_semantic_matches("helpful customer service", 0.7)
        .collect()
    )

Content Moderation

def moderate_content(user_content):
    moderation_result = (
        Aisert(user_content)
        .assert_not_contains(["spam", "offensive"], strict=False)
        .assert_tokens(max_tokens=1000, strict=False)
        .collect()
    )
    
    return moderation_result.status

Batch Processing

def validate_multiple_responses(responses):
    results = []
    for response in responses:
        result = (
            Aisert(response, config)
            .assert_contains(["required_info"], strict=False)
            .assert_tokens(200, strict=False)
            .collect()
        )
        results.append(result)
    return results

Exception Handling

Aisert provides specific exceptions for different validation types:

from aisert import AisertError
from aisert.exception import (
    SchemaValidationError,    # Schema validation failures
    ContainsValidationError,  # Content validation failures
    TokenValidationError,     # Token count validation failures
    SemanticValidationError   # Semantic similarity failures
)

try:
    Aisert(content).assert_schema(UserModel)
except SchemaValidationError as e:
    print(f"Schema validation failed: {e}")
except AisertError as e:
    print(f"General validation error: {e}")

Performance Notes 📝

First Run: Semantic validation slower initially (model loading time varies by model) ⏳
Subsequent: All validations fast (<100ms) ⚡
Selective Usage: Use only needed validators for optimal performance 🎯
Model Caching: Models cached after first load for 10x+ speedup 🚀
Thread Safety: All validators use singleton pattern with proper locking 🔒
Default Config: Uses OpenAI gpt-3.5-turbo and all-MiniLM-L6-v2 by default ⚙️

Troubleshooting 🔧

Common Issues

Model Loading Timeout

# Use lightweight model for faster loading
config = AisertConfig(
    sentence_transformer_model="paraphrase-MiniLM-L3-v2"  # Ultra-fast
)

API Key Errors

# Ensure environment variables are set
echo $OPENAI_API_KEY
echo $ANTHROPIC_API_KEY

Memory Issues

Semantic models use 100-500MB RAM
Use lightweight models on resource-constrained systems
Consider content-only validation for high-volume scenarios

Dependency Conflicts

# Install in clean environment
python -m venv aisert-env
source aisert-env/bin/activate  # or aisert-env\Scripts\activate on Windows
pip install aisert

Current Limitations ⚠️

Schema Validation: Only supports Pydantic models and TypeAdapter, not raw JSON schemas
Semantic Models: Limited to sentence-transformers compatible models

Supported Providers 🌐

OpenAI: tiktoken-based encoding with model-specific tokenizers
Anthropic: Native anthropic client token counting
HuggingFace: AutoTokenizer from transformers library
Google: genai client integration (experimental)
Custom: Extend TokenValidatorBase for additional providers

License

MIT License

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.0

Aug 24, 2025

This version

0.1.1

Aug 17, 2025

0.1.0

Aug 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aisert-0.1.1.tar.gz (34.4 kB view details)

Uploaded Aug 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aisert-0.1.1-py3-none-any.whl (34.7 kB view details)

Uploaded Aug 17, 2025 Python 3

File details

Details for the file aisert-0.1.1.tar.gz.

File metadata

Download URL: aisert-0.1.1.tar.gz
Upload date: Aug 17, 2025
Size: 34.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for aisert-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`68d745efc330262bdd47652377229d46375fda85f789a3b3e6d64fca9526526c`
MD5	`0ca99a99989ebc4a61bd1c393072983d`
BLAKE2b-256	`b916d2ccb15515d16012174f0f0550f0346c30cefae6f01e9a9b4095daed5701`

See more details on using hashes here.

File details

Details for the file aisert-0.1.1-py3-none-any.whl.

File metadata

Download URL: aisert-0.1.1-py3-none-any.whl
Upload date: Aug 17, 2025
Size: 34.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for aisert-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5d21b1b832d4c851d5c2701ad32058fe66a8bce5cc9dbbc30080a4907f8badfe`
MD5	`a2b2187f28c1b8f03010369cd3f457a8`
BLAKE2b-256	`37bb0b883f7df3665dc294007d246b31f274a7e1893e0a045c3977ab47330cf3`

See more details on using hashes here.

aisert 0.1.1

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

Aisert 🚀

Who Is This For? 👥

Features

Prerequisites & Setup

System Requirements

API Keys (for token counting)

Installation

Quick Start

Validation Types

Content Validation

Token Count Validation

Schema Validation

Semantic Similarity

Validation Patterns

Content Moderation (Instant)

API Response Validation (Comprehensive)

Performance-Optimized (Selective)

Validation Modes

Strict Mode (Default)

Non-Strict Mode

Configuration

Real-World Examples

API Response Validation

Content Moderation

Batch Processing

Exception Handling

Performance Notes 📝

Troubleshooting 🔧

Common Issues

Current Limitations ⚠️

Supported Providers 🌐

License

Contributing

Links

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes