Production-ready hallucination detection for LLM outputs

These details have not been verified by PyPI

Project links

Project description

🔍 Hallucination Detector

Production-ready hallucination detection for LLM outputs.

Detect, measure, and prevent AI hallucinations with confidence. Works with any LLM - OpenAI, Anthropic, open-source models, and more.

✨ Features

🎯 Multi-Strategy Detection - Semantic analysis, entity verification, fact-checking
⚡ Production Ready - Async support, batch processing, caching
📊 Detailed Reports - JSON, HTML, and Markdown output formats
🔌 Zero Dependencies - Core works standalone, optional enhancements available
🎨 Simple API - One line to detect, full control when needed

🚀 Quick Start

Installation

# Basic installation (no dependencies)
pip install hallucination-detector

# Full installation with all features
pip install hallucination-detector[full]

Basic Usage

from hallucination_detector import HallucinationDetector

# Initialize detector
detector = HallucinationDetector()

# Detect hallucinations
result = detector.detect(
    response="The Eiffel Tower is located in Berlin, Germany.",
    context="The Eiffel Tower is a famous landmark in Paris, France."
)

# Check results
print(result.is_hallucination)  # True
print(result.confidence)         # 0.95
print(result.summary())
# ⚠️ 1 hallucination(s) detected
#    Confidence: 95.0%
#    Claims: 0/1 verified

📖 Documentation

Detection Methods

Standard Detection

from hallucination_detector import HallucinationDetector

detector = HallucinationDetector()

# With context
result = detector.detect(
    response="The CEO announced record profits of $5 billion.",
    context="The company reported annual revenue of $3 billion with modest growth."
)

# With source documents
result = detector.detect(
    response="According to the study, 80% of users prefer the new design.",
    sources=[
        "User survey results: 65% expressed preference for the new interface.",
        "Focus group findings indicated mixed reactions to the redesign."
    ]
)

# Domain-specific (stricter validation)
result = detector.detect(
    response="Take 500mg of aspirin daily for pain relief.",
    context="Recommended aspirin dosage is 75-100mg daily.",
    domain="medical",
    strict=True
)

Quick Check

# Fast boolean check
if detector.quick_check(response, context):
    print("⚠️ Potential hallucination detected!")

Batch Processing

# Process multiple items
results = detector.detect_batch([
    {"response": "...", "context": "..."},
    {"response": "...", "sources": ["...", "..."]},
    {"response": "...", "context": "...", "domain": "legal"},
])

for i, result in enumerate(results):
    print(f"Item {i}: {'❌' if result.is_hallucination else '✅'}")

Async Support

from hallucination_detector import AsyncHallucinationDetector
import asyncio

async def check_responses():
    detector = AsyncHallucinationDetector()
    
    # Single detection
    result = await detector.detect(response, context)
    
    # Batch detection
    results = await detector.detect_batch(items)
    
    return results

# Run
results = asyncio.run(check_responses())

Configuration

from hallucination_detector import HallucinationDetector, DetectorConfig

# Custom configuration
config = DetectorConfig(
    # Detection settings
    confidence_threshold=0.8,      # Higher = stricter
    min_claim_length=10,           # Minimum characters for a claim
    
    # Validation toggles
    enable_entity_validation=True,
    enable_temporal_validation=True,
    enable_numeric_validation=True,
    enable_semantic_validation=True,
    
    # Performance
    batch_size=32,
    max_workers=4,
    cache_embeddings=True,
    
    # Output
    include_explanations=True,
    include_suggestions=True,
    verbose=True
)

detector = HallucinationDetector(config=config)

Understanding Results

result = detector.detect(response, context)

# Basic info
result.is_hallucination      # bool - Any hallucinations found?
result.confidence            # float - Overall confidence (0-1)
result.total_claims          # int - Total claims extracted
result.verified_claims       # int - Claims that passed verification

# Detailed breakdown
result.hallucination_rate    # float - Percentage of hallucinated claims
result.severity_breakdown    # dict - Count by severity level
result.type_breakdown        # dict - Count by hallucination type

# Individual hallucinations
for h in result.hallucinations:
    print(f"Text: {h.text}")
    print(f"Type: {h.hallucination_type.name}")
    print(f"Severity: {h.severity.name}")
    print(f"Confidence: {h.confidence:.0%}")
    print(f"Explanation: {h.explanation}")
    print(f"Suggestion: {h.suggested_correction}")

Hallucination Types

Type	Description
`FACTUAL_ERROR`	Incorrect facts
`ENTITY_ERROR`	Wrong names, places, organizations
`TEMPORAL_ERROR`	Incorrect dates or times
`NUMERIC_ERROR`	Wrong numbers or statistics
`ATTRIBUTION_ERROR`	Misattributed quotes or claims
`FABRICATION`	Completely made-up information
`CONTRADICTION`	Self-contradicting statements
`CONTEXT_DRIFT`	Response deviates from context
`UNSUPPORTED_CLAIM`	Claims without evidence
`EXAGGERATION`	Overstated facts

Severity Levels

Level	Description
`LOW`	Minor inaccuracy, unlikely to cause issues
`MEDIUM`	Noticeable error, may mislead users
`HIGH`	Significant error, likely to cause problems
`CRITICAL`	Severe error, dangerous misinformation

Generating Reports

from hallucination_detector import (
    DetectionReport,
    JSONReporter,
    HTMLReporter
)

# Create a report
report = DetectionReport(title="Daily Hallucination Check")

# Add results
for item in items:
    result = detector.detect(item["response"], item["context"])
    report.add_result(result)

# Print summary
print(report.summary())

# Export to JSON
json_reporter = JSONReporter(pretty=True)
json_reporter.save(report, "report.json")

# Export to HTML
html_reporter = HTMLReporter(theme="dark")
html_reporter.save(report, "report.html")

🔧 Advanced Usage

Custom Validators

from hallucination_detector.validators import FactValidator

# Add custom fact patterns
validator = FactValidator()
validator.add_false_pattern(r"COVID-19 is caused by 5G")

# Use in detector
detector = HallucinationDetector()
detector.fact_validator = validator

Semantic Analysis

from hallucination_detector.analyzers import SemanticAnalyzer

analyzer = SemanticAnalyzer(
    model_name="all-MiniLM-L6-v2",
    use_gpu=True,
    cache_embeddings=True
)

# Compute similarity
similarity = analyzer.compute_similarity(text1, text2)

# Check for contradictions
is_contradiction = analyzer.is_contradiction(claim, context)

# Find similar passages
matches = analyzer.find_most_similar(query, candidates, top_k=5)

Entity Validation

from hallucination_detector.analyzers import EntityAnalyzer

analyzer = EntityAnalyzer()

# Extract entities
entities = analyzer.extract("John works at Google in New York.")
for entity in entities:
    print(f"{entity.text}: {entity.entity_type}")
# John: PERSON
# Google: ORG  
# New York: GPE

# Verify against context
is_valid = analyzer.verify_against_context(entity, context_entities)

🏗️ Integration Examples

With OpenAI

import openai
from hallucination_detector import HallucinationDetector

detector = HallucinationDetector()

# Get completion
response = openai.ChatCompletion.create(
    model="gpt-4",
    messages=[
        {"role": "system", "content": context},
        {"role": "user", "content": question}
    ]
)

# Check for hallucinations
result = detector.detect(
    response=response.choices[0].message.content,
    context=context
)

if result.is_hallucination:
    print("⚠️ Response may contain hallucinations")
    print(result.summary())

With LangChain

from langchain.llms import OpenAI
from hallucination_detector import HallucinationDetector

llm = OpenAI()
detector = HallucinationDetector()

def safe_generate(prompt: str, context: str) -> str:
    response = llm(prompt)
    
    result = detector.detect(response=response, context=context)
    
    if result.is_hallucination and result.confidence > 0.8:
        # Retry or flag for review
        return f"[FLAGGED] {response}"
    
    return response

As Middleware

from fastapi import FastAPI, HTTPException
from hallucination_detector import HallucinationDetector

app = FastAPI()
detector = HallucinationDetector()

@app.post("/generate")
async def generate(prompt: str, context: str):
    # Generate response (your LLM call)
    response = await generate_llm_response(prompt)
    
    # Check for hallucinations
    result = detector.detect(response=response, context=context)
    
    return {
        "response": response,
        "hallucination_check": {
            "passed": not result.is_hallucination,
            "confidence": result.confidence,
            "issues": len(result.hallucinations)
        }
    }

📊 Performance Tips

Enable Caching: Set cache_embeddings=True for repeated texts
Use Batch Processing: Process multiple items together
Adjust Thresholds: Lower thresholds for faster, less accurate checks
Use Quick Check: For high-volume, boolean-only needs
GPU Acceleration: Enable use_gpu=True with sentence-transformers

🤝 Contributing

Contributions are welcome! Please read our Contributing Guide.

# Clone repository
git clone https://github.com/pranaym/hallucination-detector
cd hallucination-detector

# Install dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Format code
black src tests
isort src tests

📄 License

MIT License - see LICENSE for details.

🙏 Acknowledgments

Created by Pranay M

Found this useful? Give it a ⭐ on GitHub!

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.0

Jan 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hallucination_detector-1.0.0.tar.gz (32.2 kB view details)

Uploaded Jan 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hallucination_detector-1.0.0-py3-none-any.whl (28.0 kB view details)

Uploaded Jan 1, 2026 Python 3

File details

Details for the file hallucination_detector-1.0.0.tar.gz.

File metadata

Download URL: hallucination_detector-1.0.0.tar.gz
Upload date: Jan 1, 2026
Size: 32.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for hallucination_detector-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`fc606db8e40b13d64a4aa13932924c83265956983098fc316d8dc998b940705e`
MD5	`992f77bbdfec601c7556d45c7fc40d1a`
BLAKE2b-256	`6f50b5df3f55acbdf73ebce086b0ec8ed084334d16cec457fcb18e54fded0e62`

See more details on using hashes here.

File details

Details for the file hallucination_detector-1.0.0-py3-none-any.whl.

File metadata

Download URL: hallucination_detector-1.0.0-py3-none-any.whl
Upload date: Jan 1, 2026
Size: 28.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for hallucination_detector-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9cc7834b3d5e05f66f0fbd63610ea51f0caaa4bfedfe4b39058da65eb10a5b8f`
MD5	`fc19390f94f0cff32f5438f61efc6446`
BLAKE2b-256	`c36176c4bf798fc47f596653468695386af2bc5760a8a71eabec48eed7416c16`

See more details on using hashes here.

hallucination-detector 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🔍 Hallucination Detector

✨ Features

🚀 Quick Start

Installation

Basic Usage

📖 Documentation

Detection Methods

Standard Detection

Quick Check

Batch Processing

Async Support

Configuration

Understanding Results

Hallucination Types

Severity Levels

Generating Reports

🔧 Advanced Usage

Custom Validators

Semantic Analysis

Entity Validation

🏗️ Integration Examples

With OpenAI

With LangChain

As Middleware

📊 Performance Tips

🤝 Contributing

📄 License

🙏 Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes