Skip to main content

Gaussia - AI evaluation framework for measuring fairness, quality, and safety of AI models and assistants

Project description

Gaussia

AI evaluation framework for measuring fairness, quality, and safety of AI models and assistants.

Installation

pip install gaussia

With specific metric dependencies:

pip install gaussia[toxicity]    # Toxicity analysis
pip install gaussia[bias]        # Bias detection
pip install gaussia[metrics]     # All metrics
pip install gaussia[all]         # Everything

Quick Start

from gaussia import Retriever, Dataset, Batch
from gaussia.metrics import Context

# 1. Define your data source
class MyRetriever(Retriever):
    def load_dataset(self) -> list[Dataset]:
        return [
            Dataset(
                session_id="session-1",
                assistant_id="assistant-1",
                language="en",
                context="France is a country in Western Europe.",
                conversation=[
                    Batch(
                        qa_id="q1",
                        query="Where is France?",
                        assistant="France is located in Western Europe.",
                        ground_truth_assistant="France is a country in Western Europe.",
                    )
                ],
            )
        ]

# 2. Run a metric
metrics = Context.run(retriever=MyRetriever())

Metrics

Metric Description Install extra
Context Evaluates response alignment with provided context
Conversational Dialogue quality via Grice's maxims (memory, language, quality, quantity, relation, manner)
BestOf King-of-the-hill tournament comparison of multiple assistants
Agentic Agent evaluation with pass@K and tool correctness
Toxicity Cluster-based toxicity profiling with demographic and sentiment analysis [toxicity]
Bias Bias detection across protected attributes using guardians [bias]
Humanity Emotion, empathy, and human-like quality analysis [humanity]
Regulatory Compliance evaluation against regulatory documents [regulatory]
VisionSimilarity VLM description comparison via semantic similarity [vision]
VisionHallucination Hallucination detection in VLM outputs [vision]

Features

Guardians

Pluggable bias detection backends:

from gaussia.guardians import IBMGraniteGuardian, LLamaGuardGuardian

metrics = Bias.run(retriever=MyRetriever(), guardian=IBMGraniteGuardian())

Statistical Modes

Choose between frequentist and Bayesian aggregation:

from gaussia import FrequentistMode, BayesianMode

metrics = Context.run(retriever=MyRetriever(), statistical_mode=FrequentistMode())
metrics = Context.run(retriever=MyRetriever(), statistical_mode=BayesianMode())

Synthetic Data Generation

Generate evaluation datasets from documents:

from gaussia.generators import BaseGenerator, create_markdown_loader

loader = create_markdown_loader(path="./docs")
generator = BaseGenerator(context_loader=loader)
datasets = generator.generate()

Explainability

Token-level attribution analysis:

from gaussia.explainability import AttributionExplainer

explainer = AttributionExplainer(method="lime")
attributions = explainer.explain(text="Your input text")

Prompt Optimization

Optimize prompts using evolutionary and multi-objective strategies:

from gaussia.prompt_optimizer import GEPAOptimizer, MIPROv2Optimizer

Documentation

Full documentation available at docs.gaussia.ai.

Requirements

  • Python >= 3.11

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gaussia-1.0.0b1.tar.gz (770.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gaussia-1.0.0b1-py3-none-any.whl (813.4 kB view details)

Uploaded Python 3

File details

Details for the file gaussia-1.0.0b1.tar.gz.

File metadata

  • Download URL: gaussia-1.0.0b1.tar.gz
  • Upload date:
  • Size: 770.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gaussia-1.0.0b1.tar.gz
Algorithm Hash digest
SHA256 c16372d30acc2c1f2b6fecb9ac9d070351f4c213668c094da8da5b64416d643a
MD5 704aedba4a81d022b6d37b05f44d0d29
BLAKE2b-256 a809c355ecd3b8d510de583ed74555440bc97b7c59f736cb950f39d612c9a9d4

See more details on using hashes here.

File details

Details for the file gaussia-1.0.0b1-py3-none-any.whl.

File metadata

  • Download URL: gaussia-1.0.0b1-py3-none-any.whl
  • Upload date:
  • Size: 813.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gaussia-1.0.0b1-py3-none-any.whl
Algorithm Hash digest
SHA256 a9302d7302d31edb4d0e655aa5c9d2c9544f7cf1fde098b6b40890b7a435ae1f
MD5 5c463fc91e5405dcc7f413e7d507153c
BLAKE2b-256 36420471fccc0f3c1e7ac254a1d5568d3757886b5534f18471da5dccf7f31d6e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page