Python library for evaluating LLM outputs across multiple ethical dimensions and performance metrics using Azure AI Evaluation services.

These details have not been verified by PyPI

Project description

RAIT Connector

Python library for evaluating LLM outputs across multiple ethical dimensions and performance metrics using Azure AI Evaluation services.

Features

22 Evaluation Metrics across 8 ethical dimensions
Parallel Execution for faster evaluations
Automatic API Integration with RAIT services
Type-Safe with Pydantic models
Flexible Configuration via environment variables or direct parameters
Batch Processing with custom callbacks
Comprehensive Documentation with examples

Installation

pip install rait-connector

Or with uv:

uv add rait-connector

Quick Start

from rait_connector import RAITClient

# Initialize client
client = RAITClient()

# Evaluate a single prompt
result = client.evaluate(
    prompt_id="123",
    prompt_url="https://example.com/123",
    timestamp="2025-12-11T10:00:00Z",
    model_name="gpt-4",
    model_version="1.0",
    query="What is AI?",
    response="AI is artificial intelligence...",
    environment="production",
    purpose="monitoring"
)

print(f"Evaluation complete: {result['prompt_id']}")

Configuration

Environment Variables

Set required environment variables:

# RAIT API
export RAIT_API_URL="https://api.raitracker.com"
export RAIT_CLIENT_ID="your-client-id"
export RAIT_CLIENT_SECRET="your-client-secret"

# Azure OpenAI
export AZURE_OPENAI_ENDPOINT="https://your.openai.azure.com"
export AZURE_OPENAI_API_KEY="your-api-key"
export AZURE_OPENAI_DEPLOYMENT="your-deployment"

# Azure AD
export AZURE_CLIENT_ID="your-azure-client-id"
export AZURE_TENANT_ID="your-azure-tenant-id"
export AZURE_CLIENT_SECRET="your-azure-client-secret"

# Azure Resources
export AZURE_SUBSCRIPTION_ID="your-subscription-id"
export AZURE_RESOURCE_GROUP="your-resource-group"
export AZURE_PROJECT_NAME="your-project-name"
export AZURE_ACCOUNT_NAME="your-account-name"

Direct Configuration

Or pass configuration directly:

client = RAITClient(
    rait_api_url="https://api.raitracker.com",
    rait_client_id="your-client-id",
    rait_client_secret="your-secret",
    azure_openai_endpoint="https://your.openai.azure.com",
    azure_openai_api_key="your-key",
    azure_openai_deployment="gpt-4",
    # ... other parameters
)

Evaluation Metrics

RAIT Connector supports 22 metrics across 8 ethical dimensions:

Dimension	Metrics
Bias and Fairness	Hate and Unfairness
Explainability and Transparency	Ungrounded Attributes, Groundedness, Groundedness Pro
Monitoring and Compliance	Content Safety
Legal and Regulatory Compliance	Protected Materials
Security and Adversarial Robustness	Code Vulnerability
Model Performance	Coherence, Fluency, QA, Similarity, F1 Score, BLEU, GLEU, ROUGE, METEOR, Retrieval
Human-AI Interaction	Relevance, Response Completeness
Social and Demographic Impact	Sexual, Violence, Self-Harm

Batch Evaluation

Evaluate multiple prompts efficiently:

prompts = [
    {
        "prompt_id": "001",
        "prompt_url": "https://example.com/001",
        "timestamp": "2025-12-11T10:00:00Z",
        "model_name": "gpt-4",
        "model_version": "1.0",
        "query": "What is AI?",
        "response": "AI is...",
        "environment": "production",
        "purpose": "monitoring"
    },
    # ... more prompts
]

summary = client.evaluate_batch(prompts)
print(f"Completed: {summary['successful']}/{summary['total']}")

With Custom Callback

def on_complete(summary):
    print(f"Success: {summary['successful']}")
    print(f"Failed: {summary['failed']}")

client.evaluate_batch(prompts, on_complete=on_complete)

Calibration

Automatic Background Calibration

When you call evaluate(), the client automatically:

Checks the API for calibration prompts
If available, runs calibration in the background (once per model/version/environment)
Evaluates calibration prompts with pre-defined responses

This happens automatically - no manual intervention needed!

Collect Calibration Responses

Optionally pass an invoke_model callback to collect responses from your model for calibration prompts:

def invoke_my_model(prompt_text: str) -> str:
    return my_llm.generate(prompt_text)

result = client.evaluate(
    prompt_id="123",
    prompt_url="https://example.com/123",
    timestamp="2025-12-11T10:00:00Z",
    model_name="gpt-4",
    model_version="1.0",
    query="What is AI?",
    response="AI is artificial intelligence...",
    environment="production",
    purpose="monitoring",
    invoke_model=invoke_my_model,  # Automatically collects calibration responses
)

Parallel Execution

Control parallelism for faster evaluations:

result = client.evaluate(
    ...,
    parallel=True,
    max_workers=10  # Use 10 parallel workers
)

Documentation

Full documentation is available in the docs/ directory:

Requirements

Python 3.12+
Azure OpenAI access
RAIT API credentials

Development

Setup

Clone the repository:

git clone https://github.com/Responsible-Systems/rait-connector.git
cd rait-connector

Install dependencies:

uv sync --dev

Install pre-commit hooks:

uv tool install pre-commit
pre-commit install

Project Documentation

Serve docs locally:

uv run mkdocs serve

Build docs:

uv run mkdocs build

Releasing a New Version

Bump the version:
```
uv version --bump <major|minor|patch>
```
Build the package:
```
uv build
```
Publish to PyPI:
```
uv publish --token <PYPI_TOKEN>
```

Commit the version change:

git commit -am "Release version <version>"

Create a git tag:
```
git tag -a <version> <commit-hash> -m "Release version <version>"
```
[!NOTE] Use the commit hash of the release commit created in step 4.
Push the commit and tag:
```
git push && git push --tags
```

Contributing

Contributions are welcome! Please read our contributing guidelines.

Support

For issues and questions:

GitHub Issues: https://github.com/Responsible-Systems/rait-connector/issues

Changelog

See CHANGELOG.md for release history.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.0

Apr 21, 2026

0.5.0

Feb 27, 2026

0.4.0

Feb 18, 2026

This version

0.3.0

Feb 16, 2026

0.2.0

Jan 27, 2026

0.1.0

Dec 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rait_connector-0.3.0.tar.gz (150.0 kB view details)

Uploaded Feb 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rait_connector-0.3.0-py3-none-any.whl (28.9 kB view details)

Uploaded Feb 16, 2026 Python 3

File details

Details for the file rait_connector-0.3.0.tar.gz.

File metadata

Download URL: rait_connector-0.3.0.tar.gz
Upload date: Feb 16, 2026
Size: 150.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.13

File hashes

Hashes for rait_connector-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`21caed9d72790da13e0c7971055ebe29a3f95f6baa781e5425108e6fed3effaa`
MD5	`4bb055721b207454464c895cdfbb3513`
BLAKE2b-256	`a791c1a5a099235d5b9d79d0bb6f6a2fe84502f7737966d4537154943181df4b`

See more details on using hashes here.

File details

Details for the file rait_connector-0.3.0-py3-none-any.whl.

File metadata

Download URL: rait_connector-0.3.0-py3-none-any.whl
Upload date: Feb 16, 2026
Size: 28.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.13

File hashes

Hashes for rait_connector-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`760cb7a236344877c8c4a19f067ef7b3203f5b306f7734201fb25d8963795c89`
MD5	`646c33b03f4c76e766e713fceb7cccaf`
BLAKE2b-256	`55518dd7708d8585ab5aedcb978c80b45b918b8e4aab46d9dc127e9916adddc9`

See more details on using hashes here.

rait-connector 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

RAIT Connector

Features

Installation

Quick Start

Configuration

Environment Variables

Direct Configuration

Evaluation Metrics

Batch Evaluation

With Custom Callback

Calibration

Automatic Background Calibration

Collect Calibration Responses

Parallel Execution

Documentation

Requirements

Development

Setup

Project Documentation

Releasing a New Version

Contributing

Support

Changelog

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes