Qualifire Python SDK

These details have not been verified by PyPI

Project links

Project description

Qualifire Python SDK

Coverage

Evaluate LLM outputs for quality, safety, and reliability

Documentation · Dashboard · PyPI

Installation
Quick Start
Available Checks
Usage Examples
Configuration
Response Format

Installation

pip install qualifire

Quick Start

from qualifire.client import Client

client = Client(api_key="your_api_key")

result = client.evaluate(
    input="What is the capital of France?",
    output="The capital of France is Paris.",
    hallucinations_check=True,
)

print(f"Score: {result.score}")  # 0-100
print(f"Flagged: {result.evaluationResults[0].results[0].flagged}")

Available Checks

Check	Description
`hallucinations_check`	Detect factual inaccuracies or hallucinations
`grounding_check`	Verify output is grounded in the provided context
`pii_check`	Detect personally identifiable information
`prompt_injections`	Identify prompt injection attempts
`content_moderation_check`	Check for harmful content (harassment, hate speech, dangerous content, sexual content)
`tool_use_quality_check`	Evaluate quality of tool/function calls
`syntax_checks`	Validate output syntax (JSON, SQL, etc.)
`assertions`	Custom assertions to validate against the output

Usage Examples

Basic Input/Output Evaluation

result = client.evaluate(
    input="Summarize this document about climate change.",
    output="Climate change is primarily caused by human activities...",
    hallucinations_check=True,
    grounding_check=True,
)

Message-Based Evaluation

Evaluate full conversation histories using the OpenAI message format:

from qualifire.types import LLMMessage

result = client.evaluate(
    messages=[
        LLMMessage(role="system", content="You are a helpful assistant."),
        LLMMessage(role="user", content="What is the capital of France?"),
        LLMMessage(role="assistant", content="The capital of France is Paris."),
    ],
    hallucinations_check=True,
)

Multi-Turn Conversations

Enable multi-turn mode for evaluating conversation context:

result = client.evaluate(
    messages=[
        LLMMessage(role="user", content="What is 2 + 2?"),
        LLMMessage(role="assistant", content="2 + 2 equals 4."),
        LLMMessage(role="user", content="And if you add 3 more?"),
        LLMMessage(role="assistant", content="4 + 3 equals 7."),
    ],
    hallucinations_check=True,
    grounding_multi_turn_mode=True,
)

Content Safety

result = client.evaluate(
    input="Write a story about friendship.",
    output="Once upon a time...",
    content_moderation_check=True,
    pii_check=True,
    prompt_injections=True,
)

Syntax Validation

from qualifire.types import SyntaxCheckArgs

result = client.evaluate(
    input="Return the user data as JSON.",
    output='{"name": "John", "age": 30}',
    syntax_checks={"json": SyntaxCheckArgs(args="strict")},
)

Custom Assertions

Define natural language assertions to validate against:

result = client.evaluate(
    input="List three fruits.",
    output="1. Apple\n2. Banana\n3. Orange",
    assertions=[
        "The output must contain exactly three items",
        "Each item must be a fruit",
        "Items must be numbered",
    ],
)

Tool Selection Quality

Evaluate whether the LLM selected the right tools with correct arguments:

from qualifire.types import LLMMessage, LLMToolCall, LLMToolDefinition

result = client.evaluate(
    messages=[
        LLMMessage(role="user", content="What's the weather in New York tomorrow?"),
        LLMMessage(
            role="assistant",
            content="Let me check that for you.",
            tool_calls=[
                LLMToolCall(
                    id="call_123",
                    name="get_weather",
                    arguments={"location": "New York", "date": "tomorrow"},
                )
            ],
        ),
    ],
    available_tools=[
        LLMToolDefinition(
            name="get_weather",
            description="Get weather forecast for a location",
            parameters={
                "type": "object",
                "properties": {
                    "location": {"type": "string"},
                    "date": {"type": "string"},
                },
                "required": ["location"],
            },
        ),
    ],
    tool_use_quality_check=True,
)

Pre-configured Evaluations

Run evaluations configured in the Qualifire Dashboard:

result = client.invoke_evaluation(
    evaluation_id="eval_abc123",
    input="User query here",
    output="LLM response here",
)

Model Modes

Control the speed/quality trade-off for each check:

from qualifire.types import ModelMode

result = client.evaluate(
    input="...",
    output="...",
    hallucinations_check=True,
    hallucinations_mode=ModelMode.QUALITY,  # SPEED | BALANCED | QUALITY
    grounding_check=True,
    grounding_mode=ModelMode.SPEED,
)

Configuration

Environment Variables

Variable	Description
`QUALIFIRE_API_KEY`	Your Qualifire API key
`QUALIFIRE_BASE_URL`	Custom API base URL (optional)

Client Options

client = Client(
    api_key="your_api_key",  # Or set QUALIFIRE_API_KEY env var
    base_url="https://...",  # Custom base URL (optional)
    debug=True,              # Enable debug logging
    verify=True,             # SSL certificate verification
)

Response Format

result = client.evaluate(...)

# Overall score (0-100)
result.score

# Evaluation status
result.status

# Detailed results per check
for item in result.evaluationResults:
    print(f"Check: {item.type}")
    for r in item.results:
        print(f"  {r.name}: {r.label} (score: {r.score})")
        print(f"  Reason: {r.reason}")
        print(f"  Flagged: {r.flagged}")

Example JSON Response

{
  "score": 95,
  "status": "completed",
  "evaluationResults": [
    {
      "type": "hallucinations",
      "results": [
        {
          "name": "hallucination_check",
          "label": "pass",
          "score": 100,
          "flagged": false,
          "reason": "The response is factually accurate and consistent with known information.",
          "claim": "The capital of France is Paris.",
          "quote": "The capital of France is Paris.",
          "confidence_score": 98
        }
      ]
    }
  ]
}

Requirements

Python 3.8+

License

MIT License - see LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.17.0

Feb 10, 2026

0.15.0

Jan 30, 2026

0.14.0

Jan 29, 2026

0.13.0

Jan 18, 2026

0.12.0

Jan 7, 2026

0.11.0

Dec 23, 2025

0.10.2

Nov 12, 2025

0.10.1

Sep 17, 2025

0.10.0

Aug 24, 2025

0.9.0

May 26, 2025

0.8.0

May 13, 2025

0.7.0

Apr 26, 2025

0.6.7

Apr 26, 2025

0.6.1

Oct 31, 2024

0.5.13

Oct 23, 2024

0.5.12

Oct 23, 2024

0.5.11

Oct 22, 2024

0.5.10

Oct 22, 2024

0.5.9

Mar 19, 2024

0.5.8

Mar 12, 2024

0.5.5

Sep 26, 2023

0.5.3

Sep 26, 2023

0.5.0

Sep 18, 2023

0.3.0

Sep 13, 2023

0.2.0

Sep 13, 2023

0.1.0

Sep 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qualifire-0.17.0.tar.gz (14.4 kB view details)

Uploaded Feb 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

qualifire-0.17.0-py3-none-any.whl (11.0 kB view details)

Uploaded Feb 10, 2026 Python 3

File details

Details for the file qualifire-0.17.0.tar.gz.

File metadata

Download URL: qualifire-0.17.0.tar.gz
Upload date: Feb 10, 2026
Size: 14.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for qualifire-0.17.0.tar.gz
Algorithm	Hash digest
SHA256	`dde013aaa426cdcd82164f38441930d6ee8202b56a3591d6f2e33b0f6e4463c7`
MD5	`5b41d3ee23cfe0ebc21381b824afb6e7`
BLAKE2b-256	`b5068a0e1fa3c39ebc7470242a6b73959caafb3aed1a1f0336ca7e980917f3ef`

See more details on using hashes here.

File details

Details for the file qualifire-0.17.0-py3-none-any.whl.

File metadata

Download URL: qualifire-0.17.0-py3-none-any.whl
Upload date: Feb 10, 2026
Size: 11.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for qualifire-0.17.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7cbcf76185403468068a31ad1d8a1327737ec42bfac58591e6c233433d3427b6`
MD5	`51bd04dd83051db99d283832ecde9e0b`
BLAKE2b-256	`abe19935604b14fc37c840dd9a97843b9038a399e9d1d1a14dca7d310f264ddb`

See more details on using hashes here.

qualifire 0.17.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Qualifire Python SDK

Table of Contents

Installation

Quick Start

Available Checks

Usage Examples

Basic Input/Output Evaluation

Message-Based Evaluation

Multi-Turn Conversations

Content Safety

Syntax Validation

Custom Assertions

Tool Selection Quality

Pre-configured Evaluations

Model Modes

Configuration

Environment Variables

Client Options

Response Format

Requirements

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes