Patronus Python SDK

These details have not been verified by PyPI

Project links

Project description

Patronus Python SDK

The Patronus Python SDK is a Python library for systematic evaluation of Large Language Models (LLMs). Build, test, and improve your LLM applications with customizable tasks, evaluators, and comprehensive experiment tracking.

Note: This library is currently in beta and is not stable. The APIs may change in future releases.

Documentation

For detailed documentation, including API references and advanced usage, please visit our documentation.

Installation

pip install patronus

Quickstart

Tracing

import patronus

patronus.init()

# Wrap function with @traced() decorator.
@patronus.traced()
def main():
    perform()

def perform():
    # Or use context start_span context manager.
    with patronus.start_span("Performing action"):
        # Do work
        ...

Custom evaluations

from patronus import init
from patronus import evaluator

init()

@evaluator
def iexact_match(actual: str, expected: str) -> bool:
    return actual.lower().strip() == expected.lower().strip()

def main():
    iexact_match("bonne nuit", "Bonne nuit")

Patronus evaluations

from patronus import init
from patronus import RemoteEvaluator

init()

check_hallucinates = RemoteEvaluator("lynx", "patronus:hallucination")

resp = check_hallucinates.evaluate(
    task_input="What is the car insurance policy?",
    task_context=(
        """
        To qualify for our car insurance policy, you need a way to show competence
        in driving which can be accomplished through a valid driver's license.
        You must have multiple years of experience and cannot be graduating from driving school before or on 2028.
        """
    ),
    task_output="To even qualify for our car insurance policy, you need to have a valid driver's license that expires later than 2028."
)
print(resp.model_dump_json(indent=4))

Experiments

The Patronus Python SDK includes a powerful experimentation framework designed to help you evaluate, compare, and improve your AI models. Whether you're working with pre-trained models, fine-tuning your own, or experimenting with new architectures, this framework provides the tools you need to set up, execute, and analyze experiments efficiently.

from patronus.evals import evaluator, RemoteEvaluator
from patronus.experiments import run_experiment, Row, TaskResult, FuncEvaluatorAdapter


def my_task(row: Row, **kwargs):
    return f"{row.task_input} World"


# Reference remote Judge Patronus Evaluator with is-concise criteria.
# This evaluator runs remotely on Patronus infrastructure.
is_concise = RemoteEvaluator("judge", "patronus:is-concise")


@evaluator()
def exact_match(row: Row, task_result: TaskResult, **kwargs):
    print(f"{task_result.output=}  :: {row.task_output=}")
    return task_result.output == row.task_output


result = run_experiment(
    project_name="Tutorial Project",
    dataset=[
        {
            "task_input": "Hello",
            "gold_answer": "Hello World",
        },
    ],
    task=my_task,
    evaluators=[is_concise, FuncEvaluatorAdapter(exact_match)],
)

result.to_csv("./experiment.csv")

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.25

Jan 9, 2026

0.1.24

Oct 17, 2025

0.1.23

Sep 8, 2025

0.1.22

Aug 26, 2025

0.1.21

Aug 13, 2025

0.1.20

Aug 6, 2025

0.1.19

Aug 6, 2025

0.1.18

Jul 14, 2025

0.1.18a1 pre-release

Jun 9, 2025

0.1.17

Jun 18, 2025

0.1.16

May 22, 2025

0.1.16rc1 pre-release

May 22, 2025

0.1.5

May 12, 2025

0.1.5rc5 pre-release

May 12, 2025

0.1.5rc4 pre-release

May 12, 2025

0.1.5rc3 pre-release

May 12, 2025

0.1.5rc2 pre-release

May 12, 2025

0.1.5rc1 pre-release

May 5, 2025

0.1.4

May 5, 2025

0.1.4rc1 pre-release

Apr 10, 2025

0.1.3

Mar 31, 2025

0.1.2

Mar 21, 2025

0.1.1

Mar 17, 2025

0.1.0

Mar 13, 2025

0.1.0rc3 pre-release

Mar 13, 2025

0.1.0rc2 pre-release

Mar 12, 2025

This version

0.1.0rc1 pre-release

Mar 10, 2025

0.0.18

Jan 27, 2025

0.0.17

Jan 13, 2025

0.0.16

Jan 2, 2025

0.0.15

Dec 17, 2024

0.0.15rc1 pre-release

Dec 11, 2024

0.0.14

Nov 7, 2024

0.0.13

Oct 30, 2024

0.0.13rc0 pre-release

Oct 30, 2024

0.0.12

Oct 30, 2024

0.0.11

Oct 28, 2024

0.0.10

Oct 25, 2024

0.0.9

Oct 22, 2024

0.0.8

Oct 1, 2024

0.0.7

Sep 10, 2024

0.0.6

Sep 3, 2024

0.0.3

Aug 30, 2024

0.0.2

Aug 27, 2024

0.0.1

Aug 16, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

patronus-0.1.0rc1.tar.gz (38.9 kB view details)

Uploaded Mar 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

patronus-0.1.0rc1-py3-none-any.whl (49.5 kB view details)

Uploaded Mar 10, 2025 Python 3

File details

Details for the file patronus-0.1.0rc1.tar.gz.

File metadata

Download URL: patronus-0.1.0rc1.tar.gz
Upload date: Mar 10, 2025
Size: 38.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.0.0 CPython/3.12.8 Darwin/24.3.0

File hashes

Hashes for patronus-0.1.0rc1.tar.gz
Algorithm	Hash digest
SHA256	`6801233cf2ef3e9fe976dea78bf7e49f915e42be31741b100b9881c8db93069f`
MD5	`2a17f9c636655c46b307c61b7a232b6f`
BLAKE2b-256	`8a2d7624101da2ccff09a015ca49bb83cbbc55a3bcfaeed50e02a9e52fb66688`

See more details on using hashes here.

File details

Details for the file patronus-0.1.0rc1-py3-none-any.whl.

File metadata

Download URL: patronus-0.1.0rc1-py3-none-any.whl
Upload date: Mar 10, 2025
Size: 49.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.0.0 CPython/3.12.8 Darwin/24.3.0

File hashes

Hashes for patronus-0.1.0rc1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e197f80be33d788164bfe91e3b323b34177709d0ae8b9d34d34a12e2dc6bf5d7`
MD5	`8c936edfe1add06b3cddbaaf8dae9986`
BLAKE2b-256	`f9422d60a34506b29e486cdc9c5a978639902c34aa91414b7cff1df4c4d6b860`

See more details on using hashes here.

patronus 0.1.0rc1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Patronus Python SDK

Documentation

Installation

Quickstart

Tracing

Custom evaluations

Patronus evaluations

Experiments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes