Evaluation Framework SDK

Project description

DeepEvalClient

A lightweight Python client for interacting with the Evaluation API. It provides convenient wrappers for text and schema evaluation endpoints, with support for background jobs and probabilistic execution.

Features

🔹 Text Evaluation – Run evaluations on plain text inputs.
🔹 Schema Evaluation – Evaluate structured inputs against schema-based metrics.
🔹 Background Jobs – Submit jobs asynchronously and process later.
🔹 Probabilistic Execution – Run evaluations with a configurable chance (e.g., A/B testing scenarios).
🔹 Robust Error Handling – Handles network errors and invalid JSON gracefully.
🔹 Configurable – Configure via constructor args, environment variables, or external settings module.

Installation

pip install rakam-eval-sdk

Usage

Basic Setup

from deepeval.client import DeepEvalClient
from deepeval.schema import TextInputItem, MetricConfig

client = DeepEvalClient(
    base_url="http://localhost:8080",
    api_token="your-api-key"
)

Text Evaluation

    client.maybe_text_eval_background(
                component="ocr",
                data=[
                    TextInputItem(

                        id="runtime evaluation", # identifiar (that can be unique). use same id in case you want to follow performance over time
                        input="...", # input given to ai component
                        output="...", # output of the ai component
                        # optional args/ condtional based on metrics passed
                        expected_output=["..."],
                        retrieval_context=[
                            ["..."]
                        ]

                    )
                ],
                metrics=[
                    ToxicityConfig(
                        # model="gpt-4.1",
                        threshold=0.2,
                        include_reason=False
                    ),
                    CorrectnessConfig(
                        steps=[
                            "You are evaluating text extracted from resumes and job descriptions using OCR.",
                            "1. Verify that the extracted text is coherent and free of major corruption (e.g., broken words, random characters).",
                            "2. Check whether key resume/job-related fields are preserved correctly (e.g., name, job title, skills, education, experience, company name, job requirements).",
                            "3. Ensure that important details are not missing or replaced with irrelevant content.",
                            "4. Ignore minor formatting issues (line breaks, spacing) as long as the information is readable and accurate.",
                            "5. Consider the output correct if it faithfully represents the resume or job description’s main information."
                        ],
                        params=["actual_output"],

                    )
                ],
                chance=.3
            )

Schema Evaluation

    client.maybe_text_eval_background(
                component="ocr",
                data=[
                    TextInputItem(

                        id="runtime evaluation", # identifiar (that can be unique). use same id in case you want to follow performance over time
                        input="...", # input given to ai component
                        output="...", # output of the ai component
                        # optional args/ condtional based on metrics passed
                        expected_output=["..."],
                        retrieval_context=[
                            ["..."]
                        ]

                    )
                ],
                metrics=[
                    ToxicityConfig(
                        # model="gpt-4.1",
                        threshold=0.2,
                        include_reason=False
                    ),
                    CorrectnessConfig(
                        steps=[
                            "You are evaluating text extracted from resumes and job descriptions using OCR.",
                            "1. Verify that the extracted text is coherent and free of major corruption (e.g., broken words, random characters).",
                            "2. Check whether key resume/job-related fields are preserved correctly (e.g., name, job title, skills, education, experience, company name, job requirements).",
                            "3. Ensure that important details are not missing or replaced with irrelevant content.",
                            "4. Ignore minor formatting issues (line breaks, spacing) as long as the information is readable and accurate.",
                            "5. Consider the output correct if it faithfully represents the resume or job description’s main information."
                        ],
                        params=["actual_output"],

                    )
                ],
                chance=.3
            )

Configuration

The client can be configured in multiple ways:

Directly via constructor arguments

DeepEvalClient(base_url="http://api", api_token="123")

Environment variables

export EVALFRAMWORK_URL=http://api
export EVALFRAMWORK_API_KEY=123

Settings module

import settings # it can be django settings e.g.: from django.conf import settings
client = DeepEvalClient(settings_module=settings)

Project details

Release history Release notifications | RSS feed

0.2.4

Feb 2, 2026

0.2.4rc10 pre-release

Feb 10, 2026

0.2.4rc9 pre-release

Feb 10, 2026

0.2.4rc8 pre-release

Feb 5, 2026

0.2.4rc7 pre-release

Feb 4, 2026

0.2.4rc6 pre-release

Feb 4, 2026

0.2.4rc5 pre-release

Feb 4, 2026

0.2.4rc4 pre-release

Feb 4, 2026

0.2.4rc3 pre-release

Feb 4, 2026

0.2.4rc2 pre-release

Feb 4, 2026

0.2.4rc1 pre-release

Feb 4, 2026

0.2.3

Jan 29, 2026

0.2.2

Jan 29, 2026

0.2.1

Jan 29, 2026

0.2.0

Jan 29, 2026

This version

0.2.0rc2 pre-release

Jan 29, 2026

0.2.0rc1 pre-release

Jan 29, 2026

0.1.16

Jan 29, 2026

0.1.16rc1 pre-release

Jan 29, 2026

0.1.15

Jan 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rakam_eval_sdk-0.2.0rc2.tar.gz (9.5 kB view details)

Uploaded Jan 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rakam_eval_sdk-0.2.0rc2-py3-none-any.whl (11.8 kB view details)

Uploaded Jan 29, 2026 Python 3

File details

Details for the file rakam_eval_sdk-0.2.0rc2.tar.gz.

File metadata

Download URL: rakam_eval_sdk-0.2.0rc2.tar.gz
Upload date: Jan 29, 2026
Size: 9.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.6

File hashes

Hashes for rakam_eval_sdk-0.2.0rc2.tar.gz
Algorithm	Hash digest
SHA256	`5c9279d354e444aec22ddbb6e8952b62aa66a948627a45f2bae36b04ef0e50b7`
MD5	`a35b065cef0b6b071692650e719a2cc7`
BLAKE2b-256	`75ff1dc1f0c1b3c83aefab307a5e01a1b6f8dabd0efcf5725785f482dcdf978c`

See more details on using hashes here.

File details

Details for the file rakam_eval_sdk-0.2.0rc2-py3-none-any.whl.

File metadata

Download URL: rakam_eval_sdk-0.2.0rc2-py3-none-any.whl
Upload date: Jan 29, 2026
Size: 11.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.6

File hashes

Hashes for rakam_eval_sdk-0.2.0rc2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1583afbf7d897dca8ee53cae3a799e97262634d40350b04bec3f5d9a2f9c0e62`
MD5	`9d4190079f8e7ef587c39aa77d22de24`
BLAKE2b-256	`60a9a763f6ce7e6a80ad1e9deb9ec4d8685da1decf4f3011382cac7840c2f2a6`

See more details on using hashes here.

rakam-eval-sdk 0.2.0rc2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

DeepEvalClient

Features

Installation

Configuration

Directly via constructor arguments

Environment variables

Settings module

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes