HB-Eval SDK: operational reliability evaluation for agentic AI — fault-injection battery, runtime monitoring, MCP server, and LangChain/LangGraph/CrewAI adapters

These details have not been verified by PyPI

Project links

Project description

HB-Eval SDK

The official Python SDK for HB-Eval OS — the reliability operating system for agentic AI. Evaluate any agent trajectory against five reliability metrics and receive a tier certification, in a few lines of code.

Install

pip install hb-eval-sdk

For the LangChain integration:

pip install hb-eval-sdk[langchain]

Quick start

from hb_eval_sdk import HBEvalClient

client = HBEvalClient(
    api_key="...",          # identifies your project
    aes_key="...",          # encrypts your payload (base64, 32 bytes)
    signing_secret="...",   # signs your request (base64; never transmitted)
)

result = client.evaluate({
    "trajectory": [
        {"step": 1, "action": "chain_start"},
        {"step": 2, "action": "tool_call", "tool": "search"},
        {"step": 3, "action": "chain_end"},
    ],
    "sub_tasks": 3,
    "constraint_violations": 0,
    "recovery_episodes": [],
    "agent_id": "my-agent",
})

print(result.verdict, result.tier)
print(result.metrics)   # pei, irs, frr, ti, csi

The five metrics

Every evaluation returns five reliability metrics. Any of them may be None when it is genuinely undefined for a given run, and None always means "not measured" — never "scored zero".

PEI — Planning Efficiency Index
IRS — Intentional Recovery Score (None when the run had no faults)
FRR — Failure Resilience Rate
TI — Traceability Index (None when no judge evaluation was made)
CSI — Consistency Stability Index (None without enough history)

LangChain

from hb_eval_sdk import HBEvalCallback

callback = HBEvalCallback(api_key="...", aes_key="...", signing_secret="...")
agent.run(task, callbacks=[callback])
print(callback.last_result.verdict)

The callback observes the real run — counting genuine tool errors and detecting actual fault-and-recovery patterns — rather than assuming a clean execution.

Credentials

Your project has three credentials, issued together when the project is created. The API key is sent on each request to identify you. The AES key encrypts your payload locally. The signing secret signs your request and is never transmitted — it proves the request genuinely came from you, even to an observer who has seen your API key.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.5.0

Jul 23, 2026

2.4.0

Jul 23, 2026

2.3.1

Jul 22, 2026

2.3.0

Jul 22, 2026

2.2.0

Jun 21, 2026

2.1.0

Jun 7, 2026

2.0.0

May 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hb_eval_sdk-2.5.0.tar.gz (39.0 kB view details)

Uploaded Jul 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hb_eval_sdk-2.5.0-py3-none-any.whl (41.4 kB view details)

Uploaded Jul 23, 2026 Python 3

File details

Details for the file hb_eval_sdk-2.5.0.tar.gz.

File metadata

Download URL: hb_eval_sdk-2.5.0.tar.gz
Upload date: Jul 23, 2026
Size: 39.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for hb_eval_sdk-2.5.0.tar.gz
Algorithm	Hash digest
SHA256	`51a8c40040ae795ae1cb682cc2c7fce9779b1b7a6bb4535e4e68bc053c873f5f`
MD5	`8e29fd03459efe79cfb0531c9efc818c`
BLAKE2b-256	`40aab6467500f82f39aad252d03f8651d4cb6e13d1ecb07de9d67a7f8a9bd854`

See more details on using hashes here.

File details

Details for the file hb_eval_sdk-2.5.0-py3-none-any.whl.

File metadata

Download URL: hb_eval_sdk-2.5.0-py3-none-any.whl
Upload date: Jul 23, 2026
Size: 41.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for hb_eval_sdk-2.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fbfdd8323158f7d0b1a0c80192cc39fdf301f46ff8b752c3cb6456ec9f2c01a4`
MD5	`ae22c56b4c1a419dcc029a1ddbb974d6`
BLAKE2b-256	`7193fef8451a83f847a72c1e027a3e0429705e862522e372022cbef356dc0b7a`

See more details on using hashes here.

hb-eval-sdk 2.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

HB-Eval SDK

Install

Quick start

The five metrics

LangChain

Credentials

Links

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes