Skip to main content

Framework-agnostic LLM agent evaluation harness

Project description

EvalForge Python SDK

pip install evalforge

Quick Start

import evalforge result = evalforge.run("trace.json", metrics=["faithfulness"]) print(result.passed) print(result.metrics[0].score)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

evalforge-0.2.0-py3-none-macosx_11_0_arm64.whl (1.6 MB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file evalforge-0.2.0-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for evalforge-0.2.0-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 a1a65a85cad7a9f890895aafc7787052b64c4c01ce9001fda2bd63e43ad81ee4
MD5 5b8998211719363f0a6afc2f866cc088
BLAKE2b-256 895bf9f4376a12f7694212b3118e5dd71ff5684e5948053506abc1fdf53e46f8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page