Skip to main content

Framework-agnostic LLM agent evaluation harness

Project description

EvalForge Python SDK

pip install evalforge

Quick Start

import evalforge result = evalforge.run("trace.json", metrics=["faithfulness"]) print(result.passed) print(result.metrics[0].score)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

evalforge-1.0.0-py3-none-win_amd64.whl (1.8 MB view details)

Uploaded Python 3Windows x86-64

evalforge-1.0.0-py3-none-manylinux_2_38_x86_64.whl (4.1 MB view details)

Uploaded Python 3manylinux: glibc 2.38+ x86-64

evalforge-1.0.0-py3-none-macosx_11_0_arm64.whl (1.8 MB view details)

Uploaded Python 3macOS 11.0+ ARM64

evalforge-1.0.0-py3-none-any.whl (10.7 kB view details)

Uploaded Python 3

File details

Details for the file evalforge-1.0.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: evalforge-1.0.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 1.8 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for evalforge-1.0.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 78f0a0959062b2805c01e5432be99dd4c3b684d3ad5880a308b45de0cd3db279
MD5 27fb9ccba50fbec118086a380dec8508
BLAKE2b-256 04f5ba7272c90b9ff13129226b22d9b6f3527c138fbc545dce075f1c85c90b99

See more details on using hashes here.

Provenance

The following attestation bundles were made for evalforge-1.0.0-py3-none-win_amd64.whl:

Publisher: release.yml on heManKuMAR6/evalforge

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file evalforge-1.0.0-py3-none-manylinux_2_38_x86_64.whl.

File metadata

File hashes

Hashes for evalforge-1.0.0-py3-none-manylinux_2_38_x86_64.whl
Algorithm Hash digest
SHA256 be725cfade7cae34608660845dce45e016236986de27e18f0b6b38e290f3cf34
MD5 60a61ef39b14a15868ea290f062aac45
BLAKE2b-256 6f4d8f68e1f99e43fa2a41399497379d8c3a48c98de847694b3d020c61bcf332

See more details on using hashes here.

Provenance

The following attestation bundles were made for evalforge-1.0.0-py3-none-manylinux_2_38_x86_64.whl:

Publisher: release.yml on heManKuMAR6/evalforge

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file evalforge-1.0.0-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for evalforge-1.0.0-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 cd9c6f927e9fad5321a4e4db88d7c129de30017a83ca9ebabafd5d555da79530
MD5 8b82582cd7192f0fadc374bab0c71804
BLAKE2b-256 8a0c17b1db618ebcdd95fee32abcdb1b7a34d896e9979f9e23303ea41581c3fc

See more details on using hashes here.

Provenance

The following attestation bundles were made for evalforge-1.0.0-py3-none-macosx_11_0_arm64.whl:

Publisher: release.yml on heManKuMAR6/evalforge

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file evalforge-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: evalforge-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 10.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for evalforge-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f783dfe17434a8584e095bd9bef88f9cb25f41342770ac24298383f53f085a9a
MD5 34a87161becd6da81c383f366ad95b33
BLAKE2b-256 349a58f021619f3ae14af8967277c8ea6ba8677441d60fb110d5ce8993350670

See more details on using hashes here.

Provenance

The following attestation bundles were made for evalforge-1.0.0-py3-none-any.whl:

Publisher: release.yml on heManKuMAR6/evalforge

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page