Skip to main content

High-level, zero-code interface for evaluating LLMs using Inspect-AI

Project description

Easy Inspect

A high-level, zero-code interface for evaluating LLMs built on top of Inspect-AI.

Overview

Easy Inspect provides a simple way to evaluate language models using YAML configuration files. It handles all the complexity of setting up evaluation tasks, running models, and analyzing results.

Features

  • Define evaluation questions using simple YAML files
  • Support for multiple question types:
    • Free-form text responses
    • Numerical ratings (0-100)
    • Model-graded evaluations
  • Built-in support for multiple LLM providers (OpenAI, Anthropic)
  • Automatic result caching and logging
  • Easy results analysis and visualization

Installation

git clone https://github.com/dtch1997/easy-eval.git
cd easy-eval
pip install -e .

Usage

See the examples directory for usage examples.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easy_inspect-0.1.0.tar.gz (8.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

easy_inspect-0.1.0-py3-none-any.whl (9.6 kB view details)

Uploaded Python 3

File details

Details for the file easy_inspect-0.1.0.tar.gz.

File metadata

  • Download URL: easy_inspect-0.1.0.tar.gz
  • Upload date:
  • Size: 8.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for easy_inspect-0.1.0.tar.gz
Algorithm Hash digest
SHA256 4ecbffc7fddda9c4a2529085ba1bd9bcb0ceb6dc36f3454b7eccd72b72a9e597
MD5 a488bfe3028bc14370ccddad5755add7
BLAKE2b-256 4851c007ff262f63d4164e0dd90e85ccd0605b3054fe7ff8a6a8f540b4209ff0

See more details on using hashes here.

Provenance

The following attestation bundles were made for easy_inspect-0.1.0.tar.gz:

Publisher: manual_publish.yaml on dtch1997/easy-inspect

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file easy_inspect-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: easy_inspect-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 9.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for easy_inspect-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f2e7f21f16694fc5258570a06fd359bcc35298c635980c3a97b4b9b7c3a92dcb
MD5 5430cfdc38323747150a0c0e34197709
BLAKE2b-256 c878769ca668d6b9756f2f41d782f3cc766d14df4c6d582e656856e9bbfa39a9

See more details on using hashes here.

Provenance

The following attestation bundles were made for easy_inspect-0.1.0-py3-none-any.whl:

Publisher: manual_publish.yaml on dtch1997/easy-inspect

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page