Skip to main content

Framework for large language model evaluations

Project description

Welcome to Inspect, a framework for large language model evaluations created by the UK AI Security Institute.

Inspect provides many built-in components, including facilities for prompt engineering, tool usage, multi-turn dialog, and model graded evaluations. Extensions to Inspect (e.g. to support new elicitation and scoring techniques) can be provided by other Python packages.

To get started with Inspect, please see the documentation at https://inspect.aisi.org.uk/.

Inspect also includes a collection of over 100 pre-built evaluations ready to run on any model (learn more at Inspect Evals)


To work on development of Inspect, clone the repository and install with the -e flag and [dev] optional dependencies:

git clone https://github.com/UKGovernmentBEIS/inspect_ai.git
cd inspect_ai
pip install -e ".[dev]"

Optionally install pre-commit hooks via

make hooks

Run linting, formatting, and tests via

make check
make test

If you use VS Code, you should be sure to have installed the recommended extensions (Python, Ruff, and MyPy). Note that you'll be prompted to install these when you open the project in VS Code.

Frontend development (TypeScript)

The web UI lives in a git submodule at src/inspect_ai/_view/ts-mono/. These steps are only needed if you plan to work on the TypeScript/React frontend — Python-only contributors can skip this entirely.

Initialize the submodule and install dependencies — see the one-time setup guide.

Documentation

To work on the Inspect documentation, install the optional [doc] dependencies with the -e flag and build the docs:

pip install -e ".[doc]"
cd docs
quarto render # or 'quarto preview'

If you intend to work on the docs iteratively, you'll want to install the Quarto extension in VS Code.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inspect_ai-0.3.206.tar.gz (45.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

inspect_ai-0.3.206-py3-none-any.whl (36.0 MB view details)

Uploaded Python 3

File details

Details for the file inspect_ai-0.3.206.tar.gz.

File metadata

  • Download URL: inspect_ai-0.3.206.tar.gz
  • Upload date:
  • Size: 45.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for inspect_ai-0.3.206.tar.gz
Algorithm Hash digest
SHA256 e58d509f8d462b9f410f6386943417611462bb84151c3a18bce768fa1756b714
MD5 632e4f057a344628cf3655911f16d65d
BLAKE2b-256 c4edf54e27d351c224c44d9c363e58476e8a281cc9228f1a1f74a1507c7b6e08

See more details on using hashes here.

File details

Details for the file inspect_ai-0.3.206-py3-none-any.whl.

File metadata

  • Download URL: inspect_ai-0.3.206-py3-none-any.whl
  • Upload date:
  • Size: 36.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for inspect_ai-0.3.206-py3-none-any.whl
Algorithm Hash digest
SHA256 6c15feb7c94acacd12e02d8244c6584ada533c25e38046f5832ed6fa34a1f917
MD5 c3310ee207e4fa456c3b4d7cff893d83
BLAKE2b-256 b71898c8bc05cc67720e78f3b44eb7ec7c1cf7860a8997001ea0eb2b590cca87

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page