Framework for large language model evaluations
Project description
Welcome to Inspect, a framework for large language model evaluations created by the UK AI Security Institute.
Inspect provides many built-in components, including facilities for prompt engineering, tool usage, multi-turn dialog, and model graded evaluations. Extensions to Inspect (e.g. to support new elicitation and scoring techniques) can be provided by other Python packages.
To get started with Inspect, please see the documentation at https://inspect.aisi.org.uk/.
Inspect also includes a collection of over 100 pre-built evaluations ready to run on any model (learn more at Inspect Evals)
To work on development of Inspect, clone the repository and install with the -e flag and [dev] optional dependencies:
git clone https://github.com/UKGovernmentBEIS/inspect_ai.git
cd inspect_ai
pip install -e ".[dev]"
Optionally install pre-commit hooks via
make hooks
Run linting, formatting, and tests via
make check
make test
If you use VS Code, you should be sure to have installed the recommended extensions (Python, Ruff, and MyPy). Note that you'll be prompted to install these when you open the project in VS Code.
To work on the Inspect documentation, install the optional [doc] dependencies with the -e flag and build the docs:
pip install -e ".[doc]"
cd docs
quarto render # or 'quarto preview'
If you intend to work on the docs iteratively, you'll want to install the Quarto extension in VS Code.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file inspect_ai-0.3.172.tar.gz.
File metadata
- Download URL: inspect_ai-0.3.172.tar.gz
- Upload date:
- Size: 43.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
34a033a1682b678b63ba924b56a2f5a80fbb0eaf211f201c0852638e8c3198c5
|
|
| MD5 |
ce6ce725b7a57c2091c233da3f27d638
|
|
| BLAKE2b-256 |
dca025793628f7cd30d4e2661807e5dd3a660c4f8239ceadb32ecfd6e6cd687e
|
File details
Details for the file inspect_ai-0.3.172-py3-none-any.whl.
File metadata
- Download URL: inspect_ai-0.3.172-py3-none-any.whl
- Upload date:
- Size: 34.6 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0cb67f87d6097208c54bd32d945dae32af194c0f47ba3ebbce20540002b54d72
|
|
| MD5 |
a190352513ecf9b6da353c6611a0cc2e
|
|
| BLAKE2b-256 |
38a014e4c3333db90610eefd970a6cb7dea4fa4c97bdb7227df5c5adf007a770
|