Automatically and uniformly measure the behavior of many AI Systems.

These details have been verified by PyPI

Maintainers

auxy bkorycki bollacker dhosterman mlcommons wpietri

These details have not been verified by PyPI

Project links

Project description

ModelGauge

Goal: Make it easy to automatically and uniformly measure the behavior of many AI Systems.

[!WARNING] This repo is still in beta with a planned full release in Fall 2024. Until then we reserve the right to make backward incompatible changes as needed.

ModelGauge is an evolution of crfm-helm, intended to meet their existing use cases as well as those needed by the MLCommons AI Safety project.

Summary

ModelGauge is a library that provides a set of interfaces for Tests and Systems Under Test (SUTs) such that:

Each Test can be applied to all SUTs with the required underlying capabilities (e.g. does it take text input?)
Adding new Tests or SUTs can be done without modifications to the core libraries or support from ModelGauge authors.

Currently ModelGauge is targeted at LLMs and single turn prompt response Tests, with Tests scored by automated Annotators (e.g. LlamaGuard). However, we expect to extend the library to cover more Test, SUT, and Annotation types as we move toward full release.

Docs

Developer Quick Start
Tutorial for how to create a Test
Tutorial for how to create a System Under Test (SUT)
How we use plugins to connect it all together.

Project details

These details have been verified by PyPI

Maintainers

auxy bkorycki bollacker dhosterman mlcommons wpietri

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.6.3

Sep 13, 2024

0.6.2

Sep 5, 2024

0.6.1

Sep 5, 2024

0.6.0

Aug 6, 2024

0.5.1

Apr 27, 2024

0.5.0

Apr 15, 2024

0.3.3

Apr 12, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelgauge-0.6.3.tar.gz (55.5 kB view details)

Uploaded Sep 13, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

modelgauge-0.6.3-py3-none-any.whl (72.5 kB view details)

Uploaded Sep 13, 2024 Python 3

File details

Details for the file modelgauge-0.6.3.tar.gz.

File metadata

Download URL: modelgauge-0.6.3.tar.gz
Upload date: Sep 13, 2024
Size: 55.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.3 CPython/3.10.10 Darwin/22.3.0

File hashes

Hashes for modelgauge-0.6.3.tar.gz
Algorithm	Hash digest
SHA256	`181ad1f691e5d3bdd3b1de519919ec48da9618cdd3eaebd38d4b655af9391e8b`
MD5	`1d556c642d2e0630335cf1459108f079`
BLAKE2b-256	`51796892bea160dda36c74bbe8c4275db4351df7f3d3469e98e1abdecbbbf9fe`

See more details on using hashes here.

File details

Details for the file modelgauge-0.6.3-py3-none-any.whl.

File metadata

Download URL: modelgauge-0.6.3-py3-none-any.whl
Upload date: Sep 13, 2024
Size: 72.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.3 CPython/3.10.10 Darwin/22.3.0

File hashes

Hashes for modelgauge-0.6.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a7317b1a8d39221b1ea8455cdb49c895959e57890a0254a26cc1e0ad03ad4344`
MD5	`d8787fd74768ff78060ffa6c1e302a94`
BLAKE2b-256	`61d2dccef44f5399c0ade89ecf319e25ef6f4e9dbec5c71bf84e6d1eae214d84`

See more details on using hashes here.

modelgauge 0.6.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ModelGauge

Summary

Docs

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes