Skip to main content

PyTorch-native measurement science toolkit: IRT, CAT, psychometric metrics, and factor models for AI evaluation

Project description

torch_measure

License: MIT Python 3.10+ PyTorch Discord

PyTorch-native toolkit for predictive evaluation of AI systems.

Benchmark scores increasingly gate deployment decisions but rarely predict how a model will behave in production. torch_measure treats evaluation itself as a predictive modeling problem: latent-variable models infer a system's capability directly from sparse benchmark observations and predict its performance on unseen tasks. Built on PyTorch, with GPU-accelerated IRT, factor models, amortized inference, adaptive testing, and tabular baselines.

Installation

With pip:

pip install torch_measure

With uv (faster; drop-in replacement for pip):

uv pip install torch_measure        # into the active environment
uv add torch_measure                # into a uv-managed project

Contributing

We welcome contributions! Please see our contributing guidelines for details, or drop by our Discord to chat.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torch_measure-0.1.0.tar.gz (4.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

torch_measure-0.1.0-py3-none-any.whl (90.2 kB view details)

Uploaded Python 3

File details

Details for the file torch_measure-0.1.0.tar.gz.

File metadata

  • Download URL: torch_measure-0.1.0.tar.gz
  • Upload date:
  • Size: 4.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for torch_measure-0.1.0.tar.gz
Algorithm Hash digest
SHA256 f9904bd196149a0ba5f087854c9a1efd536d0fd678fdcfc9691ba48b39e11662
MD5 40a207dbb6723d1c27ab3959922abb30
BLAKE2b-256 0e4055e1cf431364ab10f2f3b0ae557eed02054835c6acb1b3929af0ccc938ac

See more details on using hashes here.

Provenance

The following attestation bundles were made for torch_measure-0.1.0.tar.gz:

Publisher: publish.yml on aims-foundations/torch_measure

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file torch_measure-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: torch_measure-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 90.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for torch_measure-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2708b0895bcb0fcbb4bda644b268725539db87cfb36938b93136c4cecf38b4c7
MD5 f04f74d4a728c05c8eeb8cd24e06c049
BLAKE2b-256 c554b9079f1624513e663949350eced4a925b4cfd9770ece1e602a74a25d8fb6

See more details on using hashes here.

Provenance

The following attestation bundles were made for torch_measure-0.1.0-py3-none-any.whl:

Publisher: publish.yml on aims-foundations/torch_measure

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page