Inspect AI `Scorer` adapter for whatifd.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

voseghale

These details have not been verified by PyPI

Project description

whatifd-inspect-ai

Inspect AI Scorer adapter for whatifd. Phase 4B.2 of the v0.1 plan.

Install

pip install whatifd-inspect-ai

Pulls whatifd and inspect-ai>=0.3.216,<0.4 (industry-standard library pinning: lower bound + minor-version cap, since Inspect AI is pre-1.0 and ships breaking changes within minor bumps).

Usage

from inspect_ai.scorer import Score, Target
from inspect_ai.solver import TaskState
from whatifd_inspect_ai import InspectAIScorer
from whatifd.contract import ScoreCase


def score_fn(case: ScoreCase) -> Score:
    """Wire the user's Inspect AI scorer into the (ScoreCase) -> Score
    callable shape this adapter expects. Typical pattern: build a
    TaskState from the case, run the Inspect AI scorer, return Score."""
    state = TaskState(
        model="anthropic/claude-opus-4-7",
        sample_id=case.trace_id,
        epoch=0,
        input=case.input.user_message,
        messages=[],
        output=...,  # ModelOutput from case.replayed_output.text
    )
    target = Target(case.original_output.text)
    return my_inspect_scorer(state, target)


scorer = InspectAIScorer(
    score_fn=score_fn,
    judge_provider="anthropic",
    judge_model_id="claude-opus-4-7",
    rubric_id="faithfulness-v1",
    rubric_text="Score 0-1 by faithfulness to the original output...",
    scoring_parameters={"temperature": 0.0, "max_tokens": 256},
)

# Plug into the whatifd pipeline alongside a TraceSource.

Cardinal alignment

#5 Sensitive at the boundary: JudgeResult.rationale is wrapped at _project_score. Inspect AI's Score.explanation carries free text from the judge model; it MUST be wrapped before any whatifd-core code sees it.
#1 failures-as-data: when the wrapped score_fn returns None or raises, the adapter surfaces a JudgeResult(score=None) with structured rationale. The pipeline converts that into a FailureRecord. A non-numeric Score.value (e.g., a categorical label) projects to score=None instead of crashing on float().
#10 statistical claims: the adapter is metric-agnostic — that's the user's responsibility when defining the Inspect AI scorer. Methodology (judge model, rubric hash, scoring parameters) flows through cache_key_components.

Why no recorded-smoke test in this package

Unlike Langfuse (which has a hosted ingestion API replayed via pytest-recording cassettes), Inspect AI is a local evaluation framework — its scorers run in-process against a model provider (Anthropic / OpenAI / etc.). There is no "Inspect AI host" to record HTTP cassettes against. The real-network surface is the model provider behind Inspect, which Phase 9B's real-adapter smoke covers via the integration suite. This package ships mocked-only conformance; cardinal #5 still applies (Sensitive[str] at the boundary), and the conformance harness pins it.

Contributor setup

This package lives in the parent whatifd monorepo as a uv workspace member. From the repo root:

uv sync --all-extras --dev --group workspace

The --group workspace flag pulls the in-tree whatifd-inspect-ai editable install via PEP 735 dependency groups (uv-native). Without it, uv sync --all-extras --dev installs the rest of the dev environment but leaves this package out, and pytest packages/whatifd-inspect-ai/tests/ fails with ModuleNotFoundError: whatifd_inspect_ai.

Plain pip install ".[dev]" will NOT work for the workspace package — pip ignores PEP 735 groups (deliberate; the workspace dep can't be resolved from PyPI because it isn't published yet). Use uv for development setup; pip-only consumers install the published whatifd-inspect-ai from PyPI once it lands.

Stability

Pre-1.0; the adapter follows whatifd's v0.1 stability contract. The Inspect AI minor-version cap (<0.4) reserves the next minor for a coordinated migration if Inspect AI changes the Scorer / Score shape.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

voseghale

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0

Jun 4, 2026

0.2.1

May 30, 2026

0.2.0

May 10, 2026

0.1.0

May 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whatifd_inspect_ai-0.3.0.tar.gz (11.6 kB view details)

Uploaded Jun 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

whatifd_inspect_ai-0.3.0-py3-none-any.whl (8.8 kB view details)

Uploaded Jun 4, 2026 Python 3

File details

Details for the file whatifd_inspect_ai-0.3.0.tar.gz.

File metadata

Download URL: whatifd_inspect_ai-0.3.0.tar.gz
Upload date: Jun 4, 2026
Size: 11.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for whatifd_inspect_ai-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`7ec8c806eadb083b7e819062488dcd6fd769b1c3f808196e3ad8ff42bf86fe91`
MD5	`f52c968c70ab0d634759fb289926846e`
BLAKE2b-256	`303fcc0d13ea6985a985e796b234a2c1d5c198dce06716fa1db9584989c29161`

See more details on using hashes here.

Provenance

The following attestation bundles were made for whatifd_inspect_ai-0.3.0.tar.gz:

Publisher: release.yml on victoralfred/whatifd

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: whatifd_inspect_ai-0.3.0.tar.gz
- Subject digest: 7ec8c806eadb083b7e819062488dcd6fd769b1c3f808196e3ad8ff42bf86fe91
- Sigstore transparency entry: 1725309041
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: victoralfred/whatifd@47869c1d0653ebe9d95106ca9e5d263ff58ee5e0
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/victoralfred
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@47869c1d0653ebe9d95106ca9e5d263ff58ee5e0
- Trigger Event: push

File details

Details for the file whatifd_inspect_ai-0.3.0-py3-none-any.whl.

File metadata

Download URL: whatifd_inspect_ai-0.3.0-py3-none-any.whl
Upload date: Jun 4, 2026
Size: 8.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for whatifd_inspect_ai-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7da6cf74bf6cb8c4e01927958fbe7e2ef3b4d0b7534f7f1ba515616e3e7b4ff4`
MD5	`9c561fe8e98d26049cd8aea857d5c457`
BLAKE2b-256	`7dcc773194a3166324a3938b6e95d65358a9e74a4b988431c2a47238d8a20641`

See more details on using hashes here.

Provenance

The following attestation bundles were made for whatifd_inspect_ai-0.3.0-py3-none-any.whl:

Publisher: release.yml on victoralfred/whatifd

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: whatifd_inspect_ai-0.3.0-py3-none-any.whl
- Subject digest: 7da6cf74bf6cb8c4e01927958fbe7e2ef3b4d0b7534f7f1ba515616e3e7b4ff4
- Sigstore transparency entry: 1725309380
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: victoralfred/whatifd@47869c1d0653ebe9d95106ca9e5d263ff58ee5e0
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/victoralfred
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@47869c1d0653ebe9d95106ca9e5d263ff58ee5e0
- Trigger Event: push

whatifd-inspect-ai 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

whatifd-inspect-ai

Install

Usage

Cardinal alignment

Why no recorded-smoke test in this package

Contributor setup

Stability

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance