A library for evaluating ML model performance across subgroups with stratified metrics and bootstrap confidence intervals

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

bea-bm

These details have not been verified by PyPI

Project description

Model Auditor

A Python library for evaluating machine learning model performance across subgroups with support for stratified metrics, bootstrap confidence intervals, and hierarchical visualizations.

Installation

pip install model-auditor

Features

Stratified Evaluation: Evaluate model metrics across different subgroups (e.g., by age, gender, region)
Bootstrap Confidence Intervals: Calculate 95% confidence intervals for all supported metrics
Comprehensive Metrics: Built-in support for classification metrics including:
- Sensitivity, Specificity, Precision, Recall, F1 Score
- AUROC, AUPRC
- Matthews Correlation Coefficient (MCC)
- F-beta Score (configurable beta)
- TPR, TNR, FPR, FNR
- Count metrics (N, TP, TN, FP, FN, Positive, Negative)
Threshold Optimization: Automatic threshold selection using the Youden index
Hierarchical Visualization: Generate data structures for sunburst/treemap plots
Extensible Design: Protocol-based architecture for custom metrics

Quick Start

from model_auditor import Auditor
from model_auditor.metrics import Sensitivity, Specificity, AUROC, F1Score

# Initialize the auditor
auditor = Auditor()

# Add your data
auditor.add_data(df)

# Define stratification features
auditor.add_feature(name="age_group", label="Age Group")
auditor.add_feature(name="gender", label="Gender")

# Define the score column and threshold
auditor.add_score(name="risk_score", label="Risk Score", threshold=0.5)

# Define the outcome column
auditor.add_outcome(name="diagnosis", mapping={"positive": 1, "negative": 0})

# Set metrics to evaluate
auditor.set_metrics([
    Sensitivity(),
    Specificity(),
    AUROC(),
    F1Score()
])

# Run evaluation with bootstrap confidence intervals
results = auditor.evaluate(score_name="risk_score", n_bootstraps=1000)

# Convert results to a DataFrame
results_df = results.to_dataframe()
print(results_df)

Threshold Optimization

Find the optimal decision threshold using the Youden index:

auditor = Auditor()
auditor.add_data(df)
auditor.add_score(name="risk_score")
auditor.add_outcome(name="label")

# Find optimal threshold
optimal_threshold = auditor.optimize_score_threshold(score_name="risk_score")
# Output: Optimal threshold for 'risk_score' found at: 0.423

Available Metrics

Classification Metrics

Metric	Class	Description
Sensitivity	`Sensitivity()`	TP / (TP + FN)
Specificity	`Specificity()`	TN / (TN + FP)
Precision	`Precision()`	TP / (TP + FP)
Recall	`Recall()`	TP / (TP + FN)
F1 Score	`F1Score()`	Harmonic mean of precision and recall
F-beta	`FBetaScore(beta=2.0)`	Weighted harmonic mean
MCC	`MatthewsCorrelationCoefficient()`	Matthews Correlation Coefficient

Ranking Metrics

Metric	Class	Description
AUROC	`AUROC()`	Area Under ROC Curve
AUPRC	`AUPRC()`	Area Under Precision-Recall Curve

Rate Metrics

Metric	Class	Description
TPR	`TPR()`	True Positive Rate
TNR	`TNR()`	True Negative Rate
FPR	`FPR()`	False Positive Rate
FNR	`FNR()`	False Negative Rate

Count Metrics

Metric	Class	Description
N	`nData()`	Sample size
TP	`nTP()`	True positive count
TN	`nTN()`	True negative count
FP	`nFP()`	False positive count
FN	`nFN()`	False negative count
Positive	`nPositive()`	Positive class count
Negative	`nNegative()`	Negative class count

Custom Metrics

Create custom metrics by implementing the AuditorMetric protocol:

from model_auditor.metrics import AuditorMetric
import pandas as pd

class AccuracyMetric(AuditorMetric):
    name = "accuracy"
    label = "Accuracy"
    inputs = ["tp", "tn", "fp", "fn"]
    ci_eligible = True

    def data_call(self, data: pd.DataFrame) -> float:
        tp = data["tp"].sum()
        tn = data["tn"].sum()
        fp = data["fp"].sum()
        fn = data["fn"].sum()
        return (tp + tn) / (tp + tn + fp + fn)

# Use with the auditor
auditor.set_metrics([AccuracyMetric(), Sensitivity()])

Hierarchical Visualization

Generate data for hierarchical plots (sunburst, treemap):

from model_auditor.plotting import HierarchyPlotter

plotter = HierarchyPlotter()
plotter.set_data(df)
plotter.set_features(["region", "age_group", "gender"])
plotter.set_score(name="risk_score")
plotter.set_aggregator("median")  # or "mean", or a custom function

# Compile plot data
plot_data = plotter.compile(container="All Patients")

# Use with Plotly
import plotly.graph_objects as go

fig = go.Figure(go.Sunburst(
    labels=plot_data.labels,
    ids=plot_data.ids,
    parents=plot_data.parents,
    values=plot_data.values,
    marker=dict(colors=plot_data.colors)
))
fig.show()

Custom Hierarchies

Define complex hierarchies with conditional features:

from model_auditor.plotting.schemas import Hierarchy, HLevel, HItem

hierarchy = Hierarchy(levels=[
    HLevel([HItem(name="region")]),
    HLevel([
        HItem(name="urban_category", query="region == 'Urban'"),
        HItem(name="rural_category", query="region == 'Rural'")
    ]),
    HLevel([HItem(name="age_group")])
])

plotter.set_features(hierarchy)

Disabling Confidence Intervals

For faster evaluation without confidence intervals:

results = auditor.evaluate(score_name="risk_score", n_bootstraps=None)

Output Format

Results are returned as nested dataclass objects that can be converted to DataFrames:

# Get results as DataFrame
df = results.to_dataframe(n_decimals=3, metric_labels=True)

# Access specific feature results
gender_results = results.features["gender"].to_dataframe()

# Access specific level results
male_results = results.features["gender"].levels["Male"].to_dataframe()

License

MIT License

Author

Beatrice BM

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

bea-bm

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.15

Apr 14, 2026

0.1.14

Apr 14, 2026

0.1.13

Apr 13, 2026

0.1.12

Mar 18, 2026

0.1.11

Mar 17, 2026

0.1.10

Mar 17, 2026

0.1.9

Mar 17, 2026

0.1.8

Mar 17, 2026

0.1.7

Mar 16, 2026

0.1.6

Mar 16, 2026

0.1.5

Mar 16, 2026

This version

0.1.4

Jan 12, 2026

0.1.2

May 7, 2025

0.1.1

May 5, 2025

0.1.0

May 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

model_auditor-0.1.4.tar.gz (24.5 kB view details)

Uploaded Jan 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

model_auditor-0.1.4-py3-none-any.whl (21.2 kB view details)

Uploaded Jan 12, 2026 Python 3

File details

Details for the file model_auditor-0.1.4.tar.gz.

File metadata

Download URL: model_auditor-0.1.4.tar.gz
Upload date: Jan 12, 2026
Size: 24.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for model_auditor-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`3169d8f338679287650b2185d3bd88182fbced56db67b2b5394451404c24d27a`
MD5	`c929667835be5356c9c847acc21ff4f9`
BLAKE2b-256	`84af59ddf4393ac30d760d723b15a601b25961b20e023285f7ed3e597716ecdb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for model_auditor-0.1.4.tar.gz:

Publisher: publish.yml on beatrice-b-m/model-auditor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: model_auditor-0.1.4.tar.gz
- Subject digest: 3169d8f338679287650b2185d3bd88182fbced56db67b2b5394451404c24d27a
- Sigstore transparency entry: 814257958
- Sigstore integration time: Jan 12, 2026
Source repository:
- Permalink: beatrice-b-m/model-auditor@222a79b40c6c75a95d616bb625f87ec21b7514b8
- Branch / Tag: refs/tags/v0.1.4
- Owner: https://github.com/beatrice-b-m
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@222a79b40c6c75a95d616bb625f87ec21b7514b8
- Trigger Event: release

File details

Details for the file model_auditor-0.1.4-py3-none-any.whl.

File metadata

Download URL: model_auditor-0.1.4-py3-none-any.whl
Upload date: Jan 12, 2026
Size: 21.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for model_auditor-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ab4a30f3401abab56f7ebb99365d00f200aaad53e70c9bad54a1dd9c8037cf97`
MD5	`2485d19052dc4514fcd4e5a82b7e3b4b`
BLAKE2b-256	`f04a80afc73032f16ea64836f0c2bf283435305e596f8abced2fa77f94b6e0e5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for model_auditor-0.1.4-py3-none-any.whl:

Publisher: publish.yml on beatrice-b-m/model-auditor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: model_auditor-0.1.4-py3-none-any.whl
- Subject digest: ab4a30f3401abab56f7ebb99365d00f200aaad53e70c9bad54a1dd9c8037cf97
- Sigstore transparency entry: 814257960
- Sigstore integration time: Jan 12, 2026
Source repository:
- Permalink: beatrice-b-m/model-auditor@222a79b40c6c75a95d616bb625f87ec21b7514b8
- Branch / Tag: refs/tags/v0.1.4
- Owner: https://github.com/beatrice-b-m
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@222a79b40c6c75a95d616bb625f87ec21b7514b8
- Trigger Event: release

model-auditor 0.1.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Model Auditor

Installation

Features

Quick Start

Threshold Optimization

Available Metrics

Classification Metrics

Ranking Metrics

Rate Metrics

Count Metrics

Custom Metrics

Hierarchical Visualization

Custom Hierarchies

Disabling Confidence Intervals

Output Format

License

Author

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance