rank-preserving-calibration

Rank-preserving calibration of multiclass probabilities via Dykstra's projections and ADMM.

These details have not been verified by PyPI

Project links

Project description

Rank Preserving Calibration of Multiclass Probabilities

Survey statisticians and machine learning practitioners often need to adjust the predicted class probabilities from a classifier so that they match known population totals (column marginals). Simple post-hoc methods that apply separate logit shifts or raking to each class can scramble the ranking of individuals within a class when there are three or more classes. This package implements a rank-preserving calibration procedure that projects probabilities onto the intersection of two convex sets:

Row-simplex: each row sums to one and all entries are non-negative.
Isotonic column marginals: within each class, values are non-decreasing when instances are sorted by their original scores for that class, and the sum of each column equals a user-supplied target.

The algorithm uses Dykstra's alternating projection method in Euclidean geometry. When the specified column totals are feasible, the procedure returns a matrix that preserves cross-person discrimination within each class, matches the desired totals, and remains a valid probability distribution for each instance. If no such matrix exists, the algorithm converges to the closest point (in L2 sense) satisfying both sets of constraints.

New: Nearly Isotonic Calibration

This package now supports nearly isotonic constraints that allow small violations of strict monotonicity when appropriate:

Epsilon-slack constraints: Allow z[i+1] ≥ z[i] - ε instead of strict z[i+1] ≥ z[i]
Lambda-penalty approach: Penalize isotonicity violations with a tunable parameter

These relaxed constraints can provide better balance between rank preservation and probability calibration when strict isotonic constraints are too restrictive.

An ADMM optimization implementation is also provided as an alternative solver that minimizes ||Q - P||² subject to the same constraints.

Installation

pip install rank_preserving_calibration

The only runtime dependency is numpy. Optional dependencies include scipy (for enhanced test case generation) and matplotlib (for examples).

Usage

Basic Usage

import numpy as np
from rank_preserving_calibration import calibrate_dykstra

P = np.array([
    [0.6, 0.3, 0.1],
    [0.2, 0.5, 0.3],
    [0.1, 0.2, 0.7],
])

# Target column sums, e.g. population class frequencies. Must sum to the
# number of rows (3 in this example) for perfect feasibility.
M = np.array([1.0, 1.0, 1.0])

result = calibrate_dykstra(P, M)

print("Adjusted probabilities:\n", result.Q)
print("Converged:", result.converged)
print("Iterations:", result.iterations)
print("Max row error:", result.max_row_error)
print("Max column error:", result.max_col_error)
print("Rank violations:", result.max_rank_violation)

Nearly Isotonic Usage

# Epsilon-slack: Allow small rank violations (recommended)
nearly_params = {"mode": "epsilon", "eps": 0.05}
result = calibrate_dykstra(P, M, nearly=nearly_params)

# Lambda-penalty: Soft isotonic constraint (experimental)
nearly_params = {"mode": "lambda", "lam": 1.0}
result = calibrate_admm(P, M, nearly=nearly_params)

The returned CalibrationResult contains the calibrated matrix Q with the same shape as P. Each row of Q sums to one, the column sums match M, and within each column the entries are sorted in non-decreasing order according to the order implied by the original P.

Evaluation and Metrics

After calibration, it's important to validate that the constraints are satisfied and understand the impact on prediction quality. This package provides comprehensive metrics for evaluation:

Constraint Validation

from rank_preserving_calibration import feasibility_metrics, isotonic_metrics

# Check constraint satisfaction
feasibility = feasibility_metrics(result.Q, M)
print(f"Max row error: {feasibility['row']['max_abs_error']}")
print(f"Max column error: {feasibility['col']['max_abs_error']}")

# Check rank preservation  
isotonic = isotonic_metrics(result.Q, P)
print(f"Max rank violation: {isotonic['max_rank_violation']}")
print(f"Violation mass: {isotonic['total_violation_mass']}")

Calibration Quality Assessment

from rank_preserving_calibration import distance_metrics, nll, brier

# Measure calibration changes
distances = distance_metrics(result.Q, P)
print(f"Frobenius distance: {distances['frobenius']}")
print(f"Max change: {distances['max_abs']}")

# Evaluate with labeled data (if available)
if y_true is not None:
    original_nll = nll(y_true, P)
    calibrated_nll = nll(y_true, result.Q) 
    print(f"NLL improvement: {original_nll - calibrated_nll}")

Available Metrics

Function	Purpose
`feasibility_metrics(Q, M)`	Validate row (simplex) and column (marginal) constraints
`isotonic_metrics(Q, P)`	Check rank preservation and measure violations
`distance_metrics(Q, P)`	Quantify changes between original and calibrated probabilities
`tie_group_variance(Q, P)`	Assess handling of tied predictions (useful for `ties='group'`)
`nll(y, probs)`	Negative log-likelihood (requires true labels)
`brier(y, probs)`	Brier score (requires true labels)
`top_label_ece(y, probs)`	Expected calibration error for top predictions
`classwise_ece(y, probs)`	Per-class calibration error analysis
`sharpness_metrics(probs)`	Prediction confidence and entropy analysis
`auc_deltas(y, P, Q)`	One-vs-rest AUC changes after calibration

Complete Evaluation Workflow

import numpy as np
from rank_preserving_calibration import (
    calibrate_dykstra, feasibility_metrics, isotonic_metrics, 
    distance_metrics, nll, top_label_ece
)

# Calibrate
result = calibrate_dykstra(P, M)

# 1. Validate constraints
feasibility = feasibility_metrics(result.Q, M)
isotonic = isotonic_metrics(result.Q, P) 
print(f"Converged: {result.converged}")
print(f"Row constraint satisfied: {feasibility['row']['max_abs_error'] < 1e-6}")
print(f"Rank preserved: {isotonic['max_rank_violation'] < 1e-6}")

# 2. Assess calibration impact
distances = distance_metrics(result.Q, P)
print(f"Average change per probability: {distances['mean_abs']:.4f}")

# 3. Evaluate predictive quality (if labels available)
if y_true is not None:
    ece_before = top_label_ece(y_true, P)
    ece_after = top_label_ece(y_true, result.Q)
    print(f"Calibration error before: {ece_before['ece']:.3f}")
    print(f"Calibration error after: {ece_after['ece']:.3f}")

Functions

`calibrate_dykstra(P, M, **kwargs)`

Calibrate using Dykstra's alternating projections (recommended). Supports both strict and nearly isotonic constraints.

`calibrate_admm(P, M, **kwargs)`

Calibrate using ADMM optimization with penalty parameter rho. Supports lambda-penalty nearly isotonic constraints.

`create_test_case(case_type, N, J, **kwargs)` (in `examples.data_helpers`)

Generate synthetic test data for various scenarios used in examples and tests.

Arguments

Parameter	Type	Description
`P`	`ndarray` of shape `[N, J]`	Base multiclass probabilities or non-negative scores. Rows will be projected to the simplex.
`M`	`ndarray` of shape `[J]`	Target column totals (e.g. population class frequencies). The sum of `M` should equal the number of rows `N` for exact feasibility.
`max_iters`	`int`	Maximum number of projection iterations (default `3000` for Dykstra, `1000` for ADMM).
`tol`	`float`	Relative convergence tolerance (default `1e-7` for Dykstra, `1e-6` for ADMM).
`verbose`	`bool`	If `True`, prints convergence diagnostics.
`nearly`	`dict`	Nearly isotonic parameters: `{"mode": "epsilon", "eps": 0.05}` or `{"mode": "lambda", "lam": 1.0}`.
`rho`	`float`	ADMM penalty parameter (default `1.0`, ADMM only).

Returns

CalibrationResult

Both functions return a CalibrationResult object with the following attributes:

Q: NumPy array of shape [N, J] containing the calibrated probabilities. Each row sums to one, each column approximately sums to the corresponding entry of M, and within each column the values are non-decreasing according to the ordering induced by P.
converged: boolean indicating whether the solver met the tolerance criteria.
iterations: number of iterations performed.
max_row_error: maximum absolute deviation of row sums from 1.
max_col_error: maximum absolute deviation of column sums from M.
max_rank_violation: maximum violation of monotonicity (should be 0 up to numerical tolerance).
final_change: final relative change between iterations.

ADMMResult

The ADMM function returns an ADMMResult object with additional convergence history:

All CalibrationResult attributes plus:
objective_values: objective function values over iterations.
primal_residuals: primal residual norms over iterations.
dual_residuals: dual residual norms over iterations.

Algorithm Notes

Dykstra's Method: Uses alternating projections with memory terms to ensure convergence to the intersection of constraint sets. Rows are projected onto the simplex via the algorithm of Duchi et al., and columns are projected via the pool-adjacent-violators algorithm followed by an additive shift to match column totals. This is the recommended method for most applications.
Nearly Isotonic Extensions:
- Epsilon-slack (Dykstra): Projects onto the convex set {z : z[i+1] ≥ z[i] - ε} using coordinate transformation. Maintains theoretical convergence guarantees.
- Lambda-penalty (ADMM): Uses proximal operator to minimize ||Q - P||² + λ∑max(0, z[i] - z[i+1]). More experimental but provides soft constraints.
ADMM: Solves the constrained optimization problem using the Alternating Direction Method of Multipliers. May converge faster for some problems but requires tuning the penalty parameter rho. The algorithm minimizes the sum of squared differences 0.5 * ||Q - P||²_F subject to the calibration constraints.

Examples

See examples/ directory for comprehensive examples including:

example.ipynb: Basic usage and visualization
focused_nearly_isotonic_example.py: When to use nearly isotonic calibration
Real-world classifier calibration scenarios
Survey reweighting applications
Algorithm comparison and performance analysis

When to Use Nearly Isotonic Calibration

Use Nearly Isotonic When:

Model predictions have good discrimination but need marginal calibration
Some predictions are already well-calibrated
Small rank violations are acceptable in your domain
You want to preserve model confidence where possible

Use Strict Isotonic When:

Rank order is critical (regulatory, safety applications)
Model predictions have clear monotonic relationship
Conservative approach is preferred

Testing

python -m pytest tests/ -v

License

This software is released under the terms of the MIT license.

Author

Gaurav Sood <gsood07@gmail.com>

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.8.0

Mar 27, 2026

0.7.1

Dec 28, 2025

This version

0.6.0

Oct 30, 2025

0.4.1

Aug 29, 2025

0.4.0

Aug 25, 2025

0.3.0

Aug 17, 2025

0.2.0

Aug 16, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rank_preserving_calibration-0.6.0.tar.gz (38.1 kB view details)

Uploaded Oct 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rank_preserving_calibration-0.6.0-py3-none-any.whl (21.5 kB view details)

Uploaded Oct 30, 2025 Python 3

File details

Details for the file rank_preserving_calibration-0.6.0.tar.gz.

File metadata

Download URL: rank_preserving_calibration-0.6.0.tar.gz
Upload date: Oct 30, 2025
Size: 38.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rank_preserving_calibration-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`e70ef655d97f6a831332a39503ade988111f1361dd48e11cadfbf53efb5d7640`
MD5	`ddeb593bd8fd424f036a42a70f82a223`
BLAKE2b-256	`294bc373afc80880dea2adaa6110e6eaf8725506894e43fb51f7c15c13674ea6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rank_preserving_calibration-0.6.0.tar.gz:

Publisher: python-publish.yml on finite-sample/rank_preserving_calibration

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rank_preserving_calibration-0.6.0.tar.gz
- Subject digest: e70ef655d97f6a831332a39503ade988111f1361dd48e11cadfbf53efb5d7640
- Sigstore transparency entry: 653547915
- Sigstore integration time: Oct 30, 2025
Source repository:
- Permalink: finite-sample/rank_preserving_calibration@72ed82788c52ce49e447a0c0b9bded524a2d629b
- Branch / Tag: refs/heads/main
- Owner: https://github.com/finite-sample
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@72ed82788c52ce49e447a0c0b9bded524a2d629b
- Trigger Event: workflow_dispatch

File details

Details for the file rank_preserving_calibration-0.6.0-py3-none-any.whl.

File metadata

Download URL: rank_preserving_calibration-0.6.0-py3-none-any.whl
Upload date: Oct 30, 2025
Size: 21.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rank_preserving_calibration-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6472e82911d4bc0a830d8fd0540c4d1608adf4f36f97335b15f6b294611b81b9`
MD5	`3ef67011674e4c5ed62c57d6358a9fd0`
BLAKE2b-256	`502f803be20a391bfdb27d2e93728583a8cd3aa9e7238a6ab935e92c6435bcca`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rank_preserving_calibration-0.6.0-py3-none-any.whl:

Publisher: python-publish.yml on finite-sample/rank_preserving_calibration

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rank_preserving_calibration-0.6.0-py3-none-any.whl
- Subject digest: 6472e82911d4bc0a830d8fd0540c4d1608adf4f36f97335b15f6b294611b81b9
- Sigstore transparency entry: 653547916
- Sigstore integration time: Oct 30, 2025
Source repository:
- Permalink: finite-sample/rank_preserving_calibration@72ed82788c52ce49e447a0c0b9bded524a2d629b
- Branch / Tag: refs/heads/main
- Owner: https://github.com/finite-sample
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@72ed82788c52ce49e447a0c0b9bded524a2d629b
- Trigger Event: workflow_dispatch

rank-preserving-calibration 0.6.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Rank Preserving Calibration of Multiclass Probabilities

New: Nearly Isotonic Calibration

Installation

Usage

Basic Usage

Nearly Isotonic Usage

Evaluation and Metrics

Constraint Validation

Calibration Quality Assessment

Available Metrics

Complete Evaluation Workflow

Functions

calibrate_dykstra(P, M, **kwargs)

calibrate_admm(P, M, **kwargs)

create_test_case(case_type, N, J, **kwargs) (in examples.data_helpers)

Arguments

Returns

CalibrationResult

ADMMResult

Algorithm Notes

Examples

When to Use Nearly Isotonic Calibration

Testing

License

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`calibrate_dykstra(P, M, **kwargs)`

`calibrate_admm(P, M, **kwargs)`

`create_test_case(case_type, N, J, **kwargs)` (in `examples.data_helpers`)