Skip to main content

Achieve error-rate parity between protected groups for any predictor

Project description

error-parity

Tests status PyPI status PyPI version OSI license Python compatibility

Fast postprocessing of any score-based predictor to meet fairness criteria.

The error-parity package can achieve strict or relaxed fairness constraint fulfillment, which can be useful to compare ML models at equal fairness levels.

Installing

Install package from PyPI:

pip install error-parity

Or, for development, you can clone the repo and install from local sources:

git clone https://github.com/socialfoundations/error-parity.git
pip install ./error-parity

Getting started

See detailed example notebooks under the examples folder.

from error_parity import RelaxedThresholdOptimizer

# Given any trained model that outputs real-valued scores
fair_clf = RelaxedThresholdOptimizer(
    predictor=lambda X: model.predict_proba(X)[:, -1],   # for sklearn API
    # predictor=model,  # use this for a callable model
    constraint="equalized_odds",
    tolerance=0.05,     # fairness constraint tolerance
)

# Fit the fairness adjustment on some data
# This will find the optimal _fair classifier_
fair_clf.fit(X=X, y=y, group=group)

# Now you can use `fair_clf` as any other classifier
# You have to provide group information to compute fair predictions
y_pred_test = fair_clf(X=X_test, group=group_test)

How it works

Given a callable score-based predictor (i.e., y_pred = predictor(X)), and some (X, Y, S) data to fit, RelaxedThresholdOptimizer will:

  1. Compute group-specific ROC curves and their convex hulls;
  2. Compute the r-relaxed optimal solution for the chosen fairness criterion (using cvxpy);
  3. Find the set of group-specific binary classifiers that match the optimal solution found.
    • each group-specific classifier is made up of (possibly randomized) group-specific thresholds over the given predictor;
    • if a group's ROC point is in the interior of its ROC curve, partial randomization of its predictions may be necessary.

Features and implementation road-map

We welcome community contributions for cvxpy implementations of other fairness constraints.

Currently implemented fairness constraints:

  • equality of odds (Hardt et al., 2016);
    • i.e., equal group-specific TPR and FPR;
    • use constraint="equalized_odds";
  • equal opportunity;
    • i.e., equal group-specific TPR;
    • use constraint="true_positive_rate_parity";
  • predictive equality;
    • i.e., equal group-specific FPR;
    • use constraint="false_positive_rate_parity";

Road-map:

  • demographic parity;
    • i.e., equal group-specific predicted prevalence;

Citing

This repository contains code and supplementary materials for the following preprint:

André F. Cruz and Moritz Hardt. "Unprocessing Seven Years of Algorithmic Fairness." arXiv preprint, 2023.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

error-parity-0.3.3.tar.gz (33.3 kB view details)

Uploaded Source

Built Distribution

error_parity-0.3.3-py3-none-any.whl (37.1 kB view details)

Uploaded Python 3

File details

Details for the file error-parity-0.3.3.tar.gz.

File metadata

  • Download URL: error-parity-0.3.3.tar.gz
  • Upload date:
  • Size: 33.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for error-parity-0.3.3.tar.gz
Algorithm Hash digest
SHA256 996807d7e2e0d803fe3fb8740f1e4c8358731d94031fc7a9141f3207bbdead9f
MD5 ea545d1c8687d57a4c40a7f72ce538b0
BLAKE2b-256 87bc3ea77064cd5c4ed8f1a2f0667f7569aeda81d14fb0212f8b4679dedc335b

See more details on using hashes here.

File details

Details for the file error_parity-0.3.3-py3-none-any.whl.

File metadata

File hashes

Hashes for error_parity-0.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 d93ccd9e41365096920994f44d19dbb5cf736cd791823ca9e546beb43ba3b6c4
MD5 f854b9f68bf37fd4126e8ae233499b36
BLAKE2b-256 93ef1516b421dc9c2979775f75102b3bbe58ff2aed581b794b2ac3157cb9681b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page