Skip to main content

Achieve error-rate parity between protected groups for any predictor

Project description

error-parity

Tests status PyPI status PyPI version OSI license Python compatibility

Fast postprocessing of any score-based predictor to meet fairness criteria.

The error-parity package can achieve strict or relaxed fairness constraint fulfillment, which can be useful to compare ML models at equal fairness levels.

Installing

Install package from PyPI:

pip install error-parity

Or, for development, you can clone the repo and install from local sources:

git clone https://github.com/socialfoundations/error-parity.git
pip install ./error-parity

Getting started

See detailed example notebooks under the examples folder.

from error_parity import RelaxedThresholdOptimizer

# Given any trained model that outputs real-valued scores
fair_clf = RelaxedThresholdOptimizer(
    predictor=lambda X: model.predict_proba(X)[:, -1],   # for sklearn API
    # predictor=model,  # use this for a callable model
    constraint="equalized_odds",
    tolerance=0.05,     # fairness constraint tolerance
)

# Fit the fairness adjustment on some data
# This will find the optimal _fair classifier_
fair_clf.fit(X=X, y=y, group=group)

# Now you can use `fair_clf` as any other classifier
# You have to provide group information to compute fair predictions
y_pred_test = fair_clf(X=X_test, group=group_test)

How it works

Given a callable score-based predictor (i.e., y_pred = predictor(X)), and some (X, Y, S) data to fit, RelaxedThresholdOptimizer will:

  1. Compute group-specific ROC curves and their convex hulls;
  2. Compute the r-relaxed optimal solution for the chosen fairness criterion (using cvxpy);
  3. Find the set of group-specific binary classifiers that match the optimal solution found.
    • each group-specific classifier is made up of (possibly randomized) group-specific thresholds over the given predictor;
    • if a group's ROC point is in the interior of its ROC curve, partial randomization of its predictions may be necessary.

Features and implementation road-map

We welcome community contributions for cvxpy implementations of other fairness constraints.

Currently implemented fairness constraints:

  • equality of odds (Hardt et al., 2016);
    • i.e., equal group-specific TPR and FPR;
    • use constraint="equalized_odds";
  • equal opportunity;
    • i.e., equal group-specific TPR;
    • use constraint="true_positive_rate_parity";
  • predictive equality;
    • i.e., equal group-specific FPR;
    • use constraint="false_positive_rate_parity";

Road-map:

  • demographic parity;
    • i.e., equal group-specific predicted prevalence;

Citing

This repository contains code and supplementary materials for the following preprint:

André F. Cruz and Moritz Hardt. "Unprocessing Seven Years of Algorithmic Fairness." arXiv preprint, 2023.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

error-parity-0.3.2.tar.gz (33.3 kB view details)

Uploaded Source

Built Distribution

error_parity-0.3.2-py3-none-any.whl (37.1 kB view details)

Uploaded Python 3

File details

Details for the file error-parity-0.3.2.tar.gz.

File metadata

  • Download URL: error-parity-0.3.2.tar.gz
  • Upload date:
  • Size: 33.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for error-parity-0.3.2.tar.gz
Algorithm Hash digest
SHA256 578010b94d231b0ebbef05d65474b6a96fa36020126524f097272c38e891f065
MD5 c2f3bf3ef999e3afd9426fc149530e67
BLAKE2b-256 8504d916c19731f63aead9fcbbba213bff2846df5d82298ccc38e09174424147

See more details on using hashes here.

File details

Details for the file error_parity-0.3.2-py3-none-any.whl.

File metadata

File hashes

Hashes for error_parity-0.3.2-py3-none-any.whl
Algorithm Hash digest
SHA256 82c5bdd398480d77d94e9abe67fe0120ed4846fd9876da2294896d746f1795b4
MD5 4197e7cddc6cebc4f0a94c00c116dacf
BLAKE2b-256 459fb88eb6a22427cf28b6ccb49988a5a27edb27cfd18b97aebd4fbf9c24177a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page