A multi-objective multi-fairness boosting classifier
Project description
MMM-Fair is a multi-objective, fairness-aware boosting classifier originally inspired by the paper: "Multi-fairness Under Class-Imbalance"
https://link.springer.com/chapter/10.1007/978-3-031-18840-4_21
The original algorithm targeted Equalized Odds (a.k.a. Disparate Mistreatment). This MMM-Fair implementation generalizes to multiple fairness objectives:
• Demographic Parity (DP)
• Equal Opportunity (EP)
• Equalized Odds (EO)
We further improve the approach by:
- Flexible Base Learners: Any scikit-learn estimator (e.g. DecisionTreeClassifier, LogisticRegression, ExtraTreeClassifier, etc.) can be used as the base learner.
- Fairness-Weighted Alpha: The boosting weight (alpha) accounts for fairness metrics alongside classification error.
- Dynamic Handling of Over-Boosted Samples: Reduces excessive emphasis on specific samples once fairness goals are partially met.
- Gradient Boosted Tree version
Two Approaches: AdaBoost-Style vs. Gradient-Boosted Trees
We provide two main classifiers:
- MMM_Fair (Original Adaptive Boosting version)
- MMM_Fair_GradientBoostedClassifier (Histogram-based Gradient Boosting approach)
Both handle multi-objective, multi-attribute, and multi-type fairness constraints (DP, EP, EO) but differ in how they perform the boosting internally. You can choose via the command line argument --classifier MMM_Fair or --classifier MMM_Fair_GBT.
Installation
pip install mmm-fair
Requires Python 3.11+.
Dependencies: numpy, scikit-learn, tqdm, pymoo, pandas, ucimlrepo, skl2onnx, etc.
Usage
The mmm-fair package provides two different usage possibilities. One is a chat based on a web-based UI (specially tailored new user, with even non-technical abckground), and the other is command line based (for ML scientist, engineers, etc.)
Usage Overview (mmm-chat)
Right now its still terminal dependent (soon will release a destop app). So after installing one needs to bash
mmm-fair-chat
and then in any browser copy paste:
http://127.0.0.1:5000
Then start chating with the interactive web app to get your MMM-Fair AI model.
Usage Overview (AdaBoost-Style)
You can import and use MMM-Fair (original version):
from mmm_fair import MMM_Fair
from sklearn.tree import DecisionTreeClassifier
Suppose you have X (features), y (labels)
mmm = MMM_Fair(
estimator=DecisionTreeClassifier(max_depth=5),
constraints="EO", # or "DP", "EP"
n_estimators=1000,
saIndex=..., # shape (n_samples, n_protected)
saValue=..., # dictionary {'prot_att_column_name': prot value}
random_state=42,
# other parameters, e.g. gamma, saIndex, saValue...
)
mmm.fit(X, y)
preds = mmm.predict(X_test)
Fairness Constraints
• constraints="DP" → Demographic Parity
• constraints="EP" → Equal Opportunity
• constraints="EO" → Equalized Odds
In all cases, pass the relevant saIndex (sensitive attribute array) and saValue (dictionary of protected group mappings) to MMM_Fair if you want it to track fairness for different protected attributes.
Usage Overview (Gradient-Boosted Trees)
We also provide MMM_Fair_GradientBoostedClassifier. This uses a histogram-based gradient boosting approach (similar to HistGradientBoostingClassifier) but includes a custom fairness loss to train and then multi-objective post-processing step to select the best pareto-optimal ensemble round. Example:
from mmm_fair import MMM_Fair_GradientBoostedClassifier
clf = MMM_Fair_GradientBoostedClassifier(
constraint="EO", # or "DP", "EP"
alpha=0.1, # fairness weight
saIndex=..., # shape (n_samples, n_protected)
saValue=..., # dictionary or None
max_iter=100,
random_state=42,
## any other arguments that the HistGradientBoostingClassifier from sklearn can handle
)
clf.fit(X, y)
preds = clf.predict(X_test)
Train & Deploy Script
This package provides a train_and_deploy.py script. It:
- Loads data (from a known UCI dataset or a local CSV).
- Specifies fairness constraints, protected attributes, and base learner.
- Selects either the original MMM_Fair or the new MMM_Fair_GradientBoostedClassifier via --classifier MMM_Fair or --classifier MMM_Fair_GBT.
- Trains with your chosen hyperparameters.
- Optionally deploys the model in ONNX or pickle format.
Key Arguments
• --classifier: MMM_Fair (original boosting) or MMM_Fair_GBT (gradient-based).
• --constraint: e.g., DP, EP, EO.
• --n_learners: Number of estimators (for either version).
• --pos_Class: Specify the positive class label if needed.
• --early_stop: True or False, relevant for the GBT approach to enable scikit-learn’s early stopping.
• --base_learner: E.g. tree, lr, logistic, etc. (for the original MMM_Fair).
• --deploy: 'onnx' or 'pickle'.
• --moo_vis True: Optionally visualize multi-objective (3D) plots (accuracy, class-imbalance, multi-fairness) after training, opening a local HTML page with interactive charts.
Example command:
1. Original AdaBoost MMM_Fair:
python -m mmm_fair.train_and_deploy \
--dataset Adult \
--prots race sex \
--nprotgs White Male \
--constraint EO \
--base_learner Logistic \
--deploy onnx \
--moo_vis True
python -m mmm_fair.train_and_deploy \
--dataset mydata.csv \
--target label_col \
--prots prot_1 prot_2 prot_3 \
--nprotgs npg1 npg2 npg3 \
--constraint EO \
--base_learner tree \
--deploy onnx
2. Gradient-Boosted MMM_Fair_GBT:
python -m mmm_fair.train_and_deploy \
--classifier MMM_Fair_GBT \
--dataset mydata.csv \
--target label_col \
--prots prot_1 prot_2 \
--nprotgs npg1 npg2 \
--constraint DP \
--alpha 0.5 \
--early_stop True \
--n_learners 100 \
--deploy pickle \
--moo_vis True
Note:
- Setting --moo_vis True triggers an interactive local HTML page for exploring the multi-objective trade-offs in 3D plots (accuracy vs. class-imbalance vs. fairness, etc.).
- Currently the fairness intervention only implemented for categorical groups. So if protected attribute is numerical e.g. "age" then for non-protected value i.e. --nprotgs provide a range like 30_60 as argument.
Additional options
If you want to select the best theta from only the Pareto optimal ensembles set (default is False and selects applies the post-processing to all set of solutions):
--pareto True
If you want to provide test data:
--test 'your_test_file.csv'
Or just test split:
--test 0.3
If you want change style (default is table, choose from {table, console}) of report displayed (Check FairBench Library for more details):
--report_type Console
When deploying with 'onnx', we change the models to ONNX file(s), and store additional parameters in a model_params.npy. This gets zipped into a .zip archive for distribution/analysis.
MAMMOth Toolkit Integration
For the bias exploration using MAMMOth pipeline it is really important to select 'onnx' as the '--deploy' argument. The ONNX model accelerator and model_params.npy are used to integrate with the MAMMOth-toolkit or the demonstrator app from the mammoth-commons project.
By providing the .zip archive, you can:
• Upload it to MAMMOth,
• Examine bias and performance metrics across subgroups,
• Compare fairness trade-offs with a user-friendly interface.
Example Workflow
- Choose Fairness Constraint: e.g., DP, EO, or EP.
- Define sensitive attributes in saIndex and the protected-group condition in saValue.
- Pick base learner (e.g., DecisionTreeClassifier(max_depth=5)) or gradient-based approach.
- Train with a large number of estimators (n_estimators=300 or max_iter=300).
- Optionally do partial ensemble selection with update_theta(criteria="all") or update_theta(criteria="fairness") .
- Export to ONNX or pickle for downstream usage.
- Use --moo_vis True to open local multi-objective 3D plots for deeper analysis.
- Upload the .zip file (if exported to onnx) to MAMMOth for bias exploration.
References
“Multi-Fairness Under Class-Imbalance,” Roy, Arjun, Vasileios Iosifidis, and Eirini Ntoutsi. International Conference on Discovery Science. Cham: Springer Nature Switzerland, 2022.
Maintainer: Arjun Roy (arjunroyihrpa@gmail.com)
Contributors: Swati Swati (swati17293@gmail.com)
License & Contributing
This project is released under [Apache License Version 2.0]. Contributions are welcome—please open an issue or pull request on GitHub.
Contact
For questions or collaborations, please contact arjun.roy@unibw.de Check out the source code at: GITHUB.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mmm_fair-1.0.1.tar.gz.
File metadata
- Download URL: mmm_fair-1.0.1.tar.gz
- Upload date:
- Size: 4.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f1f296dc614b901fee82422bd07ae6a6e1d103e7c4f0a98891234b4a00db5b77
|
|
| MD5 |
f3b01f1644b65b8adfc03a7fbac9f86b
|
|
| BLAKE2b-256 |
5409e5047a44af276f552727cfbe473851ce6c788b9dcc2404d62042bb43b72c
|
Provenance
The following attestation bundles were made for mmm_fair-1.0.1.tar.gz:
Publisher:
publish.yml on arjunroyihrpa/MMM_fair
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mmm_fair-1.0.1.tar.gz -
Subject digest:
f1f296dc614b901fee82422bd07ae6a6e1d103e7c4f0a98891234b4a00db5b77 - Sigstore transparency entry: 185603264
- Sigstore integration time:
-
Permalink:
arjunroyihrpa/MMM_fair@5e7f81836281650eb10fdf1357154b7199848674 -
Branch / Tag:
refs/tags/v1.0.1 - Owner: https://github.com/arjunroyihrpa
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@5e7f81836281650eb10fdf1357154b7199848674 -
Trigger Event:
push
-
Statement type:
File details
Details for the file mmm_fair-1.0.1-py3-none-any.whl.
File metadata
- Download URL: mmm_fair-1.0.1-py3-none-any.whl
- Upload date:
- Size: 4.6 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5ac387852f7559048dc2b39b37d370cb24119d682e4f638101d8477e6f3dde32
|
|
| MD5 |
1476771cc2dbe6ed448890e7aab5205a
|
|
| BLAKE2b-256 |
538df3cc5e6988c6b74a802bbfeab8de9f097978296157802fd73d944354cb43
|
Provenance
The following attestation bundles were made for mmm_fair-1.0.1-py3-none-any.whl:
Publisher:
publish.yml on arjunroyihrpa/MMM_fair
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mmm_fair-1.0.1-py3-none-any.whl -
Subject digest:
5ac387852f7559048dc2b39b37d370cb24119d682e4f638101d8477e6f3dde32 - Sigstore transparency entry: 185603275
- Sigstore integration time:
-
Permalink:
arjunroyihrpa/MMM_fair@5e7f81836281650eb10fdf1357154b7199848674 -
Branch / Tag:
refs/tags/v1.0.1 - Owner: https://github.com/arjunroyihrpa
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@5e7f81836281650eb10fdf1357154b7199848674 -
Trigger Event:
push
-
Statement type: