A simple Python package Optimal Counterfactual Explanations in Tree Ensembles
Project description
Optimal Counterfactual Explanations in Tree Ensembles
ocean is a full package dedicated to counterfactual explanations for tree ensembles.
It builds on the paper Optimal Counterfactual Explanations in Tree Ensemble by Axel Parmentier and Thibaut Vidal in the Proceedings of the thirty-eighth International Conference on Machine Learning, 2021, in press. The article is available here.
Beyond the original MIP approach, ocean also includes constraint programming (CP) and weighted MaxSAT backends for exact counterfactual search on the same parsed tree ensembles.
Installation
You can install the package with the following command:
pip install oceanpy
Note : The MIP method requires the gurobi solver access. You can request for a free academic license here. Once you have installed gurobi, you can install the package with the command above. However, you can also use the CP method without gurobi.
Usage
The package provides multiple classes and functions to wrap the tree ensemble models from the scikit-learn library. A minimal example is provided below:
from sklearn.ensemble import RandomForestClassifier
from ocean import (
ConstraintProgrammingExplainer,
MaxSATExplainer,
MixedIntegerProgramExplainer,
)
from ocean.datasets import load_adult
# Load the adult dataset
(data, target), mapper = load_adult()
# Select an instance to explain from the dataset
x = data.iloc[0].to_frame().T
# Train a random forest classifier
rf = RandomForestClassifier(n_estimators=10, max_depth=3, random_state=42)
rf.fit(data, target)
# Predict the class of the random instance
y = int(rf.predict(x).item())
x = x.to_numpy().flatten()
# Explain the prediction using the MIP backend
mip_model = MixedIntegerProgramExplainer(rf, mapper=mapper)
mip_explanation = mip_model.explain(x, y=1 - y, norm=1)
# Explain the prediction using the CP backend
cp_model = ConstraintProgrammingExplainer(rf, mapper=mapper)
cp_explanation = cp_model.explain(x, y=1 - y, norm=1)
# Explain the prediction using the MaxSAT backend
maxsat_model = MaxSATExplainer(rf, mapper=mapper)
maxsat_explanation = maxsat_model.explain(x, y=1 - y, norm=1)
# Show the explanations and their objective values
print("MIP objective value:", mip_model.get_objective_value())
print("MIP", mip_explanation, "\n")
print("CP objective value:", cp_model.get_objective_value())
print("CP", cp_explanation, "\n")
print("MaxSAT objective value:", maxsat_model.get_objective_value())
print("MaxSAT", maxsat_explanation, "\n")
Expected output:
MIP objective value: 3.0
MIP Explanation:
Age : 39.0
CapitalGain : 2174.0
CapitalLoss : 0
EducationNumber : 13.0
HoursPerWeek : 40.0
MaritalStatus : 3
NativeCountry : 0
Occupation : 10
Relationship : 0
Sex : 0
WorkClass : 6
CP objective value: 3.0
CP Explanation:
Age : 39.0
CapitalGain : 2174.0
CapitalLoss : 0.0
EducationNumber : 13.0
HoursPerWeek : 40.0
MaritalStatus : 3
NativeCountry : 0
Occupation : 1
Relationship : 0
Sex : 0
WorkClass : 4
MaxSAT objective value: 3.0
MaxSAT Explanation:
Age : 39.0
CapitalGain : 2174.0
CapitalLoss : 0.0
EducationNumber : 13.0
HoursPerWeek : 40.0
MaritalStatus : 3
NativeCountry : 0
Occupation : 1
Relationship : 0
Sex : 0
WorkClass : 4
See the examples folder or the documentation for more usage examples.
Feature Preview & Roadmap
| Area | Status | Notes / References |
|---|---|---|
| MIP formulation | ✅ Done | Based on Parmentier & Vidal (2020/2021). |
| Constraint Programming (CP) | ✅ Done | Based on an upcoming paper. |
| MaxSAT formulation | ✅ Done | Based on Raevskaya & Lehtonen (2025). |
| Heuristics | ⏳ Upcoming | Fast approximate methods. |
| Other methods | ⏳ Upcoming | Additional formulations under exploration. |
| AdaBoost support | ✅ Ready | Fully supported in ocean. |
| Random Forest support | ✅ Ready | Fully supported in ocean. |
| XGBoost support | ✅ Ready | Fully supported in ocean. |
Legend: ✅ available · ⏳ upcoming
Stargazers over time
References
- Axel Parmentier and Thibaut Vidal. 2021. Optimal Counterfactual Explanations in Tree Ensembles. In Proceedings of the thirty-eighth International Conference on Machine Learning. PMLR, 8276–8286. Available here.
- Raevskaya, Alesya & Lehtonen, Tuomo. (2025). Optimal Counterfactual Explanations for Random Forests with MaxSAT. 10.3233/FAIA250895. Available here.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file oceanpy-2.0.7.tar.gz.
File metadata
- Download URL: oceanpy-2.0.7.tar.gz
- Upload date:
- Size: 349.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
219ee7f84e7ea449b284a9f6b77769664c54eb559fd8872f0c9ccaba85e5a465
|
|
| MD5 |
a691dffbf9efe5ed25b67d8d37e41361
|
|
| BLAKE2b-256 |
8badb873e573ce65a4d6a93e86a21cc2ec0fe66324d018bb83c0034715dc4e76
|
Provenance
The following attestation bundles were made for oceanpy-2.0.7.tar.gz:
Publisher:
publish.yml on vidalt/OCEAN
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
oceanpy-2.0.7.tar.gz -
Subject digest:
219ee7f84e7ea449b284a9f6b77769664c54eb559fd8872f0c9ccaba85e5a465 - Sigstore transparency entry: 1343478952
- Sigstore integration time:
-
Permalink:
vidalt/OCEAN@b36f30630123bb547e65a0fabcd33c5a335ec64c -
Branch / Tag:
refs/tags/v2.0.7 - Owner: https://github.com/vidalt
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b36f30630123bb547e65a0fabcd33c5a335ec64c -
Trigger Event:
push
-
Statement type:
File details
Details for the file oceanpy-2.0.7-py3-none-any.whl.
File metadata
- Download URL: oceanpy-2.0.7-py3-none-any.whl
- Upload date:
- Size: 84.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d77a420d2d062ba2b3e0cb3fa611f3f54e2d3f8afda0e2a58ae12079a9ad29e5
|
|
| MD5 |
6ef59c75597a2243635054ede4b8dd79
|
|
| BLAKE2b-256 |
fdabc2a0db16a1e6eab74b228186b5d29e6473ab77e3e35a873098c449c5318b
|
Provenance
The following attestation bundles were made for oceanpy-2.0.7-py3-none-any.whl:
Publisher:
publish.yml on vidalt/OCEAN
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
oceanpy-2.0.7-py3-none-any.whl -
Subject digest:
d77a420d2d062ba2b3e0cb3fa611f3f54e2d3f8afda0e2a58ae12079a9ad29e5 - Sigstore transparency entry: 1343478955
- Sigstore integration time:
-
Permalink:
vidalt/OCEAN@b36f30630123bb547e65a0fabcd33c5a335ec64c -
Branch / Tag:
refs/tags/v2.0.7 - Owner: https://github.com/vidalt
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b36f30630123bb547e65a0fabcd33c5a335ec64c -
Trigger Event:
push
-
Statement type: