Skip to main content

A simple Python package Optimal Counterfactual Explanations in Tree Ensembles

Project description

Optimal Counterfactual Explanations in Tree Ensembles

Logo

ocean is a full package dedicated to counterfactual explanations for tree ensembles.
It builds on the paper Optimal Counterfactual Explanations in Tree Ensemble by Axel Parmentier and Thibaut Vidal in the Proceedings of the thirty-eighth International Conference on Machine Learning, 2021, in press. The article is available here.
Beyond the original MIP approach, ocean includes a new constraint programming (CP) method and will grow to cover additional formulations and heuristics.

Installation

You can install the package with the following command:

pip install oceanpy

Note : The MIP method requires the gurobi solver access. You can request for a free academic license here. Once you have installed gurobi, you can install the package with the command above. However, you can also use the CP method without gurobi.

Usage

The package provides multiple classes and functions to wrap the tree ensemble models from the scikit-learn library. A minimal example is provided below:

from sklearn.ensemble import RandomForestClassifier

from ocean import MixedIntegerProgramExplainer, ConstraintProgrammingExplainer
from ocean.datasets import load_adult

# Load the adult dataset
(data, target), mapper = load_adult()

# Select an instance to explain from the dataset
x = data.iloc[0].to_frame().T

# Train a random forest classifier
rf = RandomForestClassifier(n_estimators=10, max_depth=3, random_state=42)
rf.fit(data, target)

# Predict the class of the random instance
y = int(rf.predict(x).item())

# Explain the prediction using MIPEXplainer
mip_model = MixedIntegerProgramExplainer(rf, mapper=mapper)
x = x.to_numpy().flatten()
mip_explanation = mip_model.explain(x, y=1 - y, norm=1)

# Explain the prediction using CPEExplainer
cp_model = ConstraintProgrammingExplainer(rf, mapper=mapper)
x = x.to_numpy().flatten()
cp_explanation = cp_model.explain(x, y=1 - y, norm=1)

# Show the explanation
print("MIP: ",mip_explanation)
print("CP : ",cp_explanation)

Expected output:

MIP Explanation:
Age              : 39.0
CapitalGain      : 2174.0
CapitalLoss      : 0
EducationNumber  : 13.0
HoursPerWeek     : 41.0
MaritalStatus    : 3
NativeCountry    : 0
Occupation       : 1
Relationship     : 0
Sex              : 0
WorkClass        : 6
CP Explanation:
Age              : 39.0
CapitalGain      : 2174.0
CapitalLoss      : 0.0
EducationNumber  : 13.0
HoursPerWeek     : 41.0
MaritalStatus    : 3
NativeCountry    : 0
Occupation       : 1
Relationship     : 0
Sex              : 0
WorkClass        : 6

Feature Preview & Roadmap

Area Status Notes / References
MIP formulation ✅ Done Based on Parmentier & Vidal (2020/2021).
Constraint Programming (CP) ✅ Done Based on an upcoming paper.
MaxSAT formulation ⏳ Upcoming Planned addition to the toolbox.
Heuristics ⏳ Upcoming Fast approximate methods.
Other methods ⏳ Upcoming Additional formulations under exploration.
Random Forest support ✅ Ready Fully supported in ocean.
XGBoost support ⏳ Upcoming Implementation planned.

Legend: ✅ available · ⏳ upcoming

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oceanpy-0.2.3.tar.gz (57.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

oceanpy-0.2.3-py3-none-any.whl (43.0 kB view details)

Uploaded Python 3

File details

Details for the file oceanpy-0.2.3.tar.gz.

File metadata

  • Download URL: oceanpy-0.2.3.tar.gz
  • Upload date:
  • Size: 57.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.23

File hashes

Hashes for oceanpy-0.2.3.tar.gz
Algorithm Hash digest
SHA256 07baecb1e5c13f06d3707ce73154c192c4fcdb85d3989275abf6bb40381bcd11
MD5 73d932fe8a7ed7cde9885f50f2cdb8a5
BLAKE2b-256 3b0e66b2abcb701ce7ca24cbaded68a357b77e06e12f09e5d5bb862ddd88f8de

See more details on using hashes here.

File details

Details for the file oceanpy-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: oceanpy-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 43.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.23

File hashes

Hashes for oceanpy-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 a4feff676e55d1caf1f99c56bb9d5406165d7b9331b26e1e14f4ef8c6f153912
MD5 9806afacb167b400c974559e4b625ce8
BLAKE2b-256 66c6444d6efb1f8ce0c80da6b0d719eea5fd34f92f67c87aa5af96da6ae9e59e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page