Skip to main content

explainy is a library for generating explanations for machine learning models in Python. It uses methods from Machine Learning Explainability and provides a standardized API to create feature importance explanations for samples. The explanations are generated in the form of plots and text.

Project description

explainy - black-box model explanations for humans

pypi version codecov docs Supported versions Code style: black Imports: isort Downloads

explainy is a library for generating machine learning models explanations in Python. It uses methods from Machine Learning Explainability and provides a standardized API to create feature importance explanations for samples.

The API is inspired by scikit-learn and has three core methods explain(), plot() and, importance(). The explanations are generated in the form of texts and plots.

explainy comes with four different algorithms to create either global or local and contrastive or non-contrastive model explanations.

Method Type Explanations Classification Regression
Permutation Feature Importance non-contrastive global :star: :star:
Shap Values non-contrastive local :star: :star:
Surrogate Model contrastive global :star: :star:
Counterfactual Example contrastive local :star: :star:

Description:

  • global: explanation of system functionality (all samples have the same explanation)
  • local: explanation of decision rationale (each sample has its own explanation)
  • contrastive: tracing of decision path (differences to other outcomes are described)
  • non-contrastive: parameter weighting (the feature importance is reported)

Documentation

https://explainy.readthedocs.io

Install explainy

pip install explainy

Further, install graphviz (version 9.0.0 or later) for plotting tree surrogate models:

Windows

choco install graphviz

Mac

brew install graphviz

Linux: Ubuntu packages

sudo apt install graphviz

Further details on how to install graphviz can be found in the official graphviz docs.

Also, make sure that the folder with the dot executable is added to your systems PATH. You can find further details here.

Usage

📚 A comprehensive example of the explainy API can be found in this Jupyter Notebook

📖 Or in the example section of the documentation

Initialize and train a scikit-learn model:

import pandas as pd
from sklearn.datasets import load_diabetes
from sklearn.ensemble import RandomForestRegressor
from sklearn.model_selection import train_test_split

diabetes = load_diabetes()
X_train, X_test, y_train, y_test = train_test_split(
    diabetes.data, diabetes.target, random_state=0
)
X_test = pd.DataFrame(X_test, columns=diabetes.feature_names)
y_test = pd.DataFrame(y_test)

model = RandomForestRegressor(random_state=0)
model.fit(X_train, y_train)

Initialize the PermutationExplanation (or any other explanation) object and pass in the trained model and the to be explained dataset.

Addtionally, define the number of features used in the explanation. This allows you to configure the verbosity of your exaplanation.

Set the index of the sample that should be explained.

from explainy.explanations import PermutationExplanation

number_of_features = 4

explainer = PermutationExplanation(
    X_test, y_test, model, number_of_features
)

Call the explain() method and print the explanation for the sample (in case of a local explanation every sample has a different explanation).

explanation = explainer.explain(sample_index=1)
print(explanation)

The RandomForestRegressor used 10 features to produce the predictions. The prediction of this sample was 251.8.

The feature importance was calculated using the Permutation Feature Importance method.

The four features which were most important for the predictions were (from highest to lowest): 'bmi' (0.15), 's5' (0.12), 'bp' (0.03), and 'age' (0.02).

Use the plot() method to create a feature importance plot of that sample.

explainer.plot()

Permutation Feature Importance

If your prefer, you can also create another type of plot, as for example a boxplot.

explainer.plot(kind='box')

Permutation Feature Importance BoxPlot

Finally, you can also look at the importance values of the features (in form of a pd.DataFrame).

feature_importance = explainer.importance()
print(feature_importance)
  Feature  Importance
0     bmi        0.15
1      s5        0.12
2      bp        0.03
3     age        0.02
4      s2       -0.00
5     sex       -0.00
6      s3       -0.00
7      s1       -0.01
8      s6       -0.01
9      s4       -0.01

Features

  • Algorithms for inspecting black-box machine learning models
  • Support for the machine learning frameworks scikit-learn and xgboost
  • explainy offers a standardized API with three core methods explain(), plot(), importance()

Other Machine Learning Explainability libraries to watch

  • shap: A game theoretic approach to explain the output of any machine learning model
  • eli5: A library for debugging/inspecting machine learning classifiers and explaining their predictions
  • alibi: Algorithms for explaining machine learning models
  • interpret: Fit interpretable models. Explain blackbox machine learning

Source

Molnar, Christoph. "Interpretable machine learning. A Guide for Making Black Box Models Explainable", 2019. https://christophm.github.io/interpretable-ml-book/

Author

Mauro Luzzatto - Maurol

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

explainy-0.2.12.tar.gz (654.9 kB view details)

Uploaded Source

Built Distribution

explainy-0.2.12-py2.py3-none-any.whl (28.3 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file explainy-0.2.12.tar.gz.

File metadata

  • Download URL: explainy-0.2.12.tar.gz
  • Upload date:
  • Size: 654.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.9.6 requests/2.31.0 setuptools/65.5.0 requests-toolbelt/1.0.0 tqdm/4.66.1 CPython/3.11.3

File hashes

Hashes for explainy-0.2.12.tar.gz
Algorithm Hash digest
SHA256 49ceaf243ebe0cd8502088b657937ff520464d9797dcab39123df377b69a2176
MD5 0dfa129a8e21dd10fd4eda9d6b95513e
BLAKE2b-256 ffaa10800a79c5cb00d9927862566e37ed467bbbe1179976f36d0a73c204af1f

See more details on using hashes here.

File details

Details for the file explainy-0.2.12-py2.py3-none-any.whl.

File metadata

  • Download URL: explainy-0.2.12-py2.py3-none-any.whl
  • Upload date:
  • Size: 28.3 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.9.6 requests/2.31.0 setuptools/65.5.0 requests-toolbelt/1.0.0 tqdm/4.66.1 CPython/3.11.3

File hashes

Hashes for explainy-0.2.12-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 04357526c810330f3f215fb5f530cd86898e1d9f6fd26400fdbcc7fef5bd3f44
MD5 42c919374db6c2531d15a66b1907b799
BLAKE2b-256 e84fea46fa92bbdc4e3b66b4f99af7b6de122e48a9bc1ea07539906c81e8fea5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page