Benchmarking tool for Counterfactual Explanations

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Universal Counterfactual Benchmark Framework

Fastest way to test your tabular counterfactuals, evaluating 22 different datasets/models. All models are Keras/TensorFlow NN.

Installation

pip install cfbench

Usage

import numpy as np
from cfbench.cfbench import BenchmarkCF

# A simple CF generator, when the factual class is 1
# return full 0 array, otherwise return full 1 array
def my_cf_generator(factual_array, model):
    if model.predict(np.array([factual_array]))[0][0] > 0.5:
        return [0]*len(factual_array)
    else:
        return [1]*len(factual_array)

# Create Benchmark Generator
benchmark_generator = BenchmarkCF(framework_name='my_framework').create_generator()

# The Benchmark loop
for benchmark_data in benchmark_generator:
    # Get factual array
    factual_array = benchmark_data['factual_oh']
    # Get Keras TensorFlow model
    model = benchmark_data['model']

    # Create CF
    cf = my_cf_generator(factual_array, model)

    # Get Evaluator
    evaluator = benchmark_data['cf_evaluator']
    # Evaluate CF
    evaluator(cf, verbose=True)

Further information

We understand that different counterfactual generators need different data, so our generator provide multiple data described in the following table:

Click here for detailed info

The BenchmarkCF().create_generator() method returns a generator that provides the following data:

key	Type	Description
factual_oh	list	Factual, one hot encoded (if categorical features), data
model	tf.Keras.Model	Model to be explained
factual	list	Factual data (WITHOUT one hot encoding)
num_feats	list	Indexes of the numerical continuous features
cat_feats	list	Indexes of the categorical features
cf_evaluator	BenchmarkGenerator.cf_evaluator	Evaluates if the CF is indeed a CF. Returns [True, cf_array] if a CF and [False, nan_array] otherwise
oh_converter	cfbench.cfg.OHConverter.Converter	Converts to one hot `.convert_to_oh` or from one hot `.convert`
df_train	pandas.DataFrame	Dataframe of model's training data (WITHOUT one hot encoding)
df_oh_train	pandas.DataFrame	Dataframe of model's training data (WITH one hot encoding)
df_test	pandas.DataFrame	Dataframe of model's test data (WITHOUT one hot encoding)
df_oh_test	pandas.DataFrame	Dataframe of model's test data (WITH one hot encoding)
df_factual	pandas.DataFrame	Dataframe of factual data (WITHOUT one hot encoding)
tf_session	tf.Session	TensorFlow session
factual_idx	int	Index of the factual data in the factual dataset
factual_class	int	Model's prediction (0 or 1) of the factual data
dsname	str	Name of the dataset

TensorFlow Version compatibility

This framework is supposed to be compatible with TensorFlow 1 and 2, however, problems can arise. Therefore, if you encounter any problem, please open an issue.

Reference

If you used this package on your experiments, here's the reference paper:

@Article{app11167274,
AUTHOR = {de Oliveira, Raphael Mazzine Barbosa and Martens, David},
TITLE = {A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data},
JOURNAL = {Applied Sciences},
VOLUME = {11},
YEAR = {2021},
NUMBER = {16},
ARTICLE-NUMBER = {7274},
URL = {https://www.mdpi.com/2076-3417/11/16/7274},
ISSN = {2076-3417},
DOI = {10.3390/app11167274}
}

0.0.3 / 2022-08-29

==================

Fix data files

0.0.2 / 2022-08-29

==================

Simplified interface to run the benchmark with module cfbench
Updated README.md

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.0.9

Sep 9, 2022

0.0.8

Sep 3, 2022

0.0.7

Sep 2, 2022

0.0.6

Sep 1, 2022

0.0.5

Sep 1, 2022

0.0.4

Aug 31, 2022

This version

0.0.3

Aug 29, 2022

0.0.2

Aug 29, 2022

0.0.1

Aug 29, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cfbench-0.0.3.tar.gz (10.7 kB view hashes)

Uploaded Aug 29, 2022 Source

Built Distribution

cfbench-0.0.3-py3-none-any.whl (10.3 kB view hashes)

Uploaded Aug 29, 2022 Python 3

Hashes for cfbench-0.0.3.tar.gz

Hashes for cfbench-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`8c6b24cf6828522455d9dc49632df342fded12c6f62fff6419b6d60f55668138`
MD5	`4572efdd878b07bae9459237d79a12dc`
BLAKE2b-256	`2946317029a9c972084f3c8c74b756b74d7d61679ce6a222649cdad582cec213`

Hashes for cfbench-0.0.3-py3-none-any.whl

Hashes for cfbench-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`531edccfdb701965f56da091b578172a7ee844716656eedea65dff169cc0b2db`
MD5	`0255004dc3c46f8b88e5ecd4d68ea903`
BLAKE2b-256	`5dea7bcb0d81051757e774ef5f26a032128d7c98e51f0bff5ab2629b477d88fc`