PertCF: Perturbation-based counterfactual explanations with SHAP-weighted feature attribution

These details have not been verified by PyPI

Project links

Project description

PertCF

PertCF is a perturbation-based counterfactual explanation method that combines SHAP feature attribution with nearest-neighbour search to generate high-quality, stable counterfactuals for tabular classification models.

What is a counterfactual explanation?
Given a model's prediction for an instance x, a counterfactual x' is the minimal change to x that would flip the prediction. For example: "If Leo earned $500 more per month, his loan application would be accepted."

Why PertCF?

Feature	PertCF	DiCE	CF-SHAP
Multi-class support	✅	❌	❌
SHAP-weighted distance	✅	❌	Partial
Custom domain knowledge	✅	❌	❌
Works with sklearn, PyTorch, Keras	✅	Partial	❌
No external server needed	✅	✅	✅

PertCF outperforms DiCE and CF-SHAP on dissimilarity and instability across both benchmark datasets (South German Credit, User Knowledge Modeling). See the paper for full results.

Installation

pip install pertcf

For PyTorch or Keras model support:

pip install pertcf[torch]      # + PyTorch adapter
pip install pertcf[tensorflow] # + Keras/TF adapter
pip install pertcf[viz]        # + matplotlib/seaborn for plots

Requirements: Python ≥ 3.9, numpy, pandas, scikit-learn, shap.
No Java. No REST server. No external frameworks.

Quick Start (30 seconds)

import pandas as pd
from sklearn.ensemble import GradientBoostingClassifier
from sklearn.model_selection import train_test_split
from pertcf import PertCFExplainer

# 1. Load data and train a model (example: German Credit dataset)
df = pd.read_csv("german_credit.csv")
X = df.drop(columns=["credit_risk"])
y = df["credit_risk"]

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
clf = GradientBoostingClassifier(random_state=42).fit(X_train, y_train)

# 2. Create and fit the explainer
explainer = PertCFExplainer(
    model=clf,
    X_train=X_train,
    y_train=y_train,
    categorical_features=["purpose", "personal_status", "housing"],
    label="credit_risk",
    num_iter=5,
    coef=5,
)
explainer.fit()

# 3. Explain a prediction
instance = X_test.iloc[0].copy()
instance["credit_risk"] = clf.predict(X_test.iloc[[0]])[0]

counterfactual = explainer.explain(instance)
print("Original:       ", instance.to_dict())
print("Counterfactual: ", counterfactual.to_dict())

Feature Highlights

Works with any classifier

# scikit-learn (auto-detected)
from sklearn.ensemble import RandomForestClassifier
explainer = PertCFExplainer(model=RandomForestClassifier().fit(X, y), ...)

# PyTorch
from pertcf import PertCFExplainer
explainer = PertCFExplainer(
    model=my_torch_model,
    class_names=["bad", "good"],
    ...
)

# Keras / TensorFlow
explainer = PertCFExplainer(
    model=my_keras_model,
    class_names=["bad", "good"],
    ...
)

# Any callable
explainer = PertCFExplainer(
    model=my_model,
    predict_fn=lambda X: my_model.predict(X),
    predict_proba_fn=lambda X: my_model.predict_proba(X),
    class_names=["bad", "good"],
    ...
)

Domain knowledge via custom similarity matrices

# Model the relationship between credit purposes
explainer = PertCFExplainer(
    model=clf,
    ...
    similarity_matrices={
        "purpose": {
            ("car", "furniture"): 0.7,   # similar purposes
            ("car", "education"): 0.2,   # less similar
        }
    }
)

Pre-computed SHAP values (for large datasets)

import shap
# Compute SHAP once, reuse across experiments
shap_exp = shap.TreeExplainer(clf)
shap_vals = shap_exp.shap_values(X_train)
# … build shap_df with shape (n_classes, n_features) …

explainer = PertCFExplainer(
    model=clf, shap_values=shap_df, ...
)
explainer.fit()  # skips SHAP computation

Built-in benchmark

# Reproduce the paper's Table 2 results
results = explainer.benchmark(X_test, n=100, coef=5, verbose=True)
# Results (n=100/100):
#   dissimilarity       : 0.0517
#   sparsity            : 0.7983
#   runtime_mean        : 0.4069

Evaluation metrics

from pertcf import metrics

print(metrics.dissimilarity(query, cf, explainer.sim_fn, cf_class))
print(metrics.sparsity(query, cf))
print(metrics.instability(query, cf, explainer))

# All at once:
results = metrics.evaluate(queries, counterfactuals, explainer)

How PertCF Works

1. Compute SHAP values per class → class-specific feature importance weights
2. For query x:
   a. Find Nearest Unlike Neighbour (NUN) using SHAP-weighted similarity
   b. Perturb x toward NUN using SHAP weights:
      - Numeric:     p_f = x_f + shap_target_f * (nun_f - x_f)
      - Categorical: p_f = nun_f  if sim(x_f, nun_f) < 0.5  else  x_f
   c. If perturbed instance flips class → refine (approach source)
   d. If not → push harder (approach target)
   e. Terminate when step size < threshold or max iterations reached

See the paper for full algorithmic details.

Examples

Notebook	Dataset	Description
quickstart_german_credit.ipynb	South German Credit	Basic usage, benchmark, comparison to DiCE
quickstart_knowledge.ipynb	User Knowledge Modeling	Multi-class classification
custom_similarity.ipynb	German Credit	Domain knowledge with custom similarity matrices

Launch in Colab:

Parameter Guide

Parameter	Default	Description
`num_iter`	`10`	Max perturbation iterations. Higher → better quality, slower.
`coef`	`5`	Step-size threshold coefficient. Higher → finer convergence.

Recommended settings from the paper:

Dataset	`num_iter`	`coef`	Notes
South German Credit	5	5	Most categorical features
User Knowledge Modeling	5	3	All numeric features

Citation

If you use PertCF in your research, please cite:

@inproceedings{bayrak2023pertcf,
  title     = {PertCF: A Perturbation-Based Counterfactual Generation Approach},
  author    = {Bayrak, Bet{\"u}l and Bach, Kerstin},
  booktitle = {Artificial Intelligence XXXVIII},
  series    = {Lecture Notes in Computer Science},
  volume    = {14381},
  pages     = {174--187},
  year      = {2023},
  publisher = {Springer, Cham},
  doi       = {10.1007/978-3-031-47994-6_13}
}

License

MIT © Betül Bayrak, NTNU

This work was supported by the Research Council of Norway through the EXAIGON project (ID 304843).

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.0

Apr 22, 2026

0.1.18

Jul 3, 2023

0.1.17

Jul 3, 2023

0.1.16

Jul 3, 2023

0.1.15

Jul 3, 2023

0.1.14

Jul 3, 2023

0.1.13

Jul 3, 2023

0.1.12

Jul 3, 2023

0.1.11

Jul 3, 2023

0.1.10

Jul 3, 2023

0.1.9

Jul 3, 2023

0.1.8

Jul 3, 2023

0.1.7

Jul 3, 2023

0.1.6

Jul 3, 2023

0.1.5

Jul 3, 2023

0.1.4

Jul 3, 2023

0.1.3

Jul 3, 2023

0.1.2

Jul 3, 2023

0.1.1

Jul 3, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pertcf-1.0.0.tar.gz (36.6 kB view details)

Uploaded Apr 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pertcf-1.0.0-py3-none-any.whl (18.3 kB view details)

Uploaded Apr 22, 2026 Python 3

File details

Details for the file pertcf-1.0.0.tar.gz.

File metadata

Download URL: pertcf-1.0.0.tar.gz
Upload date: Apr 22, 2026
Size: 36.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pertcf-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`1ea11ce4dbd200138bc01bdd1d73d1b70e3b81e817d967fd4343337967613356`
MD5	`4d0cc5520a537b18d29d84bf6a8fca56`
BLAKE2b-256	`75799a6bc6fca3a72ebf055f3437378d710074059936af8a2a83c8a36ecec31d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pertcf-1.0.0.tar.gz:

Publisher: publish.yml on b-bayrak/PertCF

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pertcf-1.0.0.tar.gz
- Subject digest: 1ea11ce4dbd200138bc01bdd1d73d1b70e3b81e817d967fd4343337967613356
- Sigstore transparency entry: 1356807571
- Sigstore integration time: Apr 22, 2026
Source repository:
- Permalink: b-bayrak/PertCF@23eb2526cd3a23760a4757c27cf144554734966d
- Branch / Tag: refs/tags/v1.1.0
- Owner: https://github.com/b-bayrak
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@23eb2526cd3a23760a4757c27cf144554734966d
- Trigger Event: release

File details

Details for the file pertcf-1.0.0-py3-none-any.whl.

File metadata

Download URL: pertcf-1.0.0-py3-none-any.whl
Upload date: Apr 22, 2026
Size: 18.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pertcf-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e7e8192b9bd6e4fc90bb5c3ebf78a8719c8d4db1c30f8b0025554ca678ea888e`
MD5	`db1d750dc90f8ebd7af44ec65ddfec83`
BLAKE2b-256	`6e66b7c835631ea7ed722f2f4e186541701e795df6c1f102c4184329d683d702`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pertcf-1.0.0-py3-none-any.whl:

Publisher: publish.yml on b-bayrak/PertCF

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pertcf-1.0.0-py3-none-any.whl
- Subject digest: e7e8192b9bd6e4fc90bb5c3ebf78a8719c8d4db1c30f8b0025554ca678ea888e
- Sigstore transparency entry: 1356807594
- Sigstore integration time: Apr 22, 2026
Source repository:
- Permalink: b-bayrak/PertCF@23eb2526cd3a23760a4757c27cf144554734966d
- Branch / Tag: refs/tags/v1.1.0
- Owner: https://github.com/b-bayrak
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@23eb2526cd3a23760a4757c27cf144554734966d
- Trigger Event: release

pertcf 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PertCF

Why PertCF?

Installation

Quick Start (30 seconds)

Feature Highlights

Works with any classifier

Domain knowledge via custom similarity matrices

Pre-computed SHAP values (for large datasets)

Built-in benchmark

Evaluation metrics

How PertCF Works

Examples

Parameter Guide

Citation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance