A package for efficient Shapley performance attribution for least-squares problems

These details have not been verified by PyPI

Project description

Least-Squares Shapley Performance Attribution (LS-SPA)

Installation - Usage - Hello world - Example notebook - Optional arguments - Citing

Library companion to the paper Efficient Shapley Performance Attribution for Least-Squares Regression by Logan Bell, Nikhil Devanathan, and Stephen Boyd.

The results provided in the reference paper were generated using a more performant, but harder to use implementation of the same algorithm. This benchmark code and the numerical experiments from the reference paper can be found at cvxgrp/ls-spa-benchmark. We recommend caution in trying to use the benchmark code.

Installation

To install this package, execute

pip install ls_spa

Import ls_spa by adding

from ls_spa import ls_spa

to the top of your Python file.

ls_spa has the following dependencies:

numpy
scipy
pandas
joblib

Optional dependencies are

marimo for using the demo notebook
matplotlib for plotting in the demo notebook

Usage

We assume that you have imported ls_spa and you have a $N\times p$ matrix of training data X_train, a $M\times p$ matrix of testing data X_test, a $N$ vector of training labels y_train, and a $M$ vector of testing labels y_test for positive integers $p, N, M$ with $N,M\geq p$. In this case, you can find the Shapley attribution of the out-of-sample $R^2$ on your data by executing

attrs = ls_spa(X_train, X_test, y_train, y_test).attribution

attrs will be a NumPy array containing the Shapley values of your features. The ls_spa function computes Shapley values for the given data using the LS-SPA method described in the companion paper. It takes arguments:

X_train: Training feature matrix (NumPy array or pandas DataFrame).
X_test: Testing feature matrix (NumPy array or pandas DataFrame).
y_train: Training response vector (NumPy array or pandas Series).
y_test: Testing response vector (NumPy array or pandas Series).

Hello world

We present a complete Python script that utilizes LS-SPA to compute the Shapley attribution on the data from the toy example described in the companion paper.

# Imports
import numpy as np
from ls_spa import ls_spa

# Data loading
X_train, X_test, y_train, y_test = [np.load("./data/toy_data.npz")[key] for key in ["X_train","X_test","y_train","y_test"]]

# Compute Shapley attribution with LS-SPA
results = ls_spa(X_train, X_test, y_train, y_test)

# Print attribution
print(results)

This example uses data from the data directory of this repository.

The line print(results) prints a dashboard of information generated while computing the Shapley attribution such as the attribution, the $R^2$ of the model fitted with all of the features, the feature cofficients of the fitted model, and an error estimate on the attribution (since LS-SPA is a method of estimation).

To extract just the vector of Shapley values, use results.attribution. For more info, see optional arguments.

Example notebook

In this demo, we walk through the process of computing Shapley values on the data for the toy example in the companion paper. We then use ls_spa to compute the Shapley attribution on the same data.

Optional arguments

ls_spa takes the optional arguments:

reg: Ridge regularization parameter (Default 0.0).
max_samples: Maximum number of feature permutations to sample (Default 8192).
batch_size: Number of permutations to process per batch (Default 256).
tolerance: Stopping criterion for estimation error (Default 0.01).
seed: Seed for random number generation (Default 42).
perms: Permutation sampling method (Default None). Options include:
- None: Auto-select "exact" for p < 9 features, otherwise "random"
- "exact": Enumerate all permutations (only feasible for p < 9)
- "random": Uniformly random permutations
- "argsort": Quasi-Monte Carlo permutations using argsort
- "permutohedron": Quasi-Monte Carlo permutations from permutohedron lattice
- Custom array or tuple of permutations
antithetical: Use antithetical (paired) sampling for variance reduction (Default True).
return_attribution_history: Return convergence history of attributions (Default False).
n_jobs: Number of parallel jobs; use -1 for all CPU cores (Default 1).

ls_spa returns a ShapleyResults object. The ShapleyResults object has the fields:

attribution: Array of Shapley values for each feature.
theta: Array of regression coefficients with all features.
r_squared: Out-of-sample R² with all features.
overall_error: Estimated error (95th percentile L2 norm) in Shapley attribution vector.
attribution_errors: Array of estimated errors for each feature's attribution.
error_history: Array of error estimates after each batch. None if using exact computation.
attribution_history: Array of attribution estimates over time. None if return_attribution_history=False.

Citing

If you use this code for research, please cite the associated paper.

@article{Bell2024,
  title = {Efficient Shapley performance attribution for least-squares regression},
  volume = {34},
  ISSN = {1573-1375},
  url = {http://dx.doi.org/10.1007/s11222-024-10459-9},
  DOI = {10.1007/s11222-024-10459-9},
  number = {5},
  journal = {Statistics and Computing},
  publisher = {Springer Science and Business Media LLC},
  author = {Bell,  Logan and Devanathan,  Nikhil and Boyd,  Stephen},
  year = {2024},
  month = jul
}

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.0.0

Jan 16, 2026

1.1.2

Sep 10, 2024

1.1.1

Sep 9, 2024

0.0.0

Sep 9, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ls_spa-2.0.0.tar.gz (136.3 kB view details)

Uploaded Jan 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ls_spa-2.0.0-py3-none-any.whl (18.3 kB view details)

Uploaded Jan 16, 2026 Python 3

File details

Details for the file ls_spa-2.0.0.tar.gz.

File metadata

Download URL: ls_spa-2.0.0.tar.gz
Upload date: Jan 16, 2026
Size: 136.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for ls_spa-2.0.0.tar.gz
Algorithm	Hash digest
SHA256	`b68529e271006b8322a696ac3cd12a063a044543059f7e28487687b52fc80ec3`
MD5	`683abff5d8b7fa4ec19a7a9e075bd479`
BLAKE2b-256	`b2cdbc164f20dfc84e7588ca71af9191723eaff5daf360cd4082e60d30a107bc`

See more details on using hashes here.

File details

Details for the file ls_spa-2.0.0-py3-none-any.whl.

File metadata

Download URL: ls_spa-2.0.0-py3-none-any.whl
Upload date: Jan 16, 2026
Size: 18.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for ls_spa-2.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`894d63fafce4c48427fa6ce6d5959efb38c2c70ec6880fb4003226274415bca5`
MD5	`966ab4c4ce1907897964e8e1664ac9fc`
BLAKE2b-256	`f4f075431d50d66013e7866c20ce54a941930f2fa0f982fd9840d41217883174`

See more details on using hashes here.

ls_spa 2.0.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Least-Squares Shapley Performance Attribution (LS-SPA)

Installation - Usage - Hello world - Example notebook - Optional arguments - Citing

Installation

Usage

Hello world

Example notebook

Optional arguments

Citing

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes