A Data Valuation Package for Machine Learning
Project description
Valda
Introduction
Valda is a Python package for data valuation in machine learning. If you are interested in
- analyzing the contribution of individual training examples to the final classification performance, or
- identifying some noisy examples in the training set,
you may be interested in the functions provided by this package.
The current version supports five different data valuation methods. It supports all the classifiers from Sklearn for valuation, and also user-defined classifier using PyTorch.
- Leave-one-out (LOO),
- Data Shapley with the TMC algorithm (TMC-Shapley) from Ghorbani and Zou (2019),
- Beta Shapley from Kwon and Zou (2022)
- Class-wise Shapley (CS-Shapley) from Schoch et al. (2022)
- Influence Function (IF) from Koh and Liang (2017)
- IF only works with the classifiers built with PyTorch, because it requires gradient computation.
- v0.1.8 only support the first-order gradient computation, and we will add the second-order computation soon.
Tutorial
Please checkout a simple tutorial on Google Colab, for how to use this package.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
valda-0.1.10.tar.gz
(11.4 kB
view details)
Built Distribution
valda-0.1.10-py3-none-any.whl
(14.3 kB
view details)
File details
Details for the file valda-0.1.10.tar.gz
.
File metadata
- Download URL: valda-0.1.10.tar.gz
- Upload date:
- Size: 11.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 67e42c8446ea2e0baa42a1da769bf9d2758705205107d772f140ef4731a8e585 |
|
MD5 | 5b96727a828357185716a0423eaebddd |
|
BLAKE2b-256 | 21787bdf61979f801314e997771b2def6ecf90bffcab9ce8933a091dd47a04d4 |
File details
Details for the file valda-0.1.10-py3-none-any.whl
.
File metadata
- Download URL: valda-0.1.10-py3-none-any.whl
- Upload date:
- Size: 14.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aa8cdddccd296cabe097aee7cee45dbc23b97f52622da50b6167b5ecb9845792 |
|
MD5 | e206c4630105d34f40d1687516f35181 |
|
BLAKE2b-256 | 0c3e3ebbb4f310898ca87bd9e29a31e85155b838a017a5738e59666efb5deff6 |