Skip to main content

A Data Valuation Package for Machine Learning

Project description

Valda

Introduction

Valda is a Python package for data valuation in machine learning. If you are interested in

  • analyzing the contribution of individual training examples to the final classification performance, or
  • identifying some noisy examples in the training set,

you may be interested in the functions provided by this package.

The current version supports five different data valuation methods. It supports all the classifiers from Sklearn for valuation, and also user-defined classifier using PyTorch.

  • Leave-one-out (LOO),
  • Data Shapley with the TMC algorithm (TMC-Shapley) from Ghorbani and Zou (2019),
  • Beta Shapley from Kwon and Zou (2022)
  • Class-wise Shapley (CS-Shapley) from Schoch et al. (2022)
  • Influence Function (IF) from Koh and Liang (2017)
    • IF only works with the classifiers built with PyTorch, because it requires gradient computation.
    • v0.1.8 only support the first-order gradient computation, and we will add the second-order computation soon.

Tutorial

Please checkout a simple tutorial on Google Colab, for how to use this package.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

valda-0.1.10.tar.gz (11.4 kB view details)

Uploaded Source

Built Distribution

valda-0.1.10-py3-none-any.whl (14.3 kB view details)

Uploaded Python 3

File details

Details for the file valda-0.1.10.tar.gz.

File metadata

  • Download URL: valda-0.1.10.tar.gz
  • Upload date:
  • Size: 11.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.7

File hashes

Hashes for valda-0.1.10.tar.gz
Algorithm Hash digest
SHA256 67e42c8446ea2e0baa42a1da769bf9d2758705205107d772f140ef4731a8e585
MD5 5b96727a828357185716a0423eaebddd
BLAKE2b-256 21787bdf61979f801314e997771b2def6ecf90bffcab9ce8933a091dd47a04d4

See more details on using hashes here.

File details

Details for the file valda-0.1.10-py3-none-any.whl.

File metadata

  • Download URL: valda-0.1.10-py3-none-any.whl
  • Upload date:
  • Size: 14.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.7

File hashes

Hashes for valda-0.1.10-py3-none-any.whl
Algorithm Hash digest
SHA256 aa8cdddccd296cabe097aee7cee45dbc23b97f52622da50b6167b5ecb9845792
MD5 e206c4630105d34f40d1687516f35181
BLAKE2b-256 0c3e3ebbb4f310898ca87bd9e29a31e85155b838a017a5738e59666efb5deff6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page