Skip to main content

A Data Valuation Package for Machine Learning

Project description

Valda

Introduction

Valda is a Python package for data valuation in machine learning. If you are interested in

  • analyzing the contribution of individual training examples to the final classification performance, or
  • identifying some noisy examples in the training set,

you may be interested in the functions provided by this package.

The current version supports five different data valuation methods. It supports all the classifiers from Sklearn for valuation, and also user-defined classifier using PyTorch.

  • Leave-one-out (LOO),
  • Data Shapley with the TMC algorithm (TMC-Shapley) from Ghorbani and Zou (2019),
  • Beta Shapley from Kwon and Zou (2022)
  • Class-wise Shapley (CS-Shapley) from Schoch et al. (2022)
  • Influence Function (IF) from Koh and Liang (2017)
    • IF only works with the classifiers built with PyTorch, because it requires gradient computation.
    • v0.1.8 only support the first-order gradient computation, and we will add the second-order computation soon.

Tutorial

Please checkout a simple tutorial on Google Colab, for how to use this package.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

valda-0.1.9.tar.gz (11.3 kB view details)

Uploaded Source

Built Distribution

valda-0.1.9-py3-none-any.whl (14.1 kB view details)

Uploaded Python 3

File details

Details for the file valda-0.1.9.tar.gz.

File metadata

  • Download URL: valda-0.1.9.tar.gz
  • Upload date:
  • Size: 11.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.7

File hashes

Hashes for valda-0.1.9.tar.gz
Algorithm Hash digest
SHA256 14366e74236725c9f76119031afc1c41b67670ef02ed2282a6e8f2773b3c0479
MD5 f94641b56f6200abc98db014e1a305ca
BLAKE2b-256 26082db73c8a66a48ea712aa7e65a762846a02621cdf86d5ec9b655f7f03bf0c

See more details on using hashes here.

File details

Details for the file valda-0.1.9-py3-none-any.whl.

File metadata

  • Download URL: valda-0.1.9-py3-none-any.whl
  • Upload date:
  • Size: 14.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.7

File hashes

Hashes for valda-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 a0b0ea343e4f85ed758f67214922b73b67e041bc4f6d2780f3c0afbe48306d03
MD5 4d9552ec1fce79ac0975e3ac4227bc9c
BLAKE2b-256 11733def21caaced5efdec27ffe1ba3c83fec939a2860c42a13780de06ba563f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page