Sklearn wrapper for classifications system that computes the prediction uncertainty

These details have not been verified by PyPI

Project links

Development Status
- 1 - Planning
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language

Project description

Uncertainty assessment for Pre-trained models

Jose Mena

Motivation

Imagine one of the following situations:

You have trained a model for medical diagnosis to enable the early detection of an illness based on images obtained from an X-ray machine of a hospital. You train a model using labelled data to predict whether the patient has the illness or not based on her corresponding image. Now, a new hospital is interested on your model and they ask you to start using it, but then you realize that they use a different X-ray machine. To which extent is your model good at predicting using the new images? How can you trust these predictions?
You find an API that offers a Sentiment Analysis classifier. At the documentation you can read that the model was trained using reviews of Amazon products. The point is that you want to apply it to reviews of restaurants that you plan to use as implicit feedback for a tourism recommender system. How will the API work in the new domain? A horror movie can be terrifying and thus have a good review, but I don't see how a terrifying restaurant could be good.

Of course, there might be situations where getting some labelled data for the target domain can enable to fine-tune the original model or even training it from scratch, but sometimes this is not an option, as we don't have access to the original model or to the resources needed for training it.

What if we could have a way to tell if the prediction for the new dataset is confident or not? Picture a system that, together with the prediction of the model, you could have a metric of its uncertainty. Usually, a useful way of measuring this uncertainty, in probabilistic classifiers, is by observing the output probabilities. A probability of 0.9 for the predicted class would mean a very confident prediction. The problem with this approach is that there is no guarantee that these probabilities are well-calibrated (that they are real probabilities), and, besides, there might be cases where the prediction might be very confident, like this 0.9, but that it corresponds to a wrong prediction.

Take for example, a linear classifier trained with the date at the top-left figure. When we apply to the target data, bottom-left figure, we observe that there is a mass of points that will be wrongly classified, as they are in the class 1 region, but they are labelled as class 2.

Imagine that this classifier was trained to predict if an image belongs to a cat, a dog or a bear. Imagine that the images used for training that classifier included white cats, but only black or brown bears. At prediction time, we get pictures of polar bears. Our classifier we'll try to do its best and it may predict them as cats, as it only had seen white animals belonging to the cat category.

Uncertainty wrapper

The component here presented tackles the scenario of having a pre-trained classifier, from now on a black-box, and trying to apply it to a new target scenario.

What we propose in this component is a wrapper that takes the input and the output of the black-box and enriches the later with a measure of uncertainty. Based on this uncertainty, you are be free of "trusting" the prediction or discard it. If the overall results of applying the black box to your problem are poor, maybe you would have to reconsider the option of labelling more data and train a specific classifier for the target problem.

Further theoretical details can be found at the paper https://ieeexplore.ieee.org/document/9097854, but the main idea is that we have a black-box classification system that takes as input a feature vector x , or an embedding, in the case of images and texts, and outputs a probability distribution y* over the predicted categories or classes. The wrapper produces a new prediction y' together with an uncertainty score for this prediction u , as illustrated in the following diagram:

In the instantiation of the wrapper, these are the parameters that can be set:

Name	Type	Mandatory
black_box	An object that observes the sklearn BaseEstimator interface. This will be used as the black box to retrieve the original predictions	Optional, if not set, then a list of predictions on X must be passed on fit and predict
lambda_reg	Regularization property used for adjusting the uncertainties	no, default 1e-2
num_samples	Number of samples to include for the MC sampling of the dirichlet dsitribution	no, default 1000
learning_rate	Learning rate used when training uncertainties	no, default 1e-3
num_hidden_units	Number of units to use in the uncertainty neural network model	no, default 20
batch_size	Batch size used when training uncertainties	no, default 256
epochs	Number of epochs when training the uncertainties	no, default 50
verbose	Verbosity level ofthe training(same as in Keras)	no, default 0

Usually these parameters produces good estimates for the uncertainty but, depending on the case, they have to be tuned to avoid overfitting.

The wrapper offers two methods (following the sklearn standard):

fit

This method is meant for training the wrapper. It needs some examples of the target domain (x, y) and will call the black-box for the corresponding y*.

Name	Type	Mandatory
X	Vector of input vectors	yes
y	Vector of labels	yes
pred_y	Vector of blackbox predictions on X	yes, when there is no blackbox defined

Returns

A wrapper model trained for the target dataset

Predict

This method applies the wrapper to obtain the prediction and the associated uncertainty. It takes the input x and calls the black-box for a list of the corresponding y*.

Name	Type	Mandatory
X	Vector of input vectors	yes
pred_y	Vector of blackbox predictions on X	yes, when there is no blackbox defined

Returns

A list of pairs (y*, u) corresponding to the prediction and the degree of uncertainty.

IMPORTANT: Note that the output of the black box, the original prediction can be in the form of a probability distribution over the categories of the classifier, as resulting from the usual softmax layer, or a categorical output, consisting on the identifier of the predicted class. In both cases, the wrapper will be able to add the corresponding uncertainty.

Example

In the examples folder you will find an example(script or notebook) of a potential use case for the library. The example shows how to apply the uncertainty wrapper to enrich the predictions of a trained classifier with the corresponding uncertainty when applied to a new domain of application. The goal is to show how to use a classifier already trained for one task in another task, and how we can measure that the blackbox adapts for the new task. We will see how wecan identify the cases where the blackbox is less confident on its predictions, and how we can increase the accuracy of the resulting system by rejecting those examples that more uncertain.

Setup

Requirements

Python 3.6+
scipy
pandas
numpy
scikit-learn
tensorflow==1.15.2
tensorflow_probability==0.7.0

Installation

Install it directly into an activated virtual environment:

$ pip3 install uncwrap

Usage

After installation, the package can be imported:

$ python
>>> import uncwrap
>>> uncwrap.__version__

Project details

These details have not been verified by PyPI

Project links

Development Status
- 1 - Planning
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

0.5

Jan 25, 2023

This version

0.4

Jan 24, 2023

0.3

Jan 24, 2023

0.2

Oct 16, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uncwrap-0.4.tar.gz (3.6 MB view details)

Uploaded Jan 24, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

uncwrap-0.4-py3-none-any.whl (3.6 MB view details)

Uploaded Jan 24, 2023 Python 3

File details

Details for the file uncwrap-0.4.tar.gz.

File metadata

Download URL: uncwrap-0.4.tar.gz
Upload date: Jan 24, 2023
Size: 3.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.1.15 CPython/3.9.12 Darwin/22.2.0

File hashes

Hashes for uncwrap-0.4.tar.gz
Algorithm	Hash digest
SHA256	`b3d2e640c29ef882963b18c6b9d8ae305327caa9bd53858b8e866c245c9ea2dd`
MD5	`ff4e01ba9b6705b4441a0192ae579f91`
BLAKE2b-256	`c36e9f3d8bff8509deb58a99aab98eb168d9cd1aa3c1a51570a88e34b92dc64a`

See more details on using hashes here.

File details

Details for the file uncwrap-0.4-py3-none-any.whl.

File metadata

Download URL: uncwrap-0.4-py3-none-any.whl
Upload date: Jan 24, 2023
Size: 3.6 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.1.15 CPython/3.9.12 Darwin/22.2.0

File hashes

Hashes for uncwrap-0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0c914d18a5c34692d0a1b30bc9e847c0403187fcb894f7636643dd60244435f2`
MD5	`72cfaa6229ed25241f0258718baa3eb3`
BLAKE2b-256	`771ce18cf457e803a2babf5aa3a1b2c327930efc9b32af5b3138e711b4eb13b4`

See more details on using hashes here.

uncwrap 0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Uncertainty assessment for Pre-trained models

Motivation

Uncertainty wrapper

fit

Returns

Predict

Returns

Example

Setup

Requirements

Installation

Usage

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes