Simple, compact, and hackable post-hoc deep OOD detection for alreadytrained tensorflow or pytorch image classifiers.

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Oodeel is a library that performs post-hoc deep OOD detection on already trained neural network image classifiers. The philosophy of the library is to favor quality over quantity and to foster easy adoption. As a result, we provide a simple, compact and easily customizable API and carefully integrate and test each proposed baseline into a coherent framework that is designed to enable their use in tensorflow and pytorch. You can find the documentation here.

from oodeel.methods import MLS

mls = MLS()
mls.fit(model) # A tensorflow or torch model
scores, info = mls.score(ds) # ds is a tf.data.Dataset or a torch.DataLoader

Installation
Quick Start
Tutorials
What's Included
Development roadmap
Contributing
See Also
Acknowledgments
Creator
License

Installation

Installation can be done using:

pip install oodeel

oodeel requires either tensorflow or pytorch to be already installed (it will not install them automatically not to mess-up with existing installations). It is regularly tested with:

Python version	Pytorch version	Tensorflow version
`3.8`	`1.11`	`2.5`
`3.9`	`1.13`	`2.8`
`3.10`	`2.00`	`2.11`

Quick Start

Now that oodeel is installed, here are some basic examples of what you can do with the available modules. See also the notebooks directory for more advanced examples.

For benchmarking with one dataset as in-distribution and another as out-of-distribution

Load in-distribution and out-of-distribution datasets.

from oodeel.datasets import OODDataset

ds_in = OODDataset(
  'mnist', load_kwargs={"split":"test"},
  backend="tensorflow").prepare(batch_size) # use backend="torch" if you prefer torch.DataLoader
ds_out = OODDataset(
  'fashion_mnist', load_kwargs={"split":"test"},
  backend="tensorflow").prepare(batch_size)

For benchmarking with a classes subset as in-distribution and another classes subset as out-of-distribution

Load a dataset and split it into an in-distribution dataset and ou-of-distribution dataset depending on its label values (a common practice of anomaly detection and open set recognition).

from oodeel.datasets import OODDataset

in_labels = [0, 1, 2, 3, 4]
oods_in, oods_out = oods_test.split_by_class(in_labels=in_labels)
ds_in = oods_in.prepare(batch_size=batch_size)
ds_out = oods_out.prepare(batch_size=batch_size)

Run an OOD method

Load an OOD method and use it on an already-trained model

from oodeel.methods import MLS

mls = MLS()
mls.fit(model)
# info contains model predictions and labels if avail
scores_in, info_in = mls.score(ds_in)
scores_out, info_out = mls.score(ds_out)

Evaluate the method

from oodeel.eval.metrics import bench_metrics

metrics = bench_metrics(
    (scores_in, scores_out),
    metrics = ["auroc", "fpr95tpr"],
    )

And visualize the results!

2D t-SNE (3D is also available).

plot_2D_features(
    model=model,
    in_dataset=ds_in,
    out_dataset=ds_out,
    output_layer_id=-2,
)

TSNE

Classical histograms and AUROC curve.

plot_ood_scores(scores_in, scores_out, log_scale=False)
plot_roc_curve(scores_in, scores_out)

AUROC

Tutorials

We propose some tutorials to get familiar with the library and its API. See the Tutorial section of the doc

What's Included

The library is based on a class, OODBaseDetector, that fits a model and then scores new samples. Some baselines use extra data, so OODBaseDetector can also fit additional data if needed. The library uses OODDataset to properly load data from different sources and prepare it for OOD detection. It can perform OOD-specific operations like adding extra OOD data for tuning with Outlier Exposure or filters according to label values for anomaly detection or open set recognition benchmarks.

Currently, oodeel includes the following baselines:

Name	Link	Venue	Status
MLS	Open-Set Recognition: a Good Closed-Set Classifier is All You Need?	ICLR 2022	avail tensorflow & torch
MSP	A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks	ICLR 2017	avail tensorflow & torch
Mahalanobis	A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks	NeurIPS 2018	avail tensorflow or torch
Energy	Energy-based Out-of-distribution Detection	NeurIPS 2020	avail tensorflow or torch
Odin	Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks	ICLR 2018	avail tensorflow or torch
DKNN	Out-of-Distribution Detection with Deep Nearest Neighbors	ICML 2022	avail tensorflow or torch
VIM	ViM: Out-Of-Distribution with Virtual-logit Matching	CVPR 2022	avail tensorflow or torch
Entropy	Likelihood Ratios for Out-of-Distribution Detection	NeurIPS 2019	avail tensorflow or torch
GODIN	Generalized ODIN: Detecting Out-of-Distribution Image Without Learning From Out-of-Distribution Data	CVPR 2020	planned
ReAct	ReAct: Out-of-distribution Detection With Rectified Activations	NeurIPS 2021	avail tensorflow or torch
NMD	Neural Mean Discrepancy for Efficient Out-of-Distribution Detection	CVPR 2022	planned
Gram	Detecting Out-of-Distribution Examples with Gram Matrices	ICML 2020	avail tensorflow or torch

Oodeel also includes standard training functions with data augmentation and learning rate scheduler for toy convnet models or models from keras.applications in tf_training_tools.py and torchvision.models in torch_training_tools.py files. These functions come in handy for benchmarks like leave-k-classes-out that requires retraining models on a subset of dataset classes.

Development Roadmap

More baselines!
A module for thorough visualizations (result plots and feature space visualizations)
Integrate model loading and uploading with hugginface's transformers library for pretraining
Extend the library to more diverse tasks like object detection, segmentation, NLP ...
Towards OOD Generalization?

Contributing

Feel free to propose your ideas or come and contribute with us on the oodeel toolbox! We have a specific document where we describe in a simple way how to make your first pull request: just here.

Acknowledgments

This project received funding from the French ”Investing for the Future – PIA3” program within the Artificial and Natural Intelligence Toulouse Institute (ANITI). The authors gratefully acknowledge the support of the DEEL , a research project jointly conducted in France and Quebec.

Creators

The library was created by Paul Novello to streamline DEEL research on post-hoc deep OOD methods and foster their adoption by DEEL industrial partners. He was soon joined by Yann Pequignot, Yannick Prudent, Corentin Friedrich and Matthieu Le Goff.

License

The package is released under MIT license.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.2.0

Sep 15, 2023

0.1.1

Sep 4, 2023

0.1.0

Sep 4, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oodeel-0.2.0.tar.gz (60.1 kB view hashes)

Uploaded Sep 15, 2023 Source

Built Distribution

oodeel-0.2.0-py3-none-any.whl (109.8 kB view hashes)

Uploaded Sep 15, 2023 Python 3

Hashes for oodeel-0.2.0.tar.gz

Hashes for oodeel-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`60b560ea57b86bddf6f02507de3f93a7589e18b84626b2c1bebc2cbd91f05d97`
MD5	`b3edcf358a28a60b76398c0505151eaa`
BLAKE2b-256	`67efc849da5679c4d6b28a5322bcbe88a3ac0b11f661c9368773576a13b621dc`

Hashes for oodeel-0.2.0-py3-none-any.whl

Hashes for oodeel-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3158289313fe29544ae2861e8ec8f74c1955c28fa61e62f004058475714d6db0`
MD5	`a6b7f48a0f73de079966997357df1bab`
BLAKE2b-256	`92e5c8f3016e5f5524a45af3ca397526ff7ca6a12dc117fd4e4cb50f788de908`