Skip to main content

Predictive UNcertainty Calibration and Conformalization Library

Project description

Puncc


Puncc (short for Predictive uncertainty calibration and conformalization) is an open-source Python library. It seamlessly integrates a collection of state-of-the-art conformal prediction algorithms and associated techniques for diverse machine learning tasks, including regression, classification, object detection and anomaly detection. Puncc can be used with any predictive model to provide rigorous uncertainty estimations. Under data exchangeability (or i.i.d), the generated prediction sets are guaranteed to cover the true outputs within a user-defined error $\alpha$.

Documentation is available online.

📚 Table of contents

🐾 Installation

puncc requires a version of python higher than 3.9 and several libraries including Scikit-learn. It is recommended to install puncc in a virtual environment to not mess with your system's dependencies.

You can directly install the library using pip:

pip install -U puncc

The package is also available on conda-forge:

conda install -c conda-forge puncc

📖 Documentation

For comprehensive documentation, we encourage you to visit the official documentation page.

👨‍🎓 Tutorials

We highly recommend following the introductory tutorials to get familiar with the library and its API.

Tutorial Description Link
Introduction Tutorial Get started with the basics of puncc. Open In Github Open In Colab
API Tutorial Learn about puncc's API. Open In Github Open In Colab
Tutorial on CP with PyTorch Learn how to use puncc with PyTorch. Open In Github Open In Colab
Adaptive Conformal Regression Learn how conformalize regression models to obtain adaptive prediction intervals using methods such as LACP, LWCP and CQR. Open In Github Open In Colab
Conformal Object Detection Learn to conformalize an object detector. Open In Github Open In Colab
Conformal Text Classification Learn to conformalize pretrained HuggingFace NLP model. Open In Github Open In Colab
Conformal Classwise Classification Learn how to conformalize classification models for class-conditional coverage. Open In Github Open In Colab
Conformal Time Series Forecasting Learn to conformalize time series models. Open In Github Open In Colab
Architecture Overview Detailed overview of puncc's architecture. Open In Github Open In Colab

🚀 Quickstart

Conformal prediction enables to transform point predictions into interval predictions with high probability of coverage. The figure below shows the result of applying the split conformal algorithm on a linear regressor.

Many conformal prediction algorithms can easily be applied using puncc. The code snippet below shows the example of split conformal prediction with a pretrained linear model:

from deel.puncc.api.prediction import BasePredictor
from deel.puncc.regression import SplitCP

# Load calibration and test data
# ...

# Pretrained regression model
# trained_linear_model = ...

# Wrap the model to enable interoperability with different ML libraries
trained_predictor =  BasePredictor(trained_linear_model)

# Instanciate the split conformal wrapper for the linear model.
# Train argument is set to False because we do not want to retrain the model
split_cp = SplitCP(trained_predictor, train=False)

# With a calibration dataset, compute (and store) nonconformity scores
split_cp.fit(X_calib=X_calib, y_calib=y_calib)

# Obtain the model's point prediction y_pred and prediction interval
# PI = [y_pred_lower, y_pred_upper] for a target coverage of 90% (1-alpha).
y_pred, y_pred_lower, y_pred_upper = split_cp.predict(X_test, alpha=0.1)

The library provides several metrics (deel.puncc.metrics) and plotting capabilities (deel.puncc.plotting) to evaluate and visualize the results of a conformal procedure. For a target error rate of $\alpha = 0.1$, the marginal coverage reached in this example on the test set is higher than $90$% (see Introduction tutorial):

90% Prediction Interval with the Split Conformal Prediction Method
90% Prediction Interval with Split Conformal Prediction.

More flexibility with the API

Puncc provides two ways of defining and using conformal prediction wrappers:

  • A direct approach to run state-of-the-art conformal prediction procedures. This is what we used in the previous conformal regression example.
  • Low-level API: a more flexible approach based of full customization of the prediction model, the choice of nonconformity scores and the split between fit and calibration datasets.

A quick comparison of both approaches is provided in the API tutorial for a regression problem.

🖥️ Implemented Algorithms

Overview of Implemented Methods from the Literature:
Procedure Type Procedure Name Description (more details in Theory overview)
Conformal Regression deel.puncc.regression.SplitCP Split Conformal Regression
Conformal Regression deel.puncc.regression.LocallyAdaptiveCP Locally Adaptive Conformal Regression
Conformal Regression deel.puncc.regression.LeverageWeightedCP Leverage-Weighted Conformal Prediction
Conformal Regression deel.puncc.regression.CQR Conformalized Quantile Regression
Conformal Regression deel.puncc.regression.CvPlus CV + (cross-validation)
Conformal Regression deel.puncc.regression.EnbPI Ensemble Batch Prediction Intervals method
Conformal Regression deel.puncc.regression.aEnbPI Locally adaptive Ensemble Batch Prediction Intervals method
Conformal Classification deel.puncc.classification.LAC Least Ambiguous Set-Valued Classifiers
Conformal Classification deel.puncc.classification.ClasswiseLAC Classwise Conformal Classification
Conformal Classification deel.puncc.classification.APS Adaptive Prediction Sets
Conformal Classification deel.puncc.classification.RAPS Regularized Adaptive Prediction Sets (APS is a special case where $\lambda = 0$)
Conformal Anomaly Detection deel.puncc.anomaly_detection.SplitCAD Split Conformal Anomaly detection (used to control the maximum false positive rate)
Conformal Object Detection deel.puncc.object_detection.SplitBoxWise Box-wise split conformal object detection

📚 Citation

If you use our library for your work, please cite our paper:

@inproceedings{mendil2023puncc,
  title={PUNCC: a Python Library for Predictive Uncertainty Calibration and Conformalization},
  author={Mendil, Mouhcine and Mossina, Luca and Vigouroux, David},
  booktitle={Conformal and Probabilistic Prediction with Applications},
  pages={582--601},
  year={2023},
  organization={PMLR}
}

Puncc has been used to support the work presented in our COPA 2022 paper on conformal prediction for time series.

@inproceedings{mendil2022robust,
  title={Robust Gas Demand Forecasting With Conformal Prediction},
  author={Mendil, Mouhcine and Mossina, Luca and Nabhan, Marc and Pasini, Kevin},
  booktitle={Conformal and Probabilistic Prediction with Applications},
  pages={169--187},
  year={2022},
  organization={PMLR}
}

🙏 Acknowledgments

This project received funding from the French ”Investing for the Future – PIA3” program within the Artificial and Natural Intelligence Toulouse Institute (ANITI). The authors gratefully acknowledge the support of the DEEL project.

👨‍💻 About the Developers

Puncc's development team is a group of passionate scientists and engineers who are committed to developing a dependable and user-friendly open-source software. We are always looking for new contributors to this initiative. If you are interested in helping us develop puncc, please feel free to get involved.

💻 Contributing

Contributions are welcome! Feel free to report an issue or open a pull request. Take a look at our guidelines here.

🔑 License

The package is released under MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

puncc-0.9.1.tar.gz (66.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

puncc-0.9.1-py3-none-any.whl (82.3 kB view details)

Uploaded Python 3

File details

Details for the file puncc-0.9.1.tar.gz.

File metadata

  • Download URL: puncc-0.9.1.tar.gz
  • Upload date:
  • Size: 66.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.18

File hashes

Hashes for puncc-0.9.1.tar.gz
Algorithm Hash digest
SHA256 15da66451c70e2f91c06a3176c42fe7d9a971957d2f68bb982185f2badd0eece
MD5 92cae0d6f2f346117b68ea74135eecb9
BLAKE2b-256 349d1d2975f2da7a0fd82cdd11f6fc748577582749ab74d30dcf20aa2a57dd9a

See more details on using hashes here.

File details

Details for the file puncc-0.9.1-py3-none-any.whl.

File metadata

  • Download URL: puncc-0.9.1-py3-none-any.whl
  • Upload date:
  • Size: 82.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.18

File hashes

Hashes for puncc-0.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d4ec8d91f30e6103972f33de5d52a8107d1810982a726e0f23b8427248c14638
MD5 03416e2029496d17c0780efd78fd2209
BLAKE2b-256 7d03c6fd0c955188003f6d1a518c18688cc455c53fbca0210868fba9d72f634d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page