pycaleva

A framework for calibration evaluation of binary classification models

These details have not been verified by PyPI

Project links

Project description

A framework for calibration evaluation of binary classification models.

When performing classification tasks you sometimes want to obtain the probability of a class label instead of the class label itself. For example, it might be interesting to determine the risk of cancer for a patient. It is desireable to have a calibrated model which delivers predicted probabilities very close to the actual class membership probabilities. For this reason, this framework was developed allowing users to measure the calibration of binary classification models.

Evaluate the calibration of binary classification models with probabilistic output (LogisticRegression, SVM, NeuronalNets ...).
Apply your model to testdata and use true class labels and predicted probabilities as input for the framework.
Various statistical tests, metrics and plots are available.
Supports creating a calibration report in pdf-format for your model.

Image Design

See the documentation for detailed information about classes and methods.

Installation

$ pip install pycaleva

or build on your own

$ git clone https://github.com/MartinWeigl/pycaleva.git
$ cd pycaleva
$ python setup.py install

Requirements

numpy>=1.17
scipy>=1.3
matplotlib>=3.1
tqdm>=4.40
pandas>=1.3.0
statsmodels>=0.13.1
fpdf2>=2.5.0
ipython>=7.30.1

Usage

Import and initialize

from pycaleva import CalibrationEvaluator
ce = CalibrationEvaluator(y_test, pred_prob, outsample=True, n_groups='auto')

Apply statistical tests

ce.hosmerlemeshow()     # Hosmer Lemeshow Test
ce.pigeonheyse()        # Pigeon Heyse Test
ce.z_test()             # Spiegelhalter z-Test
ce.calbelt(plot=False)  # Calibrationi Belt (Test only)

Show calibration plot
```
ce.calibration_plot()
```
Show calibration belt
```
ce.calbelt(plot=True)
```
Get various metrics
```
ce.metrics()
```

Create pdf calibration report

ce.calibration_report('report.pdf', 'my_model')

See the documentation of single methods for detailed usage examples.

Example Results

Well calibrated model	Poorly calibrated model


hltest_result(statistic=4.982635477424991, pvalue=0.8358193332183672, dof=9)	hltest_result(statistic=26.32792475118742, pvalue=0.0018051545107069522, dof=9)
ztest_result(statistic=-0.21590257919669287, pvalue=0.829063686607032)	ztest_result(statistic=-3.196125145498827, pvalue=0.0013928668407116645)

Features

Statistical tests for binary model calibration
- Hosmer Lemeshow Test
- Pigeon Heyse Test
- Spiegelhalter z-test
- Calibration belt
Graphical represantions showing calibration of binary models
- Calibration plot
- Calibration belt
Various Metrics
- Brier Score
- Adaptive Calibration Error
- Maximum Calibration Error
- Area within LOWESS Curve
- (AUROC)

The above features are explained in more detail in PyCalEva's documentation

References

Statistical tests and metrics:

[1] Hosmer Jr, David W., Stanley Lemeshow, and Rodney X. Sturdivant. Applied logistic regression. Vol. 398. John Wiley & Sons, 2013.

[2] Pigeon, Joseph G., and Joseph F. Heyse. An improved goodness of fit statistic for probability prediction models. Biometrical Journal: Journal of Mathematical Methods in Biosciences 41.1 (1999): 71-82.

[3] Spiegelhalter, D. J. (1986). Probabilistic prediction in patient management and clinical trials. Statistics in medicine, 5(5), 421-433.

[4] Huang, Y., Li, W., Macheret, F., Gabriel, R. A., & Ohno-Machado, L. (2020). A tutorial on calibration measurements and calibration models for clinical prediction models. Journal of the American Medical Informatics Association, 27(4), 621-633.
Calibration plot:

[5] Jr, F. E. H. (2021). rms: Regression modeling strategies (R package version 6.2-0) [Computer software]. The Comprehensive R Archive Network. Available from https://CRAN.R-project.org/package=rms
Calibration belt:

[6] Nattino, G., Finazzi, S., & Bertolini, G. (2014). A new calibration test and a reappraisal of the calibration belt for the assessment of prediction models based on dichotomous outcomes. Statistics in medicine, 33(14), 2390-2407.

[7] Bulgarelli, L. (2021). calibrattion-belt: Assessment of calibration in binomial prediction models [Computer software]. Available from https://github.com/fabiankueppers/calibration-framework

[8] Nattino, G., Finazzi, S., Bertolini, G., Rossi, C., & Carrara, G. (2017). givitiR: The giviti calibration test and belt (R package version 1.3) [Computer software]. The Comprehensive R Archive Network. Available from https://CRAN.R-project.org/package=givitiR
Others:

[9] Sturges, H. A. (1926). The choice of a class interval. Journal of the american statistical association, 21(153), 65-66.

For most of the implemented methods in this software you can find references in the documentation as well.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.8.2

May 2, 2024

0.8.0

May 1, 2024

This version

0.7.0

Dec 10, 2022

0.6.1

Sep 4, 2022

0.6.0

May 21, 2022

0.5.0

May 21, 2022

0.3.0

May 21, 2022

0.2.0

Mar 12, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycaleva-0.7.0.tar.gz (23.2 kB view details)

Uploaded Dec 10, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pycaleva-0.7.0-py3-none-any.whl (24.6 kB view details)

Uploaded Dec 10, 2022 Python 3

File details

Details for the file pycaleva-0.7.0.tar.gz.

File metadata

Download URL: pycaleva-0.7.0.tar.gz
Upload date: Dec 10, 2022
Size: 23.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.28.1 requests-toolbelt/0.9.1 urllib3/1.26.13 tqdm/4.64.1 importlib-metadata/5.1.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.6 CPython/3.10.8

File hashes

Hashes for pycaleva-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`dccca81494b218fb3bb550e881aa67196cda3fe3c7b198474b051f4d08913423`
MD5	`ad4086ab6aeaec9aa03d16c635b7ade0`
BLAKE2b-256	`41a05c2780bb75ba12bebc9908a6eeea7cafb77cd3ed3305187767c5a1951b7b`

See more details on using hashes here.

File details

Details for the file pycaleva-0.7.0-py3-none-any.whl.

File metadata

Download URL: pycaleva-0.7.0-py3-none-any.whl
Upload date: Dec 10, 2022
Size: 24.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.28.1 requests-toolbelt/0.9.1 urllib3/1.26.13 tqdm/4.64.1 importlib-metadata/5.1.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.6 CPython/3.10.8

File hashes

Hashes for pycaleva-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3e88cef4f0916c34566b3adcacdd9b2702c1d233b9cce16c7cb3b3f546c7af87`
MD5	`a802a08642a4b2b494f261392d26593e`
BLAKE2b-256	`c91d0df174a34a99699b8bf04475e39d21885963492b4a81c499992c108fa05c`

See more details on using hashes here.

pycaleva 0.7.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

A framework for calibration evaluation of binary classification models.

Installation

Requirements

Usage

Example Results

Features

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes