scripts for calculating likelihood ratios
Project description
LIR Python Likelihood Ratio Library
This library provides a collection of scripts to aid calibration, and calculation and evaluation of Likelihood Ratios.
A simple score-based LR system
A score-based LR system needs a scorer and a calibrator. The most basic setup uses a training set and a test set. Both the scorer and the calibrator are fitted on the training set.
import lir
import numpy as np
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
# generate some data randomly from a normal distribution
X = np.concatenate([np.random.normal(loc=0, size=(100, 1)),
np.random.normal(loc=1, size=(100, 1))])
y = np.concatenate([np.zeros(100), np.ones(100)])
# split the data into train and test
X_train, X_test, y_train, y_test = train_test_split(X, y)
# initialize a scorer and a calibrator
scorer = LogisticRegression(solver='lbfgs') # choose any sklearn style classifier
calibrator = lir.KDECalibrator() # use plain KDE for calibration
calibrated_scorer = lir.CalibratedScorer(scorer, calibrator)
# fit and predict
calibrated_scorer.fit(X_train, y_train)
lrs_test = calibrated_scorer.predict_lr(X_test)
# print the quality of the system as log likelihood ratio cost (lower is better)
print('The log likelihood ratio cost is', lir.metrics.cllr(lrs_test, y_test), '(lower is better)')
print('The discriminative power is', lir.metrics.cllr_min(lrs_test, y_test), '(lower is better)')
# plot calibration
import lir.plotting
with lir.plotting.show() as ax:
ax.pav(lrs_test, y_test)
The log likelihood ratio cost (Cllr) may be used as a metric of performance. In this case it should yield a value of around .8, but highly variable due to the small number of samples. Increase the sample size to get more stable results.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file lir-0.1.28.tar.gz
.
File metadata
- Download URL: lir-0.1.28.tar.gz
- Upload date:
- Size: 50.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | af24b834b2604d634174532060d59dfbc93dd985dbeb09d7ce5faad0eba1a734 |
|
MD5 | 4a207222dc4a665fe1a165f890bc793e |
|
BLAKE2b-256 | 9867c47941ba8fa6c4b5ecf89823ef6774faed01d19ea3cdcc745ee9507e8e9a |
File details
Details for the file lir-0.1.28-py3-none-any.whl
.
File metadata
- Download URL: lir-0.1.28-py3-none-any.whl
- Upload date:
- Size: 61.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f9376b89553972c174cf07dd83dfd6204c7238885fcc6c7db4f193ef3c01c76c |
|
MD5 | ba777283cb4136e997ff178023204e36 |
|
BLAKE2b-256 | 0eee79ccc309aab04f2393978fdd456bfbc51f138b29288a8dc2795d8ba078fd |