Skip to main content

scripts for calculating likelihood ratios

Project description

LIR Python Likelihood Ratio Library

This library provides a collection of scripts to aid calibration, and calculation and evaluation of Likelihood Ratios.

A simple score-based LR system

A score-based LR system needs a scorer and a calibrator. The most basic setup uses a training set and a test set. Both the scorer and the calibrator are fitted on the training set.

import lir
import numpy as np
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split

# generate some data randomly from a normal distribution
X = np.concatenate([np.random.normal(loc=0, size=(100, 1)),
              np.random.normal(loc=1, size=(100, 1))])
y = np.concatenate([np.zeros(100), np.ones(100)])

# split the data into train and test
X_train, X_test, y_train, y_test = train_test_split(X, y)

# initialize a scorer and a calibrator
scorer = LogisticRegression(solver='lbfgs')  # choose any sklearn style classifier
calibrator = lir.KDECalibrator()  # use plain KDE for calibration
calibrated_scorer = lir.CalibratedScorer(scorer, calibrator)

# fit and predict
calibrated_scorer.fit(X_train, y_train)
lrs_test = calibrated_scorer.predict_lr(X_test)

# print the quality of the system as log likelihood ratio cost (lower is better)
print('The log likelihood ratio cost is', lir.metrics.cllr(lrs_test, y_test), '(lower is better)')
print('The discriminative power is', lir.metrics.cllr_min(lrs_test, y_test), '(lower is better)')

# plot calibration
import lir.plotting
with lir.plotting.show() as ax:
    ax.pav(lrs_test, y_test)

The log likelihood ratio cost (Cllr) may be used as a metric of performance. In this case it should yield a value of around .8, but highly variable due to the small number of samples. Increase the sample size to get more stable results.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lir-0.1.28.tar.gz (50.6 kB view details)

Uploaded Source

Built Distribution

lir-0.1.28-py3-none-any.whl (61.4 kB view details)

Uploaded Python 3

File details

Details for the file lir-0.1.28.tar.gz.

File metadata

  • Download URL: lir-0.1.28.tar.gz
  • Upload date:
  • Size: 50.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for lir-0.1.28.tar.gz
Algorithm Hash digest
SHA256 af24b834b2604d634174532060d59dfbc93dd985dbeb09d7ce5faad0eba1a734
MD5 4a207222dc4a665fe1a165f890bc793e
BLAKE2b-256 9867c47941ba8fa6c4b5ecf89823ef6774faed01d19ea3cdcc745ee9507e8e9a

See more details on using hashes here.

File details

Details for the file lir-0.1.28-py3-none-any.whl.

File metadata

  • Download URL: lir-0.1.28-py3-none-any.whl
  • Upload date:
  • Size: 61.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for lir-0.1.28-py3-none-any.whl
Algorithm Hash digest
SHA256 f9376b89553972c174cf07dd83dfd6204c7238885fcc6c7db4f193ef3c01c76c
MD5 ba777283cb4136e997ff178023204e36
BLAKE2b-256 0eee79ccc309aab04f2393978fdd456bfbc51f138b29288a8dc2795d8ba078fd

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page