Actuarial tail risk quantile and expectile regression for UK personal lines pricing, including EQRN extreme quantile neural nets (Pasche & Engelke 2024)

These details have not been verified by PyPI

Project links

Project description

insurance-quantile

Actuarial tail risk quantile and expectile regression for UK personal lines pricing.

Blog post: Quantile GBMs for Insurance: TVaR, ILFs, and Large Loss Loadings

Wraps CatBoost's native MultiQuantile loss with the vocabulary actuaries actually use: TVaR, large loss loading, ILFs, OEP curves. Polars in, Polars out.

The problem

Tweedie GBMs estimate E[Y | X] well. But pricing teams routinely need:

Large loss loading: how much extra to add for claims that blow past the mean
Increased Limits Factors: what to charge for higher policy limits
TVaR per risk: expected loss given it exceeds its VaR threshold
OEP curves: exceedance probability for reinsurance attachment points

None of this comes out of a Tweedie model. You need the full conditional distribution, or at least a set of quantiles.

Why CatBoost quantile regression, not quantile forests or GLMs

CatBoost's MultiQuantile loss trains a single model for all quantile levels simultaneously — shared feature representations, one training pass. It outperforms separate models and is faster to fit than quantile random forests on structured tabular data. The downside is quantile crossing at prediction time (CatBoost issue #2317), which we fix with per-row isotonic regression.

For heavy-tailed lines (motor BI, liability), expectile mode is available. Expectile regression is both elicitable (has a proper scoring rule, making it backtestable) and satisfies subadditivity under elliptical distributions. For general heavy-tailed non-elliptical distributions, subadditivity of expectiles is not guaranteed — Expected Shortfall (TVaR) remains the standard coherent risk measure for capital purposes. Expectile mode is appropriate when elicitability and backtest-friendliness are the priority, not when Solvency II coherence is the requirement.

Installation

uv add insurance-quantile

💬 Questions or feedback? Start a Discussion. Found it useful? A ⭐ helps others find it.

Quick start

import numpy as np
import polars as pl
from sklearn.model_selection import train_test_split
from insurance_quantile import QuantileGBM, per_risk_tvar, large_loss_loading

# Synthetic motor severity portfolio — 1,000 non-zero claims
rng = np.random.default_rng(42)
n = 1_000

vehicle_age   = rng.integers(1, 15, n).astype(float)
driver_age    = rng.integers(21, 75, n).astype(float)
ncd_years     = rng.integers(0, 9, n).astype(float)
vehicle_group = rng.choice([1.0, 2.0, 3.0, 4.0], size=n)  # encoded as float
exposure      = rng.uniform(0.3, 1.0, n)

# Heteroskedastic lognormal severity: tail weight increases with vehicle group
log_mu    = 6.5 + 0.03 * vehicle_age - 0.01 * ncd_years + 0.1 * vehicle_group
log_sigma = 0.5 + 0.05 * vehicle_group   # tail weight varies by segment
claim_amount = np.exp(rng.normal(log_mu, log_sigma, n))

# Feature matrix — QuantileGBM requires Polars input
feature_names = ["vehicle_age", "driver_age", "ncd_years", "vehicle_group"]
X = pl.DataFrame({
    "vehicle_age":   vehicle_age,
    "driver_age":    driver_age,
    "ncd_years":     ncd_years,
    "vehicle_group": vehicle_group,
})
y = pl.Series("claim_amount", claim_amount)

idx_train, idx_val = train_test_split(np.arange(n), test_size=0.2, random_state=42)
X_train, X_val       = X[idx_train], X[idx_val]
y_train, y_val       = y[idx_train], y[idx_val]
exposure_train       = pl.Series("exposure", exposure[idx_train])

# Fit quantile GBM
model = QuantileGBM(
    quantiles=[0.5, 0.75, 0.9, 0.95, 0.99],
    fix_crossing=True,
    iterations=500,
)
model.fit(X_train, y_train, exposure=exposure_train)

# Predict quantiles — Polars DataFrame, columns: q_0.5, q_0.75, q_0.9, q_0.95, q_0.99
preds = model.predict(X_val)

# TVaR per risk
tvar = per_risk_tvar(model, X_val, alpha=0.95)

# Large loss loading: requires a fitted mean model for comparison.
# large_loss_loading handles Polars DataFrames and numpy arrays transparently —
# raw CatBoostRegressor or sklearn models that do not accept Polars are converted
# to numpy automatically.
from catboost import CatBoostRegressor
tweedie_model = CatBoostRegressor(loss_function="Tweedie:variance_power=1.5",
                                  iterations=200, verbose=0)
tweedie_model.fit(X_train.to_numpy(), y_train.to_numpy())
loading = large_loss_loading(tweedie_model, model, X_val, alpha=0.95)

TwoPartQuantilePremium

New in v0.3.0. Two-part (frequency × severity) quantile premium at an explicit aggregate confidence level, implementing the framework of Heras et al. (2018) / Laporta et al. (2024) with the ML extension from NAAJ 2025.

The key insight: running QuantileGBM directly on zero-inflated aggregate losses gives trivial (zero) quantiles for low-frequency risks when the confidence level is below the no-claim probability. The two-part approach maps the desired aggregate quantile τ to a risk-specific adjusted severity quantile:

τ_i = (τ - p_i) / (1 - p_i)

where p_i = P(N_i = 0 | x_i) is the no-claim probability. For UK motor OD with p_i ≈ 0.80 and τ = 0.90, this gives τ_i ≈ 0.50 — the severity median. The severity model is asked for a well-estimated interior quantile rather than an extreme percentile from a distribution with 80% mass at zero.

The loaded premium is:

P_i = γ · Q̃_{τ_i}(x_i) + (1 − γ) · E[S_i | x_i]

where γ ∈ [0, 1] is the safety loading factor. The safety loading γ · (Q̃_{τ_i} − E[S_i]) is formal and risk-specific — derived from the explicit confidence level τ and the risk's own no-claim probability.

import numpy as np
import polars as pl
from catboost import CatBoostRegressor
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from insurance_quantile import QuantileGBM, TwoPartQuantilePremium

rng = np.random.default_rng(42)
n = 8_000

vehicle_age   = rng.integers(1, 15, n).astype(float)
driver_age    = rng.integers(21, 75, n).astype(float)
ncd_years     = rng.integers(0, 9, n).astype(float)
vehicle_group = rng.choice([1.0, 2.0, 3.0, 4.0], size=n)

feature_names = ["vehicle_age", "driver_age", "ncd_years", "vehicle_group"]
X_np = np.column_stack([vehicle_age, driver_age, ncd_years, vehicle_group])
X    = pl.DataFrame(dict(zip(feature_names, X_np.T)))

# Low-frequency: 14% claim rate
claim_prob = np.clip(0.06 + 0.008 * vehicle_age - 0.004 * ncd_years + 0.02 * vehicle_group, 0.02, 0.45)
has_claim  = rng.random(n) < claim_prob
log_mu     = 7.2 + 0.04 * vehicle_age + 0.12 * vehicle_group
y_sev      = np.where(has_claim, np.exp(rng.normal(log_mu, 0.55 + 0.05 * vehicle_group)), 0.0)

idx_tr, idx_val = train_test_split(np.arange(n), test_size=0.2, random_state=0)
X_train, X_val = X[idx_tr], X[idx_val]

# Step 1: fit frequency model (binary classifier, target = claim indicator)
freq_model = LogisticRegression(max_iter=500)
freq_model.fit(X_np[idx_tr], has_claim[idx_tr].astype(int))

# Step 2: fit severity QuantileGBM on non-zero claims only
# Include quantile levels that will cover the expected range of τ_i values
claim_rows = idx_tr[y_sev[idx_tr] > 0]
sev_model = QuantileGBM(
    quantiles=[0.40, 0.50, 0.60, 0.70, 0.80, 0.90, 0.95, 0.99],
    fix_crossing=True,
    iterations=300,
)
sev_model.fit(X[claim_rows], pl.Series("y", y_sev[claim_rows]))

# Step 3 (optional): fit mean severity model for more accurate pure premium
mean_sev_model = CatBoostRegressor(
    loss_function="Gamma", iterations=200, verbose=0
)
mean_sev_model.fit(X_np[claim_rows], y_sev[claim_rows])

# Compute two-part quantile premium
tpqp = TwoPartQuantilePremium(
    freq_model=freq_model,
    sev_model=sev_model,
    mean_sev_model=mean_sev_model,  # optional: improves pure premium accuracy
)
result = tpqp.predict_premium(
    X_val,
    tau=0.95,   # target aggregate confidence level
    gamma=0.5,  # safety loading factor (0 = no loading, 1 = severity quantile only)
)

print(result.summary())
# TwoPartResult (tau=0.95, gamma=0.50)
#   Mean loaded premium:   £ 612.34
#   Mean pure premium:     £ 572.18
#   Mean safety loading:   £  40.16
#   Loading as % premium:    6.6%
#   Policies using fallback: 0 of 1,600 (0.0%)

# Per-policy output as a Polars DataFrame
df_out = result.to_polars()
# columns: premium, pure_premium, safety_loading, no_claim_prob,
#          adjusted_tau, severity_quantile

The adjusted τ_i: result.adjusted_tau gives the per-risk severity quantile level actually used. For a policy with no-claim probability 0.85 and τ = 0.95, this is (0.95 − 0.85) / 0.15 = 0.67 — the 67th percentile of severity. Policies with p_i ≥ τ fall back to the pure premium with zero safety loading (flagged in result.n_fallback).

Setting γ: γ = 0.5 is a reasonable starting point for UK personal lines. It blends equally between the quantile premium and the pure premium. For Solvency II SCR pricing use τ = 0.995 with γ = 1. Document your choice in the model sign-off.

Module overview

insurance_quantile/
  QuantileGBM          — core class: fit/predict, quantile or expectile mode
  per_risk_tvar        — TVaR per risk at confidence level alpha
  portfolio_tvar       — aggregated portfolio TVaR
  large_loss_loading   — additive loading: TVaR minus mean model prediction
  ilf                  — Increased Limits Factor: E[min(Y,L2)] / E[min(Y,L1)]
  exceedance_curve     — P(Y > x) averaged across portfolio
  oep_curve            — occurrence exceedance probability (OEP)
  coverage_check       — calibration: observed vs expected coverage per quantile
  pinball_loss         — standard scoring rule for quantile regression

insurance_quantile.eqrn/
  EQRNModel            — extreme quantile regression neural network (Pasche & Engelke 2024)
  EQRNDiagnostics      — GPD QQ, calibration, threshold stability plots
  GPDNet               — feedforward network for covariate-dependent GPD parameters
  IntermediateQuantileEstimator — K-fold OOF intermediate quantile estimation

Expectile mode

# For motor bodily injury or other heavy-tailed lines
# X_train and y_train are pl.DataFrame and pl.Series as in the quick start above.
model = QuantileGBM(
    quantiles=[0.5, 0.75, 0.9, 0.95],
    use_expectile=True,  # fits separate CatBoost model per alpha
)
model.fit(X_train, y_train)

Expectiles are not the same as quantiles. The e_0.9 expectile is generally different from Q(0.9). Use expectile mode when you need an elicitable, backtestable tail risk measure — not when you need P(Y > x) directly. For capital purposes requiring coherence (e.g. Solvency II), use TVaR (Expected Shortfall) instead.

Zero-inflated data

Most personal lines portfolios have a large mass of zero claims. There are two ways to handle this:

Model the full distribution (including zeros): correct but quantiles below the zero-fraction level will all be zero, which is less useful for large loss loading.
Separate frequency and severity (recommended): fit QuantileGBM only on non-zero claims (severity), with a separate frequency model. Large loss loading then applies to severity only. Document which approach you've used in your model sign-off.

Integration with insurance-conformal

QuantileGBM output feeds directly into insurance-conformal for Conformalized Quantile Regression (CQR):

import numpy as np
import polars as pl
from sklearn.model_selection import train_test_split
from insurance_quantile import QuantileGBM
from insurance_conformal import ConformalQuantileRegressor

# Assumes X and y defined as pl.DataFrame and pl.Series as in the quick start above.
# Split into train / calibration sets for conformal coverage guarantee.
idx_tr, idx_cal = train_test_split(np.arange(len(X_train) + len(X_val)),
                                   test_size=0.25, random_state=0)
X_all = pl.concat([X_train, X_val])
y_all = pl.concat([y_train, y_val])
X_tr2, X_cal2 = X_all[idx_tr], X_all[idx_cal]
y_tr2, y_cal2 = y_all[idx_tr], y_all[idx_cal]

model = QuantileGBM(quantiles=[0.05, 0.95]).fit(X_tr2, y_tr2)
preds_cal = model.predict(X_cal2)

cqr = ConformalQuantileRegressor(alpha=0.1)
cqr.fit(y_cal2, preds_cal["q_0.05"], preds_cal["q_0.95"])
# Guaranteed 90% coverage, distribution-free

Design decisions

Quantile crossing fix: isotonic regression per row at predict time. CatBoost's MultiQuantile loss can produce crossing predictions for individual risks despite enforcing correct orderings in the loss function. The fix is O(n_rows × n_quantiles) and adds negligible overhead.

Exposure as sample_weight: exposure is passed to CatBoost as sample_weight, not as an offset. This weights each row's loss contribution, which is appropriate when the target is aggregate cost. If your target is severity (cost per claim), do not pass exposure here.

TVaR approximation: we estimate TVaR using trapezoidal integration over the quantile function at the stored levels above alpha. This correctly weights each quantile estimate by its interval width, unlike a naive mean. Accuracy improves with the number of high quantile levels in the model — include 0.95, 0.99 at minimum for TVaR at alpha=0.9.

ILF integration: E[min(Y, L)] = integral_0^L P(Y > x) dx, integrated numerically using the trapezoidal rule over the interpolated survival function from quantile predictions. 200 integration points is sufficient for smooth severity distributions.

Performance

Benchmarked against parametric lognormal quantiles (OLS on log(Y) with global sigma) on synthetic severity data — 5,000 claims from a heteroskedastic lognormal DGP where tail weight (log_sigma) varies by vehicle group from 0.46 to 0.64. 4,000 train / 1,000 test split. Numbers from the post-P0-fix benchmark run:

TABLE 1: Quantile calibration (coverage) and pinball loss

Quantile	Coverage — Lognormal	Coverage — QuantileGBM	Pinball — Lognormal	Pinball — QuantileGBM
Q90	0.8970	0.8590	185.8	197.9
Q95	0.9470	0.9150	122.3	136.7
Q99	0.9890	0.9560	38.4	54.8

TABLE 2: TVaR accuracy vs DGP analytical truth (TVaR_90)

Metric	Lognormal baseline	QuantileGBM
MAE vs DGP truth	315.1	477.0
RMSE vs DGP truth	370.9	733.9
Bias (mean over/under-estimate)	−70.9	−39.8

TABLE 3: ILF accuracy vs DGP truth (base limit £5,000)

Limit	ILF (DGP)	ILF (Lognormal)	ILF (QuantileGBM)	Error — Lognormal	Error — QuantileGBM
£10,000	1.0070	1.0042	1.0029	−0.0028	−0.0041
£25,000	1.0074	1.0043	1.0026	−0.0031	−0.0049
£50,000	1.0074	1.0043	1.0031	−0.0031	−0.0043
£100,000	1.0074	1.0043	1.0017	−0.0031	−0.0057

TABLE 4: Q95 coverage by vehicle group (heteroskedastic test)

Group	True log_sigma	Coverage — Lognormal	Coverage — QuantileGBM
1	0.460	0.9878	0.9143
2	0.520	0.9421	0.8843
3	0.580	0.9434	0.9283
4	0.640	0.9153	0.9315

Honest interpretation: At this sample size (5,000 claims, 1,000 test), the lognormal baseline outperforms QuantileGBM on pinball loss, TVaR MAE/RMSE, and ILF accuracy for all limits tested. The GBM has lower TVaR bias (−39.8 vs −70.9), which matters when you care about directional accuracy rather than absolute error. The heteroskedastic Q95 coverage test shows QuantileGBM adapts better for group 4 (heaviest tail, 0.9315 vs 0.9153) but worse for groups 1–2.

The OLS lognormal benefits from the fact that the DGP is lognormal — a correctly specified parametric model will win when the distribution family is right. The GBM advantage appears at larger sample sizes and when the true distribution diverges from lognormal: e.g. bimodal severity (small repair + large BI), threshold effects by region, or mixing across lines. Run the full Databricks notebook for larger-scale validation.

When to use: Where severity genuinely departs from the parametric family you'd otherwise assume, and you have more than ~10,000 non-zero claims. For TVaR and large loss loading where the tail shape varies across risk segments in a way that a single-parameter family cannot capture.

When NOT to use: When the portfolio has fewer than ~5,000 large claims — parametric regularisation wins at small n. When the actuarial deliverable requires a smooth, monotone ILF curve — quantile regression is not constrained to be monotone in the limit dimension without additional work.

Databricks Notebook

A ready-to-run Databricks notebook benchmarking this library against standard approaches is available in burning-cost-examples.

Related Libraries

Library	Description
insurance-severity	Spliced severity models and EVT — parametric complement to quantile regression for heavy-tailed lines
insurance-conformal	Conformalized Quantile Regression — wraps this library's output to give distribution-free coverage guarantees
insurance-distributional	Parametric severity distributions (Pareto, Gamma, LogNormal) — alternative approach when you need closed-form tail quantities
shap-relativities	Extract what's driving the tail — SHAP values on the QuantileGBM output
insurance-cv	Walk-forward cross-validation for time-structured insurance data

References

Fissler, T. and Ziegel, J.F. (2025). Two-part quantile premium principles with formal safety loading. North American Actuarial Journal.
Pasche, Y. and Engelke, S. (2024). Extremal quantile regression with neural networks. Journal of the American Statistical Association.

Your Burning Cost Has a Tail Risk Problem — why Tweedie models systematically misprice tail risk and how quantile regression fills the gap.

Source repos

This package consolidates two previously separate libraries:

insurance-quantile — core CatBoost quantile/expectile GBM (v0.1.x)
insurance-eqrn — archived, merged into insurance_quantile.eqrn

Requirements

Python 3.10+
catboost >= 1.2
polars >= 1.0
scikit-learn >= 1.3
numpy >= 1.24

Licence

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.0

Apr 4, 2026

0.4.0

Apr 1, 2026

This version

0.3.5

Mar 25, 2026

0.3.4

Mar 22, 2026

0.3.1

Mar 20, 2026

0.3.0

Mar 20, 2026

0.2.3

Mar 17, 2026

0.2.2

Mar 15, 2026

0.2.1

Mar 14, 2026

0.2.0

Mar 14, 2026

0.1.0

Mar 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

insurance_quantile-0.3.5.tar.gz (227.4 kB view details)

Uploaded Mar 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

insurance_quantile-0.3.5-py3-none-any.whl (59.3 kB view details)

Uploaded Mar 25, 2026 Python 3

File details

Details for the file insurance_quantile-0.3.5.tar.gz.

File metadata

Download URL: insurance_quantile-0.3.5.tar.gz
Upload date: Mar 25, 2026
Size: 227.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for insurance_quantile-0.3.5.tar.gz
Algorithm	Hash digest
SHA256	`b8ba0ad0300ba5794fe5593510eb9acb1e9a7d249240616be7c0b97ba1d48084`
MD5	`889bc14490896d3a40e0d74112030484`
BLAKE2b-256	`dbc5dfda54b4d134b016a81c0b40932b38bc783f1acc2e5aa5574b73e55d24bb`

See more details on using hashes here.

File details

Details for the file insurance_quantile-0.3.5-py3-none-any.whl.

File metadata

Download URL: insurance_quantile-0.3.5-py3-none-any.whl
Upload date: Mar 25, 2026
Size: 59.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for insurance_quantile-0.3.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`50b36469b1fa24b2b534069e4421c8c20dd40ec3097b0bb29b7a8b64577f936c`
MD5	`c21ef62b47ea307f7ae273018ed4fb97`
BLAKE2b-256	`43228c01ffa67769b86ae7b58d33783f668678f754e5a6c87c89dd24b858da0a`

See more details on using hashes here.

insurance-quantile 0.3.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

insurance-quantile

The problem

Why CatBoost quantile regression, not quantile forests or GLMs

Installation

Quick start

TwoPartQuantilePremium

Module overview

Expectile mode

Zero-inflated data

Integration with insurance-conformal

Design decisions

Performance

Databricks Notebook

Related Libraries

References

Read more

Source repos

Requirements

Licence

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes