Model risk management and MRM governance packs for UK GI pricing — Consumer Duty, TR24/2, and SS1/23 best-practice aligned

These details have not been verified by PyPI

Project links

Project description

insurance-governance

Automated statistical validation and MRM governance pack generation for UK pricing models — so your next PRA supervisory visit has consistent, auditable evidence across every model in production, not a folder of bespoke analyst notebooks.

Blog post: Automated MRM Governance for UK Insurance Pricing Models

Features

Five-test validation suite — Gini with bootstrap CI, A/E with Poisson CI, Hosmer-Lemeshow, lift chart, PSI; same structure for every model
MRM governance packs — self-contained HTML with risk tier rationale, assumptions register, approval history; print-to-PDF in under 1 second
Risk tier scoring — 6-dimension composite (GWP materiality, complexity, external data, validation recency, drift history, regulatory exposure); 0–100 score with documented rationale
Model inventory — JSON file checked into git; tracks validation history, overdue reviews, and approval chains
Fairness integration — accepts fairness audit results from insurance-fairness as a governance pack section
RAG status — green/amber/red status per test and overall; monitoring triggers configurable per model
Regulatory mapping — PS12/22, Consumer Duty (PRIN 2A), TR24/2, PRA SoP3/24, and SS1/23 best-practice cross-references baked into the HTML output

Why this?

UK GI pricing teams face three overlapping governance obligations: FCA Consumer Duty (PRIN 2A) and TR24/2 require documented evidence that pricing models produce fair outcomes; PRA SoP3/24 expects an annual attestation (IMOR) that model governance is sound; and most internal MRM frameworks cite SS1/23 best practice by analogy, even though that supervisory statement is technically directed at banks. In practice, pricing teams with 10+ production models end up with bespoke validation notebooks that vary by analyst, Word-document MRM packs rebuilt by hand each committee cycle, and no machine-readable inventory of overdue reviews.

This library runs a five-test validation suite (Gini with bootstrap CI, A/E with Poisson CI, Hosmer-Lemeshow, lift chart, PSI) and produces MRM governance packs as self-contained HTML — the same structure for every model, every release.

Regulatory basis for GI pricing models: The mandatory hooks are Consumer Duty (PRIN 2A) + TR24/2 (FCA side) and PRA SoP3/24 annual attestation via IMOR (PRA side). SS1/23 is a banking supervisory statement and does not apply directly to Solvency II insurers — but it describes good model governance practice, and many UK insurer MRM frameworks reference it by analogy. This library is aligned with SS1/23 best practice where relevant; your compliance obligation is PRIN 2A, TR24/2, and SoP3/24.

Manual governance vs this library

Task	Manual approach	insurance-governance
Statistical validation	Bespoke notebook per model — different tests, incomparable output	`ModelValidationReport` — five fixed tests, same HTML structure, every model
A/E miscalibration	One aggregate ratio — misses segment-level bias	A/E with Poisson CI + Hosmer-Lemeshow; catches age-band bias averaging out in global A/E
Score distribution drift	PSI value pasted into Word	PSI in JSON sidecar with pass/fail flag and threshold detail
Risk tier assignment	Subjective judgement in MRC pre-read	`RiskTierScorer` — 6 dimensions, 0–100 composite, documented rationale per dimension
Governance pack	Word document rebuilt each cycle	`GovernanceReport.save_html()` — self-contained HTML; print-to-PDF in under 1 second
Model inventory	Spreadsheet or SharePoint list	`ModelInventory` — JSON file, check into git; tracks validation history and overdue reviews
Consumer Duty evidence	Narrative in committee paper	Structured fairness section + renewal cohort A/E test in every pack

Installation

pip install insurance-governance

Quick start: statistical validation

Run the five-test validation suite and produce an HTML report for a motor frequency model.

import numpy as np
from insurance_governance import ModelValidationReport, ValidationModelCard

rng = np.random.default_rng(42)
n = 5_000
y_val      = rng.poisson(0.08, n).astype(float)
y_pred_val = np.clip(rng.normal(0.08, 0.02, n), 0.001, None)
exposure   = rng.uniform(0.5, 1.0, n)

card = ValidationModelCard(
    name="Motor Frequency v3.2", version="3.2.0",
    purpose="Predict claim frequency for UK private motor portfolio",
    methodology="CatBoost gradient boosting with Poisson objective",
    target="claim_count", features=["age", "vehicle_age", "area", "vehicle_group"],
    limitations=["No telematics data"], owner="Pricing Team",
)
report = ModelValidationReport(
    model_card=card, y_val=y_val, y_pred_val=y_pred_val, exposure_val=exposure,
    monitoring_owner="Head of Pricing", monitoring_triggers={"ae_ratio": 1.10, "psi": 0.25},
)
print(report.get_rag_status())   # "green", "amber", or "red"
report.generate("validation_report.html")

Quick start: MRM governance pack

Score a model's risk tier and produce a governance HTML for the Model Risk Committee.

from insurance_governance import MRMModelCard, RiskTierScorer, GovernanceReport, Assumption

card = MRMModelCard(
    model_id="motor-freq-v3", model_name="Motor TPPD Frequency",
    version="3.2.0", model_class="pricing",
    intended_use="Frequency pricing for UK private motor. Not for commercial fleet.",
    assumptions=[Assumption("Claim frequency stationarity since 2022",
                            risk="MEDIUM", mitigation="Quarterly A/E monitoring")],
)
tier = RiskTierScorer().score(
    gwp_impacted=125_000_000, model_complexity="high",
    deployment_status="champion", regulatory_use=False,
    external_data=False, customer_facing=True,
)
GovernanceReport(card=card, tier=tier).save_html("mrm_pack.html")

Regulatory framework

Obligation	Who it applies to	What it requires
Consumer Duty (PRIN 2A) + TR24/2	All GI pricing teams	Documented evidence that pricing models produce fair outcomes; proxy discrimination testing; renewal pricing fairness
PRA SoP3/24 (IMOR annual attestation)	PRA-regulated insurers	Annual sign-off that model governance, validation, and monitoring are in place
SS1/23 best practice	Banks (directly); insurers (by analogy)	SS1/23 is a banking supervisory statement — not mandatory for Solvency II insurers, but widely referenced in insurer MRM frameworks as good practice

This library structures its validation suite and governance packs to meet the Consumer Duty and IMOR evidence requirements, while following SS1/23 best practice where it provides useful structure.

What the validation suite catches

Benchmarked on Databricks (2026-03-16), three synthetic UK motor scenarios: well-specified (A), miscalibrated (B, A/E=1.18 with age-band bias), drifted (C, trained on shifted population).

Scenario	Manual 4-check checklist	Automated 5-test suite	Key diagnostic
Model A (well-specified)	4/4 pass	5/5 pass	Gini CI and A/E CI both tight
Model B (miscalibrated)	Flags A/E	Flags A/E + HL	HL p<0.0001 — age-band bias averages out in global A/E
Model C (drifted)	Passes PSI	Flags A/E CI	PSI=0.189 (below 0.25 threshold); A/E CI excludes 1.0

The 1-second overhead over a manual checklist is entirely the 500-resample Gini bootstrap. PSI alone is not sufficient to detect population drift of this type.

freMTPL2 real-data benchmark

notebooks/benchmark_fremtpl2.py — Databricks notebook running the full validation suite on freMTPL2 (OpenML 41214), 677,991 French MTPL policies.

This is the benchmark to look at if you want to understand what validation outputs look like on real data — not synthetic. It runs a Poisson GLM and a CatBoost GBM on the same dataset and produces side-by-side MRM governance pack reports aligned with Consumer Duty and SS1/23 best practice.

Key findings from the real-data benchmark:

Gini is lower than synthetic data suggests. Real-world motor frequency models achieve Gini of 0.15–0.30 (GLM) and 0.25–0.40 (GBM) on heterogeneous populations. Synthetic benchmarks with clean DGPs produce inflated Gini values. Calibrate your Green/Amber/Red thresholds to your actual portfolio.
Hosmer-Lemeshow catches what global A/E misses. On 677K rows, H-L has power to detect systematic miscalibration that averages out in a single A/E ratio. The GLM's exclusion of categorical features leaves residuals in urban/young-driver segments — invisible in global A/E, visible in H-L.
The governance API is model-agnostic. ModelValidationReport takes a numpy array. It does not care whether that array came from statsmodels, CatBoost, or anything else. The same validation structure applies to both models without modification.

Key classes

Validation

ModelValidationReport — facade: pass model card and validation arrays, get RAGStatus and HTML. Optionally add incumbent_pred_val for a double-lift chart, or fairness_group_col for a disparate impact section.
ValidationModelCard — Pydantic schema: model name, version, features, methodology, monitoring plan.
All tests return TestResult(passed, severity, detail) — extend with your own via extra_results.

MRM

RiskTierScorer — stateless, deterministic; 6 dimensions (GWP materiality, complexity, external data, validation recency, drift history, regulatory exposure); 0–100 composite; verbose rationale per dimension. Tier 1 (≥60): annual review, MRC sign-off. Tier 2 (30–59): 18-month, Chief Actuary. Tier 3 (<30): 24-month, Head of Pricing.
MRMModelCard — assumptions register (risk ratings LOW/MEDIUM/HIGH, mitigations), approval history, last validation run linkage.
ModelInventory — JSON file registry checked into git; register(), list_overdue(), get_history().
GovernanceReport — executive HTML pack: model purpose, tier rationale, last RAG, assumptions, outstanding issues, approval conditions, next review date.

Part of the Burning Cost stack

Takes validation outputs from insurance-monitoring to surface overdue reviews. Accepts fairness audit results from insurance-fairness as a governance pack input. Feeds into pricing committee sign-off workflows. → See the full stack

Databricks notebooks

Synthetic data demo — burning-cost-examples: full end-to-end workflow on 50K synthetic UK motor policies.
Real-data benchmark — notebooks/benchmark_fremtpl2.py: Poisson GLM vs CatBoost GBM on freMTPL2 (677K French MTPL rows, OpenML 41214). Shows what validation outputs look like in practice.

Library	What it does
insurance-monitoring	PSI, A/E ratios, Gini drift test — the ongoing monitoring that triggers governance reviews
insurance-fairness	FCA Consumer Duty proxy discrimination audit — fairness results feed directly into governance packs
insurance-conformal	Distribution-free prediction intervals with MRM-aligned model uncertainty documentation
insurance-gam	Interpretable GAM models whose shape functions are directly auditable by a pricing committee

Licence

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.1

Apr 4, 2026

0.3.0

Apr 4, 2026

This version

0.2.0

Apr 1, 2026

0.1.10

Apr 1, 2026

0.1.5

Mar 27, 2026

0.1.4

Mar 22, 2026

0.1.3

Mar 17, 2026

0.1.2

Mar 15, 2026

0.1.1

Mar 14, 2026

0.1.0

Mar 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

insurance_governance-0.2.0.tar.gz (189.6 kB view details)

Uploaded Apr 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

insurance_governance-0.2.0-py3-none-any.whl (110.8 kB view details)

Uploaded Apr 1, 2026 Python 3

File details

Details for the file insurance_governance-0.2.0.tar.gz.

File metadata

Download URL: insurance_governance-0.2.0.tar.gz
Upload date: Apr 1, 2026
Size: 189.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.8 {"installer":{"name":"uv","version":"0.10.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for insurance_governance-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`a5bded1aff0185b1b72c3a04cc672eb8494f601b281bb248b1cd5a7eec6dedf8`
MD5	`d8614872f3266abaca5564f1d5e8b00d`
BLAKE2b-256	`b4b212c06164f56538850d454d30bc979903474732b265c745a553eafddefa30`

See more details on using hashes here.

File details

Details for the file insurance_governance-0.2.0-py3-none-any.whl.

File metadata

Download URL: insurance_governance-0.2.0-py3-none-any.whl
Upload date: Apr 1, 2026
Size: 110.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.8 {"installer":{"name":"uv","version":"0.10.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for insurance_governance-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3a4879c5b083aa5550fbeb784ed2e60f700a1dfc02345d6fc8b173156a955d66`
MD5	`65afa0f3cc2a0288f7d3c81dc4daf036`
BLAKE2b-256	`2f8014545c51978fa6768edd105ab4aa4279fa38185d538ab1fd9aef55ed4dc0`

See more details on using hashes here.

insurance-governance 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

insurance-governance

Features

Why this?

Manual governance vs this library

Installation

Quick start: statistical validation

Quick start: MRM governance pack

Regulatory framework

What the validation suite catches

freMTPL2 real-data benchmark

Key classes

Part of the Burning Cost stack

Databricks notebooks

See Also

Licence

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes