Loss ratio analytics for long-term health insurance.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

seokhoonj

These details have not been verified by PyPI

Project links

Project description

lossratio (Python)

Python sibling of the R lossratio package: loss ratio analytics for long-term health insurance — cohort development analysis, stage-adaptive projection, regime detection, and backtest validation on long-format experience data. Stage-adaptive (SA) projection uses an exposure-driven (ED) model before the maturity point and chain ladder (CL) after.

This Python implementation is in active development (0.0.1.devN release line on PyPI).

Install

pip install lossratio              # polars only
pip install lossratio[pandas]      # add pandas / pyarrow support

Current status

Working components:

Triangle — cohort × dev aggregation. Accepts a long-format experience frame (uy_m, cy_m, dev_m, loss_incr, premium_incr) and validates schema + adds derived period columns inline. (dev_m is auto-derived from uy_m and cy_m if absent.) Cumulative is the unmarked default (loss, premium, lr); per-period values carry an _incr suffix.
CL, ED, LR — sklearn-style estimators for chain ladder, exposure-driven, and stage-adaptive loss-ratio projection (fit(triangle) → CLFit / EDFit / LRFit with summary(), df projection frame, and per-cohort SE / CV).
Triangle.link() — builds the long-format Link table (one row per cohort × adjacent dev pair). Method chain tri.link().ata() / tri.link().intensity() returns paired factor-level diagnostics (multiplicative ATA factors, additive ED intensities). Add .maturity(...) after .ata() to detect the development period at which age-to-age factors stabilise.
Triangle.detect_regime() — detects structural shifts across the cohort sequence via E-Divisive or Ward hierarchical clustering (returns a Regime result).
Backtest — calendar-diagonal hold-out backtest of any of the above estimators (returns a BacktestFit with per-cell, by-dev, and by-diagonal A/E Error summaries — ae_err = actual / predicted - 1).

Not yet ported from the R sibling: Calendar / Total aggregations and the Convergence diagnostic.

Quick Start

import polars as pl
import lossratio as lr

# Built-in synthetic experience: four coverages (CI / CAN / HOS / SUR),
# 36 monthly cohorts each, up to 36 dev months. SUR carries one
# regime shift at 2025-07. We focus on SUR for this walk-through.
df = lr.load_experience()
df.head(3)
#> shape: (3, 6)
#> ┌──────────┬────────────┬────────────┬───────┬───────────────┬──────────────┐
#> │ coverage ┆ uy_m       ┆ cy_m       ┆ dev_m ┆ loss_incr     ┆ premium_incr │
#> ╞══════════╪════════════╪════════════╪═══════╪═══════════════╪══════════════╡
#> │ CI       ┆ 2024-01-01 ┆ 2024-01-01 ┆ 1     ┆ 1.2562e7      ┆ 1.8836e7     │
#> │ CI       ┆ 2024-01-01 ┆ 2024-02-01 ┆ 2     ┆ 651602.511522 ┆ 1.7699e7     │
#> │ CI       ┆ 2024-01-01 ┆ 2024-03-01 ┆ 3     ┆ 3.7191e6      ┆ 1.9232e7     │
#> └──────────┴────────────┴────────────┴───────┴───────────────┴──────────────┘

# 1. Subset to SUR (the coverage with the planted regime shift), then
#    build the cohort x dev triangle. Triangle's constructor validates
#    schema and adds derived period columns inline.
df_sur = df.filter(pl.col("coverage") == "SUR")
tri = lr.Triangle(df_sur, group_var="coverage")

# 2. Factor-level diagnostics via the link chain. Build the link table
#    once, derive both ATA factors and ED intensities from it.
link = tri.link()
link
#> <Link: 1 groups, 630 total links, dual-mode>

ata = link.ata()
ata.df.head(3)
#> shape: (3, 7)
#> ┌──────────┬─────┬──────────┬──────────┬──────────┬──────────┬───────┐
#> │ coverage ┆ dev ┆ f        ┆ sigma2   ┆ cv       ┆ rse      ┆ n_obs │
#> ╞══════════╪═════╪══════════╪══════════╪══════════╪══════════╪═══════╡
#> │ SUR      ┆ 1   ┆ 6.244365 ┆ 4.5188e7 ┆ 0.371041 ┆ 0.059758 ┆ 35    │
#> │ SUR      ┆ 2   ┆ 1.748928 ┆ 4.1419e6 ┆ 0.157399 ┆ 0.026069 ┆ 34    │
#> │ SUR      ┆ 3   ┆ 1.433963 ┆ 2.3321e6 ┆ 0.160402 ┆ 0.0181   ┆ 33    │
#> └──────────┴─────┴──────────┴──────────┴──────────┴──────────┴───────┘

ata.maturity(max_cv=0.15, max_rse=0.05, min_run=2).k_star
#> {'SUR': 4}

# 3. Project loss ratios with the stage-adaptive method (default).
fit = lr.LR().fit(tri)
fit.summary().select(["coverage", "cohort", "lr_ult", "se_lr", "cv_lr"]).head(3)
#> shape: (3, 5)
#> ┌──────────┬────────────┬──────────┬──────────┬──────────┐
#> │ coverage ┆ cohort     ┆ lr_ult   ┆ se_lr    ┆ cv_lr    │
#> ╞══════════╪════════════╪══════════╪══════════╪══════════╡
#> │ SUR      ┆ 2024-01-01 ┆ 1.648832 ┆ null     ┆ null     │
#> │ SUR      ┆ 2024-02-01 ┆ 1.527993 ┆ 0.006237 ┆ 0.004082 │
#> │ SUR      ┆ 2024-03-01 ┆ 1.605468 ┆ 0.037199 ┆ 0.02317  │
#> └──────────┴────────────┴──────────┴──────────┴──────────┘

# 4. Detect cohort regime shifts.
reg = tri.detect_regime(loss_var="lr", K=12)
reg.breakpoints
#> [datetime.date(2025, 7, 1)]

# 5. Calendar-diagonal hold-out backtest. The last 6 diagonals are
#    masked, the estimator is refitted on the remaining cells, and
#    the projection is compared with actual loss.
#    ae_err = actual / predicted - 1 (signed relative error).
bt = lr.Backtest(estimator=lr.LR(), holdout=6).fit(tri)
bt.diag_summary.head(3)
#> shape: (3, 6)
#> ┌──────────┬──────────────┬─────┬─────────────┬────────────┬───────────┐
#> │ coverage ┆ calendar_idx ┆ n   ┆ ae_err_mean ┆ ae_err_med ┆ ae_err_wt │
#> ╞══════════╪══════════════╪═════╪═════════════╪════════════╪═══════════╡
#> │ SUR      ┆ 30           ┆ 30  ┆ -0.030373   ┆ -0.007741  ┆ -0.010898 │
#> │ SUR      ┆ 31           ┆ 30  ┆ -0.033368   ┆ -0.01888   ┆ -0.005902 │
#> │ SUR      ┆ 32           ┆ 30  ┆ -0.033076   ┆ -0.018453  ┆ 0.004588  │
#> └──────────┴──────────────┴─────┴─────────────┴────────────┴───────────┘

To analyse multiple coverages jointly, drop the upfront filter; every estimator and detector then fits per group, with coverage already labelling each output row.

To plug in your own data, build a long-format frame with these columns and pass it to lr.Triangle(df, group_var=...):

uy_m (date) — underwriting year-month (cohort)
cy_m (date) — calendar year-month
dev_m (int, optional) — development month; auto-derived from uy_m and cy_m if absent
loss_incr (numeric) — per-period claim amount
premium_incr (numeric) — per-period premium

The shipped lr.load_experience() dataset already includes dev_m for convenience. Coarser granularities (dev_q, dev_s, dev_a — quarterly, semi-annual, annual) can be derived via add_experience_period(df), which produces the full 12-column enrichment (uy_a/uy_s/uy_q/uy_m, cy_a/cy_s/cy_q/cy_m, dev_a/dev_s/dev_q/dev_m). Pass grain="Q" / "S" / "A" to Triangle() to aggregate at a coarser grain (default "auto" detects from data spacing).

Triangle also accepts an optional group_var (coverage, product, age band, ...) — each estimator and detector then fits per group.

Pandas inputs are accepted too; outputs mirror the input type (pandas in → pandas out, polars in → polars out). Use the [pandas] install extra (see above) to pull in pandas and pyarrow.

R package

Source: https://github.com/seokhoonj/lossratio
Documentation: https://seokhoonj.github.io/lossratio/
한국어 문서: https://seokhoonj.github.io/lossratio/ko/

remotes::install_github("seokhoonj/lossratio")
library(lossratio)

Author

Seokhoon Joo (@seokhoonj, seokhoonj@gmail.com) — also maintains the R lossratio package.

License

MPL-2.0 (Mozilla Public License 2.0).

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

seokhoonj

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.0.1.dev9 pre-release

May 10, 2026

0.0.1.dev7 pre-release

May 10, 2026

0.0.1.dev6 pre-release

May 9, 2026

0.0.1.dev5 pre-release

May 9, 2026

0.0.1.dev4 pre-release

May 7, 2026

0.0.1.dev3 pre-release

May 7, 2026

0.0.1.dev2 pre-release

May 7, 2026

0.0.1.dev1 pre-release

May 7, 2026

0.0.1.dev0 pre-release

May 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lossratio-0.0.1.dev9.tar.gz (94.5 kB view details)

Uploaded May 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lossratio-0.0.1.dev9-py3-none-any.whl (99.4 kB view details)

Uploaded May 10, 2026 Python 3

File details

Details for the file lossratio-0.0.1.dev9.tar.gz.

File metadata

Download URL: lossratio-0.0.1.dev9.tar.gz
Upload date: May 10, 2026
Size: 94.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lossratio-0.0.1.dev9.tar.gz
Algorithm	Hash digest
SHA256	`e24affee96ba457557f06b30723e240a3e6ee89bc3d0c4543c7be71953323822`
MD5	`82b9499e70458c6798799f22041a1d80`
BLAKE2b-256	`b936116a6b67f288bf888102a77a0f5535a87acff06fe61442daa8b87ff1f425`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lossratio-0.0.1.dev9.tar.gz:

Publisher: publish.yml on seokhoonj/lossratio-py

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lossratio-0.0.1.dev9.tar.gz
- Subject digest: e24affee96ba457557f06b30723e240a3e6ee89bc3d0c4543c7be71953323822
- Sigstore transparency entry: 1496242489
- Sigstore integration time: May 10, 2026
Source repository:
- Permalink: seokhoonj/lossratio-py@a49c6ce0eaf3c4f985f03b32483d2535fa4fcd2f
- Branch / Tag: refs/tags/v0.0.1.dev9
- Owner: https://github.com/seokhoonj
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a49c6ce0eaf3c4f985f03b32483d2535fa4fcd2f
- Trigger Event: push

File details

Details for the file lossratio-0.0.1.dev9-py3-none-any.whl.

File metadata

Download URL: lossratio-0.0.1.dev9-py3-none-any.whl
Upload date: May 10, 2026
Size: 99.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lossratio-0.0.1.dev9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7cfef2892a8309521d0a367f700702f26a7d386524964d42b8f0205dcbbe1381`
MD5	`d7fcc73c0c384dc5b19b417bd2cb629a`
BLAKE2b-256	`cb2a6e526f5d36f407b1f39ff27bd5d7ece8e4afc9b46e8b24be4efd9c316b59`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lossratio-0.0.1.dev9-py3-none-any.whl:

Publisher: publish.yml on seokhoonj/lossratio-py

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lossratio-0.0.1.dev9-py3-none-any.whl
- Subject digest: 7cfef2892a8309521d0a367f700702f26a7d386524964d42b8f0205dcbbe1381
- Sigstore transparency entry: 1496242625
- Sigstore integration time: May 10, 2026
Source repository:
- Permalink: seokhoonj/lossratio-py@a49c6ce0eaf3c4f985f03b32483d2535fa4fcd2f
- Branch / Tag: refs/tags/v0.0.1.dev9
- Owner: https://github.com/seokhoonj
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a49c6ce0eaf3c4f985f03b32483d2535fa4fcd2f
- Trigger Event: push

lossratio 0.0.1.dev9

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

lossratio (Python)

Install

Current status

Quick Start

R package

Author

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance