DataFrame-native metric movement decomposition for business metrics.

These details have not been verified by PyPI

Project links

Homepage

Project description

MetricLens

DataFrame-native metric movement decomposition for business analytics.

MetricLens answers the most common question in product and business analytics: a metric moved — where did it come from, and was it driven by mix shift, rate shift, or both?

It is deterministic, DataFrame-native, dependency-light, and honest about what it cannot do.

Why MetricLens exists

Every data team eventually builds the same one-off analysis: revenue dropped 14% — which segments drove it, and did they change in size or in performance? The answer requires decomposing the metric delta into segment contributions, mix effects, and rate effects. The math is standard. The implementation is not.

MetricLens packages that math as a pip-installable library with structured JSON, Markdown, and HTML output — so you stop writing the same decomposition from scratch and start getting consistent, auditable outputs.

Install

pip install metriclens

For local development:

git clone https://github.com/sidharthkriplani/metriclens
cd metriclens
pip install -e ".[dev]"
pytest          # 19 tests, all pass
python examples/ecommerce_demo.py

Quick start

from metriclens import MetricLens, RatioMetric, SumMetric

lens = MetricLens(
    data=df,
    date_col="date",
    baseline_period=("2026-04-01", "2026-04-07"),
    current_period=("2026-04-08", "2026-04-14"),
    dimensions=["channel", "device", "city", "category"],
)

# Conversion rate: orders / sessions
result = lens.analyze(RatioMetric("orders", "sessions", name="cvr"))
result.to_json("outputs/cvr_rca.json")
result.to_markdown("outputs/cvr_rca.md")
result.to_html("outputs/cvr_rca.html")

# Revenue (additive)
result = lens.analyze(SumMetric("revenue"))
result.to_json("outputs/revenue_rca.json")

Real output

The following is actual output from examples/ecommerce_demo.py on synthetic daily e-commerce data (756 rows, seed=42).

Executive summary:

CVR baseline : 0.0717  (7.17%)
CVR current  : 0.0635  (6.35%)
CVR delta    : -0.0082  (-11.4%)
Direction    : DOWN

Segment contributions — channel dimension:

segment	baseline_rate	current_rate	mix_effect	rate_effect	cross_term	total_effect	contribution_pct
paid_search	0.0730	0.0580	0.00375	-0.00724	-0.000771	-0.00427	52.2%
organic	0.0677	0.0668	-0.00276	-0.000325	0.0000358	-0.00305	37.3%
email	0.0774	0.0771	-0.000819	-0.0000386	0.00000275	-0.000855	10.5%

Investigation areas (auto-generated):

1. Investigate channel=paid_search first — largest absolute total_effect (-0.00427)
2. Investigate device=mobile first    — largest absolute total_effect (-0.00519)
3. Investigate city=Bengaluru first   — largest absolute total_effect (-0.00389)
4. Investigate category=skincare first — largest absolute total_effect (-0.00341)

Interpretation note (always included in every output):

MetricLens reports deterministic metric movement decomposition. It identifies segment contributors, mix effects, rate effects, and cross terms. It does not claim causality, statistical significance, anomaly detection, or root cause proof. Use these outputs as investigation signals, not automatic decisions.

How it works

Additive decomposition (SumMetric, CountMetric)

For additive metrics like revenue or order count, each segment's contribution is:

segment_delta_s     = current_value_s − baseline_value_s
contribution_pct_s  = segment_delta_s / total_delta

All segment deltas sum exactly to total_delta. No residual.

Mix / rate / cross decomposition (RatioMetric, AverageMetric)

For ratio metrics like conversion rate (orders / sessions), the population rate is:

R = Σ_s  w_s × r_s

where w_s is the denominator share of segment s and r_s is the per-segment rate.

The movement between baseline and current decomposes exactly:

w_c_s × r_c_s − w_b_s × r_b_s
  = (w_c_s − w_b_s) × r_b_s          ← mix_effect   (segment grew/shrank in volume)
  + w_b_s × (r_c_s − r_b_s)          ← rate_effect  (segment's own rate changed)
  + (w_c_s − w_b_s) × (r_c_s − r_b_s) ← cross_term  (interaction)

Summing mix_effect + rate_effect + cross_term across all segments equals R_c − R_b exactly (up to floating-point). The cross term is always reported — discarding it breaks the identity.

Disappeared and new segments

MetricLens uses zero-fill convention: segments present in only one period get w=0, r=0 in the other. For a disappeared segment the cross term is (0 − w_b) × (0 − r_b) = +w_b × r_b — positive, not zero. This is required for the identity to hold.

Null dimension handling

Null dimension values are never dropped. MetricLens creates an internal working copy and replaces nulls with "(null)" so missing segment labels remain visible in the output. The original DataFrame is never modified.

Edge cases handled

Situation	Behaviour
`abs(total_delta) < 1e-9` (flat metric)	`direction = "FLAT"`, `contribution_pct = None` for all segments
`baseline_value == 0`	`relative_delta_pct = None` (percentage growth from zero is undefined)
Segment present in current only	`segment_status = "new"`
Segment present in baseline only	`segment_status = "disappeared"`
Null dimension values	Filled with `"(null)"` in working copy; original untouched

Metric types

Type	Decomposition	Typical use
`SumMetric(column)`	Additive segment delta	Revenue, GMV, cost
`CountMetric(column=None)`	Additive row or non-null count	Orders, events, sessions
`RatioMetric(numerator, denominator)`	Mix / rate / cross	CVR, CTR, AOV via `revenue/orders`
`AverageMetric(value, weight=None)`	Mix / rate / cross (ratio-style)	Mean order value, weighted averages

API reference

`MetricLens(data, date_col, baseline_period, current_period, dimensions)`

Parameter	Type	Description
`data`	`pd.DataFrame`	Input DataFrame at daily segment-level grain
`date_col`	`str`	Name of the date column
`baseline_period`	`tuple[str, str]`	Inclusive start/end dates for baseline, e.g. `("2026-04-01", "2026-04-07")`
`current_period`	`tuple[str, str]`	Inclusive start/end dates for current period
`dimensions`	`list[str]`	Column names to decompose by (e.g. `["channel", "device"]`)

`lens.analyze(metric) → AnalysisResult`

Returns an AnalysisResult with:

Method	Returns
`.to_json(path=None)`	JSON string; writes file if `path` given
`.to_markdown(path=None)`	Markdown string; writes file if `path` given
`.to_html(path=None)`	HTML string; writes file if `path` given
`.summary()`	`dict` with `baseline_value`, `current_value`, `absolute_delta`, `relative_delta_pct`, `direction`
`.segment_contributions()`	`pd.DataFrame` of all segment rows across all dimensions
`.to_dict()`	Full payload dict matching the JSON schema

Output schema (v0.1)

{
  "schema_version": "0.1",
  "metadata": { "metric_name": "cvr", "decomposition_type": "ratio", ... },
  "executive_summary": {
    "baseline_value": 0.0717,
    "current_value": 0.0635,
    "absolute_delta": -0.0082,
    "relative_delta_pct": -11.4,
    "direction": "DOWN"
  },
  "quality_checks": [ { "check": "row_count_baseline", "status": "PASS", "detail": "..." } ],
  "dimensions": [
    {
      "dimension": "channel",
      "segment_contributions": [
        {
          "segment": "paid_search",
          "segment_status": "existing",
          "mix_effect": 0.00375,
          "rate_effect": -0.00724,
          "cross_term": -0.000771,
          "total_effect": -0.00427,
          "contribution_pct": 52.2
        }
      ]
    }
  ],
  "investigation_areas": [ "Investigate channel=paid_search first ..." ],
  "interpretation_note": "MetricLens reports deterministic metric movement decomposition ..."
}

Data quality checks (auto-run)

Every analyze() call runs these checks automatically and includes results in the output:

Check	What it detects
`row_count_baseline` / `row_count_current`	Empty periods
`date_coverage_baseline` / `date_coverage_current`	Missing dates within a period
`period_length_match`	Unequal baseline and current period lengths
`null_rate_{dimension}`	Null rates per dimension column
`duplicate_grain`	Duplicate rows at the full dimensional grain

What MetricLens is not

Not causal inference. A segment with a large contribution is an investigation priority, not a proven cause. Paid search driving 52% of a CVR drop means paid search is where you look next — not that paid search caused the drop.

Not anomaly detection. MetricLens does not compute z-scores, flag outliers, or compare against expected distributions.

Not statistical significance testing. There are no p-values, confidence intervals, or hypothesis tests. v1 will add bootstrap confidence intervals optionally.

Not a BI dashboard. MetricLens produces structured file outputs — JSON, Markdown, HTML. It does not run a server or render interactive charts.

Not experiment infrastructure. MetricLens does not run A/B tests, compute treatment effects, or replace an experimentation platform.

Not an ML model. There is no model, no training, no prediction. Decomposition is a deterministic algebraic identity.

Roadmap

Version	Scope
v0.1.0	Movement Mode: SumMetric, CountMetric, RatioMetric, AverageMetric, JSON/MD/HTML output ✅
v0.2	CLI (`metriclens analyze`), additional demo datasets
v1.0	Shapley attribution across dimensions, bootstrap confidence intervals, optional LLM narrator
v2.0	LiftMap Mode: segment opportunity ranking by benchmark-gap × volume

Contributing

See CONTRIBUTING.md. Issues and PRs welcome.

License

MIT © Sidharth Kriplani

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1.0

May 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metriclens-0.1.0.tar.gz (19.3 kB view details)

Uploaded May 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

metriclens-0.1.0-py3-none-any.whl (15.9 kB view details)

Uploaded May 5, 2026 Python 3

File details

Details for the file metriclens-0.1.0.tar.gz.

File metadata

Download URL: metriclens-0.1.0.tar.gz
Upload date: May 5, 2026
Size: 19.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for metriclens-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`671a749f82abe241ea70d6c80f6558d1b71249db606834489add1caba43ac6a3`
MD5	`fac026b873f57a8b274a30f4c524138d`
BLAKE2b-256	`2d00e1a5bad20e76855257c1fb5dc858bca8f878a1b3b42d1ad615fb32f15b9c`

See more details on using hashes here.

File details

Details for the file metriclens-0.1.0-py3-none-any.whl.

File metadata

Download URL: metriclens-0.1.0-py3-none-any.whl
Upload date: May 5, 2026
Size: 15.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for metriclens-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`26377ed80aa8c492f7bdff4454ba7aa6b0b24dc6c80f121e8e024006320be3d5`
MD5	`c5de6fa6c378294056bcbfade56e0b28`
BLAKE2b-256	`1aab7cfa176f1e98ea108c6211dff40ea01e0450148121b95204ddb4881c74ff`

See more details on using hashes here.

metriclens 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

MetricLens

Why MetricLens exists

Install

Quick start

Real output

How it works

Additive decomposition (SumMetric, CountMetric)

Mix / rate / cross decomposition (RatioMetric, AverageMetric)

Disappeared and new segments

Null dimension handling

Edge cases handled

Metric types

API reference

MetricLens(data, date_col, baseline_period, current_period, dimensions)

lens.analyze(metric) → AnalysisResult

Output schema (v0.1)

Data quality checks (auto-run)

What MetricLens is not

Roadmap

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`MetricLens(data, date_col, baseline_period, current_period, dimensions)`

`lens.analyze(metric) → AnalysisResult`