Piecewise-linear trend detection and plain-English narrative generation for time-series data.
Project description
trend-narrative
A standalone Python package that combines piecewise-linear trend detection, relationship analysis, and plain-English narrative generation for time-series data.
Installation
uv add trend-narrative
For development (editable install with test dependencies):
git clone https://github.com/yukinko-iwasaki/trend-narrative.git
cd trend-narrative
uv sync --extra dev
Dependencies: numpy, scipy, pwlf
Two calling paths
Path 1 — from raw data
Create an InsightExtractor with your chosen detector, then pass it to the
narrative function. Keeping the two steps separate means you can swap in any
custom detector without touching the narrative layer:
import numpy as np
from trend_narrative import InsightExtractor, TrendDetector, get_segment_narrative
x = np.arange(2010, 2022, dtype=float)
y = np.array([100, 105, 112, 108, 115, 130, 125, 120, 118, 122, 135, 148], dtype=float)
extractor = InsightExtractor(x, y, detector=TrendDetector(max_segments=2))
narrative = get_segment_narrative(extractor=extractor, metric="health spending")
print(narrative)
# → "From 2010 to 2015, the health spending showed an upward trend.
# Trend then shifted, reaching a peak in 2015 before reversing into a decline."
You can also call the extraction step separately if you need the raw numbers:
suite = extractor.extract_full_suite()
# {"cv_value": 14.2, "segments": [...], "n_points": 12}
Path 2 — from precomputed data
If you already have segments and a CV value stored (e.g. from a database or a previous extraction run), pass them directly — no re-fitting required:
from trend_narrative import get_segment_narrative
narrative = get_segment_narrative(
segments=row["segments"],
cv_value=row["cv_value"],
metric="health spending",
)
print(narrative)
Relationship narratives
Analyze the relationship between two time series (e.g., spending vs outcomes).
Path 1 — from raw data
Compute the analysis on the fly from raw time series data:
import numpy as np
from trend_narrative import get_relationship_narrative
result = get_relationship_narrative(
reference_years=np.array([2010, 2012, 2014, 2016, 2018]),
reference_values=np.array([100, 120, 140, 160, 180]),
comparison_years=np.array([2010, 2012, 2014, 2016, 2018]),
comparison_values=np.array([50, 55, 62, 70, 78]),
reference_name="spending",
comparison_name="outcome",
)
print(result["narrative"])
# → "When spending increases, outcome tends to increase in the same year..."
print(result["method"]) # "lagged_correlation", "comovement", or "insufficient_data"
Path 2 — from precomputed insights
If you already have relationship insights stored (e.g. from a database or a previous analysis run), pass them directly — no re-analysis required:
from trend_narrative import get_relationship_narrative
narrative = get_relationship_narrative(
insights=row["relationship_insights"],
reference_name="spending",
comparison_name="outcome",
)
print(narrative["narrative"])
Separate analysis and narrative generation
Use analyze_relationship() when you want to inspect or store the analysis
results separately from narrative generation:
from trend_narrative import analyze_relationship, get_relationship_narrative
insights = analyze_relationship(
reference_years=years1,
reference_values=values1,
comparison_years=years2,
comparison_values=values2,
)
# Store insights in database, inspect programmatically, etc.
print(insights["method"]) # "lagged_correlation", "comovement", or "insufficient_data"
print(insights["best_lag"]) # lag details for correlation path
# Generate narrative later from stored insights
result = get_relationship_narrative(
insights=insights,
reference_name="spending",
comparison_name="outcome",
)
The function automatically chooses the analysis method based on data availability:
- Lagged correlation: >= 5 points, tests correlations at various lags
- Comovement: 3-4 points, describes directional movement within segments
- Insufficient data: < 3 points
API reference
get_segment_narrative(segments, cv_value, metric="expenditure")
get_segment_narrative(extractor, metric="expenditure")
Generates a plain-English narrative for a single time series. Accepts either
precomputed data (Path 1) or an InsightExtractor instance (Path 2).
- No segments + low CV → "remained highly stable"
- No segments + high CV → "exhibited significant volatility"
- Single segment → direction + % change sentence
- Multi-segment → transition phrases (peak / trough / continuation)
analyze_relationship(...)
Analyzes the relationship between two time series and returns structured insights without generating narrative text.
analyze_relationship(
reference_years, # array-like, the "driver" series years
reference_values, # array-like, the "driver" series values
comparison_years, # array-like, the "outcome" series years
comparison_values, # array-like, the "outcome" series values
reference_segments=None, # optional pre-computed segments
correlation_threshold=5, # min points for correlation analysis
max_lag_cap=5, # max lag to test in years
)
Returns a dict with:
method: "lagged_correlation", "comovement", or "insufficient_data"n_points: int, number of points in sparser seriessegment_details: list[dict], per-segment analysis (comovement only)best_lag: dict with lag, correlation, p_value, n_pairs (correlation only)all_lags: list of all tested lags (correlation only)max_lag_tested: int, maximum lag tested (correlation only)reference_leads: bool, whether reference series leads comparison
get_relationship_narrative(...)
Generates a narrative from relationship analysis. Accepts either precomputed insights (Path 1) or raw data arrays (Path 2).
get_relationship_narrative(
# Path 2: raw data (optional if insights provided)
reference_years=None, # array-like, the "driver" series years
reference_values=None, # array-like, the "driver" series values
comparison_years=None, # array-like, the "outcome" series years
comparison_values=None, # array-like, the "outcome" series values
# Required for narrative
reference_name="", # str, display name for reference
comparison_name="", # str, display name for comparison
# Optional parameters
reference_segments=None, # optional pre-computed segments
correlation_threshold=5, # min points for correlation analysis
max_lag_cap=5, # max lag to test in years
reference_format=".2f", # format spec or callable for reference values
comparison_format=".2f", # format spec or callable for comparison values
time_unit="year", # "year", "month", "quarter" for narratives
reference_leads=None, # True/False to override, None to infer
# Path 1: precomputed insights
insights=None, # dict from analyze_relationship()
)
Returns a dict with:
narrative: str, human-readable descriptionmethod: "lagged_correlation", "comovement", or "insufficient_data"n_points: int, number of points in sparser seriessegment_details: list[dict], per-segment analysis (comovement only)best_lag: dict with lag details (correlation path only)all_lags: list of all tested lags (correlation path only)max_lag_tested: int, maximum lag tested (correlation only)
TrendDetector(max_segments=3, threshold=0.05)
Fits a piecewise-linear model using BIC-optimised segment count, snapping breakpoints to integer years and local extrema.
| Method | Returns | Description |
|---|---|---|
extract_trend(x, y) |
list[dict] |
Fit model; return per-segment stats |
fit_best_model(x, y) |
pwlf model | None |
Run both fitting passes |
calculate_bic(ssr, n, k) |
float |
Static BIC helper |
Each segment dict contains: start_year, end_year, start_value,
end_value, slope, p_value.
InsightExtractor(x, y, detector=None)
Combines volatility measurement with trend detection. Pass a custom detector to control the fitting logic.
| Method | Returns | Description |
|---|---|---|
get_volatility() |
float |
Coefficient of Variation (%) |
get_structural_segments() |
list[dict] |
Delegates to the detector |
extract_full_suite() |
dict |
{cv_value, segments, n_points} |
consolidate_segments(segments)
Merges consecutive segments that share the same slope direction. Applied
automatically inside get_segment_narrative.
millify(n)
Formats large numbers with a human-readable suffix: 1_500_000 → "1.50 M".
Running tests
uv run pytest
# or with coverage:
uv run pytest --cov=trend_narrative --cov-report=term-missing
Project structure
trend-narrative/
├── trend_narrative/
│ ├── __init__.py # Public API
│ ├── detector.py # TrendDetector – piecewise-linear fitting
│ ├── extractor.py # InsightExtractor – volatility + trend facade
│ ├── narrative.py # Narrative generation + millify helper
│ ├── relationship_analysis.py # Relationship analysis between two series
│ └── relationship_narrative.py # Relationship narrative generation
├── tests/
│ ├── test_detector.py
│ ├── test_extractor.py
│ ├── test_narrative.py
│ ├── test_relationship_analysis.py
│ └── test_relationship_narrative.py
├── pyproject.toml
└── README.md
Contact
For questions, feedback, or collaboration enquiries, please reach out to ysuzuki2@worldbank.org.
License
This project is licensed under the MIT License together with the World Bank IGO Rider. The Rider is purely procedural: it reserves all privileges and immunities enjoyed by the World Bank, without adding restrictions to the MIT permissions. Please review both files before using, distributing or contributing.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file trend_narrative-0.3.0.tar.gz.
File metadata
- Download URL: trend_narrative-0.3.0.tar.gz
- Upload date:
- Size: 28.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
24b073fd85308231920ac0b7765c59c4dd9a9ac45f14c1b92d2827c51cd20040
|
|
| MD5 |
c14abb05ba6a2a14e29b29323ba56f4e
|
|
| BLAKE2b-256 |
cd49ba763dee1c762ffaf49779a75565489d4296072b031c7ec33f3ba320acbc
|
File details
Details for the file trend_narrative-0.3.0-py3-none-any.whl.
File metadata
- Download URL: trend_narrative-0.3.0-py3-none-any.whl
- Upload date:
- Size: 21.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2a2431ba5c3274039e023d76a39dd6be469f531f64238ad12d4cdd5f5da59002
|
|
| MD5 |
a8fa7b9d906ddaaeb26ddb7793b1f7cd
|
|
| BLAKE2b-256 |
42c38fcf1527319f4706ca897706b699aa5a3d75c205eb085159572c3809cbbc
|