A reproducible benchmark framework for evaluating long-range dependence estimators on canonical, contaminated, and observational time series.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

davianc

These details have not been verified by PyPI

Project links

Documentation

Project description

lrdbench

A reproducible benchmark framework for evaluating long-range dependence estimators on canonical, contaminated, and observational time series.

Documentation: lrdbench.readthedocs.io (built with MkDocs and Read the Docs).

Current public release: 1.0.3. No DOI is attached yet; cite the software using CITATION.cff and the GitHub release.

lrdbench is a research-oriented benchmarking framework for studying the behaviour of long-range dependence (LRD) estimators across three distinct settings:

ground-truth mode for canonical synthetic time series with declared target truth;
stress-test mode for synthetic time series under controlled contamination;
observational mode for biomedical or user-provided time series without benchmark truth.

The framework is designed to support:

rigorous comparison of classical and new LRD estimators;
bundled temporal, spectral, geometric, wavelet, aggregation, and data-driven estimator families;
uncertainty-aware benchmarking, including empirical interval coverage where applicable;
robustness analysis under the bundled contamination operators: heavy-tailed noise, level shifts, outliers, and polynomial trends;
experimental data-driven baselines, including Random Forest, SVR, CNN, and LSTM estimators;
transparent failure analysis and validity-rate reporting;
manifest-driven, provenance-complete, reproducible benchmark execution.

Why this project exists

There is currently no widely adopted, comprehensive, reproducible benchmark specifically designed for long-range dependence estimation that simultaneously addresses:

canonical synthetic processes with known targets;
contamination-induced estimator instability;
uncertainty quantification and interval coverage;
observational biomedical time series with no benchmark truth;
extensible enrolment of new estimators under a common interface.

lrdbench aims to fill that gap.

It is especially intended to support the careful evaluation of the hypothesis that many classical second-order LRD estimators behave well in their intended stationary finite-variance regime, but become unstable, miscalibrated, or non-identifiable under nonstationarity, heavy-tailed fluctuations, artefacts, and other out-of-regime conditions.

Scope

lrdbench is a research benchmark framework. It is not:

a clinical decision system;
a diagnostic tool;
a guarantee of “true LRD” in arbitrary empirical signals;
a universal ranking oracle for all estimators in all regimes.

Benchmark results must always be interpreted in light of:

the declared benchmark mode;
the target estimand;
the source specification;
the contamination design;
the metric definitions;
the aggregation and leaderboard rules.

See RESEARCH_USAGE.md for the full policy.

Core features

Benchmark modes

Ground-truth mode
- bias, MAE, RMSE
- empirical coverage
- interval width
- validity rate
- runtime and efficiency
Stress-test mode
- estimate drift
- degradation ratios
- validity collapse
- coverage collapse
- robustness leaderboards
Observational mode
- instability across windows
- preprocessing sensitivity
- resampling variability
- failure analysis
- stability leaderboards

Supported data sources

implemented synthetic generators:
- fGn
- fBm
- ARFIMA(0,d,0)
- MRW
- fOU
contaminated synthetic pipelines
custom CSV datasets
future observational/API-based datasets

Reporting

HTML reports
Markdown reports
CSV exports
Parquet result stores
JSON metadata exports
LaTeX tables for publication workflows

Extensibility

pluggable estimator interface
manifest-driven benchmark runs
explicit estimator metadata and estimand declarations
run-local supervised training for built-in ML/NN baseline estimators
registry-based component enrolment

Bundled estimators

temporal Hurst-proxy methods: RS, DFA, DMA, AbsoluteMoment, Variance, VarianceResidual
spectral long-memory methods: GPH, Periodogram, WhittleMLE, ModifiedLocalWhittle
geometric and wavelet Hurst-proxy comparators
experimental data-driven baselines: MLRandomForest, MLSVR, MLCNN, MLLSTM

See docs/bundled_estimators.md and docs/estimator_status.md for names, targets, and interpretation status.

Design principles

lrdbench is built around a few non-negotiable principles:

Explicit estimands
Every estimator must declare the quantity it is intended to estimate.
Mode-aware evaluation
Truth-based metrics are not used where truth does not exist.
Failure transparency
Invalid outputs, crashes, and missing uncertainty are recorded explicitly.
Provenance preservation
Every benchmark result is traceable to a manifest, source, estimator configuration, and software version.
Reproducibility first
A benchmark run should be reproducible from a single manifest plus the relevant package version and data sources.

Installation

Core installation

pip install lrdbench

First run

After installing with reporting support, run the smallest packaged benchmark:

pip install "lrdbench[reports]"
lrdbench run smoke_ground_truth

From a repository checkout, the same quickstart is available as:

pip install -e ".[reports]"
python examples/quickstart_pure.py

The command prints the run identifier, result store, HTML report path, and output validation command. See the quickstart tutorial for the full walkthrough.

Preview before running

Use --dry-run when you want to inspect the benchmark grid before fitting estimators or writing outputs:

lrdbench run smoke_ground_truth --dry-run

The preview prints the benchmark mode, materialised record count, enrolled estimator count, total fit jobs, clean/contaminated split for stress tests, and global seed. This is the recommended first step before launching public-medium, custom, or data-driven suites.

Data-driven smoke benchmark

The RF/SVR data-driven smoke suite uses optional scikit-learn dependencies:

pip install "lrdbench[ml,reports]"
lrdbench run smoke_data_driven

From a repository checkout:

pip install -e ".[ml,reports]"
lrdbench run configs/suites/smoke_data_driven.yaml
python examples/data_driven_baseline_benchmark.py

Data-driven estimators are trained from the manifest-declared ml_training block and should be interpreted relative to that synthetic training distribution. See docs/data_driven_estimators.md.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

davianc

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

1.2.1

Jun 26, 2026

1.1.0rc1 pre-release

Jun 26, 2026

This version

1.0.3

Jun 8, 2026

1.0.2

Apr 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lrdbench-1.0.3.tar.gz (653.4 kB view details)

Uploaded Jun 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lrdbench-1.0.3-py3-none-any.whl (114.5 kB view details)

Uploaded Jun 8, 2026 Python 3

File details

Details for the file lrdbench-1.0.3.tar.gz.

File metadata

Download URL: lrdbench-1.0.3.tar.gz
Upload date: Jun 8, 2026
Size: 653.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lrdbench-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`5476879c9210866a79714758222a49b92531bebf6635c8c74954335412bebf58`
MD5	`75291a7283ea79bcb1df46bc444ba7b7`
BLAKE2b-256	`36691feb83c96593d71ac56cf26e1c5c50ad95f4453df31d68405661857cb80b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lrdbench-1.0.3.tar.gz:

Publisher: release.yml on dave2k77/lrdbench

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lrdbench-1.0.3.tar.gz
- Subject digest: 5476879c9210866a79714758222a49b92531bebf6635c8c74954335412bebf58
- Sigstore transparency entry: 1758809328
- Sigstore integration time: Jun 8, 2026
Source repository:
- Permalink: dave2k77/lrdbench@479483f87da06a23ed66bb948065bc7f00b128e3
- Branch / Tag: refs/tags/v1.0.3
- Owner: https://github.com/dave2k77
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@479483f87da06a23ed66bb948065bc7f00b128e3
- Trigger Event: push

File details

Details for the file lrdbench-1.0.3-py3-none-any.whl.

File metadata

Download URL: lrdbench-1.0.3-py3-none-any.whl
Upload date: Jun 8, 2026
Size: 114.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lrdbench-1.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6eb8f6e99fe7906c8024b5d812282333e6a67fbde915729ef3c3515f57136ee1`
MD5	`57128c641ed1faebff067fbfa559ff9b`
BLAKE2b-256	`36945b9f82edcca64b9e387b7b3dfe87043e21ab32b002af5a61ec48f8d80469`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lrdbench-1.0.3-py3-none-any.whl:

Publisher: release.yml on dave2k77/lrdbench

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lrdbench-1.0.3-py3-none-any.whl
- Subject digest: 6eb8f6e99fe7906c8024b5d812282333e6a67fbde915729ef3c3515f57136ee1
- Sigstore transparency entry: 1758809485
- Sigstore integration time: Jun 8, 2026
Source repository:
- Permalink: dave2k77/lrdbench@479483f87da06a23ed66bb948065bc7f00b128e3
- Branch / Tag: refs/tags/v1.0.3
- Owner: https://github.com/dave2k77
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@479483f87da06a23ed66bb948065bc7f00b128e3
- Trigger Event: push

lrdbench 1.0.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

lrdbench

Why this project exists

Scope

Core features

Benchmark modes

Supported data sources

Reporting

Extensibility

Bundled estimators

Design principles

Installation

Core installation

First run

Preview before running

Data-driven smoke benchmark

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance