Quantile Error Metrics for Wind Climates

These details have not been verified by PyPI

Project links

Project description

`climate-error`: Quantile Error Metrics for Wind Climates

Error Metrics for Wind Climates based on Statistical Quantiles and Wasserstein Distances.

Description

Climate error metrics quantify the agreement between wind speed climates by comparing the statistical distributions of predictions and observations.

For two time series of predictions $$x_p(t)$$ and observations $$x_o(t)$$, conventional time-dependent error metrics are defined from the difference of paired time records:

$$\epsilon_n = x_p(t_n) - x_o(t_n), \quad n = 1, \ldots, N,$$

where $t_n$ is a time index and $\epsilon$ is the error from which metrics such as the BIAS, the root of the mean squared errors (RMSE) and the standard deviation of the error (STDE) are computed:

$$\text{BIAS} = \mathbb{E}[\bf{\epsilon}], \qquad \text{RMSE} = \sqrt{\mathbb{E}[\bf{\epsilon}^2]}, \qquad \text{STDE}^2 = \text{RMSE}^2 - \text{BIAS}^2.$$

While BIAS is independent of time alignment, RMSE and STDE are affected by time‑lag (phase) errors, which can dominate these metrics even when predictions and observations share the same climate statistics.

The climate-error module enables the computation of time-independent error metrics by comparing the probability distributions that govern the datasets. This is achieved by computing the differences between the quantiles of the predicted and observed wind‑speed distributions, given by the quantile functions $Q_p(u)$ and $Q_o(u)$, respectively. The quantile-based metrics provided by climate-error are defined as:

$$\text{BIAS} = \int_0^1 \left( Q_p(u) - Q_o(u) \right) du \equiv \mathbb{E}[\mathbf{p}] - \mathbb{E}[\mathbf{o}],$$

$$\text{RMSE}^2 \approx \int_0^1 \left[ Q_p(u) - Q_o(u) \right]^2 du,$$

$$\text{STDE}^2 = \text{RMSE}^2 - \text{BIAS}^2.$$

These expressions correspond to signed first-order and second-order Wasserstein distances between distributions, yielding error metrics that:

are independent of time alignment,
separate systematic (accuracy) and random (precision) error,
retain the physical units of wind speed,
are directly comparable to conventional BIAS, RMSE, and STDE.

A related measure is the area metric of Ferson et al. (2008). Yet, its value cannot be split into systematic and random error as its formulation is an absolute first-order Wasserstein distance (also known as the earth mover's distance).

These metrics are applicable to empirical distributions derived from samples, analytical distributions (e.g. Weibull), and comparisons between analytical and empirical distributions (e.g. goodness-of-fit).

For further details on the mathematical formulation and the statistical arguments favoring their application, please refer to Veiga Rodrigues & Odderskov (2025).

What's in this repository

Code: reference implementation of the climate-error metrics (BSD 3‑Clause).
Tests: software test suite in tests/ (BSD 3‑Clause).
Data (derived): small, curated subsets produced from publicly available sources (DTU Data datasets and the NEWA API), stored under example_wind_data/ or generated by scripts in that folder. For Attribution & licensing please refer to ATTRIBUTION.md for per-source details. Datasets: see La Ventosa dataset (2021) and Capel-Cynon dataset (2021); NEWA API details: NEWA Application.
Examples: python scripts and results in examples/ (BSD 3‑Clause) with application of the climate-error metrics focusing the datasets in example_wind_data/.

Other methods provided in the source code

The methods used to fit Weibull distributions to a data sample are described in EWA (1989) and EMD (2014). Other implementations of this method may be found in Climatic and ANEMOI software repositories.

License

This project is released under the BSD 3‑Clause License. See the LICENSE file for details.

Citation

If you use this repository or the underlying methodology, please cite:

Veiga Rodrigues C & Odderskov I (2025). Climate error metrics based on Wasserstein distances. Applied Energy, Volume 398, 126392. DOI: 10.1016/j.apenergy.2025.126392

Also consider citing this repository (see CITATION.cff).

How to use the `climate-error` metrics

Walkthrough

After installation, the metrics can be used by importing the appropriate module:

>>> import climate_error as climerr
>>> climerr.__all__
['__version__', '__citation__', 'get_concurrent_records', 'estimate_bins', 'weibull_moment', 'weibull_cdf', 'weibull_ppf', 'weibull_pdf', 'ewa_weibull_fit_sample', 'ewa_weibull_fit_hist', 'numerical_wasserstein', 'error_metrics_wasserstein_weibull', 'error_metrics_wasserstein_2sample', 'error_metrics_wasserstein_weibull_vs_sample', 'error_metrics_timeseries', 'normalise_error_metrics']

Consider two time series with observations and predictions:

>>> import numpy as np

>>> N = 365*24*6

>>> Ao, Ko = 6., 1.8
>>> wso = np.random.weibull(Ko, N) * Ao  # observations

>>> Ap, Kp = 8., 2.5
>>> wsp = np.random.weibull(Kp, N) * Ap  # predictions

where for demonstration purposes these were taken from random samples of Weibull distributions. As the samples should be completely uncorrelated, index-based errors are expected to be high:

>>> t_bias, t_stde, t_rmse = climerr.error_metrics_timeseries(wsp, wso)
>>> t_bias_n, t_stde_n, t_rmse_n = climerr.normalise_error_metrics(t_bias, t_stde, t_rmse, wso.mean())

>>> print(f"Time dependent error (m/s): {t_bias=:.1f}  {t_stde=:.1f}  {t_rmse=:.1f}")
Time dependent error (m/s): t_bias=1.8  t_stde=4.3  t_rmse=4.7

>>> print(f"Time dependent error (%): {t_bias_n=:.0f}%  {t_stde_n=:.0f}%  {t_rmse_n=:.0f}%")
Time dependent error (%): t_bias_n=34%  t_stde_n=81%  t_rmse_n=88%

where in this context STDE is the biased standard deviation of the error and whose definition matches $STDE^2 = RMSE^2 - BIAS^2$.

The error between the two Weibull distributions can be computed analytically via the error_metrics_wasserstein_weibull method:

>>> w_wso_mu = climerr.weibull_moment(Ao, Ko, n=1)
>>> print(f"Weibull dist. observations mean = {w_wso_mu:.2f} m/s")
Weibull dist. observations mean = 5.34 m/s

>>> w_bias, w_stde, w_rmse = climerr.error_metrics_wasserstein_weibull(Ap, Kp, Ao, Ko)
>>> w_bias_n, w_stde_n, w_rmse_n = climerr.normalise_error_metrics(w_bias, w_stde, w_rmse, w_wso_mu)
>>> print(f"Weibull dist. error (m/s): {w_bias=:.1f}  {w_stde=:.1f}  {w_rmse=:.1f}")
Weibull dist. error (m/s): w_bias=1.8  w_stde=0.3  w_rmse=1.8

>>> print(f"Weibull dist. error (%): {w_bias_n=:.0f}%  {w_stde_n=:.0f}%  {w_rmse_n=:.0f}%")
Weibull dist. error (%): w_bias_n=33%  w_stde_n=6%  w_rmse_n=34%

Whereas the index-based errors were showing high STDE, the error metrics focused on the agreement between the Weibull distributions have much less STDE, thus RMSE is mainly due to the BIAS.

The climate-error metrics are applicable to samples, where for each dataset the respective empirical distribution is estimated and their agreement is computed through Wasserstein distances, following the framework in Veiga Rodrigues & Odderskov (2025). This is achieved by error_metrics_wasserstein_2sample:

>>> q_bias, q_stde, q_rmse = climerr.error_metrics_wasserstein_2sample(wsp, wso)
>>> q_bias_n, q_stde_n, q_rmse_n = climerr.normalise_error_metrics(q_bias, q_stde, q_rmse, wso.mean())
>>> print(f"Empirical dist. error (m/s): {q_bias=:.1f}  {q_stde=:.1f}  {q_rmse=:.1f}")
Empirical dist. error (m/s): q_bias=1.8  q_stde=0.3  q_rmse=1.8

>>> print(f"Empirical dist. error (%): {q_bias_n=:.0f}%  {q_stde_n=:.0f}%  {q_rmse_n=:.0f}%")
Empirical dist. error (%): q_bias_n=34%  q_stde_n=6%  q_rmse_n=34%

where the values are fully inline with the analytical error associated with the Weibull distributions.

Finally, a sample can also be compared against an analytical Weibull distribution through error_metrics_wasserstein_weibull_vs_sample. For a Weibull distribution whose parameters were fitted to a data sample, this method may be used to infer the goodness-of-fit:

>>> climerr.error_metrics_wasserstein_weibull_vs_sample(Ao, Ko, wso)

>>> Afit, Kfit = climerr.ewa_weibull_fit_sample(wso)
>>> print(f"{Afit=:.2f} m/s,  {Kfit=:.3f}")
Afit=5.98 m/s,  Kfit=1.795

>>> g_bias, g_stde, g_rmse = climerr.error_metrics_wasserstein_weibull_vs_sample(Afit, Kfit, wso)
>>> g_bias_n, g_stde_n, g_rmse_n = climerr.normalise_error_metrics(g_bias, g_stde, g_rmse, wso.mean())
>>> print(f"Weibull goodness-of-fit (m/s): {g_bias=:.2f}  {g_stde=:.2f}  {g_rmse=:.2f}")
Weibull goodness-of-fit (m/s): g_bias=0.00  g_stde=0.03  g_rmse=0.03

>>> print(f"Weibull goodness-of-fit (%): {g_bias_n=:.0f}%  {g_stde_n=:.0f}%  {g_rmse_n=:.0f}%")
Weibull goodness-of-fit (%): g_bias_n=0%  g_stde_n=1%  g_rmse_n=1%

where the fitting method in ewa_weibull_fit_sample follows the European Wind Atlas approach EWA (1989).

Examples

Four examples are provided in the examples/ where one demonstrates the concept behind the use of climate-error metrics:

examples/run_periodic_lag_error.py,

and three other show applications of the climate-error metrics to the wind data in example_wind_data/:

To run any of the examples, the run_docker.sh bash script can be executed to initiate an interactive docker container that will be destroyed after exiting it. The code executed while initializing the container install the code and ensures software tests are run. This allows to quickly test climate-error and its application examples.

In case there are issues with attaining a graphical environment, the run_docker.sh bash script can be executed instead and figures will be saved as PNG files, together with results TXT tables. Instruction on how to copy figures and other files to the host system are available in the examples documentation.

Taking the run_experiment_realcase.py example:

~/climate-error $ ./run_docker.sh 
non-network local connections being added to access control list
channels:
  - conda-forge
...
===================== test session starts ==================
...
================= 7 passed, 8 warnings in 2.95s ============ 

(base) root@ceebc070f35f:/opt/repo#

after the interactive shell is ready, the code in the examples can be run:

(base) root@ceebc070f35f:/opt/repo#  python examples/run_experiment_realcase.py
REPO_DIR=PosixPath('/opt/repo/examples')
DATA_DIR=PosixPath('/opt/repo/example_wind_data')
Sampling interval determined from TimeStamps as 30.0 min
Sampling interval determined from TimeStamps as 30.0 min
Sampling interval determined from TimeStamps as 30.0 min
Sampling interval determined from TimeStamps as 30.0 min
Sampling interval determined from TimeStamps as 30.0 min
Sampling interval determined from TimeStamps as 30.0 min
Sampling interval determined from TimeStamps as 30.0 min
Sampling interval determined from TimeStamps as 30.0 min
Sampling interval determined from TimeStamps as 30.0 min
        Time-dependent error  Climate error (empirical)  Time-dependent error for Z-scaled TS  Climate error for Z-scaled TS (empirical)
BIAS     -0.31 m/s   (-3.7%)        -0.31 m/s   (-3.7%)                   -0.00 m/s   (-0.0%)                        -0.00 m/s   (-0.0%)
STDE      1.96 m/s   (23.0%)         0.21 m/s   ( 2.4%)                    1.99 m/s   (23.4%)                         0.15 m/s   ( 1.7%)
RMSE      1.98 m/s   (23.3%)         0.37 m/s   ( 4.4%)                    1.99 m/s   (23.4%)                         0.15 m/s   ( 1.7%)
 MAE      1.50 m/s   (17.6%)                        nan                    1.50 m/s   (17.5%)                                        nan
Area                     nan         0.31 m/s   ( 3.7%)                                   nan                         0.12 m/s   ( 1.4%)

and the following plots should be produced:

Example plot of the time series. Distribution plots of the datasets with the climate error metrics. Taylor diagram of the climate error metrics.

References

Veiga Rodrigues C, Odderskov I (2025). Climate error metrics based on Wasserstein distances. Appli Energ, 398:126392. DOI: 10.1016/j.apenergy.2025.126392
Hansen KS, Vasiljevic N, Sørensen SA (2021). Resource data from the La Ventosa mast. DTU Data. doi:10.11583/DTU.14135609
Hansen KS, Vasiljevic N, Sørensen SA (2021). Resource data from the Capel Cynon masts. DTU Data. doi:10.11583/DTU.14135627
New European Wind Atlas (NEWA) — About/Terms & data access. Link.
Troen I, & Lundtang Petersen E (1989). European Wind Atlas. Risø National Laboratory. DTU Orbit URL
EMD International A/S (2014). Fitting Weibull Parameters for Wind Energy Applications. PDF Document URL.
Climatic: Wind Data Visualization (GitHub). GitHub repository, file climatic/weibull_est.py
ANEMOI — EDF's pre‑alpha Python package for wind data analysis. GitHub repository, file analysis/weibull.py
Ferson S, Oberkampf WL, Ginzburg L (2008). Model validation and predictive capability for the thermal challenge problem. Comput Methods Appl Mech Eng, 197:2408-30. DOI: 10.1016/j.cma.2007.07.030

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.1

Apr 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

climate_error-1.0.1.tar.gz (168.6 kB view details)

Uploaded Apr 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

climate_error-1.0.1-py3-none-any.whl (20.6 kB view details)

Uploaded Apr 23, 2026 Python 3

File details

Details for the file climate_error-1.0.1.tar.gz.

File metadata

Download URL: climate_error-1.0.1.tar.gz
Upload date: Apr 23, 2026
Size: 168.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for climate_error-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`4da212aecbb36ee77de1a9cf90792843adf64354482b96e5bb60205b8253eb9a`
MD5	`4e8af9cea1366796daf6cccf06a5a288`
BLAKE2b-256	`d799750d4f060fb6afaca9da931447cbb161f1d4f38d262722a057ab7601e1b6`

See more details on using hashes here.

File details

Details for the file climate_error-1.0.1-py3-none-any.whl.

File metadata

Download URL: climate_error-1.0.1-py3-none-any.whl
Upload date: Apr 23, 2026
Size: 20.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for climate_error-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e754174a3f64b59f668c705d64dbd83152c87e949c348d5b05f04bb936fdb0ad`
MD5	`27a3c79706e147c7e747b202ee78007e`
BLAKE2b-256	`342236a34a940981f27dc21931a4737227fbb3ad89dc133680bc5b49ff7a098b`

See more details on using hashes here.

climate-error 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

`climate-error`: Quantile Error Metrics for Wind Climates

Description

What's in this repository

Other methods provided in the source code

License

Citation

How to use the `climate-error` metrics

Walkthrough

Examples

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

climate-error 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

climate-error: Quantile Error Metrics for Wind Climates

Description

What's in this repository

Other methods provided in the source code

License

Citation

How to use the climate-error metrics

Walkthrough

Examples

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`climate-error`: Quantile Error Metrics for Wind Climates

How to use the `climate-error` metrics