A benchmarking tool for weather forecasts based on weather stations

These details have not been verified by PyPI

Project description

StationBench

StationBench is a Python library for benchmarking weather forecasts against weather station data. It provides tools to calculate metrics, visualize results, and compare different forecast models.

Features

Pre-processed ground truth data from 10,000+ weather stations around the world included in the package
Calculate RMSE and other metrics between forecasts and ground truth data
Support for multiple weather variables (temperature, wind speed, solar radiation)
Regional analysis capabilities (Europe, North America, Global, etc.)
Integration with Weights & Biases for experiment tracking

Installation

Using poetry:

poetry install

Documentation

Full documentation is available in the docs/ directory:

Setup - How to setup StationBench
Tutorial - Basic usage of StationBench

Quick Start

Data Format Requirements

Forecast Data

Must include dimensions: latitude, longitude, time
Variables should include:
- 10m_wind_speed (or custom name)
- 2m_temperature (or custom name)

Ground Truth Data

Stationbench comes with ready-to-use weather stations from around the world. The benchmarking data is a subset of the Meteostat dataset. It contains weather data from 2018-2024 for 10m wind speed and 2m temperature. The data is provided by the following organizations:

Deutscher Wetterdienst
NOAA
Government of Canada
MET Norway
European Data Portal
Offene Daten Österreich

Source: Meteostat (CC BY-NC 4.0)

The benchmarking data can be accessed from https://opendata.jua.sh/stationbench/meteostat_benchmark.zarr.

Map of weather stations used for benchmarking

Number of stations reporting over time

Besides the provided benchmarking data, you can also use your own ground truth data. The ground truth data must be in zarr format and must include the following dimensions and coordinates:

Must include dimensions: station_id, time
Must include coordinates: latitude, longitude

Calculate Metrics

This script computes metrics (RMSE only for now) by comparing forecast data against ground truth data for specified time periods and regions. Output are RMSE benchmarks for different variables and lead times in the format of the ground truth data.

Options

--forecast_loc: Location of the forecast data (required)
--ground_truth_loc: Location of the ground truth data (defaults to https://opendata.jua.sh/stationbench/meteostat_benchmark.zarr)
--start_date: Start date for benchmarking (required)
--end_date: End date for benchmarking (required)
--output_loc: Output path for benchmarks (required)
--region: Region to benchmark (see regions.py for available regions)
--name_10m_wind_speed: Name of 10m wind speed variable (optional)
--name_2m_temperature: Name of 2m temperature variable (optional)

If variable name is not provided, no metrics will be computed for that variable.

Example usage

poetry run python stationbench/calculate_metrics.py \
    --forecast_loc forecast.zarr \
    --start_date 2023-01-01 --end_date 2023-12-31 --output_loc forecast-rmse.zarr \
    --region europe --name_10m_wind_speed "10si" --name_2m_temperature "2t"

Compare forecasts

After generating the metrics, you can use the compare_forecasts.py script to compute metrics, create visualizations, and log the results to Weights & Biases (W&B).

What it does

The compare_forecasts.py script:

Computes RMSE (Root Mean Square Error) and skill scores for different variables and lead time ranges.
Generates geographical scatter plots showing the spatial distribution of errors.
Creates line plots showing the temporal evolution of errors.
Logs all visualizations and metrics to a W&B run.

Options

--evaluation_benchmarks_loc: Path to the evaluation benchmarks (required)
--reference_benchmark_locs: Dictionary of reference benchmark locations, the first one is used for skill score (required)
--run_name: W&B run name (required)
--regions: Comma-separated list of regions, see regions.py for available regions (required)

Example

poetry run python stationbench/compare_forecasts.py \
    --evaluation_benchmarks_loc forecast-rmse.zarr \
    --reference_benchmark_locs '{"HRES": "hres-rmse.zarr"}' \
    --regions europe \
    --run_name wandb-run-name

Contributing

We welcome contributions! Please see our CONTRIBUTING.md for details.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.2

Feb 26, 2025

0.1.1

Jan 16, 2025

This version

0.1.0

Jan 15, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stationbench-0.1.0.tar.gz (11.0 kB view details)

Uploaded Jan 15, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

stationbench-0.1.0-py3-none-any.whl (11.2 kB view details)

Uploaded Jan 15, 2025 Python 3

File details

Details for the file stationbench-0.1.0.tar.gz.

File metadata

Download URL: stationbench-0.1.0.tar.gz
Upload date: Jan 15, 2025
Size: 11.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.5 CPython/3.11.9 Linux/5.15.0-113-generic

File hashes

Hashes for stationbench-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`9bb64a8a2f021dc21040bd8998ffcaadc74737e3c4908a44f3a706ce2bd6853a`
MD5	`ee2845f4d326086d864963d1a4757b8b`
BLAKE2b-256	`7c00d6bc1f344c99e2b9baabe2e635b16bf883e8dfc3cfb13f19c06f81f15db6`

See more details on using hashes here.

File details

Details for the file stationbench-0.1.0-py3-none-any.whl.

File metadata

Download URL: stationbench-0.1.0-py3-none-any.whl
Upload date: Jan 15, 2025
Size: 11.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.5 CPython/3.11.9 Linux/5.15.0-113-generic

File hashes

Hashes for stationbench-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5602b0d7368325db8a637e7fa5e54292eab940fe3e94189317e7ebdbeeecc587`
MD5	`0ec71998c78625f360be103c2d9124ec`
BLAKE2b-256	`40f480328f36b9774d33f6463a2a1a864d4ebff090b348cce2d15377009fd3bd`

See more details on using hashes here.

stationbench 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

StationBench

Features

Installation

Documentation

Quick Start

Data Format Requirements

Forecast Data

Ground Truth Data

Calculate Metrics

Options

Example usage

Compare forecasts

What it does

Options

Example

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes