Skip to main content

Benchmarking weather and weather AI models using extreme events

Project description

Extreme Weather Bench (EWB)

Documentation Status

Read our blog post here | Documentation

As AI weather models are growing in popularity, we need a standardized set of community driven tests that evaluate the models across a wide variety of high-impact hazards. Extreme Weather Bench (EWB) builds on the successful work of WeatherBench and introduces a set of high-impact weather events, spanning across multiple spatial and temporal scales and different parts of the weather spectrum. We provide data to use for testing, standard metrics for evaluation by forecasters worldwide for each of the phenomena, as well as impact-based metrics. EWB is a community system and will be adding additional phenomena, test cases and metrics in collaboration with the worldwide weather and forecast verification community.

Events

EWB has cases broken down by multiple event types within src/extremeweatherbench/data/events.yaml between 2020 and 2024. EWB case studies are documented here.

Available:

Event Type Number of Cases
🌇 Heat Waves 46
🧊 Freezes 14
🌀 Tropical Cyclones 106
☔️ Atmospheric Rivers 56
🌪️ Severe Convection 115
Total Cases 337

EWB paper and talks

  • AMS 2025 talk: 1
  • AMS 2026 talks: 1, 2
  • EWB paper is in preparation to be released soon

How do I suggest new data, metrics, or otherwise get involved?

We welcome your involvement! The success of a benchmark suite rests on community involvement and feedback. There are several ways to get involved:

  • Get involved in community discussion using the discussion board
  • Submit new code requests using the issues
  • Send us email at hello@brightband.com

Installing EWB

Currently, the easiest way to install EWB is using pip or uv:

$ pip install extremeweatherbench

# Or, add to an existing uv virtual environment
$ uv add extremeweatherbench

If you'd like to install the most recent updates to EWB:

$ pip install git+https://github.com/brightbandtech/ExtremeWeatherBench.git 

For extra installation options:

# For running the data prep modules:
$ pip install "extremeweatherbench[data-prep]"
$ uv add "extremeweatherbench[data-prep]"

How to Run EWB

Running EWB on sample data (included) is straightforward.

Using Jupyter Notebook or a Script:

import extremeweatherbench as ewb

# Load in a forecast; here, we load in GFS initialized FCNv2 from the CIRA MLWP archive with a default variable built-in for convenience
fcnv2_heatwave_forecast = ewb.defaults.cira_fcnv2_heatwave_forecast

# Load in ERA5 with another default convenience variable 
era5_heatwave_target = ewb.defaults.era5_heatwave_target

# EvaluationObjects are used to evaluate a single forecast source against a single target source with a defined event type. Event types are declared with each case. One or more metrics can be evaluated with each EvaluationObject.
heatwave_evaluation_list = [
    ewb.inputs.EvaluationObject(
        event_type="heat_wave",
        metric_list=[
            ewb.metrics.MaximumMeanAbsoluteError(),
            ewb.metrics.RootMeanSquaredError(),
            ewb.metrics.MaximumLowestMeanAbsoluteError(),
        ],
        target=era5_heatwave_target,
        forecast=fcnv2_heatwave_forecast,
    ),
]
# Load in the EWB default list of event cases
case_metadata = ewb.cases.load_ewb_events_yaml_into_case_list()

# Create the evaluation class, with cases and evaluation objects declared
ewb_instance = ewb.evaluation(
    case_metadata=case_metadata,
    evaluation_objects=heatwave_evaluation_list,
)

# Execute a parallel run and return the evaluation results as a pandas DataFrame
heatwave_outputs = ewb_instance.run_evaluation(
    parallel_config={'n_jobs':16} # Uses 16 jobs with the loky backend as default
)

# Save the results
heatwave_outputs.to_csv('heatwave_evaluation_results.csv')

Using command line initialization:

$ ewb --default

Note: this will run every event type, case, target source, and metric for the individual event type as they become available for GFS initialized FourCastNetv2. It is expected a full evaluation will take some time, even on a large VM.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extremeweatherbench-1.0.1.tar.gz (107.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

extremeweatherbench-1.0.1-py3-none-any.whl (121.0 kB view details)

Uploaded Python 3

File details

Details for the file extremeweatherbench-1.0.1.tar.gz.

File metadata

  • Download URL: extremeweatherbench-1.0.1.tar.gz
  • Upload date:
  • Size: 107.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for extremeweatherbench-1.0.1.tar.gz
Algorithm Hash digest
SHA256 36bab5f7eef310f468c5820bb54154e9698c6bdb6d8d0d1033b3583fbd7da6ea
MD5 3af8d37707f8ffa070c413ba81a4fd83
BLAKE2b-256 510b999659355c6bfafb2d844c7b2721b926d93b433446447a8fb67f8c1e956e

See more details on using hashes here.

Provenance

The following attestation bundles were made for extremeweatherbench-1.0.1.tar.gz:

Publisher: publish.yaml on brightbandtech/ExtremeWeatherBench

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file extremeweatherbench-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for extremeweatherbench-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 cd02b639af8f10761e7c18421bd5b63846a4d487b143dafedb8b9c46ea8ad1d4
MD5 a3342d13d361166175338dd497bb22c7
BLAKE2b-256 8a12791c57b56e68668a837b16f26635faf7a4792c193fa571bc4e522c2f36a3

See more details on using hashes here.

Provenance

The following attestation bundles were made for extremeweatherbench-1.0.1-py3-none-any.whl:

Publisher: publish.yaml on brightbandtech/ExtremeWeatherBench

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page