fev: Forecast evaluation library

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

fev

fev (Forecast EValuation library) is a lightweight package that makes it easy to benchmark time series forecasting models.

Extensible: Easy to define your own forecasting tasks and benchmarks.
Reproducible: Ensures that the results obtained by different users are comparable.
Easy to use: Compatible with most popular forecasting libraries.
Minimal dependencies: Just a thin wrapper on top of 🤗datasets.

How is `fev` different from other benchmarking tools?

Existing forecasting benchmarks usually fall into one of two categories:

Standalone datasets without any supporting infrastructure. These provide no guarantees that the results obtained by different users are comparable. For example, changing the start date or duration of the forecast horizon totally changes the meaning of the scores.
Bespoke end-to-end systems that combine models, datasets and forecasting tasks. Such packages usually come with lots of dependencies and assumptions, which makes extending or integrating these libraries into existing systems difficult.

fev aims for the middle ground - it provides the core benchmarking functionality without introducing unnecessary constraints or bloated dependencies. The library supports point & probabilistic forecasting, different types of covariates, as well as all popular forecasting metrics.

📝 Updates

2025-09-16: The new version 0.6.0 contains major new functionality, updated documentation, as well as some breaking changes to the Task API. Please check the release notes for more details.

⚙️ Installation

pip install fev

🚀 Quickstart

Create a task from a dataset stored on Hugging Face Hub

import fev

task = fev.Task(
    dataset_path="autogluon/chronos_datasets",
    dataset_config="m4_hourly",
    horizon=24,
)

Iterate over the rolling evaluation windows:

for window in task.iter_windows():
    past_data, future_data = window.get_input_data()

past_data contains the past data before the forecast horizon (item ID, past timestamps, target, all covariates).
future_data contains future data that is known at prediction time (item ID, future timestamps, and known covariates)

Make predictions

def naive_forecast(y: list, horizon: int) -> dict[str, list[float]]:
    # Make predictions for a single time series
    return {"predictions": [y[-1] for _ in range(horizon)]}

predictions_per_window = []
for window in task.iter_windows():
    past_data, future_data = window.get_input_data()
    predictions = [
        naive_forecast(ts[task.target_column], task.horizon) for ts in past_data
    ]
    predictions_per_window.append(predictions)

Get an evaluation summary

task.evaluation_summary(predictions_per_window, model_name="naive")
# {'model_name': 'naive',
#  'dataset_path': 'autogluon/chronos_datasets',
#  'dataset_config': 'm4_hourly',
#  'horizon': 24,
#  'num_windows': 1,
#  'initial_cutoff': -24,
#  'window_step_size': 24,
#  'min_context_length': 1,
#  'max_context_length': None,
#  'seasonality': 1,
#  'eval_metric': 'MASE',
#  'extra_metrics': [],
#  'quantile_levels': None,
#  'id_column': 'id',
#  'timestamp_column': 'timestamp',
#  'target_column': 'target',
#  'generate_univariate_targets_from': None,
#  'past_dynamic_columns': [],
#  'excluded_columns': [],
#  'task_name': 'm4_hourly',
#  'test_error': 3.815112047601983,
#  'training_time_s': None,
#  'inference_time_s': None,
#  'dataset_fingerprint': '19e36bb78b718d8d',
#  'trained_on_this_dataset': False,
#  'fev_version': '0.6.0',
#  'MASE': 3.815112047601983}

The evaluation summary contains all information necessary to uniquely identify the forecasting task.

Multiple evaluation summaries produced by different models on different tasks can be aggregated into a single table.

# Dataframes, dicts, JSON or CSV files supported
summaries = "https://raw.githubusercontent.com/autogluon/fev/refs/heads/main/benchmarks/example/results/results.csv"
fev.leaderboard(summaries)
# | model_name     |   skill_score |   win_rate | ... |
# |:---------------|--------------:|-----------:| ... |
# | auto_theta     |         0.126 |      0.667 | ... |
# | auto_arima     |         0.113 |      0.667 | ... |
# | auto_ets       |         0.049 |      0.444 | ... |
# | seasonal_naive |         0     |      0.222 | ... |

📚 Documentation

Tutorials
- Quickstart: Define a task and evaluate a model.
- Datasets: Use fev with your own datasets.
- Tasks & benchmarks: Advanced features for defining tasks and benchmarks.
- Adapters: Easily convert data into formats expected by popular time series libraries like AutoGluon, Nixtlaverse, GluonTS, Darts and more.
- Models: Evaluate your models and submit results to the leaderboard.
API reference

Examples of model implementations compatible with fev are available in examples/.

🏅 Leaderboards

We host leaderboards obtained using fev under https://huggingface.co/spaces/autogluon/fev-bench. This leaderboard includes results for the benchmark from fev-bench: A Realistic Benchmark for Time Series Forecasting. Previous results for Chronos Benchmark II are available in benchmarks/chronos_zeroshot/.

📈 Datasets

Repositories with datasets in format compatible with fev:

Citation

If you find this package useful for your research, please consider citing the associated paper(s):

@article{shchur2025fev,
  title={{fev-bench}: A Realistic Benchmark for Time Series Forecasting},
  author={Shchur, Oleksandr and Ansari, Abdul Fatir and Turkmen, Caner and Stella, Lorenzo and Erickson, Nick and Guerron, Pablo and Bohlke-Schneider, Michael and Wang, Yuyang},
  year={2025},
  eprint={2509.26468},
  archivePrefix={arXiv},
  primaryClass={cs.LG}
}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

shchur

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.8.0

May 19, 2026

This version

0.7.0

Jan 15, 2026

0.6.1

Oct 1, 2025

0.6.0

Sep 16, 2025

0.6.0rc4 pre-release

Sep 16, 2025

0.6.0rc3 pre-release

Sep 2, 2025

0.6.0rc2 pre-release

Aug 12, 2025

0.6.0rc1 pre-release

Aug 4, 2025

0.5.0

Jul 3, 2025

0.5.0rc3 pre-release

Jun 4, 2025

0.5.0rc2 pre-release

Jun 2, 2025

0.5.0rc1 pre-release

May 27, 2025

0.4.1

Apr 9, 2025

0.4.0

Apr 8, 2025

0.3.0

Dec 17, 2024

0.2.1

Dec 13, 2024

0.2.0

Dec 12, 2024

0.1.0

Dec 9, 2024

0.0.1a2 pre-release

Dec 9, 2024

0.0.1a1 pre-release

Dec 9, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fev-0.7.0.tar.gz (71.9 kB view details)

Uploaded Jan 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fev-0.7.0-py3-none-any.whl (42.7 kB view details)

Uploaded Jan 15, 2026 Python 3

File details

Details for the file fev-0.7.0.tar.gz.

File metadata

Download URL: fev-0.7.0.tar.gz
Upload date: Jan 15, 2026
Size: 71.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for fev-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`d1bb684da28db29887211d78e386f9ef3114a312ebb7d33ffcdbed9019eb9672`
MD5	`c1a3691a9e7dc7eb4f7ec9ba7140f2e0`
BLAKE2b-256	`610df6959e92a96a5602ddadc2f1cd7cdd4eb3fe24ebc947fda5677d0ed7671e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fev-0.7.0.tar.gz:

Publisher: publish-to-pypi.yml on autogluon/fev

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fev-0.7.0.tar.gz
- Subject digest: d1bb684da28db29887211d78e386f9ef3114a312ebb7d33ffcdbed9019eb9672
- Sigstore transparency entry: 828421796
- Sigstore integration time: Jan 15, 2026
Source repository:
- Permalink: autogluon/fev@5b813ab8ab3c579d4daa3e298c6efc141113d7f8
- Branch / Tag: refs/tags/v0.7.0rc1
- Owner: https://github.com/autogluon
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@5b813ab8ab3c579d4daa3e298c6efc141113d7f8
- Trigger Event: release

File details

Details for the file fev-0.7.0-py3-none-any.whl.

File metadata

Download URL: fev-0.7.0-py3-none-any.whl
Upload date: Jan 15, 2026
Size: 42.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for fev-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a71754b89f3fe36cfc8e53eecc3f26b722cfff6c8d1fe01b9ba099f8a0aec840`
MD5	`822ce4e8830380e560eecc412fb23e9b`
BLAKE2b-256	`baaae3e808a7f14393ab5a9eec807ac39cf8fde54dc7a513950084b4557ac534`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fev-0.7.0-py3-none-any.whl:

Publisher: publish-to-pypi.yml on autogluon/fev

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fev-0.7.0-py3-none-any.whl
- Subject digest: a71754b89f3fe36cfc8e53eecc3f26b722cfff6c8d1fe01b9ba099f8a0aec840
- Sigstore transparency entry: 828421799
- Sigstore integration time: Jan 15, 2026
Source repository:
- Permalink: autogluon/fev@5b813ab8ab3c579d4daa3e298c6efc141113d7f8
- Branch / Tag: refs/tags/v0.7.0rc1
- Owner: https://github.com/autogluon
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@5b813ab8ab3c579d4daa3e298c6efc141113d7f8
- Trigger Event: release

fev 0.7.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

fev

How is `fev` different from other benchmarking tools?

📝 Updates

⚙️ Installation

🚀 Quickstart

📚 Documentation

🏅 Leaderboards

📈 Datasets

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

fev 0.7.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

fev

How is fev different from other benchmarking tools?

📝 Updates

⚙️ Installation

🚀 Quickstart

📚 Documentation

🏅 Leaderboards

📈 Datasets

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

How is `fev` different from other benchmarking tools?