The robust European language model benchmark.
Project description
The robust European language model benchmark
(formerly known as ScandEval)
Maintainer
- Dan Saattrup Smart (@saattrupdan, dan.smart@alexandra.dk)
Installation and usage
See the documentation for more information.
Reproducing the evaluation datasets
All datasets used in this project are generated using the scripts located in the src/scripts folder. To reproduce a dataset, run the corresponding script with the following command
uv run src/scripts/<name-of-script>.py
Replace with the specific script you wish to execute, e.g.,
uv run src/scripts/create_allocine.py
Contributors :pray:
A huge thank you to all the contributors who have helped make this project a success!
Contribute to EuroEval
We welcome contributions to EuroEval! Whether you're fixing bugs, adding features, or contributing new datasets, your help makes this project better for everyone.
- General contributions: Check out our contribution guidelines for information on how to get started.
- Adding datasets: If you're interested in adding a new dataset to EuroEval, we have a dedicated guide with step-by-step instructions.
Special thanks
- Thanks to Google for sponsoring Gemini credits as part of their Google Cloud for Researchers Program.
- Thanks @Mikeriess for evaluating many of the larger models on the leaderboards.
- Thanks to OpenAI for sponsoring OpenAI credits as part of their Researcher Access Program.
- Thanks to UWV and KU Leuven for sponsoring the Azure OpenAI credits used to evaluate GPT-4-turbo in Dutch.
- Thanks to Miðeind for sponsoring the OpenAI credits used to evaluate GPT-4-turbo in Icelandic and Faroese.
- Thanks to CHC for sponsoring the OpenAI credits used to evaluate GPT-4-turbo in German.
Citing EuroEval
If you want to cite the framework then feel free to use this:
@article{smart2024encoder,
title={Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks},
author={Smart, Dan Saattrup and Enevoldsen, Kenneth and Schneider-Kamp, Peter},
journal={arXiv preprint arXiv:2406.13469},
year={2024}
}
@inproceedings{smart2023scandeval,
author = {Smart, Dan Saattrup},
booktitle = {Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)},
month = may,
pages = {185--201},
title = {{ScandEval: A Benchmark for Scandinavian Natural Language Processing}},
year = {2023}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file euroeval-16.16.1.tar.gz.
File metadata
- Download URL: euroeval-16.16.1.tar.gz
- Upload date:
- Size: 1.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fd2b88f7514a0e4d64b0fd11b11a25b154c75c4bfb6e45bca1ce9c1b34a3b81b
|
|
| MD5 |
21d14826085564d7f4fcfc73d4a458b5
|
|
| BLAKE2b-256 |
25717b1ea0edf1e3b78746763796c09aebf2dd40574b67d990132875e88b5362
|
File details
Details for the file euroeval-16.16.1-py3-none-any.whl.
File metadata
- Download URL: euroeval-16.16.1-py3-none-any.whl
- Upload date:
- Size: 245.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b3dfb19b04bc332fa441e5c6b3ec226434be83d49f17b594230c2d2633af06ce
|
|
| MD5 |
8c034e0b4c360b4ca1933b8a10d90892
|
|
| BLAKE2b-256 |
e342a282e8fc1e72db1d859bef84af710fc622eb44fc76cf0b61e4dd6c9ad8e7
|