ScandEval

No project description provided

These details have not been verified by PyPI

Project links

Homepage

Project description

Evaluation of language models on mono- or multilingual Scandinavian language tasks.

Installation

To install the package simply write the following command in your favorite terminal:

$ pip install scandeval[all]

This will install all the model frameworks currently supported (pytorch, tensorflow, jax and spacy). If you know you only need one of these, you can install a slimmer package like so:

$ pip install scandeval[pytorch]

Lastly, if you are not interesting in benchmarking models, but just want to use the package to download datasets, then the following command will do the trick:

$ pip install scandeval

Quickstart

Benchmarking from the Command Line

The easiest way to benchmark models is via the command line interface. After having installed the package, you can benchmark your favorite model like so:

$ scandeval --model_id <model_id>

Here model_id is the HuggingFace model ID, which can be found on the HuggingFace Hub. By default this will benchmark the model on all the datasets eligible. If you want to benchmark on a specific dataset, this can be done via the --dataset flag. This will for instance evaluate the model on the AngryTweets dataset:

$ scandeval --model_id <model_id> --dataset angry-tweets

We can also separate by language. To benchmark all Danish models, say, this can be done using the language tag, like so:

$ scandeval --language da

Multiple models, datasets and/or languages can be specified by just attaching multiple arguments. Here is an example with two models:

$ scandeval --model_id <model_id1> --model_id <model_id2> --dataset angry-tweets

See all the arguments and options available for the scandeval command by typing

$ scandeval --help

Benchmarking from a Script

In a script, the syntax is similar to the command line interface. You simply initialise an object of the Benchmark class, and call this benchmark object with your favorite models and/or datasets:

>>> from scandeval import Benchmark
>>> benchmark = Benchmark()
>>> benchmark('<model_id>')

To benchmark on a specific dataset, you simply specify the second argument, shown here with the AngryTweets dataset again:

>>> benchmark('<model_id>', 'angry-tweets')

This would benchmark all Danish models:

>>> benchmark(language='da')

See the documentation for a more in-depth description.

Downloading Datasets

If you are just interested in downloading a dataset rather than benchmarking, this can be done as follows:

>>> from scandeval.datasets import load_angry_tweets
>>> X_train, X_test, y_train, y_test = load_angry_tweets()

Here X_train and X_test will be lists containing the relevant texts, and y_train and y_test will be lists containing the associated labels.

See the documentation for a list of all the datasets that can be loaded.

Documentation

The full documentation can be found on ReadTheDocs.

Citing ScandEval

If you want to cite the framework then feel free to use this:

@article{nielsen2021scandeval,
  title={ScandEval: Evaluation of language models on mono- or multilingual Scandinavian language tasks.},
  author={Nielsen, Dan Saattrup},
  journal={GitHub. Note: https://github.com/saattrupdan/ScandEval},
  year={2021}
}

Remarks

The image used in the logo has been created by the amazing Scandinavia and the World team. Go check them out!

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

13.1.0

Oct 31, 2024

13.0.0

Jul 31, 2024

12.11.0

Jul 3, 2024

12.10.8

Jun 21, 2024

12.10.7

Jun 19, 2024

12.10.6

Jun 19, 2024

12.10.5

Jun 12, 2024

12.10.4

Jun 3, 2024

12.10.3

Jun 3, 2024

12.10.2

May 30, 2024

12.10.1

May 28, 2024

12.10.0

May 8, 2024

12.9.1

Apr 30, 2024

12.9.0

Apr 26, 2024

12.8.0

Apr 23, 2024

12.7.0

Apr 19, 2024

12.6.1

Apr 11, 2024

12.6.0

Apr 10, 2024

12.5.3

Apr 5, 2024

12.5.2

Apr 4, 2024

12.5.1

Apr 3, 2024

12.5.0

Apr 2, 2024

12.4.0

Mar 27, 2024

12.3.2

Mar 19, 2024

12.3.1

Mar 13, 2024

12.3.0

Mar 13, 2024

12.2.1

Mar 12, 2024

12.2.0

Mar 11, 2024

12.1.0

Feb 29, 2024

12.0.0

Feb 26, 2024

11.0.0

Feb 16, 2024

10.0.1

Feb 12, 2024

10.0.0

Feb 12, 2024

9.3.2

Feb 5, 2024

9.3.1

Jan 31, 2024

9.3.0

Jan 29, 2024

9.2.0

Jan 24, 2024

9.1.2

Jan 16, 2024

9.1.1

Jan 15, 2024

9.1.0

Jan 14, 2024

9.0.0

Jan 12, 2024

8.2.1

Dec 20, 2023

8.2.0

Dec 20, 2023

8.1.0

Dec 4, 2023

8.0.0

Nov 29, 2023

7.1.1

Jul 1, 2023

7.1.0

May 15, 2023

7.0.0

May 13, 2023

6.3.0

Apr 12, 2023

6.2.4

Mar 10, 2023

6.2.3

Feb 27, 2023

6.2.2

Feb 25, 2023

6.2.1

Feb 22, 2023

6.2.0

Jan 9, 2023

6.1.1

Jan 2, 2023

6.1.0

Dec 29, 2022

6.0.1

Dec 28, 2022

6.0.0

Dec 24, 2022

5.0.0

Nov 3, 2022

4.0.2

Jul 22, 2022

4.0.1

Jul 14, 2022

4.0.0

Jul 14, 2022

3.0.0

Apr 19, 2022

2.3.2

Feb 11, 2022

2.3.1

Feb 11, 2022

2.3.0

Jan 20, 2022

2.2.0

Jan 18, 2022

2.1.0

Jan 17, 2022

2.0.0

Jan 7, 2022

1.5.9

Dec 14, 2021

1.5.8

Dec 13, 2021

1.5.7

Dec 10, 2021

1.5.6

Dec 10, 2021

1.5.5

Dec 8, 2021

1.5.4

Dec 8, 2021

1.5.3

Dec 8, 2021

1.5.2

Dec 8, 2021

1.5.1

Nov 27, 2021

1.5.0

Nov 26, 2021

1.4.0

Nov 25, 2021

1.3.8

Nov 25, 2021

1.3.7

Nov 25, 2021

1.3.6

Nov 25, 2021

1.3.5

Nov 23, 2021

1.3.4

Nov 11, 2021

1.3.3

Nov 11, 2021

1.3.2

Nov 11, 2021

1.3.1

Nov 11, 2021

1.3.0

Nov 11, 2021

1.2.1

Nov 11, 2021

1.2.0

Oct 15, 2021

1.1.3

Oct 4, 2021

1.1.2

Sep 26, 2021

1.1.1

Sep 26, 2021

1.1.0

Sep 13, 2021

1.0.2

Sep 9, 2021

1.0.1

Sep 9, 2021

1.0.0

Sep 9, 2021

0.17.0

Sep 9, 2021

This version

0.16.0

Sep 7, 2021

0.15.1

Sep 3, 2021

0.15.0

Sep 2, 2021

0.14.1

Sep 2, 2021

0.14.0

Aug 31, 2021

0.13.0

Aug 30, 2021

0.12.0

Aug 26, 2021

0.11.2

Aug 25, 2021

0.11.1

Aug 24, 2021

0.11.0

Aug 23, 2021

0.10.1

Aug 20, 2021

0.10.0

Aug 20, 2021

0.9.0

Aug 19, 2021

0.8.0

Aug 18, 2021

0.7.0

Aug 17, 2021

0.6.0

Aug 15, 2021

0.5.2

Aug 13, 2021

0.5.1

Aug 13, 2021

0.5.0

Aug 12, 2021

0.4.3

Aug 12, 2021

0.4.2

Aug 12, 2021

0.4.1

Aug 12, 2021

0.4.0

Aug 11, 2021

0.3.1

Aug 10, 2021

0.3.0

Aug 10, 2021

0.2.0

Aug 9, 2021

0.1.0

Aug 5, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scandeval-0.16.0.tar.gz (30.4 kB view hashes)

Uploaded Sep 7, 2021 Source

Built Distribution

scandeval-0.16.0-py3-none-any.whl (99.3 kB view hashes)

Uploaded Sep 7, 2021 Python 3

Hashes for scandeval-0.16.0.tar.gz

Hashes for scandeval-0.16.0.tar.gz
Algorithm	Hash digest
SHA256	`f2fa963c3bef137fdada6f542c84e1cfb425ab9d2f259f2b734f6ffd16372c50`
MD5	`e8757d215b0bf4e9d8ed8448e10cc7ec`
BLAKE2b-256	`047563bd2237198ee940223b4cafba302d87fc74f8fb9e09e13ec44c4635bd48`

Hashes for scandeval-0.16.0-py3-none-any.whl

Hashes for scandeval-0.16.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1a248d8cabd7545884f90698982daa6f3cf7e07f5992a95a15e48e2588d1ad37`
MD5	`cd97bc0cd819bf4b4146e748c04eb709`
BLAKE2b-256	`44492561efb9366a39b283ea99c4dae0d26c94a68aa7fc225b86ccfe41b78985`