ScandEval

No project description provided

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Evaluation of language models on mono- or multilingual Scandinavian language tasks.

Installation

To install the package simply write the following command in your favorite terminal:

$ pip install scandeval[all]

This will install all the model frameworks currently supported (pytorch, tensorflow, jax and spacy). If you know you only need one of these, you can install a slimmer package like so:

$ pip install scandeval[pytorch]

Lastly, if you are not interesting in benchmarking models, but just want to use the package to download datasets, then the following command will do the trick:

$ pip install scandeval

Quickstart

Benchmarking from the Command Line

The easiest way to benchmark models is via the command line interface. After having installed the package, you can benchmark your favorite model like so:

$ scandeval --model_id <model_id>

Here model_id is the HuggingFace model ID, which can be found on the HuggingFace Hub. By default this will benchmark the model on all the datasets eligible. If you want to benchmark on a specific dataset, this can be done via the --dataset flag. This will for instance evaluate the model on the AngryTweets dataset:

$ scandeval --model_id <model_id> --dataset angry-tweets

If you want to benchmark all Danish models, this can be done using the language tag, like so:

$ scandeval --language da

See all the arguments and options available for the scandeval command by typing

$ scandeval --help

Benchmarking from a Script

In a script, the syntax is similar to the command line interface. You simply initialise an object of the Benchmark class, and call this benchmark object with your favorite models and/or datasets:

>>> from scandeval import Benchmark
>>> benchmark = Benchmark()
>>> benchmark('<model_id>')

To benchmark on a specific dataset, you simply specify the second argument, shown here with the AngryTweets dataset again:

>>> benchmark('<model_id>', 'angry-tweets')

To benchmark all Danish models, this is given at initialisation:

>>> benchmark = Benchmark(language='da')
>>> benchmark()

Downloading Datasets

If you are just interested in downloading a dataset rather than benchmarking, this can be done as follows:

>>> from scandeval.datasets import load_angry_tweets
>>> X_train, X_test, y_train, y_test = load_angry_tweets()

Here X_train and X_test will be lists containing the relevant texts, and y_train and y_test will be lists containing the associated labels.

Documentation

The full documentation can be found on ReadTheDocs.

Citing ScandEval

If you want to cite the framework then feel free to use this:

@article{nielsen2021scandeval,
  title={ScandEval: Evaluation of language models on mono- or multilingual Scandinavian language tasks.},
  author={Nielsen, Dan Saattrup},
  journal={GitHub. Note: https://github.com/saattrupdan/ScandEval},
  year={2021}
}

Remarks

The image used in the logo has been created by the amazing Scandinavia and the World team. Go check them out!

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

12.10.0

May 8, 2024

12.9.1

Apr 30, 2024

12.9.0

Apr 26, 2024

12.8.0

Apr 23, 2024

12.7.0

Apr 19, 2024

12.6.1

Apr 11, 2024

12.6.0

Apr 10, 2024

12.5.3

Apr 5, 2024

12.5.2

Apr 4, 2024

12.5.1

Apr 3, 2024

12.5.0

Apr 2, 2024

12.4.0

Mar 27, 2024

12.3.2

Mar 19, 2024

12.3.1

Mar 13, 2024

12.3.0

Mar 13, 2024

12.2.1

Mar 12, 2024

12.2.0

Mar 11, 2024

12.1.0

Feb 29, 2024

12.0.0

Feb 26, 2024

11.0.0

Feb 16, 2024

10.0.1

Feb 12, 2024

10.0.0

Feb 12, 2024

9.3.2

Feb 5, 2024

9.3.1

Jan 31, 2024

9.3.0

Jan 29, 2024

9.2.0

Jan 24, 2024

9.1.2

Jan 16, 2024

9.1.1

Jan 15, 2024

9.1.0

Jan 14, 2024

9.0.0

Jan 12, 2024

8.2.1

Dec 20, 2023

8.2.0

Dec 20, 2023

8.1.0

Dec 4, 2023

8.0.0

Nov 29, 2023

7.1.1

Jul 1, 2023

7.1.0

May 15, 2023

7.0.0

May 13, 2023

6.3.0

Apr 12, 2023

6.2.4

Mar 10, 2023

6.2.3

Feb 27, 2023

6.2.2

Feb 25, 2023

6.2.1

Feb 22, 2023

6.2.0

Jan 9, 2023

6.1.1

Jan 2, 2023

6.1.0

Dec 29, 2022

6.0.1

Dec 28, 2022

6.0.0

Dec 24, 2022

5.0.0

Nov 3, 2022

4.0.2

Jul 22, 2022

4.0.1

Jul 14, 2022

4.0.0

Jul 14, 2022

3.0.0

Apr 19, 2022

2.3.2

Feb 11, 2022

2.3.1

Feb 11, 2022

2.3.0

Jan 20, 2022

2.2.0

Jan 18, 2022

2.1.0

Jan 17, 2022

2.0.0

Jan 7, 2022

1.5.9

Dec 14, 2021

1.5.8

Dec 13, 2021

1.5.7

Dec 10, 2021

1.5.6

Dec 10, 2021

1.5.5

Dec 8, 2021

1.5.4

Dec 8, 2021

1.5.3

Dec 8, 2021

1.5.2

Dec 8, 2021

1.5.1

Nov 27, 2021

1.5.0

Nov 26, 2021

1.4.0

Nov 25, 2021

1.3.8

Nov 25, 2021

1.3.7

Nov 25, 2021

1.3.6

Nov 25, 2021

1.3.5

Nov 23, 2021

1.3.4

Nov 11, 2021

1.3.3

Nov 11, 2021

1.3.2

Nov 11, 2021

1.3.1

Nov 11, 2021

1.3.0

Nov 11, 2021

1.2.1

Nov 11, 2021

1.2.0

Oct 15, 2021

1.1.3

Oct 4, 2021

1.1.2

Sep 26, 2021

1.1.1

Sep 26, 2021

1.1.0

Sep 13, 2021

1.0.2

Sep 9, 2021

1.0.1

Sep 9, 2021

1.0.0

Sep 9, 2021

0.17.0

Sep 9, 2021

0.16.0

Sep 7, 2021

0.15.1

Sep 3, 2021

0.15.0

Sep 2, 2021

0.14.1

Sep 2, 2021

0.14.0

Aug 31, 2021

0.13.0

Aug 30, 2021

0.12.0

Aug 26, 2021

0.11.2

Aug 25, 2021

0.11.1

Aug 24, 2021

0.11.0

Aug 23, 2021

0.10.1

Aug 20, 2021

0.10.0

Aug 20, 2021

0.9.0

Aug 19, 2021

0.8.0

Aug 18, 2021

0.7.0

Aug 17, 2021

0.6.0

Aug 15, 2021

0.5.2

Aug 13, 2021

0.5.1

Aug 13, 2021

0.5.0

Aug 12, 2021

0.4.3

Aug 12, 2021

0.4.2

Aug 12, 2021

0.4.1

Aug 12, 2021

0.4.0

Aug 11, 2021

0.3.1

Aug 10, 2021

0.3.0

Aug 10, 2021

This version

0.2.0

Aug 9, 2021

0.1.0

Aug 5, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scandeval-0.2.0.tar.gz (22.2 kB view hashes)

Uploaded Aug 9, 2021 Source

Built Distribution

scandeval-0.2.0-py3-none-any.whl (38.3 kB view hashes)

Uploaded Aug 9, 2021 Python 3

Hashes for scandeval-0.2.0.tar.gz

Hashes for scandeval-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`5ddee88a1581099b57cba267d018e971d34e1d466b7c4fe44b1e350b141cb97a`
MD5	`c4da47c9f6c749c72bea98afc29731dc`
BLAKE2b-256	`a59d7b748dc411ef31ec2890503af0aceb4b0560ac3b4eedc384cf83ed664409`

Hashes for scandeval-0.2.0-py3-none-any.whl

Hashes for scandeval-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dd6c588c4c1a6a3a7cfa806a831b7e409fc52d0b1420ff8f4a7fd87c12255a8d`
MD5	`fe7cee61575913e20fc891c1870a023d`
BLAKE2b-256	`409b7f2c3a558c386468c6995ac7d30cb54ca3490d44ccd72f1cd89eaeb6baf6`