Benchmark of Generative Large Language Models in Danish
Project description
Are LLMs Danoliterate?
A benchmark for Generative Large Language Models in Danish. To see results and and get more details, check out the leaderboard site:
The project is maintained by Søren Vejlgaard Holm at DTU Compute, supported by the Danish Pioneer Centre for AI and with most of the work done as part of the Master's thesis ''Are GLLMs Danoliterate? Benchmarking Generative NLP in Danish'' supervised by Lars Kai Hansen from DTU Compute and Martin Carsten Nielsen from Alvenir.
Installation
The package has been developed and used with Python 3.11. To install the package in a base version, enabling model execution, install
pip install danoliterate
Note: Some features need a full install to run:
pip install danoliterate[full]
Usage
See options with
python -m danoliterate do=evaluate
A typical use would be to run your own model hosted on the Huggingface Hub on a scenario, for example the Citizenship Test Scenario (see the frontend for scenario descriptions).
Skip the line scenarios=
to make it run on all scenarios instead.
python -m danoliterate do=evaluate\
scenarios="citizenship-test"\
model.name="MyLittleGPT"\
model.path="hf-internal-testing/tiny-random-gpt2"\
evaluation.local_results="./my-result-db"
Now, you could share the resulting JSON placed in my-result-db
to get it included in the Danoliterate benchmark, or you can satisfy your curiosity and score it yourself
# Calculates scoring metrics
python -m danoliterate do=score\
evaluation.local_results="./my-result-db"
# Prints them for you
python -m danoliterate do=report\
evaluation.local_results="./my-result-db"
Contact
Please reach here using GitHub issues or on mail to Søren Vejlgaard Holm either at swiho@dtu.dk or swh@alvenir.ai.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file danoliterate-0.0.4.tar.gz
.
File metadata
- Download URL: danoliterate-0.0.4.tar.gz
- Upload date:
- Size: 84.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 62e5a6ee42e2cca24b779a1e98bc9e214eec948766829bb63a1d59a669c9ec1d |
|
MD5 | 4ba4f79af1e5df269272cace8e5d9462 |
|
BLAKE2b-256 | e019b7dd771a6d5f132f4e344a9a6023d853cc5d4f76cd29f2e1d988da1d990d |
File details
Details for the file danoliterate-0.0.4-py3-none-any.whl
.
File metadata
- Download URL: danoliterate-0.0.4-py3-none-any.whl
- Upload date:
- Size: 94.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d59b5cd15863f37c4da3ffa9e895f5940bb8c34da9b1f9b4c4662d06e0f94d50 |
|
MD5 | 0a88e02d3b03f3a87da8486170e28079 |
|
BLAKE2b-256 | 3000ed1371f8b31c453d87c10fd8d9a6729532a6a9157ba6f98daffaa7d420d6 |