Skip to main content

Benchmark of Generative Large Language Models in Danish

Project description

Are LLMs Danoliterate?

A benchmark for Generative Large Language Models in Danish. To see results and and get more details, check out the leaderboard site:

danoliterate.compute.dtu.dk

The project is maintained by Søren Vejlgaard Holm at DTU Compute, supported by the Danish Pioneer Centre for AI and with most of the work done as part of the Master's thesis ''Are GLLMs Danoliterate? Benchmarking Generative NLP in Danish'' supervised by Lars Kai Hansen from DTU Compute and Martin Carsten Nielsen from Alvenir.

Installation

The package has been developed and used with Python 3.11. To install the package in a base version, enabling model execution, install

pip install danoliterate

Note: Some features need a full install to run:

pip install danoliterate[full]

Usage

See options with

python -m danoliterate do=evaluate --help

A typical use would be to run your own model hosted on the Huggingface Hub on a scenario, for example the Citizenship Test Scenario (see the frontend for scenario descriptions). Skip the line scenarios= to make it run on all scenarios instead.

python -m danoliterate do=evaluate\
    scenarios="citizenship-test"\
    model.name="MyLittleGPT"\
    model.path="hf-internal-testing/tiny-random-gpt2"\
    evaluation.local_results="./my-result-db"

Now, you could share the resulting JSON placed in my-result-db to get it included in the Danoliterate benchmark, or you can satisfy your curiosity and score it yourself

# Calculates scoring metrics
python -m danoliterate do=score\
    evaluation.local_results="./my-result-db"
# Prints them for you
python -m danoliterate do=report\
    evaluation.local_results="./my-result-db"

Contact

Please reach here using GitHub issues or on mail to Søren Vejlgaard Holm either at swiho@dtu.dk or swh@alvenir.ai.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

danoliterate-0.1.0.tar.gz (84.4 kB view details)

Uploaded Source

Built Distribution

danoliterate-0.1.0-py3-none-any.whl (94.9 kB view details)

Uploaded Python 3

File details

Details for the file danoliterate-0.1.0.tar.gz.

File metadata

  • Download URL: danoliterate-0.1.0.tar.gz
  • Upload date:
  • Size: 84.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for danoliterate-0.1.0.tar.gz
Algorithm Hash digest
SHA256 141edbc8be6a5d85606b202e56b8fa2f05f56daee90e4f6ab2aa4c150300f157
MD5 bf1125191a80c6879d15b5a630dc4535
BLAKE2b-256 f0e08d91c54091c2859139dc5178f968ce92691a3ab271d71e2ecea28f7af373

See more details on using hashes here.

File details

Details for the file danoliterate-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for danoliterate-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 83c1f967133c4cba0fb3ed280ed8ddccfb377c4a4ecae2db7d55b519a120348a
MD5 c1444d7242a40f09313fedcb9993fc8d
BLAKE2b-256 36d8b45089d677e7c8576ca83033d73f7138659bac4f55b206ff4d5bf408ff1e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page