Skip to main content

Benchmark of Generative Large Language Models in Danish

Project description

Are LLMs Danoliterate?

A benchmark for Generative Large Language Models in Danish. To see results and and get more details, check out the leaderboard site:

danoliterate.compute.dtu.dk

The project is maintained by Søren Vejlgaard Holm at DTU Compute, supported by the Danish Pioneer Centre for AI and with most of the work done as part of the Master's thesis ''Are GLLMs Danoliterate? Benchmarking Generative NLP in Danish'' supervised by Lars Kai Hansen from DTU Compute and Martin Carsten Nielsen from Alvenir.

Installation

The package has been developed and used with Python 3.11. To install the package in a base version, enabling model execution, install

pip install danoliterate

Note: Some features need a full install to run:

pip install danoliterate[full]

Usage

See options with

python -m danoliterate do=evaluate

A typical use would be to run your own model hosted on the Huggingface Hub on a scenario, for example the Citizenship Test Scenario (see the frontend for scenario descriptions). Skip the line scenarios= to make it run on all scenarios instead.

python -m danoliterate do=evaluate\
    scenarios="citizenship-test"\
    model.name="MyLittleGPT"\
    model.path="hf-internal-testing/tiny-random-gpt2"\
    evaluation.local_results="./my-result-db"

Now, you could share the resulting JSON placed in my-result-db to get it included in the Danoliterate benchmark, or you can satisfy your curiosity and score it yourself

# Calculates scoring metrics
python -m danoliterate do=score\
    evaluation.local_results="./my-result-db"
# Prints them for you
python -m danoliterate do=report\
    evaluation.local_results="./my-result-db"

Contact

Please reach here using GitHub issues or on mail to Søren Vejlgaard Holm either at swiho@dtu.dk or swh@alvenir.ai.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

danoliterate-0.0.4.tar.gz (84.4 kB view details)

Uploaded Source

Built Distribution

danoliterate-0.0.4-py3-none-any.whl (94.9 kB view details)

Uploaded Python 3

File details

Details for the file danoliterate-0.0.4.tar.gz.

File metadata

  • Download URL: danoliterate-0.0.4.tar.gz
  • Upload date:
  • Size: 84.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for danoliterate-0.0.4.tar.gz
Algorithm Hash digest
SHA256 62e5a6ee42e2cca24b779a1e98bc9e214eec948766829bb63a1d59a669c9ec1d
MD5 4ba4f79af1e5df269272cace8e5d9462
BLAKE2b-256 e019b7dd771a6d5f132f4e344a9a6023d853cc5d4f76cd29f2e1d988da1d990d

See more details on using hashes here.

File details

Details for the file danoliterate-0.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for danoliterate-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 d59b5cd15863f37c4da3ffa9e895f5940bb8c34da9b1f9b4c4662d06e0f94d50
MD5 0a88e02d3b03f3a87da8486170e28079
BLAKE2b-256 3000ed1371f8b31c453d87c10fd8d9a6729532a6a9157ba6f98daffaa7d420d6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page