Skip to main content

Tool to check how metric deltas for machine translation reflect on system-level human accuracies.

Project description

MT Metrics Thresholds     PyPI Version test MT-Thresholds

Code for Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies by Tom Kocmi, Vilém Zouhar, Christian Federmann, and Matt Post.

@inproceedings{kocmi-etal-2024-navigating,
    title = "Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies",
    author = "Kocmi, Tom  and Zouhar, Vil{\'e}m  and Federmann, Christian  and Post, Matt",
    editor = "Ku, Lun-Wei  and Martins, Andre  and Srikumar, Vivek",
    booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.acl-long.110",
    doi = "10.18653/v1/2024.acl-long.110",
    pages = "1999--2014",
}

Web frontend

See the MT thresholds tool. image

Local tool

pip3 install mt-thresholds

# accuracy is 63.989%
mt-thresholds bleu 1.00

# ChrF needs 0.710 difference for the same accuracy as BLEU
mt-thresholds chrf 0.63989 --delta

Or use from Python:

import mt_thresholds

mt_thresholds.accuracy(1.0, "bleu") # 0.63989
mt_thresholds.delta(0.63989, "chrf") # 0.665

Experiment code

We plan to release the code for replicating WMT results in upcoming months.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mt_thresholds-1.0.3.tar.gz (3.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mt_thresholds-1.0.3-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file mt_thresholds-1.0.3.tar.gz.

File metadata

  • Download URL: mt_thresholds-1.0.3.tar.gz
  • Upload date:
  • Size: 3.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for mt_thresholds-1.0.3.tar.gz
Algorithm Hash digest
SHA256 74b191d00706a98310642c93908803b28a14888b3699c94962f9f29621c8cb47
MD5 bdc0566178c8970982bfa2e337ec5f0e
BLAKE2b-256 9876e05ef4aa0d23b878354f33c394dfa76c468bb1950726a699c37b7a61c2f3

See more details on using hashes here.

File details

Details for the file mt_thresholds-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: mt_thresholds-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for mt_thresholds-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 1b9d616942b60b2757f3789530fb4781037d7a6bb40a570d9c34b1fb52cfb719
MD5 4f165390f9ae20a0ac1d03002b9c07c3
BLAKE2b-256 c8c959429f383f076fac4ecae6166ff86d4f8abed0e2280c337bc7841ccceb0a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page