Skip to main content

Tool to check how metric deltas for machine translation reflect on system-level human accuracies.

Project description

MT Metrics Thresholds     PyPI Version test MT-Thresholds

Code for Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies by Tom Kocmi, Vilém Zouhar, Christian Federmann, and Matt Post.

@inproceedings{kocmi-etal-2024-navigating,
    title = "Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies",
    author = "Kocmi, Tom  and Zouhar, Vil{\'e}m  and Federmann, Christian  and Post, Matt",
    editor = "Ku, Lun-Wei  and Martins, Andre  and Srikumar, Vivek",
    booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.acl-long.110",
    doi = "10.18653/v1/2024.acl-long.110",
    pages = "1999--2014",
}

Web frontend

See the MT thresholds tool. image

Local tool

pip3 install mt-thresholds

# accuracy is 63.989%
mt-thresholds bleu 1.00

# ChrF needs 0.710 difference for the same accuracy as BLEU
mt-thresholds chrf 0.63989 --delta

Or use from Python:

import mt_thresholds

mt_thresholds.accuracy(1.0, "bleu") # 0.63989
mt_thresholds.delta(0.63989, "chrf") # 0.665

Experiment code

We plan to release the code for replicating WMT results in upcoming months.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mt_thresholds-1.0.4.tar.gz (3.7 kB view details)

Uploaded Source

Built Distribution

mt_thresholds-1.0.4-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file mt_thresholds-1.0.4.tar.gz.

File metadata

  • Download URL: mt_thresholds-1.0.4.tar.gz
  • Upload date:
  • Size: 3.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for mt_thresholds-1.0.4.tar.gz
Algorithm Hash digest
SHA256 6c1288a41437001d6f668a2ff33d498fa93b1b93c67e02c1ed2b520ef9561fa0
MD5 1a3648c42c889dc4192be45f91b30c30
BLAKE2b-256 25072dc668e2c74bd63a3fbfc6c93803dd93f9261ac24c5aef62272a2dba3b26

See more details on using hashes here.

File details

Details for the file mt_thresholds-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: mt_thresholds-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for mt_thresholds-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 91bbd53edef1ab94018948583d360ae899b106918f7099796eb7e430d1e72a79
MD5 b97917e2639ef162e9f5f4e15f158cda
BLAKE2b-256 0dfcc335f623aa160f40cc6fc8b9af3a57ddaa796fca0cbd33ef82395be13e74

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page