Tool to check how metric deltas for machine translation reflect on system-level human accuracies.
Project description
MT Metrics Thresholds

Code for Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies by Tom Kocmi, Vilém Zouhar, Christian Federmann, and Matt Post.
@inproceedings{kocmi-etal-2024-navigating,
title = "Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies",
author = "Kocmi, Tom and Zouhar, Vil{\'e}m and Federmann, Christian and Post, Matt",
editor = "Ku, Lun-Wei and Martins, Andre and Srikumar, Vivek",
booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
month = aug,
year = "2024",
address = "Bangkok, Thailand",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.acl-long.110",
doi = "10.18653/v1/2024.acl-long.110",
pages = "1999--2014",
}
Web frontend
See the MT thresholds tool.
Local tool
pip3 install mt-thresholds
# accuracy is 63.989%
mt-thresholds bleu 1.00
# ChrF needs 0.710 difference for the same accuracy as BLEU
mt-thresholds chrf 0.63989 --delta
Or use from Python:
import mt_thresholds
mt_thresholds.accuracy(1.0, "bleu") # 0.63989
mt_thresholds.delta(0.63989, "chrf") # 0.665
Experiment code
We plan to release the code for replicating WMT results in upcoming months.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mt_thresholds-1.0.4.tar.gz.
File metadata
- Download URL: mt_thresholds-1.0.4.tar.gz
- Upload date:
- Size: 3.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6c1288a41437001d6f668a2ff33d498fa93b1b93c67e02c1ed2b520ef9561fa0
|
|
| MD5 |
1a3648c42c889dc4192be45f91b30c30
|
|
| BLAKE2b-256 |
25072dc668e2c74bd63a3fbfc6c93803dd93f9261ac24c5aef62272a2dba3b26
|
File details
Details for the file mt_thresholds-1.0.4-py3-none-any.whl.
File metadata
- Download URL: mt_thresholds-1.0.4-py3-none-any.whl
- Upload date:
- Size: 4.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
91bbd53edef1ab94018948583d360ae899b106918f7099796eb7e430d1e72a79
|
|
| MD5 |
b97917e2639ef162e9f5f4e15f158cda
|
|
| BLAKE2b-256 |
0dfcc335f623aa160f40cc6fc8b9af3a57ddaa796fca0cbd33ef82395be13e74
|