Tool to check how metric deltas for machine translation reflect on system-level human accuracies.
Project description
MT Metrics Thresholds

Code for Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies by Tom Kocmi, Vilém Zouhar, Christian Federmann, and Matt Post.
@inproceedings{kocmi-etal-2024-navigating,
title = "Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies",
author = "Kocmi, Tom and Zouhar, Vil{\'e}m and Federmann, Christian and Post, Matt",
editor = "Ku, Lun-Wei and Martins, Andre and Srikumar, Vivek",
booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
month = aug,
year = "2024",
address = "Bangkok, Thailand",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.acl-long.110",
doi = "10.18653/v1/2024.acl-long.110",
pages = "1999--2014",
}
Web frontend
See the MT thresholds tool.
Local tool
pip3 install mt-thresholds
# accuracy is 63.989%
mt-thresholds bleu 1.00
# ChrF needs 0.710 difference for the same accuracy as BLEU
mt-thresholds chrf 0.63989 --delta
Or use from Python:
import mt_thresholds
mt_thresholds.accuracy(1.0, "bleu") # 0.63989
mt_thresholds.delta(0.63989, "chrf") # 0.665
Experiment code
We plan to release the code for replicating WMT results in upcoming months.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
mt_thresholds-1.0.4.tar.gz
(3.7 kB
view details)
Built Distribution
File details
Details for the file mt_thresholds-1.0.4.tar.gz
.
File metadata
- Download URL: mt_thresholds-1.0.4.tar.gz
- Upload date:
- Size: 3.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
6c1288a41437001d6f668a2ff33d498fa93b1b93c67e02c1ed2b520ef9561fa0
|
|
MD5 |
1a3648c42c889dc4192be45f91b30c30
|
|
BLAKE2b-256 |
25072dc668e2c74bd63a3fbfc6c93803dd93f9261ac24c5aef62272a2dba3b26
|
File details
Details for the file mt_thresholds-1.0.4-py3-none-any.whl
.
File metadata
- Download URL: mt_thresholds-1.0.4-py3-none-any.whl
- Upload date:
- Size: 4.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
91bbd53edef1ab94018948583d360ae899b106918f7099796eb7e430d1e72a79
|
|
MD5 |
b97917e2639ef162e9f5f4e15f158cda
|
|
BLAKE2b-256 |
0dfcc335f623aa160f40cc6fc8b9af3a57ddaa796fca0cbd33ef82395be13e74
|