Automatic Evaluation for SignWriting Machine Learning Outputs
Project description
SignWriting Evaluation
The lack of automatic SignWriting evaluation metrics is a major obstacle in the development of SignWriting transcription and translation1 models.
Goals
The primary objective of this repository is to house a suite of automatic evaluation metrics specifically tailored for SignWriting. This includes standard metrics like BLEU2, chrF3, and CLIPScore4, as well as custom-developed metrics unique to our approach. We recognize the distinct challenges in evaluating single signs versus continuous signing, and our methods reflect this differentiation.
To qualitatively demonstrate the efficacy of these evaluation metrics, we implement a nearest-neighbor search for selected signs from the SignBank corpus. The rationale is straightforward: the closer the sign is to its nearest neighbor in the corpus, the more effective the evaluation metric is in capturing the nuances of sign language transcription and translation.
Evaluation Metrics
- Tokenized BLEU - BLEU score for tokenized SignWriting FSW strings.
- chrF - chrF score for untokenized SignWriting FSW strings.
- CLIPScore - CLIPScore between SignWriting images. (Using the original CLIP model)
- Similarity - symbol distance score for SignWriting FSW strings.
- Similarity v2 - the improved, name-aware symbol distance (README).
Qualitative Evaluation
Distribution of Scores
Using a sample of the corpus, we compute the any-to-any scores for each metric. Intuitively, we expect a good metric given any two random signs to produce a bad score, since most signs are unrelated. This should be reflected in the distribution of scores, which should be skewed towards lower scores.
Nearest Neighbor Search
It is well-known that the SignBank corpus contains many forms of the sign for "hello". We carefully select some of these signs to evaluate our metrics, by looking for their closest matches in the corpus, which contains around 230k single signs.
The problems of each metric are revealed when comparing the top 10 nearest neighbors for each sign. For each sign and metric, either the first match is incorrect, or there is a more correct match further down the list. The table compares the name-aware Similarity v2, the original Similarity, Tokenized BLEU, and chrF; CLIPScore is omitted here, as encoding ~230k SignWriting images per query is prohibitively slow.
| SymbolsDistancesV2 | SymbolsDistances | TokenizedBLEU | CHRF | SymbolsDistancesV2 | SymbolsDistances | TokenizedBLEU | CHRF | SymbolsDistancesV2 | SymbolsDistances | TokenizedBLEU | CHRF | |
| 1 | ||||||||||||
| 2 | ||||||||||||
| 3 | ||||||||||||
| 4 | ||||||||||||
| 5 | ||||||||||||
| 6 | ||||||||||||
| 7 | ||||||||||||
| 8 | ||||||||||||
| 9 | ||||||||||||
| 10 | ||||||||||||
Cite
If you use our toolkit in your research or projects, please consider citing the work.
@misc{signwriting-evaluation2024,
title={SignWriting Evaluation: Metrics for Evaluating SignWriting Transcription and Translation Models},
author={Moryossef, Amit and Zilberman, Rotem and Langer, Ohad},
howpublished={\url{https://github.com/sign-language-processing/signwriting-evaluation}},
year={2024}
}
References
-
Amit Moryossef, Zifan Jiang. 2023. SignBank+: Preparing a Multilingual Sign Language Dataset for Machine Translation Using Large Language Models. ↩
-
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: a Method for Automatic Evaluation of Machine Translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 311–318, Philadelphia, Pennsylvania, USA. Association for Computational Linguistics. ↩
-
Maja Popović. 2015. chrF: character n-gram F-score for automatic MT evaluation. In Proceedings of the Tenth Workshop on Statistical Machine Translation, pages 392–395, Lisbon, Portugal. Association for Computational Linguistics. ↩
-
Jack Hessel, Ari Holtzman, Maxwell Forbes, Ronan Le Bras, and Yejin Choi. 2021. CLIPScore: A Reference-free Evaluation Metric for Image Captioning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 7514–7528, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics. ↩
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file signwriting_evaluation-0.1.0.tar.gz.
File metadata
- Download URL: signwriting_evaluation-0.1.0.tar.gz
- Upload date:
- Size: 274.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4bf7d75c6b1c1320cbf5a7b3f2c2e5a4bb954e723baad31b5b47cf57789e59fc
|
|
| MD5 |
b65a9d9c473f4814c2153780087a870b
|
|
| BLAKE2b-256 |
d510a396407c0c8a5d60c9e10017b712aa9f713226408b33111f4de65a4bcd5b
|
Provenance
The following attestation bundles were made for signwriting_evaluation-0.1.0.tar.gz:
Publisher:
release.yaml on sign-language-processing/signwriting-evaluation
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
signwriting_evaluation-0.1.0.tar.gz -
Subject digest:
4bf7d75c6b1c1320cbf5a7b3f2c2e5a4bb954e723baad31b5b47cf57789e59fc - Sigstore transparency entry: 1859132431
- Sigstore integration time:
-
Permalink:
sign-language-processing/signwriting-evaluation@1d4b81ce5b9755790d21acdbf4516a1b73668b6d -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/sign-language-processing
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@1d4b81ce5b9755790d21acdbf4516a1b73668b6d -
Trigger Event:
release
-
Statement type:
File details
Details for the file signwriting_evaluation-0.1.0-cp39-abi3-win_amd64.whl.
File metadata
- Download URL: signwriting_evaluation-0.1.0-cp39-abi3-win_amd64.whl
- Upload date:
- Size: 461.1 kB
- Tags: CPython 3.9+, Windows x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f6a64056c5503e660ce5c0859ee027496bbd2549deca8910484ad17919440570
|
|
| MD5 |
d281f17216f790ade25f3e4643c42229
|
|
| BLAKE2b-256 |
a705b102023dea80b25649b63f7a3e127cb9becd5ca9bee7a265087f796ed229
|
Provenance
The following attestation bundles were made for signwriting_evaluation-0.1.0-cp39-abi3-win_amd64.whl:
Publisher:
release.yaml on sign-language-processing/signwriting-evaluation
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
signwriting_evaluation-0.1.0-cp39-abi3-win_amd64.whl -
Subject digest:
f6a64056c5503e660ce5c0859ee027496bbd2549deca8910484ad17919440570 - Sigstore transparency entry: 1859089174
- Sigstore integration time:
-
Permalink:
sign-language-processing/signwriting-evaluation@8e17a7400cc5a06d57e5c2b1faddc29285611347 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/sign-language-processing
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@8e17a7400cc5a06d57e5c2b1faddc29285611347 -
Trigger Event:
release
-
Statement type:
File details
Details for the file signwriting_evaluation-0.1.0-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.
File metadata
- Download URL: signwriting_evaluation-0.1.0-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 593.2 kB
- Tags: CPython 3.9+, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f3bf1e11e2a6d061431c8ec90acb1ef2f78b517c66f03bc846caf63c29619b93
|
|
| MD5 |
2640023892ecc6d7f063773cb210d080
|
|
| BLAKE2b-256 |
6cbb41c061bf97880507aa9666adba253dc2fc4393d6e38a91b62388fe4f5898
|
Provenance
The following attestation bundles were made for signwriting_evaluation-0.1.0-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl:
Publisher:
release.yaml on sign-language-processing/signwriting-evaluation
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
signwriting_evaluation-0.1.0-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl -
Subject digest:
f3bf1e11e2a6d061431c8ec90acb1ef2f78b517c66f03bc846caf63c29619b93 - Sigstore transparency entry: 1859088975
- Sigstore integration time:
-
Permalink:
sign-language-processing/signwriting-evaluation@8e17a7400cc5a06d57e5c2b1faddc29285611347 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/sign-language-processing
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@8e17a7400cc5a06d57e5c2b1faddc29285611347 -
Trigger Event:
release
-
Statement type:
File details
Details for the file signwriting_evaluation-0.1.0-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.
File metadata
- Download URL: signwriting_evaluation-0.1.0-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
- Upload date:
- Size: 582.9 kB
- Tags: CPython 3.9+, manylinux: glibc 2.17+ ARM64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dd068fe5b10a14aec2d456f18b88f2eff7be6d4036964a71ded3fce6d3236ce1
|
|
| MD5 |
e78bfa197468130a61466eaecbb6bba8
|
|
| BLAKE2b-256 |
cb7aa7a8f3ee758ba889ff443e3507c5213e3fba512fabe7c5ac294dc61c3258
|
Provenance
The following attestation bundles were made for signwriting_evaluation-0.1.0-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl:
Publisher:
release.yaml on sign-language-processing/signwriting-evaluation
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
signwriting_evaluation-0.1.0-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl -
Subject digest:
dd068fe5b10a14aec2d456f18b88f2eff7be6d4036964a71ded3fce6d3236ce1 - Sigstore transparency entry: 1859088868
- Sigstore integration time:
-
Permalink:
sign-language-processing/signwriting-evaluation@8e17a7400cc5a06d57e5c2b1faddc29285611347 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/sign-language-processing
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@8e17a7400cc5a06d57e5c2b1faddc29285611347 -
Trigger Event:
release
-
Statement type:
File details
Details for the file signwriting_evaluation-0.1.0-cp39-abi3-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.
File metadata
- Download URL: signwriting_evaluation-0.1.0-cp39-abi3-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
- Upload date:
- Size: 1.0 MB
- Tags: CPython 3.9+, macOS 10.12+ universal2 (ARM64, x86-64), macOS 10.12+ x86-64, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
78e83eb0737bca4d3ad32e710250bbeea208f88755803b2e5cc3fd7058535d81
|
|
| MD5 |
609c7e1a7a33183309a52abe1522dd7a
|
|
| BLAKE2b-256 |
bb9c07b0a365df58a1027ca39c9e5c2423cff8bc14d6f3fb1f4417ddb8b079ac
|
Provenance
The following attestation bundles were made for signwriting_evaluation-0.1.0-cp39-abi3-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:
Publisher:
release.yaml on sign-language-processing/signwriting-evaluation
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
signwriting_evaluation-0.1.0-cp39-abi3-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl -
Subject digest:
78e83eb0737bca4d3ad32e710250bbeea208f88755803b2e5cc3fd7058535d81 - Sigstore transparency entry: 1859088752
- Sigstore integration time:
-
Permalink:
sign-language-processing/signwriting-evaluation@8e17a7400cc5a06d57e5c2b1faddc29285611347 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/sign-language-processing
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@8e17a7400cc5a06d57e5c2b1faddc29285611347 -
Trigger Event:
release
-
Statement type: