Skip to main content

Transformer based translation quality estimation

Project description

License Downloads

TransQuest: Translation Quality Estimation with Cross-lingual Transformers

The goal of quality estimation (QE) is to evaluate the quality of a translation without having access to a reference translation. High-accuracy QE that can be easily deployed for a number of language pairs is the missing piece in many commercial translation workflows as they have numerous potential uses. They can be employed to select the best translation when several translation engines are available or can inform the end user about the reliability of automatically translated content. In addition, QE systems can be used to decide whether a translation can be published as it is in a given context, or whether it requires human post-editing before publishing or translation from scratch by a human. The quality estimation can be done at different levels: document level, sentence level and word level.

With TransQuest, we have opensourced our research in translation quality estimation which also won the sentence-level direct assessment quality estimation shared task in WMT 2020. TransQuest outperforms current open-source quality estimation frameworks such as OpenKiwi and DeepQuest.

Features

  • Sentence-level translation quality estimation on both aspects: predicting post editing efforts and direct assessment.
  • Word-level translation quality estimation capable of predicting quality of source words, target words and target gaps.
  • Perform significantly better than current state-of-the-art quality estimation methods like DeepQuest and OpenKiwi in all the languages experimented.
  • Pre-trained quality estimation models for fifteen language pairs.

Table of Contents

  1. Installation - Install TransQuest locally using pip.
  2. Architectures - Checkout the architectures implemented in TransQuest
    1. Sentence-level Architectures - We have released two architectures; MonoTransQuest and SiameseTransQuest to perform sentence level quality estimation.
    2. Word-level Architecture - We have released MicroTransQuest to perform word level quality estimation.
  3. Examples - We have provided several examples on how to use TransQuest in recent WMT quality estimation shared tasks.
    1. Sentence-level Examples
    2. Word-level Examples
  4. Pre-trained Models - We have provided pretrained quality estimation models for fifteen language pairs covering both sentence-level and word-level
    1. Sentence-level Models
    2. Word-level Models
  5. Contact - Contact us for any issues with TransQuest

Resources

Citations

If you are using the package, please consider citing this paper which is accepted to COLING 2020

@InProceedings{transquest:2020a,
author = {Ranasinghe, Tharindu and Orasan, Constantin and Mitkov, Ruslan},
title = {TransQuest: Translation Quality Estimation with Cross-lingual Transformers},
booktitle = {Proceedings of the 28th International Conference on Computational Linguistics},
year = {2020}
}

If you are using the task specific fine tuning, please consider citing this which is accepted to WMT 2020 at EMNLP 2020.

@InProceedings{transquest:2020b,
author = {Ranasinghe, Tharindu and Orasan, Constantin and Mitkov, Ruslan},
title = {TransQuest at WMT2020: Sentence-Level Direct Assessment},
booktitle = {Proceedings of the Fifth Conference on Machine Translation},
year = {2020}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transquest-1.1.1.tar.gz (94.9 kB view details)

Uploaded Source

Built Distribution

transquest-1.1.1-py3-none-any.whl (211.9 kB view details)

Uploaded Python 3

File details

Details for the file transquest-1.1.1.tar.gz.

File metadata

  • Download URL: transquest-1.1.1.tar.gz
  • Upload date:
  • Size: 94.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for transquest-1.1.1.tar.gz
Algorithm Hash digest
SHA256 9b7a4bdd07ccfb8cfa11043785542c08201d1744e596ca4c4df8e89ab065696b
MD5 340b200ffa7589b236970e59f0673399
BLAKE2b-256 94d20ab0d1dd3b2506e01b8944953fb9b6f6fb4f94b0c555d1d36c97f4ec6395

See more details on using hashes here.

File details

Details for the file transquest-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: transquest-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 211.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for transquest-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4face1c4e06ada0dfad90cf2270c772dd16b40f36f3acd0302952866d8c06a33
MD5 9ee16714a403747dbbc9488b979ca86a
BLAKE2b-256 1cadf52c71a5084a417d3fb1893b9c555f65ca93e13887f3fe4ee61aeaa9a271

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page