arXiv Analytics

Sign in

arXiv:2106.00143 [cs.CL]AbstractReferencesReviewsResources

An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers

Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Published 2021-05-31Version 1

Most studies on word-level Quality Estimation (QE) of machine translation focus on language-specific models. The obvious disadvantages of these approaches are the need for labelled data for each language pair and the high cost required to maintain several language-specific models. To overcome these problems, we explore different approaches to multilingual, word-level QE. We show that these QE models perform on par with the current language-specific models. In the cases of zero-shot and few-shot QE, we demonstrate that it is possible to accurately predict word-level quality for any given new language pair from models trained on other language pairs. Our findings suggest that the word-level QE models based on powerful pre-trained transformers that we propose in this paper generalise well across languages, making them more useful in real-world scenarios.

Comments: Accepted to appear at the ACL-IJCNLP 2021 Main conference
Categories: cs.CL, cs.AI, cs.LG
Related articles: Most relevant | Search more
arXiv:2304.00235 [cs.CL] (Published 2023-04-01)
What Does the Indian Parliament Discuss? An Exploratory Analysis of the Question Hour in the Lok Sabha
arXiv:2011.01536 [cs.CL] (Published 2020-11-01)
TransQuest: Translation Quality Estimation with Cross-lingual Transformers
arXiv:2010.10239 [cs.CL] (Published 2020-10-20)
Complete Multilingual Neural Machine Translation