It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Documents

The performance of neural machine translation systems is commonly evaluated in terms of BLEU. However, due to its reliance on target language properties and generation, the BLEU metric does not allow an assessment of which translation directions are more difficult to model. In this paper, we propose cross-mutual information (XMI): an asymmetric information-theoretic metric of machine translation difficulty that exploits the probabilistic nature of most neural machine translation models. XMI allows us to better evaluate the difficulty of translating text into the target language while controlling for the difficulty of the target-side generation component independent of the translation task. We then present the first systematic and controlled study of cross-lingual translation difficulties using modern neural translation systems. Code for replicating our experiments is available online at https://github.com/e-bug/nmt-difficulty.
Original languageEnglish
Title of host publicationProceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Place of PublicationOnline
PublisherAssociation for Computational Linguistics (ACL)
Publication date1 Jul 2020
Pages1640-1649
DOIs
Publication statusPublished - 1 Jul 2020
Event58th Annual Meeting of the Association for Computational Linguistics - Online
Duration: 5 Jul 202010 Jul 2020

Conference

Conference58th Annual Meeting of the Association for Computational Linguistics
ByOnline
Periode05/07/202010/07/2020

Number of downloads are based on statistics from Google Scholar and www.ku.dk


No data available

ID: 255126547