Abstract
This paper describes the joint participation of University of Helsinki and Aalto University to two shared tasks of WMT 2020: the news translation between Inuktitut and English and the low-resource translation between German and Upper Sorbian. For both tasks, our efforts concentrate on efficient use of monolingual and related bilingual corpora with scheduled multi-task learning as well as an optimized subword segmentation with sampling. Our submission obtained the highest score for Upper Sorbian -> German and was ranked second for German -> Upper Sorbian according to BLEU scores. For English–Inuktitut, we reached ranks 8 and 10 out of 11 according to BLEU scores.
Original language | English |
---|---|
Title of host publication | Proceedings of the Fifth Conference on Machine Translation |
Pages | 1129-1138 |
Number of pages | 10 |
ISBN (Electronic) | 978-1-948087-81-0 |
Publication status | Published - 19 Nov 2020 |
MoE publication type | A4 Article in a conference publication |
Event | Conference on Machine Translation - Virtual, Online Duration: 19 Nov 2020 → 20 Nov 2020 |
Conference
Conference | Conference on Machine Translation |
---|---|
Abbreviated title | WMT |
City | Virtual, Online |
Period | 19/11/2020 → 20/11/2020 |
Keywords
- neural machine translation
- Low-resource languages
- Subword segmentation
- Inuktitut
- Upper Sorbian