跳到主要导航 跳到搜索 跳到主要内容

Explicit cross-lingual pre-training for unsupervised machine translation

  • Shuo Ren*
  • , Yu Wu
  • , Shujie Liu
  • , Ming Zhou
  • , Shuai Ma
  • *此作品的通讯作者
  • Beihang University
  • Beijing Advanced Innovation Center for Big Data and Brain Computing
  • Microsoft USA

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Pre-training has proven to be effective in unsupervised machine translation due to its ability to model deep context information in cross-lingual scenarios. However, the cross-lingual information obtained from shared BPE spaces is inexplicit and limited. In this paper, we propose a novel cross-lingual pre-training method for unsupervised machine translation by incorporating explicit cross-lingual training signals. Specifically, we first calculate cross-lingual n-gram embeddings and infer an n-gram translation table from them. With those n-gram translation pairs, we propose a new pre-training model called Cross-lingual Masked Language Model (CMLM), which randomly chooses source n-grams in the input text stream and predicts their translation candidates at each time step. Experiments show that our method can incorporate beneficial cross-lingual information into pre-trained models. Taking pre-trained CMLM models as the encoder and decoder, we significantly improve the performance of unsupervised machine translation. Our code is available at https://github.com/Imagist-Shuo/CMLM.

源语言英语
主期刊名EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference
出版商Association for Computational Linguistics
770-779
页数10
ISBN(电子版)9781950737901
DOI
出版状态已出版 - 2019
活动2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019 - Hong Kong, 中国
期限: 3 11月 20197 11月 2019

出版系列

姓名EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference

会议

会议2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019
国家/地区中国
Hong Kong
时期3/11/197/11/19

指纹

探究 'Explicit cross-lingual pre-training for unsupervised machine translation' 的科研主题。它们共同构成独一无二的指纹。

引用此