跳到主要导航 跳到搜索 跳到主要内容

Alternating language modeling for cross-lingual pre-training

  • Jian Yang
  • , Shuming Ma
  • , Dongdong Zhang
  • , Shuang Zhi Wu
  • , Zhoujun Li*
  • , Ming Zhou
  • *此作品的通讯作者
  • Microsoft USA
  • SPPD of Tencent Inc.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Language model pre-training has achieved success in many natural language processing tasks. Existing methods for cross-lingual pre-training adopt Translation Language Model to predict masked words with the concatenation of the source sentence and its target equivalent. In this work, we introduce a novel cross-lingual pre-training method, called Alternating Language Modeling (ALM). It code-switches sentences of different languages rather than simple concatenation, hoping to capture the rich cross-lingual context of words and phrases. More specifically, we randomly substitute source phrases with target translations to create code-switched sentences. Then, we use these code-switched data to train ALM model to learn to predict words of different languages. We evaluate our pre-training ALM on the downstream tasks of machine translation and cross-lingual classification. Experiments show that ALM can outperform the previous pretraining methods on three benchmarks.

源语言英语
主期刊名AAAI 2020 - 34th AAAI Conference on Artificial Intelligence
出版商AAAI press
9386-9393
页数8
ISBN(电子版)9781577358350
出版状态已出版 - 2020
活动34th AAAI Conference on Artificial Intelligence, AAAI 2020 - New York, 美国
期限: 7 2月 202012 2月 2020

出版系列

姓名AAAI 2020 - 34th AAAI Conference on Artificial Intelligence

会议

会议34th AAAI Conference on Artificial Intelligence, AAAI 2020
国家/地区美国
New York
时期7/02/2012/02/20

指纹

探究 'Alternating language modeling for cross-lingual pre-training' 的科研主题。它们共同构成独一无二的指纹。

引用此