Skip to main navigation Skip to search Skip to main content

Learning to Select Relevant Knowledge for Neural Machine Translation

  • Jian Yang
  • , Juncheng Wan
  • , Shuming Ma
  • , Haoyang Huang
  • , Dongdong Zhang
  • , Yong Yu
  • , Zhoujun Li*
  • , Furu Wei
  • *Corresponding author for this work
  • Shanghai Jiao Tong University
  • Microsoft USA

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Most memory-based methods use encoded retrieved pairs as the translation memory (TM) to provide external guidance, but there still exist some noisy words in the retrieved pairs. In this paper, we propose a simple and effective end-to-end model to select useful sentence words from the encoded memory and incorporate them into the NMT model. Our model uses a novel memory selection mechanism to avoid the noise from similar sentences and provide external guidance simultaneously. To verify the positive influence of selected retrieved words, we evaluate our model on the single-domain dataset namely JRC-Acquis and multi-domain dataset comprised of existing benchmarks including WMT, IWSLT, JRC-Acquis, and OpenSubtitles. Experimental results demonstrate our method can improve the translation quality under different scenarios.

Original languageEnglish
Title of host publicationNatural Language Processing and Chinese Computing - 10th CCF International Conference, NLPCC 2021, Proceedings
EditorsLu Wang, Yansong Feng, Yu Hong, Ruifang He
PublisherSpringer Science and Business Media Deutschland GmbH
Pages79-91
Number of pages13
ISBN (Print)9783030884796
DOIs
StatePublished - 2021
Event10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021 - Qingdao, China
Duration: 13 Oct 202117 Oct 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13028 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021
Country/TerritoryChina
CityQingdao
Period13/10/2117/10/21

Keywords

  • Neural machine translation
  • Selective translation memory

Fingerprint

Dive into the research topics of 'Learning to Select Relevant Knowledge for Neural Machine Translation'. Together they form a unique fingerprint.

Cite this