跳到主要导航 跳到搜索 跳到主要内容

Unsupervised Sentence Representation via Contrastive Learning with Mixing Negatives

  • Beihang University
  • University of Sheffield
  • University of Ottawa

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Unsupervised sentence representation learning is a fundamental problem in natural language processing. Recently, contrastive learning has made great success on this task. Existing constrastive learning based models usually apply random sampling to select negative examples for training. Previous work in computer vision has shown that hard negative examples help contrastive learning to achieve faster convergency and better optimization for representation learning. However, the importance of hard negatives in contrastive learning for sentence representation is yet to be explored. In this study, we prove that hard negatives are essential for maintaining strong gradient signals in the training process while random sampling negative examples is ineffective for sentence representation. Accordingly, we present a contrastive model, MixCSE, that extends the current state-of-the-art Sim- CSE by continually constructing hard negatives via mixing both positive and negative features. The superior performance of the proposed approach is demonstrated via empirical studies on Semantic Textual Similarity datasets and Transfer task datasets.

源语言英语
主期刊名AAAI-22 Technical Tracks 10
出版商Association for the Advancement of Artificial Intelligence
11730-11738
页数9
ISBN(电子版)1577358767, 9781577358763
DOI
出版状态已出版 - 30 6月 2022
活动36th AAAI Conference on Artificial Intelligence, AAAI 2022 - Virtual, Online
期限: 22 2月 20221 3月 2022

出版系列

姓名Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022
36

会议

会议36th AAAI Conference on Artificial Intelligence, AAAI 2022
Virtual, Online
时期22/02/221/03/22

指纹

探究 'Unsupervised Sentence Representation via Contrastive Learning with Mixing Negatives' 的科研主题。它们共同构成独一无二的指纹。

引用此