跳到主要导航 跳到搜索 跳到主要内容

Auto-dialabel: Labeling dialogue data with unsupervised learning

  • Chen Shi
  • , Qi Chen
  • , Lei Sha
  • , Sujian Li
  • , Xu Sun
  • , Houfeng Wang*
  • , Lintao Zhang
  • *此作品的通讯作者
  • Peking University
  • Microsoft USA

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The lack of labeled data is one of the main challenges when building a task-oriented dialogue system. Existing dialogue datasets usually rely on human labeling, which is expensive, limited in size, and in low coverage. In this paper, we instead propose our framework auto-dialabel to automatically cluster the dialogue intents and slots. In this framework, we collect a set of context features, leverage an autoencoder for feature assembly, and adapt a dynamic hierarchical clustering method for intent and slot labeling. Experimental results show that our framework can promote human labeling cost to a great extent, achieve good intent clustering accuracy (84.1%), and provide reasonable and instructive slot labeling results.

源语言英语
主期刊名Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
编辑Ellen Riloff, David Chiang, Julia Hockenmaier, Jun'ichi Tsujii
出版商Association for Computational Linguistics
684-689
页数6
ISBN(电子版)9781948087841
出版状态已出版 - 2018
已对外发布
活动2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018 - Brussels, 比利时
期限: 31 10月 20184 11月 2018

出版系列

姓名Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018

会议

会议2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
国家/地区比利时
Brussels
时期31/10/184/11/18

指纹

探究 'Auto-dialabel: Labeling dialogue data with unsupervised learning' 的科研主题。它们共同构成独一无二的指纹。

引用此