Improving short text clustering performance with keyword expansion

  • Jun Wang
  • , Yiming Zhou
  • , Lin Li
  • , Biyun Hu
  • , Xia Hu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Most of traditional text clustering methods are based on bag of words representation, which ignore the important information on semantic relationship between key terms. To overcome this problem, researchers have recently proposed several new methods for improving short text clustering accuracy based on enriching short text representation. However, the computational costs of these methods based on expanding words appeared in short texts are usually time-consuming. In this paper, we improve previous work by enriching short text representation with keyword expansion. Empirical results show that the proposed method can greatly save time without sacrificing clustering accuracy.

Original languageEnglish
Title of host publicationAdvances in Intelligent and Soft Computing
EditorsHongwei Wang, Yi Shen, Zhigang Zeng, Tingwen Huang
PublisherSpringer Verlag
Pages291-298
Number of pages8
ISBN (Electronic)9783642012150
DOIs
StatePublished - 2009
Event6th International Symposium of Neural Networks, ISNN 2009 - Wuhan, China
Duration: 26 May 200929 May 2009

Publication series

NameAdvances in Intelligent and Soft Computing
Volume56
ISSN (Print)1867-5662
ISSN (Electronic)1860-0794

Conference

Conference6th International Symposium of Neural Networks, ISNN 2009
Country/TerritoryChina
CityWuhan
Period26/05/0929/05/09

Keywords

  • Keyword expansion
  • Short text clustering
  • Text representation

Fingerprint

Dive into the research topics of 'Improving short text clustering performance with keyword expansion'. Together they form a unique fingerprint.

Cite this