跳到主要导航 跳到搜索 跳到主要内容

A novel text representation model for text classification

  • Jun Wang*
  • , Yiming Zhou
  • *此作品的通讯作者
  • Beihang University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The text representation in text classification is usually a sequence of terms. As the number of terms becomes very high, it is greatly time-consuming to perform existed text categorization tasks. In this paper we presented a novel text representation model for text classification which greatly reduced the required resources. This model represents text with several features. Each feature corresponds to a theme that emerged from a set of related articles. We also introduce an efficient way to build the model. The proposed model has been applied to naïve bayes classifier and experiments on Reuters-21578 corpus have shown that the efficiency is greatly improved without sacrificing classification accuracy even when the dimension of the input space is significantly reduced.

源语言英语
主期刊名Proceedings - The 1st International Conference on Intelligent Networks and Intelligent Systems, ICINIS 2008
702-705
页数4
DOI
出版状态已出版 - 2008
活动1st International Conference on Intelligent Networks and Intelligent Systems, ICINIS 2008 - Wuhan, 中国
期限: 1 11月 20083 11月 2008

出版系列

姓名Proceedings - The 1st International Conference on Intelligent Networks and Intelligent Systems, ICINIS 2008

会议

会议1st International Conference on Intelligent Networks and Intelligent Systems, ICINIS 2008
国家/地区中国
Wuhan
时期1/11/083/11/08

指纹

探究 'A novel text representation model for text classification' 的科研主题。它们共同构成独一无二的指纹。

引用此