跳到主要导航 跳到搜索 跳到主要内容

An efficient algorithm for mining compressed sequential patterns

  • Yongxin Tong*
  • , Yuanyuan Zhang
  • , Mei Yuan
  • , Shilong Ma
  • , Dan Yu
  • , Li Zhao
  • *此作品的通讯作者
  • China Academy of Telecommunication Technology
  • Beijing Union University
  • Beihang University

科研成果: 期刊稿件文章同行评审

摘要

Mining frequent sequential patterns from sequence databases has been a central research topic in data mining and various efficient algorithms for mining sequential patterns have been proposed and studied. Recently, many researchers have not focused on the efficiency of sequential patterns mining algorithms, but have paid attention to how to make users understand the result set of sequential patterns easily, due to the huge number of frequent sequential patterns generated by the mining process. In this paper, the problem of compressing frequent sequential patterns is studied. Inspired by the ideas of compressing frequent itemsets, an algorithm, CFSP (compressing frequent sequential patterns), is developed to mine a few representative sequential patterns to express all the information of all frequent sequential patterns and eliminate a large number of redundant sequential patterns. The CFSP adopts a two-steps approach: in the first step, all closed sequential patterns as the candidate set of representative sequential patterns are obtained, and at the same time most of the representative sequential patterns are obtained; in the second step, finding the remaining representative sequential patterns takes only a little time. An empirical study with both real and synthetic data sets proves that the CFSP has good performance.

源语言英语
页(从-至)72-80
页数9
期刊Jisuanji Yanjiu yu Fazhan/Computer Research and Development
47
1
出版状态已出版 - 1月 2010

指纹

探究 'An efficient algorithm for mining compressed sequential patterns' 的科研主题。它们共同构成独一无二的指纹。

引用此