Abstract
Mining frequent sequential patterns from sequence databases has been a central research topic in data mining and various efficient algorithms for mining sequential patterns have been proposed and studied. Recently, many researchers have not focused on the efficiency of sequential patterns mining algorithms, but have paid attention to how to make users understand the result set of sequential patterns easily, due to the huge number of frequent sequential patterns generated by the mining process. In this paper, the problem of compressing frequent sequential patterns is studied. Inspired by the ideas of compressing frequent itemsets, an algorithm, CFSP (compressing frequent sequential patterns), is developed to mine a few representative sequential patterns to express all the information of all frequent sequential patterns and eliminate a large number of redundant sequential patterns. The CFSP adopts a two-steps approach: in the first step, all closed sequential patterns as the candidate set of representative sequential patterns are obtained, and at the same time most of the representative sequential patterns are obtained; in the second step, finding the remaining representative sequential patterns takes only a little time. An empirical study with both real and synthetic data sets proves that the CFSP has good performance.
| Original language | English |
|---|---|
| Pages (from-to) | 72-80 |
| Number of pages | 9 |
| Journal | Jisuanji Yanjiu yu Fazhan/Computer Research and Development |
| Volume | 47 |
| Issue number | 1 |
| State | Published - Jan 2010 |
Keywords
- Association rule
- Compression
- Data mining
- Frequent pattern mining
- Mining sequential pattern
Fingerprint
Dive into the research topics of 'An efficient algorithm for mining compressed sequential patterns'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver