TY - GEN
T1 - Mining top-K frequent closed patterns from gene expression data
AU - Ji, Shufan
AU - Wang, Xuejiao
AU - Zong, Yi
AU - Gao, Xiaopeng
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2015/1/26
Y1 - 2015/1/26
N2 - Analyzing microarray gene expression data provides biologists deep insights into gene functions and gene regulatory network. In this paper, we propose a novel efficient algorithm FCPminer to mine top-k frequent closed patterns (FCPs) of higher support with length no less than minL from gene expression data. FCPminer employs a prefix fp-tree data structure, with top-down best first search strategy, such that FCPs of adequate length with highest supports are firstly mined. Compared with existing top-k FCP mining algorithms, FCPminer is much more efficient as it avoids expanding nodes with inadequate length (less than minL) or low support (ranked below top-k) during mining process. In addition, FCPminer further improves mining efficiency by employing a hash-based closedness checking method. Experimental results on real biological and synthetic data show that our proposed FCPminer outperforms existing state-of the art algorithms with high efficiency, especially for large and dense datasets.
AB - Analyzing microarray gene expression data provides biologists deep insights into gene functions and gene regulatory network. In this paper, we propose a novel efficient algorithm FCPminer to mine top-k frequent closed patterns (FCPs) of higher support with length no less than minL from gene expression data. FCPminer employs a prefix fp-tree data structure, with top-down best first search strategy, such that FCPs of adequate length with highest supports are firstly mined. Compared with existing top-k FCP mining algorithms, FCPminer is much more efficient as it avoids expanding nodes with inadequate length (less than minL) or low support (ranked below top-k) during mining process. In addition, FCPminer further improves mining efficiency by employing a hash-based closedness checking method. Experimental results on real biological and synthetic data show that our proposed FCPminer outperforms existing state-of the art algorithms with high efficiency, especially for large and dense datasets.
UR - https://www.scopus.com/pages/publications/84936885579
U2 - 10.1109/ICDMW.2014.61
DO - 10.1109/ICDMW.2014.61
M3 - 会议稿件
AN - SCOPUS:84936885579
T3 - IEEE International Conference on Data Mining Workshops, ICDMW
SP - 732
EP - 739
BT - Proceedings - 14th IEEE International Conference on Data Mining Workshops, ICDMW 2014
A2 - Zhou, Zhi-Hua
A2 - Wang, Wei
A2 - Kumar, Ravi
A2 - Toivonen, Hannu
A2 - Pei, Jian
A2 - Zhexue Huang, Joshua
A2 - Wu, Xindong
PB - IEEE Computer Society
T2 - 14th IEEE International Conference on Data Mining Workshops, ICDMW 2014
Y2 - 14 December 2014
ER -