Skip to main navigation Skip to search Skip to main content

Parallel outlier detection using KD-tree based on MapReduce

  • Qing He*
  • , Yunlong Ma
  • , Qun Wang
  • , Fuzhen Zhuang
  • , Zhongzhi Shi
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Distributed and Parallel algorithms have attracted a vast amount of interest and research in recent decades, to handle large-scale data set in real-world applications. In this paper, we focus on a parallel implementation of KD-Tree based outlier detection method to deal with large-scale data set. As one of the state-of-the-art outlier detection methods, KD-Tree based has been approved to be an effective algorithm. However, it still cannot process large-scale data set efficiently due to its serial implementation. Based on the current and powerful parallel programming framework - MapReduce, we propose to implement the parallel KD-Tree based outlier detection algorithm (e.g., PKDTree for short). Experimental results demonstrate the efficiency of PKDTree according to the evaluation criterions of scaleup, speedup and sizeup.

Original languageEnglish
Title of host publicationProceedings - 2011 3rd IEEE International Conference on Cloud Computing Technology and Science, CloudCom 2011
PublisherIEEE Computer Society
Pages75-80
Number of pages6
ISBN (Print)9780769546223
DOIs
StatePublished - 2011
Externally publishedYes
Event3rd IEEE International Conference on Cloud Computing Technology and Science, CloudCom 2011 - Athens, Greece
Duration: 29 Nov 20111 Dec 2011

Publication series

NameProceedings - 2011 3rd IEEE International Conference on Cloud Computing Technology and Science, CloudCom 2011

Conference

Conference3rd IEEE International Conference on Cloud Computing Technology and Science, CloudCom 2011
Country/TerritoryGreece
CityAthens
Period29/11/111/12/11

Keywords

  • Data mining
  • KDTree
  • MapReduce
  • Parallel Outlier Detection

Fingerprint

Dive into the research topics of 'Parallel outlier detection using KD-tree based on MapReduce'. Together they form a unique fingerprint.

Cite this