Skip to main navigation Skip to search Skip to main content

Parallel decision tree with application to water quality data analysis

  • Qing He*
  • , Zhi Dong
  • , Fuzhen Zhuang
  • , Tianfeng Shang
  • , Zhongzhi Shi
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Decision tree is a popular classification technique in many applications, such as retail target marketing, fraud detection and design of telecommunication service plans. With the information exploration, the existing classification algorithms are not good enough to tackle large data set. In order to deal with the problem, many researchers try to design efficient parallel classification algorithms. Based on the current and powerful parallel programming framework - MapReduce, we propose a parallel ID3 classification algorithm(PID3 for short). We use water quality data monitoring the Changjiang River which contains 17 branches as experimental data. As the data are time series, we process the data to attribute data before using the decision tree. The experimental results demonstrate that the proposed algorithm can scale well and efficiently process large datasets on commodity hardware.

Original languageEnglish
Title of host publicationAdvances in Neural Networks, ISNN 2012 - 9th International Symposium on Neural Networks, Proceedings
Pages628-637
Number of pages10
EditionPART 2
DOIs
StatePublished - 2012
Externally publishedYes
Event9th International Symposium on Neural Networks, ISNN 2012 - Shenyang, China
Duration: 11 Jul 201214 Jul 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume7368 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Symposium on Neural Networks, ISNN 2012
Country/TerritoryChina
CityShenyang
Period11/07/1214/07/12

Keywords

  • Data mining
  • Mapreduce
  • PID3
  • Parallel decision tree

Fingerprint

Dive into the research topics of 'Parallel decision tree with application to water quality data analysis'. Together they form a unique fingerprint.

Cite this