Naive bayes classification given probability estimation trees

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Tree induction is one of the most effective and widely used models in classification. Unfortunately, decision trees such as C4.5 [9] have been found to provide poor probability estimates. By the empirical studies, Provost and Domingos [6] found that Probability Estimation Trees (PETs) give a fairly good probability estimation. However, different from normal decision trees, pruning reduces the performances of PETs. In order to get a good probability estimation, we usually need large trees which are not good in terms of the model transparency. In this paper, two hybrid models by combining the Naive Bayes classifier and PETs are proposed in order to build a model with good performance without losing too much transparency. The first model use Naive Bayes estimation given a PET and the second model use a group of small-sized PETs as Naive Bayes estimators. Empirical studies show that the first model outperforms the PET model at shallow depth and the second model is equivalent to Naive Bayes and PET.

Original languageEnglish
Title of host publicationProceedings - 5th International Conference on Machine Learning and Applications, ICMLA 2006
Pages34-39
Number of pages6
DOIs
StatePublished - 2006
Externally publishedYes
Event5th International Conference on Machine Learning and Applications, ICMLA 2006 - Orlando, FL, United States
Duration: 14 Dec 200616 Dec 2006

Publication series

NameProceedings - 5th International Conference on Machine Learning and Applications, ICMLA 2006

Conference

Conference5th International Conference on Machine Learning and Applications, ICMLA 2006
Country/TerritoryUnited States
CityOrlando, FL
Period14/12/0616/12/06

Keywords

  • Classification
  • Decision tree
  • Hybrid classification model
  • Naive bayes
  • Probability estimation tree

Fingerprint

Dive into the research topics of 'Naive bayes classification given probability estimation trees'. Together they form a unique fingerprint.

Cite this