On the classification performance of TAN and general Bayesian networks

Research output: Contribution to a Journal (Peer & Non Peer)Articlepeer-review

97 Citations (Scopus)

Abstract

Over a decade ago, Friedman et al. introduced the Tree Augmented Naïve Bayes (TAN) classifier, with experiments indicating that it significantly outperformed Naïve Bayes (NB) in terms of classification accuracy, whereas general Bayesian network (GBN) classifiers performed no better than NB. This paper challenges those claims, using a careful experimental analysis to show that GBN classifiers significantly outperform NB on datasets analyzed, and are comparable to TAN performance. It is found that the poor performance reported by Friedman et al. are not attributable to the GBN per se, but rather to their use of simple empirical frequencies to estimate GBN parameters, whereas basic parameter smoothing (used in their TAN analyses but not their GBN analyses) improves GBN performance significantly. It is concluded that, while GBN classifiers may have some limitations, they deserve greater attention, particularly in domains where insight into classification decisions, as well as good accuracy, is required.

Original languageEnglish
Pages (from-to)489-495
Number of pages7
JournalKnowledge-Based Systems
Volume22
Issue number7
DOIs
Publication statusPublished - 1 Oct 2009

Keywords

  • Bayesian networks
  • Classification
  • Inductive learning
  • Naïve Bayes
  • Parameter estimation
  • TAN

Authors (Note for portal: view the doc link for the full list of authors)

  • Authors
  • Madden, MG

Fingerprint

Dive into the research topics of 'On the classification performance of TAN and general Bayesian networks'. Together they form a unique fingerprint.

Cite this