Boosting the discriminant power of naive Bayes

Shihe Wang, Jianfeng Ren, Xiaoyu Lian, Ruibin Bai, Xudong Jiang

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Naive Bayes has been widely used in many applications because of its simplicity and ability in handling both numerical data and categorical data. However, lack of modeling of correlations between features limits its performance. In addition, noise and outliers in the real-world dataset also greatly degrade the classification performance. In this paper, we propose a feature augmentation method employing a stack auto-encoder to reduce the noise in the data and boost the discriminant power of naive Bayes. The proposed stack auto-encoder consists of two auto-encoders for different purposes. The first encoder shrinks the initial features to derive a compact feature representation in order to remove the noise and redundant information. The second encoder boosts the discriminant power of the features by expanding them into a higher-dimensional space so that different classes of samples could be better separated in the higher-dimensional space. By integrating the proposed feature augmentation method with the regularized naive Bayes, the discrimination power of the model is greatly enhanced. The proposed method is evaluated on a set of machine-learning benchmark datasets. The experimental results show that the proposed method significantly and consistently outperforms the state-of-the-art naive Bayes classifiers.
Original languageEnglish
Title of host publicationInternational Conference on Pattern Recognition (ICPR)
PublisherIEEE
Pages4906-4912
Number of pages7
ISBN (Electronic)9781665490627
DOIs
Publication statusPublished - 29 Nov 2022
EventInternational Conference on Pattern Recognition - Montreal, Canada
Duration: 21 Aug 202225 Aug 2022
Conference number: 26th

Conference

ConferenceInternational Conference on Pattern Recognition
Abbreviated titleICPR
Country/TerritoryCanada
CityMontreal
Period21/08/2225/08/22

Keywords

  • Correlation
  • Benchmark testing
  • Feature extraction
  • Boosting
  • Data models
  • Numerical models

Fingerprint

Dive into the research topics of 'Boosting the discriminant power of naive Bayes'. Together they form a unique fingerprint.

Cite this