Open Access Open Access  Restricted Access Subscription Access

A Novel Methodology for Constructing Rule-Based Naive Bayesian Classifiers


Affiliations
1 Applied Science University, Amman, Jordan
 

Classification is an important data mining technique that is used by many applications. Several types of classifiers have been described in the research literature. Example classifiers are decision tree classifiers, rule-based classifiers, and neural networks classifiers. Another popular classification technique is naïve Bayesian classification. Naive Bayesian classification is a probabilistic classification approach that uses Bayesian Theorem to predict the classes of unclassified records. A drawback of Naive Bayesian Classification is that every time a new data record is to be classified, the entire dataset needs to be scanned in order to apply a set of equations that perform the classification. Scanning the dataset is normally a very costly step especially if the dataset is very large. To alleviate this problem, a new approach for using naive Bayesian classification is introduced in this study. In this approach, a set of classification rules is constructed on top of naive Bayesian classifier. Hence we call this approach Rule-based Naïve Bayesian Classifier (RNBC). In RNBC, the dataset is canned only once, off-line, at the time of building the classification rule set. Subsequent scanning of the dataset, is avoided. Furthermore, this study introduces a simple three-step methodology for constructing the classification rule set.

Keywords

Data Mining, Classification, Bayes Theorem, Rule-Based Systems, Machine Learning.
User
Notifications
Font Size

Abstract Views: 197

PDF Views: 100




  • A Novel Methodology for Constructing Rule-Based Naive Bayesian Classifiers

Abstract Views: 197  |  PDF Views: 100

Authors

Abdallah Alashqur
Applied Science University, Amman, Jordan

Abstract


Classification is an important data mining technique that is used by many applications. Several types of classifiers have been described in the research literature. Example classifiers are decision tree classifiers, rule-based classifiers, and neural networks classifiers. Another popular classification technique is naïve Bayesian classification. Naive Bayesian classification is a probabilistic classification approach that uses Bayesian Theorem to predict the classes of unclassified records. A drawback of Naive Bayesian Classification is that every time a new data record is to be classified, the entire dataset needs to be scanned in order to apply a set of equations that perform the classification. Scanning the dataset is normally a very costly step especially if the dataset is very large. To alleviate this problem, a new approach for using naive Bayesian classification is introduced in this study. In this approach, a set of classification rules is constructed on top of naive Bayesian classifier. Hence we call this approach Rule-based Naïve Bayesian Classifier (RNBC). In RNBC, the dataset is canned only once, off-line, at the time of building the classification rule set. Subsequent scanning of the dataset, is avoided. Furthermore, this study introduces a simple three-step methodology for constructing the classification rule set.

Keywords


Data Mining, Classification, Bayes Theorem, Rule-Based Systems, Machine Learning.