Abstract
When our learning task is to build a model with accurate classification, C4.5 and NB are two very important algorithms for achieving this task because of their simplicity and high performance. In this paper, we present a combined classification algorithm based on C4.5 and NB, simply C4.5-NB. In C4.5-NB, the class probability estimates of C4.5 and NB are weighted according to their classification accuracy on the training data. We experimentally tested C4.5-NB in Weka system using the whole 36 UCI data sets selected by Weka, and compared it with C4.5 and NB. The experimental results show that C4.5-NB significantly outperforms C4.5 and NB in terms of classification accuracy. Besides, we also observe the ranking performance of C4.5-NB in terms of AUC (the area under the Receiver Operating Characteristics curve). Fortunately, C4.5-NB also significantly outperforms C4.5 and NB.
Original language | English |
---|---|
Title of host publication | Advances in Computation and Intelligence |
Subtitle of host publication | Third International Symposium, ISICA 2008, Proceedings |
Editors | Lishan Kang, Zhihua Cai, Xuesong Yan, Yong Liu |
Place of Publication | Berlin |
Publisher | Springer, Springer Nature |
Pages | 350-359 |
Number of pages | 10 |
ISBN (Electronic) | 9783540921370 |
ISBN (Print) | 9783540921363 |
DOIs | |
Publication status | Published - 2008 |
Externally published | Yes |
Event | 3rd International Symposium on Intelligence Computation and Applications (ISICA 2008) - Wuhan, China Duration: 19 Dec 2008 → 21 Dec 2008 |
Conference
Conference | 3rd International Symposium on Intelligence Computation and Applications (ISICA 2008) |
---|---|
Country/Territory | China |
City | Wuhan |
Period | 19/12/08 → 21/12/08 |
Keywords
- decision trees
- naive Bayes
- combined algorithms
- weights
- classification
- ranking
- data mining