Abstract
In this work, we present a semi-naive Bayes classifier that searches for dependent attributes using different filter approaches. In order to avoid that the number of cases of the compound attributes be too high, a grouping procedure is applied each time after two variables are merged. This method tries to group two or more cases of the new variable into an unique value. In an emperical study, we show as this approach outperforms the naive Bayes classifier in a very robust way and reaches the performance of the Pazzani’s semi-naive Bayes [1] without the high cost of a wrapper search.
This work has been supported by the Spanish ‘Consejer’ía de Innovación, Ciencia y Empresa de la Junta de Andalucía’ and ‘Ministerio de Educación y Ciencia’, under Projects TIC-276 and TIN2004-06204-C03-02 and FPU scholarship (AP2004-4678).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Pazzani, M.J.: Searching for dependencies in bayesian classifiers. Lecture Notes in Statistics, vol. 112, pp. 239–248 (1995)
Abellán, J.: Application of Uncertainty Measures on Credal Sets on the Naive Bayes Classifier. Int. J. of General Systems 35(6), 675–686 (2006)
Abellán, J., Moral, S.: Upper entropy of credal sets. Applications to credal classification. Int. J. of Approximate Reasoning 39(2-3), 235–255 (2005)
Abellán, J., Klir, G.J., Moral, S.: Disaggregated total uncertainty measure for credal sets. Int. J. of General Systems 35(1), 29–44 (2006)
Kononenko, I.: Semi-naive bayesian classifier. In: EWSL 1991: Proceedings of the European working session on learning on Machine learning, pp. 206–219. Springer, Heidelberg (1991)
Zheng, F., Webb, G.: A comparative study of semi-naive bayes methods in classification learning. In: Proc. 4th Australasian Data Mining conference, pp. 141–156 (2005)
Domingos, P., Pazzani, M.J.: On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning 29(2-3), 103–130 (1997)
Heckerman, D., Geiger, D., Chickering, D.M.: Learning bayesian networks: The combination of knowledge and statistical data. In: KDD Workshop, pp. 85–96 (1994)
Neapolitan, R.E.: Learning Bayesian Networks. Prentice-Hall, Englewood Cliffs (2004)
Wilks, S.S.: The large-sample distribution of the likelihood ratio for testing composite hypotheses. Annals of Mathematical Statistics 9, 60–62 (1938)
Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proc. of 13th Int. Joint Conf. on AI (1993)
Consortium, E.: Elvira: An environment for probabilistic graphical models. In: Proceedings of the 1st European Workshop on Probabilistic Graphical Models (2002)
Witten, I.H., Frank, E.: Data mining: practical machine learning tools and techniques with Java implementations. Morgan Kaufmann, San Francisco (2000)
Duda: Pattern Classification and SceneAnalysis. John Wiley & Sons, Chichester (1973)
Webb, G.I., Boughton, J.R., Wang, Z.: Not so naive bayes: Aggregating onedependence estimators. Mach. Learn. 58(1), 5–24 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abellán, J., Cano, A., Masegosa, A.R., Moral, S. (2007). A Semi-naive Bayes Classifier with Grouping of Cases. In: Mellouli, K. (eds) Symbolic and Quantitative Approaches to Reasoning with Uncertainty. ECSQARU 2007. Lecture Notes in Computer Science(), vol 4724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75256-1_43
Download citation
DOI: https://doi.org/10.1007/978-3-540-75256-1_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75255-4
Online ISBN: 978-3-540-75256-1
eBook Packages: Computer ScienceComputer Science (R0)