Abstract
Discovering knowledge from huge databases with missing values is a challenging problem in Data Mining. In this paper, a novel hybrid algorithm for learning knowledge represented in Bayesian Networks is discussed. The new algorithm combines an evolutionary algorithm with the Expectation-Maximization (EM) algorithm to overcome the problem of getting stuck in sub-optimal solutions which occurs in most existing learning algorithms. The experimental results on the databases generated from several benchmark network structures illustrate that our system outperforms some state-of-the-art algorithms. We also apply our system to a direct marketing problem, and compare the performance of the discovered Bayesian networks with the response models obtained by other algorithms. In the comparison, the Bayesian networks learned by our system outperform others.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jensen, F.V.: An Introduction to Bayesian Network. University of College London Press (1996)
Andreassen, S., Woldbye, M., Falck, B., Andersen, S.: MUNIN: A Causal Probabilistic Network for Interpretation of Electromyographic Findings. In: Proceedings of the Tenth International Joint Conference on Artificial Intelligence, pp. 366–372 (1987)
Cheeseman, P., Kelly, J., Self, M., Stutz, J., Taylor, W., Freeman, D.: AutoClass: a Bayesian classification system. In: Proceedings of the Fifth International Workshop on Machine Learning, pp. 54–64 (1988)
Heckerman, D., Horvitz, E.: Inferring Informational Goals from Free-Text Queries: A Bayesian Approach. In: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 230–237 (1998)
Heckerman, D., Wellman, M.P.: Bayesian Networks. Communications of the ACM 38(3), 27–30 (1995)
Cheng, J., Greiner, R., Kelly, J., Bell, D., Liu, W.: Learning Bayesian Networks from Data: An Information-Theory Based Approach. Artificial Intelligence 137, 43–90 (2002)
Spirtes, P., Glymour, C., Scheines, R.: Causation, Prediction, and Search. MIT Press, Cambridge (2000)
Cooper, G., Herskovits, E.: A Bayesian Method for the Induction of Probabilistic Networks from Data. Machine Learning 9(4), 309–347 (1992)
Heckerman, D.: A Tutorial on Learning Bayesian Networks. Tech. Rep. MSR-TR-95-06. Microsoft Research Adv. Technol. Div., Redmond, WA (1995)
Lam, W., Bacchus, F.: Learning Bayesian belief networks: an approach based on the MDL principle. Computational Intelligence 10(4), 269–293 (1994)
Larrañaga, P., Poza, M., Yurramendi, Y., Murga, R., Kuijpers, C.: Structure Learning of Bayesian Network by Genetic Algorithms: A Performance Analysis of Control Parameters. IEEE Transactions on Pattern Analysis and Machine Intelligence 18(9), 912–926 (1996)
Larrañaga, P., Kuijpers, C., Mura, R., Yurramendi, Y.: Learning Bayesian Network Structures by Searching for The Best Ordering with Genetic Algorithms. IEEE Transactions on System, Man and Cybernetics 26(4), 487–493 (1996)
Wong, M.L., Lam, W., Leung, K.S.: Using Evolutionary Programming and Minimum Description Length principle for data mining of Bayesian networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(2), 174–178 (1999)
Wong, M.L., Leung, K.S.: An Efficient Data Mining Method for Learning Bayesian Networks Using an Evolutionary Algorithm-Based Hybrid Approach. IEEE Transactions on Evolutionary Computation 8(4), 378–404 (2004)
Schafer, J.L., Graham, J.W.: Missing data: Our View of the State of the Art. Psychological Methods 7(2), 147–177 (2002)
Ramoni, M., Sebastiani, P.: Efficient Parameter Learning in Bayesian Networks from Incomplete Databases. Tech. Rep. KMI-TR-41 (1997)
Ramoni, M., Sebastiani, P.: The Use of Exogenous Knowledge to Learn Bayesian Networks from Incomplete Databases. Tech. Rep. KMI-TR-44 (1997)
Friedman, N.: Learning Belief Networks in the Presence of Missing Values and Hidden Variables. In: Proceedings of the Fourteenth International Conference on Machine Learning, pp. 125–133 (1997)
Friedman, N.: The Bayesian Structural EM Algorithm. In: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 80–89 (1998)
Peña, J.M., Lozano, J.A., Larrañaga, P.: An Improved Bayesian Structural EM Algorithm for Learning Bayesian Networks for Clustering. Pattern Recognition Letters 21, 779–786 (2000)
Peña, J.M., Lozano, J.A., Larrañaga, P.: Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction. Machine Learning 47, 63–89 (2002)
Myers, J., Laskey, K., DeJong, K.: Learning Bayesian Networks from Incomplete Data using Evolutionary Algorithms. In: Proceedings of the First Annual Conference on Genetic and Evolutionary Computation Conference, pp. 458–465 (1999)
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Mateo (1998)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society(B) 39(1), 1–38 (1977)
Lauritzen, S.: The EM Algorithm for Graphical Association Models with Missing Data. Computational Statistics and Data Analysis 19, 191–201 (1995)
Huang, C., Darwiche, A.: Inference in Belief Networks: a Procedural Guide. International Journal of Approximate Reasoning 15(3), 225–263 (1996)
Bayesware Discoverer, http://www.bayesware.com/frontpage.html
Norsys Bayes Net Library, http://www.norsys.com/net_library.htm
Chickering, D.M.: Learning Equivalence Classes of Bayesian Network Structures. Journal of Machine Learning Research 2, 445–498 (2002)
Beaumont, G.P., Knowles, J.D.: Statistical Tests: An Introduction with MINITAB Commentary. Prentice-Hall, Englewood Cliffs (1996)
Zahavi, J., Levin, N.: Issues and Problems in Applying Neural Computing to Target Marketing. Journal of Direct Marketing 11(4), 63–75 (1997)
Bhattacharyya, S.: Direct Marketing Response Models using Genetic Algorithms. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp. 144–148 (1998)
Cabena, P., Hadjinian, P., Stadler, R., Verhees, J., Zanasi, A.: Discovering Data Mining: From Concept to Implementation. Prentice-Hall, Englewood Cliffs (1997)
Petrison, L.A., Blattberg, R.C., Wang, P.: Database Marketing: Past, Present, and Future. Journal of Direct Marketing 11(4), 109–125 (1997)
Bhattacharyya, S.: Evolutionary Algorithms in Data Mining: Multi-Objective Performance Modeling for Direct Marketing. In: Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining, pp. 465–473 (2000)
Zahavi, J., Levin, N.: Applying Neural Computing to Target Marketing. Journal of Direct Marketing 11(4), 76–93 (1997)
Ling, C.X., Li, C.H.: Data Mining for Direct Marketing: Problems and Solutions. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp. 73–79 (1998)
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian Network Classifiers. Machine Learning 29, 131–163 (1997)
Rud, O.P.: Data Mining Cookbook: Modeling Data for Marketing, Risk and Customer Relationship Management. Wiley, New York (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Guo, Y.Y., Wong, M.L. (2009). Mining Bayesian Networks from Direct Marketing Databases with Missing Values. In: Gen, M., et al. Intelligent and Evolutionary Systems. Studies in Computational Intelligence, vol 187. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-95978-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-95978-6_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-95977-9
Online ISBN: 978-3-540-95978-6
eBook Packages: EngineeringEngineering (R0)