Bayesian Learning of Markov Network Structure

Jakulin, Aleks; Rish, Irina

doi:10.1007/11871842_22

Aleks Jakulin²¹ &
Irina Rish²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

5518 Accesses
4 Citations

Abstract

We propose a simple and efficient approach to building undirected probabilistic classification models (Markov networks) that extend naïve Bayes classifiers and outperform existing directed probabilistic classifiers (Bayesian networks) of similar complexity. Our Markov network model is represented as a set of consistent probability distributions on subsets of variables. Inference with such a model can be done efficiently in closed form for problems like class probability estimation. We also propose a highly efficient Bayesian structure learning algorithm for conditional prediction problems, based on integrating along a hill-climb in the structure space. Our prior based on the degrees of freedom effectively prevents overfitting.

Download to read the full chapter text

Chapter PDF

Efficient parameter learning of Bayesian network classifiers

Article 26 January 2017

A survey on Bayesian network structure learning from data

Article 29 May 2019

Learning Bayesian Network Structures When Discrete and Continuous Variables Are Present

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Machine Learning 29, 131–163 (1997)
Article MATH Google Scholar
Salojärvi, J., Puolamäki, K., Kaski, S.: On discriminative joint density modeling. In: Jeckle, M., Kowalczyk, R., Braun, P. (eds.) GSEM 2004. LNCS, vol. 3270, pp. 341–352. Springer, Heidelberg (2004)
Google Scholar
Pernkopf, F., Bilmes, J.: Discriminative versus generative parameter and structure learning of Bayesian network classifiers. In: Proc. 22nd ICML, Bonn, Germany, pp. 657–664. ACM Press, New York (2005)
Google Scholar
Grossman, D., Domingos, P.: Learning Bayesian network classifiers by maximizing conditional likelihood. In: Proc. 21st ICML, Banff, Canada, pp. 361–368. ACM Press, New York (2004)
Google Scholar
Chow, C.K., Liu, C.N.: Approximating discrete probability distributions with dependence trees. IEEE Trans. on Information Theory 14(3), 462–467 (1968)
Article MATH MathSciNet Google Scholar
Meilă, M., Jordan, M.I.: Learning with mixtures of trees. Journal of Machine Learning Research 1, 1–48 (2000)
Article Google Scholar
Srebro, N.: Maximum likelihood bounded tree-width Markov networks. In: Proceedings of the 17th Conference on Uncertainty in Artificial Intelligence (UAI), pp. 504–511 (2001)
Google Scholar
Bach, F., Jordan, M.: Thin junction trees. In: Advances in Neural Information Processing Systems, vol. 14, pp. 569–576 (2002)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. of the International Conference on Machine Learning (ICML), pp. 282–289 (2001)
Google Scholar
Yedidia, J.S., Freeman, W.T., Weiss, Y.: Constructing free-energy approximations and generalized belief propagation algorithms. IEEE Transactions on Information Theory 51(7), 2282–2312 (2005)
Article MathSciNet Google Scholar
Jakulin, A., Rish, I., Bratko, I.: Kikuchi-Bayes: Factorized models for approximate classification in closed form. Technical Report RC23314, IBM (2004)
Google Scholar
Santana, R.: Estimation of distribution algorithms with Kikuchi approximations. Evolutionary Computation 13(1), 67–97 (2005)
Article Google Scholar
Pearl, J.: Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Francisco (1988)
Google Scholar
Friedman, N., Koller, D.: Being Bayesian about network structure: A Bayesian approach to structure discovery in Bayesian networks. Machine Learning 50, 95–126 (2003)
Article MATH Google Scholar
Cerquides, J., López de Màntaras, R.: Tractable Bayesian learning of tree augmented naive Bayes classifiers. In: Proc. 20th ICML, pp. 75–82 (2003)
Google Scholar
Hoeting, J.A., Madigan, D., Raftery, A.E., Volinsky, C.T.: Bayesian model averaging: A tutorial. Statistical Science 14(4), 382–417 (1999)
Article MATH MathSciNet Google Scholar
Krippendorff, K.: Information Theory: Structural Models for Qualitative Data. vol. 07–062. Sage Publications Inc., Beverly Hills, CA (1986)
Google Scholar
Jakulin, A., Bratko, I.: Analyzing attribute dependencies. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS, vol. 2838, pp. 229–240. Springer, Heidelberg (2003)
Chapter Google Scholar
Greiner, R., Su, X., Shen, B., Zhou, W.: Structural extension to logistic regression: Discriminative parameter learning of belief net classifiers. Machine Learning 59, 297–322 (2005)
Article MATH Google Scholar
Jing, Y., Pavlovic, V., Rehg, J.M.: Efficient discriminative learning of Bayesian network classifiers via boosted augmented naive Bayes. In: Proc. 22nd ICML, Bonn, Germany, pp. 369–376. ACM Press, New York (2005)
Google Scholar
Roos, T., Wettig, H., Grünwald, P., Myllymäki, P., Tirri, H.: On discriminative Bayesian network classifiers and logistic regression. Machine Learning 59, 267–296 (2005)
MATH Google Scholar
Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: IJCAI 1993, pp. 1022–1027. AAAI Press, Menlo Park (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, Columbia University, 1255 Amsterdam Ave, New York, NY, 10027, USA
Aleks Jakulin
IBM T.J. Watson Research Center, 19 Skyline Drive, Hawthorne, NY, 10532, USA
Irina Rish

Authors

Aleks Jakulin
View author publications
You can also search for this author in PubMed Google Scholar
Irina Rish
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jakulin, A., Rish, I. (2006). Bayesian Learning of Markov Network Structure. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_22

Download citation

DOI: https://doi.org/10.1007/11871842_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Bayesian Learning of Markov Network Structure

Abstract

Chapter PDF

Similar content being viewed by others

Efficient parameter learning of Bayesian network classifiers

A survey on Bayesian network structure learning from data

Learning Bayesian Network Structures When Discrete and Continuous Variables Are Present

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Bayesian Learning of Markov Network Structure

Abstract

Chapter PDF

Similar content being viewed by others

Efficient parameter learning of Bayesian network classifiers

A survey on Bayesian network structure learning from data

Learning Bayesian Network Structures When Discrete and Continuous Variables Are Present

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation