Lifted Online Training of Relational Models with Stochastic Gradient Methods

Ahmadi, Babak; Kersting, Kristian; Natarajan, Sriraam

doi:10.1007/978-3-642-33460-3_43

Lifted Online Training of Relational Models with Stochastic Gradient Methods

Babak Ahmadi²⁰,
Kristian Kersting^20,21,22 &
Sriraam Natarajan²²

Conference paper

4617 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7523))

Abstract

Lifted inference approaches have rendered large, previously intractable probabilistic inference problems quickly solvable by employing symmetries to handle whole sets of indistinguishable random variables. Still, in many if not most situations training relational models will not benefit from lifting: symmetries within models easily break since variables become correlated by virtue of depending asymmetrically on evidence. An appealing idea for such situations is to train and recombine local models. This breaks long-range dependencies and allows to exploit lifting within and across the local training tasks. Moreover, it naturally paves the way for online training for relational models. Specifically, we develop the first lifted stochastic gradient optimization method with gain vector adaptation, which processes each lifted piece one after the other. On several datasets, the resulting optimizer converges to the same quality solution over an order of magnitude faster, simply because unlike batch training it starts optimizing long before having seen the entire mega-example even once.

Download to read the full chapter text

Chapter PDF

References

Getoor, L., Taskar, B.: Introduction to Statistical Relational Learning. The MIT Press (2007)
MATH Google Scholar
De Raedt, L., Frasconi, P., Kersting, K., Muggleton, S.H. (eds.): Probabilistic Inductive Logic Programming. LNCS (LNAI), vol. 4911. Springer, Heidelberg (2008)
MATH Google Scholar
Singla, P., Domingos, P.: Lifted First-Order Belief Propagation. In: AAAI (2008)
Google Scholar
Kersting, K., Ahmadi, B., Natarajan, S.: Counting belief propagation. In: UAI, Montreal, Canada (2009)
Google Scholar
Mihalkova, L., Huynh, T., Mooney, R.: Mapping and revising markov logic networks for transfer learning. In: AAAI, pp. 608–614 (2007)
Google Scholar
Rosenblatt, F.: Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan (1962)
Google Scholar
Besag, J.: Statistical Analysis of Non-Lattice Data. Journal of the Royal Statistical Society. Series D (The Statistician) 24(3), 179–195 (1975)
MathSciNet Google Scholar
Winkler, G.: Image Analysis, Random Fields and Dynamic Monte Carlo Methods. Springer (1995)
Google Scholar
Sutton, C., Mccallum, A.: Piecewise training for structured prediction. Machine Learning 77(2-3), 165–194 (2009)
Article Google Scholar
Lee, S.I., Ganapathi, V., Koller, D.: Efficient structure learning of Markov networks using L1-regularization. In: NIPS (2007)
Google Scholar
Hinton, G.: Training products of experts by minimizing contrastive divergence. Neural Computation 14 (2002)
Google Scholar
Kok, S., Domingos, P.: Learning Markov logic network structure via hypergraph lifting. In: ICML (2009)
Google Scholar
Kok, S., Domingos, P.: Learning Markov logic networks using structural motifs. In: ICML (2010)
Google Scholar
Khot, T., Natarajan, S., Kersting, K., Shavlik, J.: Learning markov logic networks via functional gradient boosting. In: ICDM (2011)
Google Scholar
Friedman, J.H.: Greedy function approximation: A gradient boosting machine. Annals of Statistics, 1189–1232 (2001)
Google Scholar
Natarajan, S., Khot, T., Kersting, K., Guttmann, B., Shavlik, J.: Gradient-based boosting for statistical relational learning: The relational dependency network case. Machine Learning (2012)
Google Scholar
Richardson, M., Domingos, P.: Markov logic networks. Machine Learning 62(1-2), 107–136 (2006)
Article Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B.39 (1977)
Google Scholar
Sato, T., Kameya, Y.: Parameter learning of logic programs for symbolic-statistical modeling. J. Artif. Intell. Res (JAIR) 15, 391–454 (2001)
MathSciNet MATH Google Scholar
Kersting, K., De Raedt, L.: Adaptive Bayesian Logic Programs. In: Rouveirol, C., Sebag, M. (eds.) ILP 2001. LNCS (LNAI), vol. 2157, pp. 104–117. Springer, Heidelberg (2001)
Chapter Google Scholar
Getoor, L., Friedman, N., Koller, D., Taskar, B.: Learning probabilistic models of link structure. Journal of Machine Learning Research 3, 679–707 (2002)
MathSciNet Google Scholar
Thon, I., Landwehr, N., De Raedt, L.: Stochastic relational processes: Efficient inference and applications. Machine Learning 82(2), 239–272 (2011)
Article MATH Google Scholar
Gutmann, B., Thon, I., De Raedt, L.: Learning the Parameters of Probabilistic Logic Programs from Interpretations. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.) ECML PKDD 2011, Part I. LNCS, vol. 6911, pp. 581–596. Springer, Heidelberg (2011)
Chapter Google Scholar
Natarajan, S., Tadepalli, P., Dietterich, T.G., Fern, A.: Learning first-order probabilistic models with combining rules. Annals of Mathematics and AI (2009)
Google Scholar
Jaeger, M.: Parameter learning for Relational Bayesian networks. In: ICML (2007)
Google Scholar
Huynh, T., Mooney, R.: Online max-margin weight learning for markov logic networks. In: SDM (2011)
Google Scholar
Singla, P., Domingos, P.: Lifted first-order belief propagation. In: AAAI (2008)
Google Scholar
Sutton, C., McCallum, A.: Piecewise training for structured prediction. Machine Learning 77(2-3), 165–194 (2009)
Article Google Scholar
Richards, B., Mooney, R.: Learning relations by pathfinding. In: AAAI (1992)
Google Scholar
Wainwright, M., Jaakkola, T., Willsky, A.: A new class of upper bounds on the log partition function. In: UAI, pp. 536–543 (2002)
Google Scholar
Schraudolph, N., Graepel, T.: Combining conjugate direction methods with stochastic approximation of gradients. In: AISTATS, pp. 7–13 (2003)
Google Scholar
Vishwanathan, S.V.N., Schraudolph, N.N., Schmidt, M.W., Murphy, K.P.: Accelerated training of conditional random fields with stochastic gradient methods. In: ICML, pp. 969–976 (2006)
Google Scholar
Amari, S.: Natural gradient works efficiently in learning. Neural Comput. 10, 251–276 (1998)
Article Google Scholar
Le Roux, N., Manzagol, P.A., Bengio, Y.: Topmoumoute online natural gradient algorithm. In: NIPS (2007)
Google Scholar
Müller, M.: A scaled conjugate gradient algorithm for fast supervised learning. Neural Networks 6(4), 525–533 (1993)
Article Google Scholar
Ahmadi, B., Kersting, K., Sanner, S.: Multi-Evidence Lifted Message Passing, with Application to PageRank and the Kalman Filter. In: IJCAI (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Knowledge Discovery Department, Fraunhofer IAIS, Sankt Augustin, Germany
Babak Ahmadi & Kristian Kersting
Institute of Geodesy and Geoinformation, University of Bonn, Bonn, Germany
Kristian Kersting
School of Medicine, Wake Forest University, Winston-Salem, USA
Kristian Kersting & Sriraam Natarajan

Authors

Babak Ahmadi
View author publications
You can also search for this author in PubMed Google Scholar
Kristian Kersting
View author publications
You can also search for this author in PubMed Google Scholar
Sriraam Natarajan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Intelligent Systems Laboratory, University of Bristol, Merchant Venturers Building, Woodland Road, BS8 1UB, Bristol, UK
Peter A. Flach , Tijl De Bie & Nello Cristianini , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ahmadi, B., Kersting, K., Natarajan, S. (2012). Lifted Online Training of Relational Models with Stochastic Gradient Methods. In: Flach, P.A., De Bie, T., Cristianini, N. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2012. Lecture Notes in Computer Science(), vol 7523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33460-3_43

Download citation

DOI: https://doi.org/10.1007/978-3-642-33460-3_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33459-7
Online ISBN: 978-3-642-33460-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics