Skip to main content

Probabilistic Inductive Querying Using ProbLog

  • Chapter
  • First Online:
Inductive Databases and Constraint-Based Data Mining

Abstract

We study how probabilistic reasoning and inductive querying can be combined within ProbLog, a recent probabilistic extension of Prolog. ProbLog can be regarded as a database system that supports both probabilistic and inductive reasoning through a variety of querying mechanisms. After a short introduction to ProbLog, we provide a survey of the different types of inductive queries that ProbLog supports, and show how it can be applied to the mining of large biological networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Suciu, D.: Probabilistic databases. SIGACT News 39(2) (2008) 111–124

    Article  Google Scholar 

  2. Imielinski, T., Mannila, H.: A database perspective on knowledge discovery. Commun. ACM 39(11) (1996) 58–64

    Article  Google Scholar 

  3. De Raedt, L., Kimmig, A., Toivonen, H.: ProbLog: A probabilistic Prolog and its application in link discovery. In Veloso, M., ed.: IJCAI. (2007) 2462–2467

    Google Scholar 

  4. Dantsin, E.: Probabilistic logic programs and their semantics. In Voronkov, A., ed.: Proc. 1st Russian Conf. on Logic Programming. Volume 592 of LNCS. (1992) 152–164

    Google Scholar 

  5. Dalvi, N.N., Suciu, D.: Efficient query evaluation on probabilistic databases. In: VLDB. (2004) 864–875

    Google Scholar 

  6. Fuhr, N.: Probabilistic Datalog: Implementing logical information retrieval for advanced applications. Journal of the American Society for Information Science 51(2) (2000) 95–110

    Article  MathSciNet  Google Scholar 

  7. Poole, D.: Probabilistic Horn abduction and Bayesian networks. Artificial Intelligence 64 (1993) 81–129

    Article  MATH  Google Scholar 

  8. Sato, T., Kameya, Y.: Parameter learning of logic programs for symbolic-statistical modeling. J. Artif. Intell. Res. (JAIR) 15 (2001) 391–454

    MATH  MathSciNet  Google Scholar 

  9. Poole, D.: Logic programming, abduction and probability. New Generation Computing 11 (1993) 377–400

    Article  MATH  Google Scholar 

  10. Wrobel, S.: First order theory refinement. In De Raedt, L., ed.: Advances in Inductive Logic Programming. IOS Press, Amsterdam (1996) 14 – 33

    Google Scholar 

  11. Richards, B.L., Mooney, R.J.: Automated refinement of first-order horn-clause domain theories. Machine Learning 19(2) (1995) 95–131

    Google Scholar 

  12. Sevon, P., Eronen, L., Hintsanen, P., Kulovesi, K., Toivonen, H.: Link discovery in graphs derived from biological databases. In: DILS. Volume 4075 of LNCS., Springer (2006) 35–49

    Google Scholar 

  13. Kimmig, A., De Raedt, L., Toivonen, H.: Probabilistic explanation based learning. In Kok, J.N., Koronacki, J., de Mantaras, R.L., Matwin, S., Mladenic, D., Skowron, A., eds.: 18th European Conference on Machine Learning (ECML). Volume 4701 of LNCS., Springer (2007) 176–187

    Google Scholar 

  14. Sato, T.: A statistical learning method for logic programs with distribution semantics. In Sterling, L., ed.: ICLP, MIT Press (1995) 715–729

    Google Scholar 

  15. Bryant, R.E.: Graph-based algorithms for boolean function manipulation. IEEE Trans. Computers 35(8) (1986) 677–691

    Article  MATH  Google Scholar 

  16. Poole, D.: Abducing through negation as failure: stable models within the independent choice logic. Journal of Logic Programming 44(1–3) (2000) 5–35

    Article  MathSciNet  Google Scholar 

  17. Lloyd, J.W.: Foundations of Logic Programming. 2. edn. Springer, Berlin (1989)

    Google Scholar 

  18. Valiant, L.G.: The complexity of enumeration and reliability problems. SIAM Journal on Computing 8(3) (1979) 410–421

    Article  MATH  MathSciNet  Google Scholar 

  19. Gutmann, B., Kimmig, A., De Raedt, L., Kersting, K.: Parameter learning in probabilistic databases: A least squares approach. In Daelemans, W., Goethals, B., Morik, K., eds.: Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2008), Part I. Volume 5211 of LNCS (Lecture Notes In Computer Science)., Antwerp, Belgium, Springer Berlin/Heidelberg (September 2008) 473–488

    Google Scholar 

  20. Kimmig, A., Santos Costa, V., Rocha, R., Demoen, B., De Raedt, L.: On the Efficient Execution of ProbLog Programs. In de la Banda, M.G., Pontelli, E., eds.: International Conference on Logic Programming. Number 5366 in LNCS, Springer (December 2008) 175–189

    Google Scholar 

  21. Santos Costa, V.: The life of a logic programming system. In de la Banda, M.G., Pontelli, E., eds.: Logic Programming, 24th International Conference, ICLP 2008, Udine, Italy, December 9–13 2008, Proceedings. Volume 5366 of Lecture Notes in Computer Science., Springer (2008) 1–6

    Google Scholar 

  22. Fredkin, E.: Trie Memory. Communications of the ACM 3 (1962) 490–499

    Article  Google Scholar 

  23. Ramakrishnan, I.V., Rao, P., Sagonas, K., Swift, T., Warren, D.S.: Efficient Access Mechanisms for Tabled Logic Programs. Journal of Logic Programming 38(1) (January 1999) 31–54

    Article  MATH  Google Scholar 

  24. Mantadelis, T., Demoen, B., Janssens, G.: A simplified fast interface for the use of CUDD for binary decision diagrams (2008) http://people.cs.kuleuven.be/˜theofrastos.mantadelis/tools/simplecudd.html.

  25. Kimmig, A., Demoen, B., De Raedt, L., Santos Costa, V., Rocha, R.: On the implementation of the probabilistic logic programming language ProbLog. Theory and Practice of Logic Programming (TPLP) (2010) to appear; https://lirias.kuleuven.be/handle/123456789/259607.

  26. Mitchell, T.M., Keller, R.M., Kedar-Cabelli, S.T.: Explanation-based generalization: A unifying view. Machine Learning 1(1) (1986) 47–80

    Google Scholar 

  27. DeJong, G., Mooney, R.J.: Explanation-based learning: An alternative view. Machine Learning 1(2) (1986) 145–176

    Google Scholar 

  28. Hirsh, H.: Explanation-based generalization in a logic-programming environment. In: IJCAI’87: Proceedings of the 10th international joint conference on Artificial intelligence, San Francisco, CA, USA, Morgan Kaufmann Publishers Inc. (1987) 221–227

    Google Scholar 

  29. Van Harmelen, F., Bundy, A.: Explanation-based generalisation = partial evaluation. Artificial Intelligence 36(3) (1988) 401–412

    Article  MATH  Google Scholar 

  30. Langley, P.: Unifying themes in empirical and explanation-based learning. In: Proceedings of the sixth international workshop on Machine learning, San Francisco, CA, USA, Morgan Kaufmann Publishers Inc. (1989) 2–4

    Google Scholar 

  31. Kimmig, A., De Raedt, L.: Local query mining in a probabilistic Prolog. In Boutilier, C., ed.: International Joint Conference on Artificial Intelligence. (2009) 1095–1100

    Google Scholar 

  32. Dehaspe, L., Toivonen, H., King, R.D.: Finding frequent substructures in chemical compounds. In Agrawal, R., Stolorz, P., Piatetsky-Shapiro, G., eds.: Proceedings of the 4th ACMSIGKDD International Conference on Knowledge Discovery and Data Mining, AAAI Press (1998) 30–36

    Google Scholar 

  33. Tsur, S., Ullman, J.D., Abiteboul, S., Clifton, C., Motwani, R., Nestorov, S., Rosenthal, A.: Query flocks: A generalization of association-rule mining. In: SIGMOD Conference. (1998) 1–12

    Google Scholar 

  34. De Raedt, L., Ramon, J.: Condensed representations for inductive logic programming. In Dubois, D., Welty, C.A., Williams, M.A., eds.: Proceedings of the 9th International Conference on Principles and Practice of Knowledge Representation. AAAI Press (2004) 438–446

    Google Scholar 

  35. Esposito, F., Fanizzi, N., Ferilli, S., Semeraro, G.: Ideal refinement under object identity. In Langley, P., ed.: Proceedings of the 17th International Conference on Machine Learning, Morgan Kaufmann (2000) 263–270

    Google Scholar 

  36. Morishita, S., Sese, J.: Traversing itemset lattice with statistical metric pruning. In: Proceedings of the 19th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, ACM Press (2000) 226–236

    Google Scholar 

  37. Mannila, H., Toivonen, H.: Levelwise search and borders of theories in knowledge discovery. Data Mining and Knowledge Discovery 1(3) (1997) 241–258

    Article  Google Scholar 

  38. Chui, C.K., Kao, B., Hung, E.: Mining frequent itemsets from uncertain data. In Zhou, Z.H., Li, H., Yang, Q., eds.: PAKDD. Volume 4426 of Lecture Notes in Computer Science., Springer (2007) 47–58

    Google Scholar 

  39. De Raedt, L., Kersting, K., Kimmig, A., Revoredo, K., Toivonen, H.: Compressing probabilistic Prolog programs. Machine Learning 70(2–3) (2008) 151–168

    Article  Google Scholar 

  40. Koppel, M., Feldman, R., Segre, A.M.: Bias-driven revision of logical domain theories. J. Artif. Intell. Res. (JAIR) 1 (1994) 159–208

    Google Scholar 

  41. Zelle, J., Mooney, R.: Inducing deterministic Prolog parsers from treebanks: A machine learning approach. In: Proceedings of the 12th National Conference on Artificial Intelligence (AAAI-94). (1994) 748–753

    Google Scholar 

  42. Paes, A., Revoredo, K., Zaverucha, G., Santos Costa, V.: Probabilistic first-order theory revision from examples. In Kramer, S., Pfahringer, B., eds.: ILP. Volume 3625 of Lecture Notes in Computer Science., Springer (2005) 295–311

    Google Scholar 

  43. Kersting, K., De Raedt, L.: Basic principles of learning bayesian logic programs. [50] 189–221

    Google Scholar 

  44. Gupta, R., Sarawagi, S.: Creating probabilistic databases from information extraction models. In: VLDB. (2006) 965–976

    Google Scholar 

  45. Chen, J., Muggleton, S., Santos, J.: Learning probabilistic logic models from probabilistic examples (extended abstract). In: ILP. (2007) 22–23

    Google Scholar 

  46. Gutmann, B., Kimmig, A., Kersting, K., De Raedt, L.: Parameter estimation in ProbLog from annotated queries. Technical Report CW 583, Department of Computer Science, Katholieke Universiteit Leuven, Belgium (April 2010)

    Google Scholar 

  47. Saul, L., Jaakkola, T., Jordan, M.: Mean field theory for sigmoid belief networks. JAIR 4 (1996) 61–76

    MATH  Google Scholar 

  48. Perez-Iratxeta, C., Bork, P., Andrade, M.: Association of genes to genetically inherited diseases using data mining. Nature Genetics 31 (2002) 316–319

    Google Scholar 

  49. Getoor, L., Taskar, B., eds.: Statistical Relational Learning. The MIT press (2007)

    Google Scholar 

  50. De Raedt, L., Frasconi, P., Kersting, K., Muggleton, S., eds.: Probabilistic Inductive Logic Programming — Theory and Applications. Volume 4911 of Lecture Notes in Artificial Intelligence. Springer (2008)

    Google Scholar 

  51. De Raedt, L., Demoen, B., Fierens, D., Gutmann, B., Janssens, G., Kimmig, A., Landwehr, N., Mantadelis, T., Meert, W., Rocha, R., Santos Costa, V., Thon, I., Vennekens, J.: Towards digesting the alphabet-soup of statistical relational learning. In Roy, D., Winn, J., McAllester, D., Mansinghka, V., Tenenbaum, J., eds.: Proceedings of the 1st Workshop on Probabilistic Programming: Universal Languages, Systems and Applications, Whistler, Canada (December 2008)

    Google Scholar 

  52. Riguzzi, F.: A top down interpreter for LPAD and CP-logic. In: AI*IA 2007: Artificial Intelligence and Human-Oriented Computing. Volume 4733 of LNCS. (2007)

    Google Scholar 

  53. Ishihata, M., Kameya, Y., Sato, T., ichi Minato, S.: Propositionalizing the EM algorithm by BDDs. In Železný, F., Lavrač, N., eds.: Proceedings of Inductive Logic Programming (ILP 2008), Late Breaking Papers, Prague, Czech Republic (September 2008) 44–49

    Google Scholar 

  54. Muggleton, S.: Stochastic logic programs. In De Raedt, L., ed.: ILP. (1995)

    Google Scholar 

  55. Santos Costa, V., Page, D., Cussens, J.: Clp(bn): Constraint logic programming for probabilistic knowledge. In: In Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence (UAI03, Morgan Kaufmann (2003) 517–524

    Google Scholar 

  56. Vennekens, J., Verbaeten, S., Bruynooghe, M.: Logic programs with annotated disjunctions. In Demoen, B., Lifschitz, V., eds.: ICLP. Volume 3132 of LNCS., Springer, Heidelberg (2004) 431–445

    Google Scholar 

  57. Gutmann, B., Thon, I., De Raedt, L.: Learning the parameters of probabilistic logic programs from interpretations. Technical Report CW 584, Department of Computer Science, Katholieke Universiteit Leuven, Belgium (April 2010)

    Google Scholar 

  58. Domingos, P., Lowd, D.: Markov Logic: an interface layer for AI. Morgan & Claypool (2009)

    Google Scholar 

  59. Getoor, L., Friedman, N., Koller, D., Pfeffer, A.: Learning probabilistic relational models. In Džeroski, S., Lavrač, N., eds.: Relational Data Mining. Springer (2001) 307–335

    Google Scholar 

  60. De Raedt, L., Thon, I.: Probabilistic rule learning. Technical Report CW 580, Department of Computer Science, Katholieke Universiteit Leuven, Belgium (April 2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luc De Raedt .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

De Raedt, L., Kimmig, A., Gutmann, B., Kersting, K., Costa, V., Toivonen, H. (2010). Probabilistic Inductive Querying Using ProbLog. In: Džeroski, S., Goethals, B., Panov, P. (eds) Inductive Databases and Constraint-Based Data Mining. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-7738-0_10

Download citation

  • DOI: https://doi.org/10.1007/978-1-4419-7738-0_10

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4419-7737-3

  • Online ISBN: 978-1-4419-7738-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics