Constructing Markov Logic Networks from First-Order Default Rules

Kuželka, Ondřej; Davis, Jesse; Schockaert, Steven

doi:10.1007/978-3-319-40566-7_7

Ondřej Kuželka¹⁶,
Jesse Davis¹⁷ &
Steven Schockaert¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9575))

Included in the following conference series:

International Conference on Inductive Logic Programming

495 Accesses
1 Citations

Abstract

Expert knowledge can often be represented using default rules of the form “if A then typically B”. In a probabilistic framework, such default rules can be seen as constraints on what should be derivable by MAP-inference. We exploit this idea for constructing a Markov logic network \(\mathcal {M}\) from a set of first-order default rules D, such that MAP inference from \(\mathcal {M}\) exactly corresponds to default reasoning from D, where we view first-order default rules as templates for the construction of propositional default rules. In particular, to construct appropriate Markov logic networks, we lift three standard methods for default reasoning. The resulting Markov logic networks could then be refined based on available training data. Our method thus offers a convenient way of using expert knowledge for constraining or guiding the process of learning Markov logic networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Here, we are just ordering the constants by the lexical ordering of their names.
2.
We will omit “w.r.t. \(\mathcal {I}\)” when it is clear from the context.
3.
With a slight abuse of terminology, we will call \(\varDelta _1^* \cup \dots \cup \varDelta _k^*\) the partition of \(\varDelta \cup \varTheta \) even though it is strictly speaking only a partition of \(\varDelta ^*\).
4.
Although existing MLN systems are not able to work with weights as large as are sometimes produced, due to numerical issues, we have implemented an MLN system based on cutting-plane MAP inference which can work with arbitrarily large weights.
5.
https://github.com/supertweety/mln2poss.
6.
Our implementation is based on a cutting-plane inference method for MAP inference implemented using the SAT4J library [5] and the MLN system Tuffy [15].
7.
Note that using AUC as an evaluation metric would not make sense in this case because of the way the MLNs are constructed by our approach. The construction can produce MLNs which make sensible predictions when used together with MAP inference but which do not have to be meaningful for the given datasets as probability distributions. After all, our MLN construction methods do not assume any information from which the probabilities could be inferred, except qualitative information on rankings of possible worlds expressed by default rules.
8.
We had to remove \(D_3\) for efficiency reasons, though.
9.
For brevity we omit hard rules here because generalizations of the proofs to involve hard rules are rather straightforward, but a bit too verbose.
10.
Recall that the formulas in \(\varDelta ^*\) are typed according to interchangeability of constants. The groundings must respect the typing information. This will be the case whenever we speak of groundings in this section.
11.
However, this does not mean that we need to ground this theory completely in order to solve it, e.g. by using cutting plane inference we can avoid the need to ground it completely.

References

Benferhat, S., Bonnefon, J.F., da Silva Neves, R.: An overview of possibilistic handling of default reasoning, with experimental studies. Synthese 146(1–2), 53–70 (2005)
Article MathSciNet MATH Google Scholar
Benferhat, S., Dubois, D., Prade, H.: Nonmonotonic reasoning, conditional objects and possibility theory. Artif. Intell. 92(1–2), 259–276 (1997)
Article MathSciNet MATH Google Scholar
Benferhat, S., Dubois, D., Prade, H.: Practical handling of exception-tainted rules and independence information in possibilistic logic. Appl. Intell. 9(2), 101–127 (1998)
Article Google Scholar
Benferhat, S., Dubois, D., Prade, H.: Possibilistic and standard probabilistic semantics of conditional knowledge bases. J. Log. Comput. 9(6), 873–895 (1999)
Article MathSciNet MATH Google Scholar
Berre, D.L., Parrain, A.: The Sat4j library, release 2.2. J. Satisfiability, Boolean Model. Comput. 7, 50–64 (2010)
Google Scholar
de Salvo Braz, R., Amir, E., Roth, D.: Lifted first-order probabilistic inference. In: Kaelbling, L.P., Saffiotti, A. (eds.) Proceeding of the 19th Joint Conference on Artificial Intelligence, p. 1319 (2005)
Google Scholar
de Saint-Cyr, F.D., Lang, J., Schiex, T.: Penalty logic and its link with Dempster-Shafer theory. In: Proceedings of the 10th International Conference on Uncertainty in Artificial Intelligence, pp. 204–211 (1994)
Google Scholar
Friedman, N., Halpern, J.Y., Koller, D.: First-order conditional logic for default reasoning revisited. ACM Trans. Comput. Log. 1(2), 175–207 (2000)
Article MathSciNet Google Scholar
Geffner, H., Pearl, J.: Conditional entailment: bridging two approaches to default reasoning. Artif. Intell. 53(2), 209–244 (1992)
Article MathSciNet MATH Google Scholar
Goldszmidt, M., Morris, P., Pearl, J.: A maximum entropy approach to nonmonotonic reasoning. IEEE Trans. Pattern Anal. Mach. Intell. 15(3), 220–232 (1993)
Article Google Scholar
Halpern, J.Y.: An analysis of first-order logics of probability. Artif. Intell. 46(3), 311–350 (1990)
Article MathSciNet MATH Google Scholar
Kern-Isberner, G., Thimm, M.: A ranking semantics for first-order conditionals. In: Proceedings of the 20th European Conference on Artificial Intelligence, pp. 456–461 (2012)
Google Scholar
Kraus, S., Lehmann, D., Magidor, M.: Nonmonotonic reasoning, preferential models and cumulative logics. Artif. Intell. 44(1–2), 167–207 (1990)
Article MathSciNet MATH Google Scholar
Lehmann, D., Magidor, M.: What does a conditional knowledge base entail? Artif. Intell. 55(1), 1–60 (1992)
Article MathSciNet MATH Google Scholar
Niu, F., Ré, C., Doan, A., Shavlik, J.W.: Tuffy: scaling up statistical inference in markov logic networks using an RDBMS. PVLDB 4(6), 373–384 (2011)
Google Scholar
Noessner, J., Niepert, M., Stuckenschmidt, H., Rockit.: Exploiting parallelism and symmetry for map inference in statistical relational models. In: Proceedings of the 27th Conference on Artificial Intelligence, AAAI (2013)
Google Scholar
Pápai, T., Ghosh, S., Kautz, H.: Combining subjective probabilities and data in training markov logic networks. In: Flach, P.A., De Bie, T., Cristianini, N. (eds.) ECML PKDD 2012, Part I. LNCS, vol. 7523, pp. 90–105. Springer, Heidelberg (2012)
Chapter Google Scholar
Pearl, J.: Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann, Burlington (1988)
MATH Google Scholar
Pearl, J., System, Z.: A natural ordering of defaults with tractable applications to nonmonotonic reasoning. In: Proceedings of the 3rd Conference on Theoretical Aspects of Reasoning about Knowledge, pp. 121–135 (1990)
Google Scholar
Richardson, M., Domingos, P.: Markov logic networks. Mach. Learn. 62(1–2), 107–136 (2006)
Article Google Scholar
Riedel, S.: Improving the accuracy and efficiency of MAP inference for Markov logic. In: Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence, pp. 468–475 (2008)
Google Scholar
Tversky, A., Kahneman, D.: Extensional versus intuitive reasoning: the conjunction fallacy in probability judgment. Psychol. Rev. 90(4), 293 (1983)
Article Google Scholar

Download references

Acknowledgement

We thank the anonymous reviewers for their detailed comments. This work has been supported by a grant from the Leverhulme Trust (RPG-2014-164).

Author information

Authors and Affiliations

School of Computer Science & Informatics, Cardiff University, Cardiff, UK
Ondřej Kuželka & Steven Schockaert
Department of Computer Science, Katholieke Universiteit Leuven, Leuven, Belgium
Jesse Davis

Authors

Ondřej Kuželka
View author publications
You can also search for this author in PubMed Google Scholar
Jesse Davis
View author publications
You can also search for this author in PubMed Google Scholar
Steven Schockaert
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ondřej Kuželka .

Editor information

Editors and Affiliations

National Institute of Informatics, Tokyo, Japan
Katsumi Inoue
Tokyo University of Science, Noda, Japan
Hayato Ohwada
Kyoto University, Kyoto, Japan
Akihiro Yamamoto

A Proofs

Here we provide formal justifications for the transformations presented in this paper^{Footnote 9}. We start by proving correctness of the ground transformations.

Proposition 1

Let \(\varDelta \) be a set of default rules. Let \(\varDelta ^{rat}\), \(\varDelta ^{lex}\) and \(\varDelta ^{ent}\) be the rational, lexicographic and maximum entropy closure of \(\varDelta \), respectively. Let \(\mathcal {M}^{rat}\), \(\mathcal {M}^{lex}\) and \(\mathcal {M}^{ent}\) be Markov logic networks obtained from \(\varDelta \) by Transformations 1, 2 and 3, respectively. Then the following holds for any default rule \(\alpha {\,\mid \!\sim \,}\beta \):

1.
\(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{rat}\) if and only if \((\mathcal {M}^{rat}, \{ \alpha \}) \vdash _{\textit{MAP}}\beta \),
2.
\(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{lex}\) if and only if \((\mathcal {M}^{lex}, \{ \alpha \}) \vdash _{\textit{MAP}}\beta \),
3.
\(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{ent}\) if and only if \((\mathcal {M}^{ent}, \{ \alpha \}) \vdash _{\textit{MAP}}\beta \).

Proof

Throughout the proof, let \(\varDelta = \varDelta _1 \cup \varDelta _2 \cup \dots \cup \varDelta _k\) be the Z-ordering of \(\varDelta \).

1. Let \(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{rat}\) be a default rule. Let j be the smallest index such that \(\varDelta _{\alpha }^{rat} = \{\lnot \gamma \vee \delta | \gamma {\,\mid \!\sim \,}\delta \in \varDelta _j \cup ... \cup \varDelta _k \} \cup \{ \alpha \}\) is consistent. Recall that \(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{rat}\) if and only if \(\varDelta _\alpha ^{rat} \models \beta \). By the construction of the MLN \(\mathcal {M}^{rat}\) it must hold that \((\mathcal {M}^{rat},\{ \alpha \}) \vdash _{\textit{MAP}}\lnot a_i\) for every \(i < j\) and also \((\mathcal {M}^{rat},\{ \alpha \}) \vdash _{\textit{MAP}}a_i\) for all \(i \ge j\). Therefore all \(\lnot \alpha \vee \beta \), such that \(\alpha {\,\mid \!\sim \,}\beta \in \varDelta _i\) where \(i \ge j\), must be true in all most probable worlds of \((\mathcal {M}^{rat},\{\alpha \})\). But then necessarily we have: if \(\varDelta _\alpha ^{rat} \models \beta \) then \((\mathcal {M}^{rat},\{ \alpha \}) \vdash _{\textit{MAP}}\beta \). Similarly, to show the other direction of the implication, let us assume that \((\mathcal {M}^{rat},\{ \alpha \}) \vdash _{\textit{MAP}}\beta \). Then we can show using basically the identical reasoning as for the other direction that the set of formulas \(\lnot \alpha \vee \beta \) which must be satisfied in all most probable worlds of \((\mathcal {M}^{rat},\{\alpha \})\) is equivalent to the set of formulas in \(\varDelta _\alpha ^{rat}\).

2. It holds that \(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{lex}\) if and only if \(\beta \) is true in all lex-preferred models of \(\alpha \), i.e. \(\forall \omega \in \llbracket \alpha \rrbracket : (\omega \not \models \beta ) \Rightarrow \exists \omega '\in \llbracket \alpha \rrbracket : \omega ' \prec \omega \) where \(\prec \) is the lex-preference relation based on Z-ordering defined in Sect. 2.2. What we need to show is that for any possible worlds \(\omega \), \(\omega '\) it holds \(\omega \prec \omega '\) if and only \(P_{\mathcal {M}^{lex}}(\omega ) > P_{\mathcal {M}^{lex}}(\omega ')\) where \(P_{\mathcal {M}^{lex}}\) is the probability given by the MLN \(\mathcal {M}^{lex}\), from which correctness of the lexicographic transformation will follow. But this actually follows immediately from the way we set the weights in Transformation 2, as the penalty for not satisfying one formula corresponding to a default rule in \(\varDelta _i\) is greater than the sum of penalties for not satisfying all formulas corresponding to the default rules in \(\bigcup _{j < i} \varDelta _j\).

3. This follows directly from the results in the paper [10] in which maximum entropy closure was introduced. An explicit ranking function on possible worlds was derived in that paper, which we explicitly use in the transformation.

Next we show correctness of the non-ground transformations. We start by proving properties of the non-ground counterpart of Z-ordering.

Proposition 2

Let \(\varDelta ^*\) be a default theory and \(\mathcal {C}\) be a set of constants (universe). Let \(\varDelta \) be the set of groundings^{Footnote 10} of default rules from \(\varDelta ^*\). Let \(\varDelta = \varDelta _1 \cup \dots \cup \varDelta _k\) be Z-ordering of the set of ground default rules \(\varDelta \). Let \(\varDelta _1^* \cup \dots \cup \varDelta _k^*\) be as defined by Eq. 3. Then a ground default rule \(\alpha {\,\mid \!\sim \,}\beta \) is in \(\varDelta _i\) if and only if a rule isomorphic to \(\textit{variabilize}(\alpha {\,\mid \!\sim \,}\beta )\) is in \(\varDelta _i^*\).

Proof

(Sketch) This proposition follows from the simple observation that Eq. 3 is equivalent to checking whether the ground default rule \(\alpha {\,\mid \!\sim \,}\beta \) is tolerated by the set of groundings of the default rules \(\gamma {\,\mid \!\sim \,}\delta \in \varDelta ^* \setminus (\varDelta ^*_1 \cup \dots \cup \varDelta ^*_{j-1})\) (because we explicitly ask there about existence of a Herbrand model^{Footnote 11} with universe \(\mathcal {C}\)). Since the answer, whether it is tolerated or not, must be the same for every default rule weakly isomorphic to \(\alpha {\,\mid \!\sim \,}\beta \), it follows that this is equivalent to checking this condition for all groundings of \(\textit{variabilize}(\alpha {\,\mid \!\sim \,}\beta )\), which must then necessarily give us an equivalent result to what we would obtain by Z-ordering performed on the explicitly enumerated groundings. The statement of the proposition then follows from this.

In other words, what the above proposition states, is that if we replace non-ground rules in the particular \(\varDelta _i^*\)’s by all their groundings then this partitioning of ground default rules must be equivalent to what we would obtain by directly Z-ordering the ground default rules in the set \(\mathcal {R}\).

Proposition 3

Let \(\varDelta ^*\) be a set of non-ground default rules and \(\mathcal {C}\) be a set of constants (universe). Let \(\varDelta ^{rat}\), \(\varDelta ^{lex}\) and \(\varDelta ^{ent}\) be the rational, lexicographic and maximum entropy closure, respectively, of the set of default rules obtained by grounding \(\varDelta ^*\). Let \(\mathcal {M}^{rat}\), \(\mathcal {M}^{lex}\) and \(\mathcal {M}^{ent}\) be Markov logic networks obtained from \(\varDelta ^*\) by Transformations 4, 5 and 6, respectively. Then the following holds for any ground default rule \(\alpha {\,\mid \!\sim \,}\beta \):

1.
\(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{rat}\) if and only if \((\mathcal {M}^{rat}, \{ \alpha \}) \vdash _{\textit{MAP}}\beta \),
2.
\(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{lex}\) if and only if \((\mathcal {M}^{lex}, \{ \alpha \}) \vdash _{\textit{MAP}}\beta \),
3.
\(\alpha {\,\mid \!\sim \,}\beta \in \varDelta ^{ent}\) if and only if \((\mathcal {M}^{ent}, \{ \alpha \}) \vdash _{\textit{MAP}}\beta \).

Proof

1. This case follows from Propositions 1 and 2 by noticing that the constructed MLNs, when grounded, are the same as what the ground Transformation 1 would produce if applied on all groundings of default rules from \(\varDelta ^*\).

2. From Proposition 2 we have that, if we ground the MLN produced by Transformation 5, then the structure of the MLN will be identical to what we would obtain if we applied Transformation 2 on all groundings of default rules from \(\varDelta ^*\). While the weights of the formulas are not the same, it is still guaranteed that \(\omega \prec \omega '\) if and only \(P_{\mathcal {M}^{lex}}(\omega ) > P_{\mathcal {M}^{lex}}(\omega ')\), where \(\prec \) is the lex-preference relation. This is because the term \(|\mathcal {C}|^{|vars(\alpha {\,\mid \!\sim \,}\beta )|}\), which is used to define the weights in Transformation 5, is an upper bound on the number of groundings of a default rule \(\alpha {\,\mid \!\sim \,}\beta \) (this implies that the sum of all weights of groundings of formulas in the MLN which correspond to default rules from \(\bigcup _{i < j} \varDelta _i\) will be smaller than the weight of a single formula corresponding to a default rule from \(\varDelta _j\) which is what we need).

3. (Sketch) To show the last part of this proposition, we would basically need to replicate a more detailed reasoning from the proof of Proposition 2 because maximum entropy closure needs to create a partitioning of the set of default rules which refines Z-ordering. Since no new ideas are needed for this proof and because of space limitations, we omit details. The basic idea is the same as for the non-ground Z-ordering – we only process representatives of the non-ground default rules and we can show that we would obtain an equivalent result if we processed all groundings of the default rules by the procedure from Transformation 3.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kuželka, O., Davis, J., Schockaert, S. (2016). Constructing Markov Logic Networks from First-Order Default Rules. In: Inoue, K., Ohwada, H., Yamamoto, A. (eds) Inductive Logic Programming. ILP 2015. Lecture Notes in Computer Science(), vol 9575. Springer, Cham. https://doi.org/10.1007/978-3-319-40566-7_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-40566-7_7
Published: 10 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40565-0
Online ISBN: 978-3-319-40566-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Constructing Markov Logic Networks from First-Order Default Rules

Abstract

Access this chapter

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

A Proofs

A Proofs

Proposition 1

Proof

Proposition 2

Proof

Proposition 3

Proof

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation