Discovering Relational Phrases for Qualia Roles Through Open Information Extraction

Siragusa, Giovanni; Leone, Valentina; Di Caro, Luigi; Schifanella, Claudio

doi:10.1007/978-3-319-69548-8_6

Giovanni Siragusa¹¹,
Valentina Leone¹¹,
Luigi Di Caro¹¹ &
…
Claudio Schifanella¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 786))

Included in the following conference series:

International Conference on Knowledge Engineering and the Semantic Web

740 Accesses

Abstract

In Generative Lexicon [17], Pustejovsky defined the Qualia Structure which organizes the semantic meaning carried by nouns through four roles: formal, telic, agentive and constitutive. Despite their expressive power, to the best of our knowledge no actual NLP system uses qualia structures possibly due to the large effort needed to construct such knowledge bases. Some researchers have tried to circumvent this obstacle using lexico-syntactic patterns based on Hearst idea [11]. In this paper, we propose an Open Information Extraction method to automatically acquire a set of relational phrases from a large corpus, starting with a small set of nouns and their qualia elements. Our idea is that the relational phrases unveil the relations between the nouns and their qualia elements. We compared our method with Reverb [10], Ollie [18] and ClausIE [9] in terms of patterns quality and the relative qualia elements extraction.

The major part of this work has been carried out by the first two authors, equally. The work has been funded by the project Semantic Burst: Embodying Semantic Resources in Vector Space Models, financed by Compagnia di San Paolo.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Note that we have two possible cases: (argument1 = noun, argument2 = qualia element) and (argument1 = qualia element, argument2 = noun).
2.
An exception is ClausIE [9] which can extract pairs of the form (argument, relational phrase) and N-ary relations composed by a triple and a set of additional information.
3.
http://www.unimorph.org.
4.
We used Mate-Tools parser (http://code.google.com/p/mate-tools).
5.
If the term referring to the argument has an empty babelnetids attribute in the qualia structure, we consider the argument as correct.
6.
The resource is available at the following url: http://lcl.uniroma1.it/babelfied-wikipedia/.
7.
We could not manage to solve this issue with the help of the available documentation.
8.
In case of a missing score, we assumed it as 0.
9.
http://www.clips.ua.ac.be/pages/pattern-en.
10.
In case of a missing evaluation of a qualia element, we assumed that it does not belong to the role.

References

Akbik, A., Löser, A.: Kraken: N-ary facts in open information extraction. In: Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction, pp. 52–56. Association for Computational Linguistics (2012)
Google Scholar
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The berkeley framenet project. In: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, vol. 1, pp. 86–90. Association for Computational Linguistics (1998)
Google Scholar
Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O.: Open information extraction from the web. IJCAI 7, 2670–2676 (2007)
Google Scholar
Bovi, C.D., Telesca, L., Navigli, R.: Large-scale information extraction from textual definitions through deep syntactic and semantic analysis. Trans. Assoc. Comput. Linguist. 3, 529–543 (2015)
Google Scholar
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI, vol. 5, p. 3 (2010)
Google Scholar
Caselli, T., Rise, T., Russo, I.: From glosses to qualia: qualia extraction from senso comune. In: 6th International Conference on Generative Approaches to the Lexicon, p. 37 (2013)
Google Scholar
Cimiano, P., Wenderoth, J.: Automatically learning qualia structures from the web. In: Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition, pp. 28–37. Association for Computational Linguistics (2005)
Google Scholar
Cimiano, P., Wenderoth, J.: Automatic acquisition of ranked qualia structures from the web. In: ACL 2007, Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (2007)
Google Scholar
Del Corro, L., Gemulla, R.: Clausie: clause-based open information extraction. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 355–366. ACM (2013)
Google Scholar
Fader, A., Soderland, S., Etzioni, O.: Identifying relations for open information extraction. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1535–1545. Association for Computational Linguistics (2011)
Google Scholar
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th Conference on Computational Linguistics, vol. 2, pp. 539–545. Association for Computational Linguistics (1992)
Google Scholar
Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)
Article Google Scholar
Moro, A., Navigli, R.: Integrating syntactic and semantic analysis into the open information extraction paradigm. In: IJCAI (2013)
Google Scholar
Nakashole, N., Weikum, G., Suchanek, F.: Patty: a taxonomy of relational patterns with semantic types. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1135–1145. Association for Computational Linguistics (2012)
Google Scholar
Navigli, R., Ponzetto, S.P.: Babelnet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif. Intell. 193, 217–250 (2012)
Article MathSciNet MATH Google Scholar
Oltramari, A., Vetere, G., Chiari, I., Jezek, E., Zanzotto, F.M., Nissim, M., Gangemi, A.: Senso comune: a collaborative knowledge resource for italian. In: Gurevych, I., Kim, J. (eds.) The Peoples Web Meets NLP. Theory and Applications of Natural Language Processing, pp. 45–67. Springer, Heidelberg (2013). doi:10.1007/978-3-642-35085-6_2
Pustejovsky, J.: The generative lexicon. Comput. Linguist. 17(4), 409–441 (1991)
Google Scholar
Schmitz, M., Bart, R., Soderland, S., Etzioni, O., et al.: Open language learning for information extraction. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 523–534. Association for Computational Linguistics (2012)
Google Scholar
Scozzafava, F., Raganato, A., Moro, A., Navigli, R.: Automatic identification and disambiguation of concepts and named entities in the multilingual wikipedia. In: Gavanelli, M., Lamma, E., Riguzzi, F. (eds.) AI*IA 2015. LNCS, vol. 9336, pp. 357–366. Springer, Cham (2015). doi:10.1007/978-3-319-24309-2_27
Chapter Google Scholar
Speer, R., Havasi, C.: Representing general relational knowledge in conceptnet 5. In: LREC, pp. 3679–3686 (2012)
Google Scholar
Wu, F., Weld, D.S.: Open information extraction using wikipedia. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 118–127. Association for Computational Linguistics (2010)
Google Scholar
Yamada, I., Baldwin, T.: Automatic discovery of telic and agentive roles from corpus data. In: PACLIC (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Turin, Turin, Italy
Giovanni Siragusa, Valentina Leone, Luigi Di Caro & Claudio Schifanella

Authors

Giovanni Siragusa
View author publications
You can also search for this author in PubMed Google Scholar
Valentina Leone
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Di Caro
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Schifanella
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giovanni Siragusa .

Editor information

Editors and Affiliations

West Pomeranian University of Technology in Szczecin, Szczecin, Poland
Przemysław Różewski
University of Bonn, Bonn, Germany
Christoph Lange

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Siragusa, G., Leone, V., Di Caro, L., Schifanella, C. (2017). Discovering Relational Phrases for Qualia Roles Through Open Information Extraction. In: Różewski, P., Lange, C. (eds) Knowledge Engineering and Semantic Web. KESW 2017. Communications in Computer and Information Science, vol 786. Springer, Cham. https://doi.org/10.1007/978-3-319-69548-8_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-69548-8_6
Published: 18 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69547-1
Online ISBN: 978-3-319-69548-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics