Abstract
Semantic resources (WordNet, Wikidata, BabelNet, ...) offer invaluable knowledge that can be exploited by humans and machines to solve a variety of tasks. Among these, we address here the one called entity set expansion: extend a given a set of words –called seeds– with new ones being of the same “sort”. Differently from classical approaches, we determine “optimal” common categories of the given seeds by analyzing the semantic relations among the objects these seeds refer to. In particular, we define the notion of an entity network to integrate information from different semantic resources, and show how to use such networks to disambiguate word senses. Finally, we propose a proof-of-concept implementation in answer set programming with external predicates to query online semantic resources and perform optimization tasks.
The paper has been partially supported by the Italian Ministry for Economic Development (MISE) under project “PIUCultura – Paradigmi Innovativi per l’Utilizzo della Cultura” (n. F/020016/01-02/X27), and under project “Smarter Solutions in the Big Data World (S2BDW)” (n. F/050389/01-03/X32) funded within the call “HORIZON2020” PON I&C 2014-2020.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
There exist satellite projects for other languages, not integrated with the core system.
- 3.
See online demo at: http://webisadb.webdatacommons.org/webisadb/.
- 4.
- 5.
- 6.
If there are more possible combinations of senses, a user can add more seeds and repeat the process, or just select the intended meaning.
- 7.
See DL navigator at: http://www.cs.man.ac.uk/~ezolin/dl/.
- 8.
- 9.
References
Baader, F., Sertkaya, B., Turhan, A.Y.: Computing the least common subsumer w.r.t. a background terminology. J. Appl. Logic 5(3), 392–420 (2007)
Brewka, G., Eiter, T., Truszczynski, M.: Answer set programming at a glance. Commun. ACM 54(12), 92–103 (2011)
Calimeri, F., Fuscà, D., Perri, S., Zangari, J.: I-DLV: the new intelligent grounder of DLV. Intelligenza Artificiale 11(1), 5–20 (2017)
Camacho-Collados, J., Pilehvar, M.T., Navigli, R.: A unified multilingual semantic representation of concepts. In: Proceedings of ACL 2015, pp. 741–751 (2015)
Carlson, A., Betteridge, J., Wang, R.C., Hruschka Jr., E.R., Mitchell, T.M.: Coupled semi-supervised learning for information extraction. In: Proceedings of WSDM 2010, pp. 101–110 (2010)
Curran, J.R., Murphy, T., Scholz, B.: Minimising semantic drift with mutual exclusion bootstrapping. In: Proceedings of PACLING 2007, pp. 172–180 (2007)
Etzioni, O., Cafarella, M., Downey, D., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extraction from the web: an experimental study. Artif. Intell. 165(1), 91–134 (2005)
Gupta, S., Manning, C.: Improved pattern learning for bootstrapped entity extraction. In: Proceedings of CoNLL 2014, pp. 98–108 (2014)
Gupta, S., Manning, C.D.: Distributed representations of words to guide bootstrapped entity classifiers. In: Proceedings of HLT-NAACL 2015, pp. 1215–1220 (2015)
Huang, R., Riloff, E.: Inducing domain-specific semantic class taggers from (almost) nothing. In: Proceedings of ACL 2010, pp. 275–285 (2010)
Iacobacci, I., Pilehvar, M.T., Navigli, R.: Sensembed: learning sense embeddings for word and relational similarity. In: Proceedings of ACL 2015, pp. 95–105 (2015)
Kozareva, Z., Riloff, E., Hovy, E.: Semantic class learning from the web with hyponym pattern linkage graphs. In: Proceedings of ACL 2008, pp. 1048–1056 (2008)
Leacock, C., Chodorow, M.: Combining local context and wordnet similarity for word sense identification. WordNet Electron. Lex. Database 49(2), 265–283 (1998)
Lehmann, J., et al.: deqa: deep web extraction for question answering. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012. LNCS, vol. 7650, pp. 131–147. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35173-0_9
Mcintosh, T., Curran, J.R.: Weighted mutual exclusion bootstrapping for domain independent lexicon and template acquisition. In: Proceedings of ALTA 2010, pp. 97–105 (2008)
Miller, G.A.: Wordnet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)
Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. 41(2), 10 (2009)
Navigli, R., Ponzetto, S.P.: Babelnet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif. Intell. 193, 217–250 (2012)
Pantel, P., Crestan, E., Borkovsky, A., Popescu, A.M., Vyas, V.: Web-scale distributional similarity and entity set expansion. In: Proceedings of EMNLP 2009, pp. 938–947 (2009)
Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and application of a metric on semantic nets. IEEE Trans. Syst. Man Cybern. 19(1), 17–30 (1989)
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. arXiv preprint cmp-lg/9511007 (1995)
Riloff, E., Jones, R.: Learning dictionaries for information extraction by multi-level bootstrapping. In: Proceedings of AAAI 1999 and IAAI 1999, pp. 474–479 (1999)
Roark, B., Charniak, E.: Noun-phrase co-occurrence statistics for semiautomatic semantic lexicon construction. In: Proceedings of ACL 1998, pp. 1110–1116 (1998)
Sarmento, L., Jijkoun, V., de Rijke, M., Oliveira, E.: “More like these”: growing entity classes from seeds. In: Proceedings of CIKM 2007, pp. 959–962 (2007)
Seitner, J., Bizer, C., Eckert, K., Faralli, S., Meusel, R., Paulheim, H., Ponzetto, S.P.: A large database of hypernymy relations extracted from the web. In: Proceedings of LREC 2016 (2016)
Sussna, M.: Word sense disambiguation for free-text indexing using a massive semantic network. In: Proceedings of CIKM 1993, pp. 67–74. ACM (1993)
Thelen, M., Riloff, E.: A bootstrapping method for learning semantic lexicons using extraction pattern contexts. In: Proceedings of EMNLP 2002, pp. 214–221 (2002)
Tong, S., Dean, J.: System and methods for automatically creating lists, 25 March 2008, US Patent 7,350,187
Wang, R.C., Cohen, W.W.: Language-independent set expansion of named entities using the web. In: Proceedings of ICDM 2007, pp. 342–350. IEEE (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Adrian, W.T., Manna, M. (2018). Navigating Online Semantic Resources for Entity Set Expansion. In: Calimeri, F., Hamlen, K., Leone, N. (eds) Practical Aspects of Declarative Languages. PADL 2018. Lecture Notes in Computer Science(), vol 10702. Springer, Cham. https://doi.org/10.1007/978-3-319-73305-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-73305-0_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73304-3
Online ISBN: 978-3-319-73305-0
eBook Packages: Computer ScienceComputer Science (R0)