Skip to main content

Navigating Online Semantic Resources for Entity Set Expansion

  • Conference paper
  • First Online:
Practical Aspects of Declarative Languages (PADL 2018)

Abstract

Semantic resources (WordNet, Wikidata, BabelNet, ...) offer invaluable knowledge that can be exploited by humans and machines to solve a variety of tasks. Among these, we address here the one called entity set expansion: extend a given a set of words –called seeds– with new ones being of the same “sort”. Differently from classical approaches, we determine “optimal” common categories of the given seeds by analyzing the semantic relations among the objects these seeds refer to. In particular, we define the notion of an entity network to integrate information from different semantic resources, and show how to use such networks to disambiguate word senses. Finally, we propose a proof-of-concept implementation in answer set programming with external predicates to query online semantic resources and perform optimization tasks.

The paper has been partially supported by the Italian Ministry for Economic Development (MISE) under project “PIUCultura – Paradigmi Innovativi per l’Utilizzo della Cultura” (n. F/020016/01-02/X27), and under project “Smarter Solutions in the Big Data World (S2BDW)” (n. F/050389/01-03/X32) funded within the call “HORIZON2020” PON I&C 2014-2020.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 60.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    See http://babelnet.org/search?word=prater&lang=EN.

  2. 2.

    There exist satellite projects for other languages, not integrated with the core system.

  3. 3.

    See online demo at: http://webisadb.webdatacommons.org/webisadb/.

  4. 4.

    See http://wordnet.princeton.edu/wordnet/man/wngloss.7WN.html.

  5. 5.

    See http://babelnet.org/guide.

  6. 6.

    If there are more possible combinations of senses, a user can add more seeds and repeat the process, or just select the intended meaning.

  7. 7.

    See DL navigator at: http://www.cs.man.ac.uk/~ezolin/dl/.

  8. 8.

    See https://github.com/DeMaCS-UNICAL/I-DLV.

  9. 9.

    See https://github.com/alviano/wasp.

References

  1. Baader, F., Sertkaya, B., Turhan, A.Y.: Computing the least common subsumer w.r.t. a background terminology. J. Appl. Logic 5(3), 392–420 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  2. Brewka, G., Eiter, T., Truszczynski, M.: Answer set programming at a glance. Commun. ACM 54(12), 92–103 (2011)

    Article  Google Scholar 

  3. Calimeri, F., Fuscà, D., Perri, S., Zangari, J.: I-DLV: the new intelligent grounder of DLV. Intelligenza Artificiale 11(1), 5–20 (2017)

    Article  Google Scholar 

  4. Camacho-Collados, J., Pilehvar, M.T., Navigli, R.: A unified multilingual semantic representation of concepts. In: Proceedings of ACL 2015, pp. 741–751 (2015)

    Google Scholar 

  5. Carlson, A., Betteridge, J., Wang, R.C., Hruschka Jr., E.R., Mitchell, T.M.: Coupled semi-supervised learning for information extraction. In: Proceedings of WSDM 2010, pp. 101–110 (2010)

    Google Scholar 

  6. Curran, J.R., Murphy, T., Scholz, B.: Minimising semantic drift with mutual exclusion bootstrapping. In: Proceedings of PACLING 2007, pp. 172–180 (2007)

    Google Scholar 

  7. Etzioni, O., Cafarella, M., Downey, D., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extraction from the web: an experimental study. Artif. Intell. 165(1), 91–134 (2005)

    Article  Google Scholar 

  8. Gupta, S., Manning, C.: Improved pattern learning for bootstrapped entity extraction. In: Proceedings of CoNLL 2014, pp. 98–108 (2014)

    Google Scholar 

  9. Gupta, S., Manning, C.D.: Distributed representations of words to guide bootstrapped entity classifiers. In: Proceedings of HLT-NAACL 2015, pp. 1215–1220 (2015)

    Google Scholar 

  10. Huang, R., Riloff, E.: Inducing domain-specific semantic class taggers from (almost) nothing. In: Proceedings of ACL 2010, pp. 275–285 (2010)

    Google Scholar 

  11. Iacobacci, I., Pilehvar, M.T., Navigli, R.: Sensembed: learning sense embeddings for word and relational similarity. In: Proceedings of ACL 2015, pp. 95–105 (2015)

    Google Scholar 

  12. Kozareva, Z., Riloff, E., Hovy, E.: Semantic class learning from the web with hyponym pattern linkage graphs. In: Proceedings of ACL 2008, pp. 1048–1056 (2008)

    Google Scholar 

  13. Leacock, C., Chodorow, M.: Combining local context and wordnet similarity for word sense identification. WordNet Electron. Lex. Database 49(2), 265–283 (1998)

    Google Scholar 

  14. Lehmann, J., et al.: deqa: deep web extraction for question answering. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012. LNCS, vol. 7650, pp. 131–147. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35173-0_9

    Chapter  Google Scholar 

  15. Mcintosh, T., Curran, J.R.: Weighted mutual exclusion bootstrapping for domain independent lexicon and template acquisition. In: Proceedings of ALTA 2010, pp. 97–105 (2008)

    Google Scholar 

  16. Miller, G.A.: Wordnet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

  17. Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. 41(2), 10 (2009)

    Article  Google Scholar 

  18. Navigli, R., Ponzetto, S.P.: Babelnet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif. Intell. 193, 217–250 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  19. Pantel, P., Crestan, E., Borkovsky, A., Popescu, A.M., Vyas, V.: Web-scale distributional similarity and entity set expansion. In: Proceedings of EMNLP 2009, pp. 938–947 (2009)

    Google Scholar 

  20. Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and application of a metric on semantic nets. IEEE Trans. Syst. Man Cybern. 19(1), 17–30 (1989)

    Article  Google Scholar 

  21. Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. arXiv preprint cmp-lg/9511007 (1995)

    Google Scholar 

  22. Riloff, E., Jones, R.: Learning dictionaries for information extraction by multi-level bootstrapping. In: Proceedings of AAAI 1999 and IAAI 1999, pp. 474–479 (1999)

    Google Scholar 

  23. Roark, B., Charniak, E.: Noun-phrase co-occurrence statistics for semiautomatic semantic lexicon construction. In: Proceedings of ACL 1998, pp. 1110–1116 (1998)

    Google Scholar 

  24. Sarmento, L., Jijkoun, V., de Rijke, M., Oliveira, E.: “More like these”: growing entity classes from seeds. In: Proceedings of CIKM 2007, pp. 959–962 (2007)

    Google Scholar 

  25. Seitner, J., Bizer, C., Eckert, K., Faralli, S., Meusel, R., Paulheim, H., Ponzetto, S.P.: A large database of hypernymy relations extracted from the web. In: Proceedings of LREC 2016 (2016)

    Google Scholar 

  26. Sussna, M.: Word sense disambiguation for free-text indexing using a massive semantic network. In: Proceedings of CIKM 1993, pp. 67–74. ACM (1993)

    Google Scholar 

  27. Thelen, M., Riloff, E.: A bootstrapping method for learning semantic lexicons using extraction pattern contexts. In: Proceedings of EMNLP 2002, pp. 214–221 (2002)

    Google Scholar 

  28. Tong, S., Dean, J.: System and methods for automatically creating lists, 25 March 2008, US Patent 7,350,187

    Google Scholar 

  29. Wang, R.C., Cohen, W.W.: Language-independent set expansion of named entities using the web. In: Proceedings of ICDM 2007, pp. 342–350. IEEE (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weronika T. Adrian .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Adrian, W.T., Manna, M. (2018). Navigating Online Semantic Resources for Entity Set Expansion. In: Calimeri, F., Hamlen, K., Leone, N. (eds) Practical Aspects of Declarative Languages. PADL 2018. Lecture Notes in Computer Science(), vol 10702. Springer, Cham. https://doi.org/10.1007/978-3-319-73305-0_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73305-0_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73304-3

  • Online ISBN: 978-3-319-73305-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics