Skip to main content

Mashups for Web Search Engines

  • Chapter
Semantic Mashups

Abstract

Current practice in looking for information on the web states that searchers rely on large-scale web search engines to get assistance. The quality of the search results is analogous to the ability of the searchers to accurately express their information needs as keywords in the search engine’s input box. In this chapter, an attempt is made to explore the various efforts that have been made regarding the query construction/refinement phase of a search session on the web. Along these lines, a number of cases are presented that are based on intuitively created mashups for the underlying web search engine. Particular attention is given to two query construction/refinement mashups that integrate various DBpedia datasets with the web search engine provided by Google.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 59.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://alexa.com/topsites, accessed: 29.6.2012.

  2. 2.

    http://yippy.com.

  3. 3.

    http://www.stumpedia.com.

  4. 4.

    http://www.anoox.com.

  5. 5.

    http://www.irazoo.com.

  6. 6.

    http://www.mysidekick.com.

  7. 7.

    http://search.wikia.com.

  8. 8.

    http://www.dmoz.org.

  9. 9.

    http://www.alexa.com, accessed: 29.6.2012.

  10. 10.

    http://www.google.com/psearch.

  11. 11.

    http://www.google.com/instant.

  12. 12.

    http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnwui/html/iuiguidelines.asp.

  13. 13.

    http://tweetnews.appspot.com.

  14. 14.

    http://developer.yahoo.com/search/boss/.

  15. 15.

    http://news.yahoo.com/.

  16. 16.

    DBpedia ver. 3.8: http://blog.dbpedia.org/2012/08/06/dbpedia-38-released-including-enlarged-ontology-and-additional-localized-versions/.

  17. 17.

    http://dbpedia.org/sparql.

  18. 18.

    N3 notation (Berners-Lee T. Notation 3 (N3): a readable RDF syntax. http://www.w3.org/DesignIssues/Notation3.html).

  19. 19.

    http://wordnet.princeton.edu.

  20. 20.

    DBpedia’s dataset dumps are readily available for downloading from: http://wiki.dbpedia.org/Downloads32.

  21. 21.

    Demo, available at: http://thalassa.ionio.gr/snh/entry/.

  22. 22.

    For more details about the specific mashup, the reader is prompted to read [25].

  23. 23.

    http://www.google.com/cse/.

  24. 24.

    http://dbpedia.org/sparql.

  25. 25.

    http://dbpedia.org/lookup.

  26. 26.

    The mechanics of a highly competitive service like Google’s auto-suggest service are not formally described. Consequently, any attempt to interpret them is subjective.

  27. 27.

    How Google Instant auto-complete suggestions work: http://searchengineland.com/how-Google-instant-autocomplete-suggestions-work-62592.

  28. 28.

    Apart from the above factors there may be other, statistical factors that affect the quality of Google’s suggestions.

  29. 29.

    Wikipedia’s ‘ajax’ disambiguation article: http://en.wikipedia.org/wiki/Ajax.

  30. 30.

    http://lod-cloud.net.

  31. 31.

    http://www.freebase.com.

References

  1. Aghaee S, Pautasso C (2012) End-user programming for web mashups. In: Haarth A, Koch N (eds) Current trends in web engineering. Lecture notes in computer science, vol 7059, pp 347–351. doi:10.1007/978-3-642-27997-3_38

    Chapter  Google Scholar 

  2. Aktas M, Nacar M, Menczer F (2004) Using hyperlink features to personalize web search. In: Advances in web mining and web usage analysis. Proceedings of the 6th international workshop on knowledge discovery from the web, WebKDD 2004, Seattle

    Google Scholar 

  3. Apostolatos I, Papadakis I (2012) GContext: context-based search provided by Wikipedia. In: AI mashup challenge 2012, ESWC 2012. https://sites.google.com/site/aimashup12/home/gcontext/

  4. Auer S, Bizer C, Lehmann J, Kobilarov G, Cyganiak R, Ives Z (2007) DBpedia: a nucleus for a web of open data. In: Proceedings of international semantic web conference, ISWC’07

    Google Scholar 

  5. Baeza-Yates R, Ciaramita M, Mika P, Zaragoza H (2008) Towards semantic search. In: Natural language and information systems. Lecture notes in computer science, vol 5039, pp 4–11

    Chapter  Google Scholar 

  6. Bates M (1986) Subject access in online catalogs: a design model. J Am Soc Inf Sci 11:357–376

    Google Scholar 

  7. Beckers T, Fuhr N (2012) Search system functions for supporting search modes. In: Proceedings of the 2nd European workshop on human–computer interaction and information retrieval, EuroHCIR 2012. http://ceur-ws.org/Vol-909/paper8.pdf

  8. Belkin NJ (1980) Anomalous states of knowledge as the basis for information retrieval. C J Inf Sci 5:133–143

    Google Scholar 

  9. Bhogal J, Macfarlane A, Smith P (2007) A review of ontology based query expansion. Inf Process Manag 43(4):866–886

    Article  Google Scholar 

  10. Billerbeck B, Scholer F, Williams HE, Zobel J (2003) Query expansion using associated queries. In: Proceedings of the ACM CIKM conference

    Google Scholar 

  11. Bizer C, Cyganiak R, Gau T (2007) The RDF book mashup: from web APIs to a web of data. In: Proceedings of the 3rd workshop on scripting for the semantic web, SFSW 07. http://ceur-ws.org/Vol-248/paper4.pdf

  12. Bradley P (2008) Human-powered search engines: an overview and roundup. Ariadne 54. http://www.ariadne.ac.uk/issue54/search-engines/

  13. Daoud M, Tamine L, Boughanem M, Chebaro B (2009) A session based personalized search using an ontological user profile. In: ACM symposium on applied computing, Hawaii. ACM, London, pp 1031–1035

    Google Scholar 

  14. Dervin B (1982) Useful theory for librarianship: communication, not information. Drexel Libr Q 13:16–32

    Google Scholar 

  15. Diaz F, Metzler D (2006) Improving the estimation of relevance models using large external corpora. In: Proceedings of the SIGIR conference, pp 154–161

    Google Scholar 

  16. Foskett DJ (1997) Readings in information retrieval. In: Jones KS, Willet P (eds) Thesaurus. Morgan Kaufman, San Francisco, pp 111–134

    Google Scholar 

  17. Koolen M, Kazai G, Craswell N (2009) Wikipedia pages as entry points for book search. In: Proceedings of the 2nd ACM international conference on web search and data mining, WSDM’09, pp 44–53

    Chapter  Google Scholar 

  18. Lenat D, Guha R, Pittman K, Pratt D, Shepherd M (1990) CYC: toward programs with common sense. Commun ACM 33(8):30–49

    Article  Google Scholar 

  19. Li Y, Luk RWP, Ho EKS, Chung FL (2007) Improving weak ad-hoc queries using Wikipedia as external corpus. In: Proceedings of the SIGIR conference, pp 797–798

    Google Scholar 

  20. Mandala R, Tokunaga T, Tanaka H (1999) Combining multiple evidence from different types of thesaurus for query expansion. In: Proceedings of the 22nd annual international conference on research and development in information retrieval. ACM, Berkeley

    Google Scholar 

  21. Marchionini G (1995) Information seeking in electronic environments. Cambridge series on human–computer interaction. Cambridge University Press, New York

    Book  Google Scholar 

  22. Miller GA (1995) WordNet: a lexical database for English. Commun ACM 38(11):39–41

    Article  Google Scholar 

  23. Milne DN, Witten IH, Nichols DM (2007) A knowledge-based search engine powered by Wikipedia. In: Proceedings of the sixteenth ACM conference on information and knowledge management, CIKM’07, Lisbon, pp 445–454

    Chapter  Google Scholar 

  24. Papadakis I, Stefanidakis M, Stamou S, Andreou I (2009) A query construction service for large-scale web search engines. In: Web intelligence and intelligent agent technologies. IEEE/WIC/ACM international joint conferences, WI-IAT’09, pp 96–99

    Chapter  Google Scholar 

  25. Papadakis I, Stefanidakis M, Stamou S, Andreou I (2012) Semantifying queries over large-scale web search engines. J Internet Serv Appl. doi:10.1007/s13174-012-0068-9

    Google Scholar 

  26. Pazzani M, Muramatsu J, Billsus D (1996) Syskill & Webert: identifying interesting web sites. In: Proceedings of the 13th national conference on artificial intelligence, pp 54–61

    Google Scholar 

  27. Salton G, Buckley C (1990) Improving retrieval performance by relevance feedback. J Am Soc Inf Sci 41(4):288–297

    Article  Google Scholar 

  28. Shneiderman B (1997) Direct manipulation for comprehensible, predictable and controllable user interfaces. In: Moore J, Edmonds E, Puerta A (eds) Proceedings of international conference on intelligent IJser interfaces (IUI97). ACM, Orlando

    Google Scholar 

  29. Sieg A, Mobasher B, Burke R (2007) Web search personalization with ontological user profiles. In: Proceedings of the ACM conference on information and knowledge management, CIKM’07. ACM, New York, pp 525–534

    Google Scholar 

  30. Spencer D (2006) Four modes of seeking information and how to design for them, boxes and arrows. http://www.boxesandarrows.com/view/four_modes_of_seeking_information_and_how_to_design_for_them

  31. Suchanek FM, Kasneci G, Weikum G (2007) YAGO: a core of semantic knowledge-unifying WordNet and Wikipedia. In: Proceedings of the WWW conference, pp 697–706

    Google Scholar 

  32. Sun JT (2005) CubeSVD: a novel approach to personalized web search. In: Proceedings of WWW’05, pp 382–390

    Google Scholar 

  33. Surowiecki J (2004) The wisdom of crowds. Random House, New York. ISBN 978-0385721707

    Google Scholar 

  34. Voorhees EM (2003) Overview of TREC 2003. In: REC, pp 1–13

    Google Scholar 

  35. White MD, Livonen M (2001) Questions as a factor in web search strategy. Inf Process Manag 37(5):721–740 (p 723)

    Article  MATH  Google Scholar 

  36. Wu F, Weld DS (2008) Automatically refining the Wikipedia infobox ontology. In: Proceedings of the 17th international world wide web conference, pp 635–644

    Chapter  Google Scholar 

  37. Yokoi T (1995) The EDR electronic dictionary. Commun ACM 38(11):42–44

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ioannis Papadakis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Papadakis, I., Apostolatos, I. (2013). Mashups for Web Search Engines. In: Endres-Niggemeyer, B. (eds) Semantic Mashups. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36403-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-36403-7_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-36402-0

  • Online ISBN: 978-3-642-36403-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics