Abstract
Perhaps the most significant tool of our internet age is the web search engine, providing a powerful interface for accessing the vast amount of information available on the world wide web and beyond. While still in its infancy compared to the knowledge tools that precede it – such as the dictionary or encyclopedia – the impact of web search engines on society and culture has already received considerable attention from a variety of academic disciplines and perspectives. This article aims to organize a meta-discipline of “web search studies,” centered around a nucleus of major research on web search engines from five key perspectives: technical foundations and evaluations; transaction log analyses; user studies; political, ethical, and cultural critiques; and legal and policy analyses.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
According to the Pew internet & American Life Project, 84% of American adult internet users have used a search engine to seek information online (Fallows, 2005: 1), making searching the web the second most popular online activity (behind using e-mail) (Rainie, 2005). In August 2007, over 750 million people worldwide over the age of 15 conducted a search, totaling more than 61 billion searches (Burns, 2007).
- 2.
The first full-text web search engine was WebCrawler, launched in 1994 (InfoSpace, 2007), making search engines relative “teenagers” compared to other tools and technologies for organizing and retrieving information. Encyclopedias, for example, date back to the first century AD.
- 3.
These categories are not necessarily mutually exclusive and are not put forth as airtight ontological divisions. They are meant simply to help organize this interdisciplinary collection of studies to aid discussion.
- 4.
Critical responses to PageRank, and Google overall, will be discussed below.
- 5.
For a comparison of web search engine ranking algorithms, see Borodin, Roberts, Rosenthal, and Tsaparas (2001).
- 6.
For a summary of literature on search engine design, see Arasu, Cho, Garcia-Molina, Paepcke, and Raghavan (2001).
- 7.
A library of research on information retrieval systems can be found at the website for the ACM Special Interest Group on Information Retrieval: http://www.sigir.org/proceedings/Proc-Browse.html.
- 8.
These concerns relate to some of the cultural and social issues that will be discussed in more detail below.
- 9.
See “Regulating Search?: A Symposium on Search Engines, Law, and Public Policy” (http://isp.law.yale.edu/regulatingsearch/overview/).
- 10.
See “The Law of Search Engines” (http://law.haifa.ac.il/events/event_sites/se/).
- 11.
References
Albrechtslund, A. (2006). Surveillance in searching. Paper presented at the EASST 2006, Lausanne.
Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., & Raghavan, S. (2001). Searching the Web. ACM Transactions on Internet Technology, 1(1), 2–43.
Bar-Ilan, J., Mat-Hassan, M., & Levene, M. (2006). Methods for comparing rankings of search engine results. Computer Networks, 50(10), 1448–1463.
Bergman, M. (2001). The deep web: Surfacing hidden value. Journal of Electronic Publishing, 7(1). http://dx.doi.org/10.3998/3336451.0007.104.
Borodin, A., Roberts, G. O., Rosenthal, J. S., & Tsaparas, P. (2001). Finding authorities and hubs from link structures on the world wide Web. Proceedings of the 10th international conference on world wide web. Hong Kong (pp. 415–429).
Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. WWW7/Computer Networks, 30(1–7), 107–117.
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., et al. (2000). Graph structure in the Web. Computer Networks, 33(1–6), 309–320.
Burns, E. (2007). Worldwide internet: Now serving 61 billion searches per month. SearchEngineWatch. Retrieved November 2, 2007, from http://searchenginewatch.com/showPage.html?page=3627304
Chandler, J. (2008). A right to reach an audience: An approach to intermediary bias on the internet. Hofstra Law Review, 35(3), 1095–1138.
Choo, C. W., Detlor, B., & Turnbull, D. (1998). A behavioral model of information seeking on the Web: Preliminary results of a study of how managers and IT specialists use the Web. Proceedings of the 61st Annual Meeting of the American Society for Information Science, 35, 290–302.
Choo, C. W., Detlor, B., & Turnbull, D. (2000). Information seeking on the Web: An integrated model of browsing and searching. First Monday, 5(2), 2000.
Chopra, S., & White, L. (2007). Privacy and artificial agents, or, is google reading my email? Paper presented at the IJCAI 2007, Hyderabad, India.
Deleuze, G., & Guattari, F. (1987). A thousand plateaus: Capitalism and schizophrenia (B. Massumi, Trans.). Minneapolis, MN: University of Minnesota Press.
Dennis, S., Bruza, P., & McArthur, R. (2002). Web searching: A process-oriented experimental study of three interactive search paradigms. Journal of the American Society for Information Science and Technology, 53(2), 120–133.
Diaz, A. (2008). Through the Google goggles: Sociopolitical bias in search engine design. In A. Spink & M. Zimmer (Eds.), Web searching: Multidisciplinary perspectives (pp. 11–34). Dordrecht, The Netherlands: Springer.
Eichmann, D. (1994). The rbse spider – balancing effective search against Web load. Proceedings of the 1st international world wide web conference. Geneva (pp. 113–120).
Elkin-Koren, N. (2001). Let the crawlers crawl: On virtual gatekeepers and the right to exclude indexing. University of Dayton Law Review, 26, 180–209.
Fallows, D. (2005). Search engine users: Internet searchers are confident, satisfied and trusting – but they are also unaware and naïve. Pew Internet & American Life Project. Retrieved October 15, 2005, from http://www.pewinternet.org/pdfs/PIP_Searchengine_users.pdf
Federal Trade Commission. (2007). Ftc to host town hall to examine privacy issues and online behavioral advertising. Retrieved November 8, 2007, from http://ftc.gov/opa/2007/08/ehavioral.shtm
Fitzgerald, B., O’Brien, D., & Fitzgerald, A. (2008). Search engine liability for copyright infringement. In A. Spink & M. Zimmer (Eds.), Web searching: Multidisciplinary perspectives (pp. 103–120). Dordrecht, The Netherlands: Springer.
Fortunato, S., Flammini, A., Menczer, F., & Vespignani, A. (2005). The egalitarian effect of search engines. Arxiv preprint cs.CY/0511005. http://arxiv.org/pdf/cs.CY/0511005
Gasser, U. (2006). Regulating search engines: Taking stock and looking ahead. Yale Journal of Law & Technology, 9, 124–157.
Goldman, E. (2006).Search engine bias and the demise of search engine utopianism. Yale Journal of Law & Technology, 8, 188–200.
Grimmelmann, J. (2007). The structure of search engine law. Iowa Law Review, 93(1), 1–63.
Gulli, A., & Signorini, A. (2005). The indexableWeb is more than 11.5 billion pages. International World Wide Web Conference. Chiba: ACM Press (pp. 902–903).
Hansell, S. (2006). AOL removes search data on vast group of Web users. The New York Times, C4.
Hargittai, E. (2002). Beyond logs and surveys: In-depth measures of people’s Web use skills. Journal of the American Society for Information Science and Technology, 53(14), 1239–1244.
Hargittai, E. (2004a). Informed Web surfing: The social context of user sophistication. In Society Online: the Internet in Context (pp. 257–274). Thousand Oaks: Sage Publications, Inc.
Hargittai, E. (2004b). The changing online landscape: From free-for-all to commercial gatekeeping. Retrieved October 14, 2006, from http://www.eszter.com/research/c03-onlinelandscape.html
Heintz, C. (2006). Web search engines and distributed assessment systems. Pragmatics & Cognition, 14(2), 387–409.
Hellsten, I., Leydesdorff, L., &Wouters, P. (2006). Multiple presents: How search engines re-write the past. New Media & Society, 8(6), 901–924.
Hendry, D., & Efthimiadis, E. (2008). Conceptual models for search engines. In A. Spink & M. Zimmer (Eds.), Web searching: Multidisciplinary perspectives (pp. 277–307). Dordrecht, The Netherlands: Springer.
Herlocker, J. L., Konstan, J. A., Terveen, L. G., & Riedl, J. T. (2004). Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems (TOIS), 22(1), 5–53.
Heydon, A., & Najork, M. (1999). Mercator: A scalable, extensible Web crawler. world wide web, 2(4), 219–229.
Hindman, M., Tsioutsiouliklis, K., & Johnson, J. A. (2003). Googlearchy: How a few heavilylinked sites dominate politics on the Web. Annual meeting of the Midwest Political Science Association. Chicago, IL.
Hinman, L. (2005). Esse est indicato in Google: Ethical and political issues in search engines. International Review of Information Ethics, 3, 19–25.
Höelscher, C. (1998). How internet experts search for information on the Web. World conference of the world wide web, Internet, and Intranet. Orlando, FL.
Höelscher, C., & Strube, G. (2000).Web search behavior of internet experts and newbies. Computer Networks, 33(1–6), 337–346.
InfoSpace. (2007). About webcrawler. Retrieved November 3, 2007, from http://www.webcrawler.com/webcrawler/ws/about/_iceUrlFlag=11?_IceUrl=true
Introna, L., & Nissenbaum, H. (2000). Shaping theWeb:Why the politics of search engines matters. The Information Society, 16(3), 169–185.
Jansen, B. J., & Pooch, U. (2001). A review of Web searching studies and a framework for future research. Journal of the American Society for Information Science and Technology, 52(3), 235–246.
Jansen, B. J., & Spink, A. (2005). How are we searching the world wide Web? A comparison of nine search engine transaction logs. Information Processing & Management, 42(1), 248–263.
Jansen, B. J., Spink, A., & Saracevic, T. (2000). Real life, real users, and real needs: A study and analysis of user queries on theWeb. Information Processing and Management, 36(2), 207–227.
Keenoy, K., & Levene, M. (2005). Personalisation of Web search. In Intelligent Techniques for Web Personalization (pp. 201–228). Berlin: Springer.
Khopkar, Y., Spink, A., Giles, C. L., Shah, P., & Debnath, S. (2003). Search engine personalization: An exploratory study. FirstMonday. Retrieved October 23, 2007, from http://www.firstmonday.org/issues/issue8_7/khopkar/
Kleinberg, J. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5), 604–632.
Kleinberg, J., & Lawrence, S. (2001). The structure of the Web. Science, 294, 1849–1850.
Kleinberg, J. M., Kumar, R., Raghavan, P., Rajagopalan, S., & Tomkins, A. (1999). The Web as a graph: Measurements, models and methods. Proceedings of the International Conference on Combinatorics and Computing, 6(1), 1–18.
Kopytoff, V. (2006). Google says no to data demand. San Francisco Chronicle, A1.
Lawrence, S., & Giles, C. L. (1998). Searching the world wide Web. Science, 280(5360), 98–100.
Lawrence, S., & Giles, L. (2000). Accessibility of information on the Web. Intelligence, 11(1), 32–39.
Lempel, R., & Moran, S. (2000). The stochastic approach for link-structure analysis (salsa) and the tkc effect. Computer Networks, 33(1–6), 387–401.
Lev-On, A. (2008). The democratizing effects of search engine use: On chance exposures and organizational hubs. In A. Spink & M. Zimmer (Eds.), Web searching: Multidisciplinary perspectives (pp. 135–149). Dordrecht, The Netherlands: Springer.
Lewandowski, D., Wahlig, H., & Meyer-Bautor, G. (2006). The freshness of Web search engine databases. Journal of Information Science, 32(2), 131.
Machill, M., Neuberger, C., Schweiger,W., &Wirth,W. (2004). Navigating the internet. European Journal of Communication, 19(3), 321–347.
Martey, R. M. (2008). Exploring gendered notions: Gender, job hunting and Web search engines. In A. Spink & M. Zimmer (Eds.), Web searching: Multidisciplinary perspectives (pp. 51–65). Dordrecht, The Netherlands: Springer.
McBryan, O. A. (1994). Genvl and wwww: Tools for taming the Web. Proceedings of the 1st International world wide web conference. Geneva pp. 79–90).
Menczer, F., Fortunato, S., Flammini, A., & Vespignani, A. (2006). Googlearchy or googlocracy? IEEE Spectrum, 43(2). http://www. spectrum.ieee.org/feb06/2787.
Nagenborg, M. (2005). The ethics of search engines (special issue). International Review of Information Ethics, 3.
Norvig, P., Winograd, T., & Bowker, G. (2006). The ethics and politics of search engines. Panel at Santa Clara University Markkula Center for Applied Ethics. Retrieved March 1, 2006, from http://www.scu.edu/sts/Search-Engine-Event.cfm
O’Brien, D., & Fitzgerald, B. (2006). Digital copyright law in a YouTube world. Internet Law Bulletin, 9(6/7), 71, 73–74.
Page, L., Brin, S., Motwani, R., & Winograd, T. (1998). The PageRank citation ranking: Bringing order to the Web. Retrieved January 12, 2007, from http://dbpubs.stanford.edu/pub/1999-66
Pan, B., Hembrooke, H., Joachims, T., Lorigo, L., Gay, G., & Granka, L. (2004). In Google we trust: Users’ decisions on rank, position, and relevance. Journal of Computer-Mediated Communication, 12(3), 801–823.
Pasquale, F., & Bracha, O. (2007). Federal search commission? Access, fairness and accountability in the law of search. U of Texas Law, Public Law Research Paper No. 123. Retrieved August 15, 2007, from http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1002453
Pinkerton, B. (1994). Finding what people want: Experiences with the webcrawler. Proceedings of the 2nd International world wide web conference. Elsevier Science
Raghavan, S., & Garcia-Molina, H. (2001). Crawling the hidden Web. Proceedings of the 27th International conference on very large data bases. Rome (pp. 129–138).
Rainie, L. (2005). Search engine use shoots up in the past year and edges towards e-mail as the primary internet application. Pew Internet and American Life Project. Retrieved September 15, 2006, from http://www.pewinternet.org/pdfs/PIP_SearchData_1105.pdf
Rieder, B. (2005). Networked control: Search engines and the symmetry of confidence. International Review of Information Ethics, 3, 26–32.
Röhle, T. (2007). Desperately seeking the consumer: Personalized search engines and the commercial exploitation of user data. First Monday. Retrieved October 23, 2007, from http://www.firstmonday.org/issues/issue12_9/rohle/index.html
Roy, M., & Chi, M. T. C. (2003). Gender differences in patterns of searching the Web. Journal of Educational Computing Research, 29(3), 335–348.
Sanderson, M. (2005). Information retrieval system evaluation: Effort, sensitivity, and reliability. Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval. Salvador (pp. 162–169).
Silverstein, C., Henzinger, M. R., Marais, H., & Moricz, M. (1999). Analysis of a very large Web search engine query log. SIGIR Forum, 33(1), 6–12.
Spink, A., & Jansen, B. J. (2004). Web search: Public searching of the web. New York, NY: Kluwer Academic Publishers.
Spink, A., Jansen, B. J., Blakely, C., & Koshman, S. (2006). A study of results overlap and uniqueness among major Web search engines. Information Processing & Management, 42(5), 1379–1391.
Steinbach, M., Karypis, G., & Kumar, V. (2000). A comparison of document clustering techniques. KDD Workshop on Text Mining, 34, 35.
Tavani, H. T. (2005). Search engines, personal information and the problem of privacy in public. International Review of Information Ethics, 3, 39–45.
Teevan, J., Dumais, S. T., & Horvitz, E. (2005). Personalizing search via automated analysis of interests and activities. Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval. ACM Press (pp. 449–456).
Travis, H. (2006). Google Book search and fair use. University of Miami Law Review, 61, 601–681.
Vaidhyanathan, S. (2007). The googlization of everything and the future of copyright. University of California Davis Law Review, 40(3), 1207–1231.
Van Couvering, E. (2004). New media? The political economy of internet search engines. Annual conference of the International Association of Media & Communications Researchers. Porto Alegre, Brazil (pp. 7–14).
Van Couvering, E. (2008). The history of the internet search engine: Navigational media and the traffic commodity. In A. Spink & M. Zimmer (Eds.), Web searching: Multidisciplinary perspectives (pp. 77–206). Dordrecht, The Netherlands: Springer.
Vaughan, L. (2004). New measurements for search engine evaluation proposed and tested. Information Processing and Management: An International Journal, 40(4), 677–691.
Welp, C., & Machill, M. (2005). Code of conduct: Transparency in the net: Search engines. International Review of Information Ethics, 3, 18.
Wirth, W., Böcking, T., Karnowski, V., & von Pape, T. (2007). Heuristic and systematic use of search engines. Journal of Computer-Mediated Communication, 12(3), 778–800.
Wouters, P., Hellsten, I., & Leydesdorff, L. (2004). Internet time and the reliability of search engines. FirstMonday. Retrieved December 24, 2006, from http://www.firstmonday.org/issues/issue9_10/wouters/index.html
Zimmer, M. (2006). The value implications of the practice of paid search. Bulletin of the American Society for Information Science and Technology. Retrieved April 3, 2006, from http://www.asis.org/Bulletin/Dec-05/zimmer.html
Zimmer, M. (2008a). Privacy on planet Google: Using the theory of “contextual integrity” to clarify the privacy threats of Google’s quest for the perfect search engine. Journal of Business & Technology Law, 3(1), 109–126.
Zimmer, M. (2008b). The gaze of the perfect search engine: Google as an infrastructure of dataveillance. In A. Spink & M. Zimmer (Eds.), Web searching: Multidisciplinary perspectives (pp. 77–99). Dordrecht, The Netherlands: Springer.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media B.V.
About this chapter
Cite this chapter
Zimmer, M. (2009). Web Search Studies: Multidisciplinary Perspectives on Web Search Engines. In: Hunsinger, J., Klastrup, L., Allen, M. (eds) International Handbook of Internet Research. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-9789-8_31
Download citation
DOI: https://doi.org/10.1007/978-1-4020-9789-8_31
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-9788-1
Online ISBN: 978-1-4020-9789-8
eBook Packages: Computer ScienceComputer Science (R0)