Web Page Quality Metrics

Kumar, Ravi

doi:10.1007/978-1-4899-7993-3_460-2

Ravi Kumar³

105 Accesses

Synonyms

Link analysis

Definition

The primary mission of web search engines is to obtain the best possible results for a given user query. To accomplish this effectively, they rely on two crucial pieces of information: the relevance of a web page to the query and some aspect of the quality of the web page that is independent of the query. Relevance, the extent to which the query matches the content of the web page, is formalized and extensively studied in the field of information retrieval. Quality, on the other hand, is more nebulous and less well-defined. Nevertheless, one can identify three concrete and somewhat complementary aspects to the quality of a web page. The first is based on the absolute goodness of the web page and its associated meta-data. This might depend on a variety of parameters, including the worth of content that exists on the web page, the reputation of the person who authored the web page, the importance of the web site that hosts the web page, and so on. The...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Recommended Reading

Bar-Yossef Z, Broder A, Kumar R, Tomkins A. Sic transit gloria telae: towards and understanding of the web’s decay. In: Proceedings of 12th International World Wide Web Conference; 2004. p. 328–37.
Google Scholar
Bharat K, Henzinger M. Improved algorithms for topic distillation in a hyperlinked environment. In: Proceedings of 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1998. p. 104–11.
Google Scholar
Borodin A, Roberts GO, Rosenthal JS, Tsaparas P. Link analysis ranking algorithms, theory, and experiments. ACM Trans Internet Tech. 2005;5:231–97.
Article Google Scholar
Brin S, Page L. The anatomy of a large-scale hypertextual web search engine. Comput Netw. 1998;30:107–17.
Google Scholar
Chakrabarti S, Dom B, Gibson D, Kleinberg J, Raghavan P, Rajagopalan S. Automatic resource compilation by analyzing hyperlink structure and associated text. Comput Netw. 1998;30:65–74.
Google Scholar
Chakrabarti S, Dom B, Gibson D, Kumar R, Raghavan P, Rajagopalan S, Tomkins A. Spectral filtering for resource discovery. In: Proceedings of ACM SIGIR Workshop on Hypertext Analysis; 1998, p. 13–21.
Google Scholar
Garfield E. Citation analysis as a tool in journal evaluation. Science. 1972;178:471–9.
Article Google Scholar
Gibson D, Kleinberg J, Raghavan P. Inferring Web communities from link topology. In: Proceedings of ACM Conference on Hypertext; 1998. p. 225–34.
Google Scholar
Gyöngyi Z, Garcia-Molina H, Pedersen J. Combating web spam with TrustRank. In: Proceedings of 30th International Conference on Very Large Data Bases; 2004. p. 576–87.
Google Scholar
Haveliwala TH. Topic-sensitive PageRank: a context-sensitive ranking algorithm for web search. IEEE Trans Knowl Data Eng. 2003;15:784–96.
Article Google Scholar
Kessler MM. Bibliographic coupling between scientific papers. Am Doc. 1963;14:10–25.
Article Google Scholar
Kleinberg J. Authoritative sources in a hyperlinked environment. J ACM. 2000;46:604–32.
Article MathSciNet MATH Google Scholar
Lempel R, Moran S. SALSA: the stochastic approach for link-structure analysis. ACM Trans Inform Syst. 2001;19:131–60.
Article Google Scholar
Rafiei D, Mendelzon AO. What is this page known for? Computing web page reputations. Comput Netw. 2000;33:823–35.
Article Google Scholar
Small H. Co-citation in the scientific literature: a new measure of the relationship between two documents. J Am Soc Inform Sci. 1973;24:265–9.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Yahoo Research, Santa Clara, CA, USA
Ravi Kumar

Authors

Ravi Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ravi Kumar .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, Georgia, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, Ontario, Canada
M. Tamer Özsu

Section Editor information

Google Research, 76th 9th Ave, 10018, New York, NY, USA
Cong Yu Research Scientist

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Kumar, R. (2016). Web Page Quality Metrics. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_460-2

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7993-3_460-2
Received: 26 April 2016
Accepted: 14 June 2016
Published: 16 November 2016
Publisher Name: Springer, New York, NY
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics