The hw-rank: an h-index variant for ranking web pages
- 480 Downloads
We introduce a novel ranking of search results based on a variant of the h-index for directed information networks such as the Web. The h-index was originally introduced to measure an individual researcher’s scientific output and influence, but here a variant of it is applied to assess the “importance” of web pages. Like PageRank, the “importance” of a page is defined by the “importance” of the pages linking to it. However, unlike the computation of PageRank which involves the whole web graph, computing the h-index for web pages (the hw-rank) is based on a local computation and only the neighbors of the neighbors of the given node are considered. Preliminary results show a strong correlation between ranking with the hw-rank and PageRank, and moreover its computation is simpler and less complex than computation of the PageRank. Further, larger scale experiments are needed in order to assess the applicability of the method.
Keywordshw-Rank h-Index PageRank Ranking of search results
The authors thank Chatzimichalis Konstantinos for carrying out the comparisons between the hw-index and PageRank while studying at Birkbeck University of London.
- Bar-Ilan, J. (2010b). The WIF of Peter Ingwersen’s website. In B. Larsen, J. W. Schneider and F. Åström (Eds.) The Janus Faced Scholar: A Festschrift in honour of Peter Ingwersen (pp. 119–125). Retrieved from http://lup.lub.lu.se/luur/download?func=downloadFile&recordOId=1632623&fileOId=1632624.
- eigenfactor.org (2008). Eigenfactor™ score and article influence™ score: Detailed methods. Retrieved from http://www.eigenfactor.org/methods.pdf.
- Garfield, E. (1973). Citation frequency as a measure of research activity and performance. Essays of an Information Scientist, 1, 406.Google Scholar
- Glänzel, W. (2006). On the opportunities and limitations of the h-index. Science Focus, 1(1), 10–11. English version retrieved from http://eprints.rclis.org/9378/1/H_Index_opprtunities.pdf.
- Google. (2010). Corporate information: Technology overview. Retrieved from https://web.archive.org/web/20100419191933/http://www.google.com/intl/en_uk/corporate/tech.html.
- Gyöngyi, Z, & Garcia-Molina, H. (2005). Link spam alliances. In Proceedings of the 31st international conference of very large databases (VLDB), pp. 517–528.Google Scholar
- Moed, H. F. (2005). Citation analysis in research evaluation. Dortrecht: Springer.Google Scholar
- Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The PageRank citation ranking: Bringing order to the Web. Technical report, Stanford InfoLab. Retrieved from http://ilpubs.stanford.edu:8090/422/.
- Salton, G., & McGill, M. J. (1986). Introduction to modern information retrieval. New York: McGraw Hill.Google Scholar
- SCImago (2007). SJR—SCImago journal & country rank. Retrieved from http://www.scimagojr.com.
- Upstill, T., Craswell, N., & Hawking, D. (2003). Predicting fame and fortune: PageRank or indegree? In Proceedings of the 8th Australasian document computing symposium. Retrieved from http://188.8.131.52/pubs/65254/upstill_adcs03.pdf.
- Google (n. d.). Facts about Google and competition. Retrieved from http://www.google.com/competition/howgooglesearchworks.html.