Abstract
PageRank is one of the most popular link analysis algorithms that have shown their effectiveness in web search. However, PageRank only consider hyperlink information. In this paper, we propose several novel ranking algorithms, which make use of both hyperlink and site structure information to measure the importance of each web page. Specifically, two kinds of methodologies are adopted to refine the PageRank algorithm: one combines hyperlink information and website structure information together by graph fusion to refine PageRank algorithm, while the other re-ranks the pages within the same site by quadratic optimization based on original PageRank values. Experiments show that both two methodologies effectively improve the retrieval performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Boyd, S., Vandenberghe, L.: Convex optimization. Course notes for EE364. Stanford University, Stanford (2003)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. In: The Seventh International World Wide Web Conference (1998)
Chakrabarti, S., Joshi, M., Tawde, V.: Enhanced topic distillation using text, markup tags, and hyperlinks. In: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 208–216. ACM Press, New York (2001)
Feng, G., Liu, T.Y., Zhang, X.D., Qin, T., Gao, B., Ma, W.Y.: Level-Based Link Analysis. In: Zhang, Y., Tanaka, K., Yu, J.X., Wang, S., Li, M. (eds.) APWeb 2005. LNCS, vol. 3399, pp. 183–194. Springer, Heidelberg (2005)
Haveliwala, T.H.: Topic-sensitive pagerank. In: Proc. of the 11th Int. World Wide Web Conference (May 2002)
Kleinberg, J.: Authoritative sources in a hyperlinked environment. Journal of the ACM 46(5), 604–622 (1999)
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the web, Technical report, Stanford University, Stanford, CA (1998)
Robertson, S.E.: Overview of the okapi projects. Journal of Pageation 53(1), 3–7 (1997)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Simon, H.A.: The Sciences of the Artificial, 3rd edn. MIT Press, Cambridge (1981)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yan, HM., Qin, T., Liu, TY., Zhang, XD., Feng, G., Ma, WY. (2005). Calculating Webpage Importance with Site Structure Constraints. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.H. (eds) Information Retrieval Technology. AIRS 2005. Lecture Notes in Computer Science, vol 3689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562382_48
Download citation
DOI: https://doi.org/10.1007/11562382_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29186-2
Online ISBN: 978-3-540-32001-2
eBook Packages: Computer ScienceComputer Science (R0)