Temporal Ranking of Search Engine Results

Jatowt, Adam; Kawai, Yukiko; Tanaka, Katsumi

doi:10.1007/11581062_4

Temporal Ranking of Search Engine Results

Adam Jatowt²¹,
Yukiko Kawai²¹ &
Katsumi Tanaka²²

Conference paper

1251 Accesses
9 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3806))

Abstract

Existing search engines contain the picture of the Web from the past and their ranking algorithms are based on data crawled some time ago. However, a user requires not only relevant but also fresh information. We have developed a method for adjusting the ranking of search engine results from the point of view of page freshness and relevance. It uses an algorithm that post-processes search engine results based on the changed contents of the pages. By analyzing archived versions of web pages we estimate temporal qualities of pages, that is, general freshness and relevance of the page to the query topic over certain time frames. For the top quality web pages, their content differences between past snapshots of the pages indexed by a search engine and their present versions are analyzed. Basing on these differences the algorithm assigns new ranks to the web pages without the need to maintain a constantly updated index of web documents.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Amitay, E., Carmel, D., Herscovici, M., Lempel, R., Soffer, A.: Trend Detection Through Temporal Link Analysis. Journal of The American Society for Information Science and Technology 55, 1–12 (2004)
Article Google Scholar
Baeza-Yates, R., Saint-Jean, F., Castillo, C.: Web Structure, Age and Page Quality. In: Laender, A.H.F., Oliveira, A.L. (eds.) SPIRE 2002. LNCS, vol. 2476, pp. 117–130. Springer, Heidelberg (2002)
Chapter Google Scholar
Boyapati, V., Chevrier, K., Finkel, A., Glance, N., Pierce, T., Stokton, R., Whitmer, C.: ChangeDetector^TM: A site level monitoring tool for WWW. In: Proceedings of 11^th International WWW Conference, Honolulu, Hawaii, USA, pp. 570–579 (2002)
Google Scholar
Brewington, E.B., Cybenko, G.: How Dynamic is the Web? In: Proceedings of the 9^th International World Wide Web Conference, Amsterdam, The Netherlands, pp. 257–276 (2000)
Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. In: Proceedings of the 7th World Wide Web Conference, Australia, pp. 107–117 (1998)
Google Scholar
Cho, J., Garcia-Molina, H.: The Evolution of the Web and Implications for an Incremental Crawler. In: Proceedings of the 26^th International Conference on Very Large Databases (VLDB), Cairo, Egypt, pp. 200–209 (2000)
Google Scholar
Cho, J., Ntoulas, A.: Effective Change Detection Using Sampling. In: Proceedings of the 28^th VLDB Conference, Hong Kong, SAR China (2002)
Google Scholar
Douglis, F., et al.: AT&T Internet difference engine: Tracking and Viewing Changes on the Web. World Wide Web 1(1), 27–44 (1998)
Article Google Scholar
Francisco-Revilla, L., Shipman, F., Furuta, R., Karadkar, U., Arora, A.: Perception of Content, Structure, and Presentation Changes in Web-based Hypertext. In: Proceedings of the 12^th ACM Conference on Hypertext and Hypermedia (Hypertext 2001), Aarhus, Denmark, pp. 205–214. ACM Press, New York (2001)
Chapter Google Scholar
Google News: http://news.google.com
Google Search Engine: http://www.google.com
Internet Archive: http://www.archive.org
Jacob, J., et al.: WebVigiL: An approach to just-in-time information propagation in large network-centric environments. Web Dynamics Book. Springer, Heidelberg (2003)
Google Scholar
JTidy: http://jtidy.sourceforge.net
Liu, L., Pu, C., Tang, W.: Continual Queries for Internet Scale Event-Driven Information Delivery. IEEE Knowledge and Data Engineering 11(4), 610–628 (1999), Special Issue on Web Technology
Article Google Scholar
MSN search: http://search.msn.com
Porter Stemmer in Java: http://www.tartarus.org/~martin/PorterStemmer/java.txt
Sato, N., Uehara, M., Sakai, Y.: Temporal Ranking for Fresh Information Retrieval. In: Proceedings of the 6^th International Workshop on Information Retrieval with Asian Languages, Sapporo, Japan, pp. 116–123 (2003)
Google Scholar
Search Engine Statistics: Freshness Showdown, http://searchengineshowdown.com/stats/freshness.shtml
Tomcat Apache: http://jakarta.apache.org/tomcat/

Download references

Author information

Authors and Affiliations

National Institute of Information and Communications Technology, 3-5 Hikaridai, Seika-cho, Soraku-gun, 619-0289, Kyoto, Japan
Adam Jatowt & Yukiko Kawai
Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, Sakyo-ku, 606-8501, Kyoto, Japan
Katsumi Tanaka

Authors

Adam Jatowt
View author publications
You can also search for this author in PubMed Google Scholar
Yukiko Kawai
View author publications
You can also search for this author in PubMed Google Scholar
Katsumi Tanaka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Texas State University, San Marcos, TX,
Anne H. H. Ngu
Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, 153-8505, Tokyo, Japan
Masaru Kitsuregawa
University of Vienna, Vienna, Austria
Erich J. Neuhold
IBM Research Division, Thomas J. Watson Research Center, P.O. Box 218, 10598, New York, Yorktown Heights, USA
Jen-Yao Chung
School of Computer Science and Engineering, University of New South Wales, NSW 2052, Sydney, Australia
Quan Z. Sheng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jatowt, A., Kawai, Y., Tanaka, K. (2005). Temporal Ranking of Search Engine Results. In: Ngu, A.H.H., Kitsuregawa, M., Neuhold, E.J., Chung, JY., Sheng, Q.Z. (eds) Web Information Systems Engineering – WISE 2005. WISE 2005. Lecture Notes in Computer Science, vol 3806. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581062_4

Download citation

DOI: https://doi.org/10.1007/11581062_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30017-5
Online ISBN: 978-3-540-32286-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics