Advertisement

Patterns on the Web

  • Krishna Bharat
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2857)

Abstract

The web is the product of a planet-wide, implicit collaboration between content creators on an unprecedented scale. Although authors on the web come from a diverse set of backgrounds and often operate independently their collective work embodies surprising regularities at various levels. In this paper we describe patterns in both structural and temporal properties of the web.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bharat, K., Chang, B., Henzinger, M., Ruhl, M.: Who Links to Whom: Mining Linkage between Web Sites. In: Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM), pp. 51–58 (2001), http://www.henzinger.com/monika/mpapers/hostgraph.ps
  2. 2.
    Bharat, K., Henzinger, M.: Improved algorithms for topic distillation in hyperlinked environments. In: Proceedings of ACM SIGIR (1998)Google Scholar
  3. 3.
    Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph Structure in the Web. In: 9th WWW Conference, Netherlands (2000), http://www9.org/w9cdrom/160/160.html
  4. 4.
    Chan, D.: Daypop - About Page (2003), http://www.daypop.com/info/about.htm
  5. 5.
    Chakrabarti, C., Dom, B., Gibson, D., Kleinberg, J., Raghavan, P., Rajagopalan, S.: Automatic resource compilation by analyzing hyperlink structure and associated text. In: Proceedings of 7th International WWW Conference (1998)Google Scholar
  6. 6.
    Dean, J.: Henzinger. M.R.: Finding Related Web Pages in the World Wide Web. In: Proc. of 8th International WWW Conference (1999)Google Scholar
  7. 7.
    Brin, S., Page, L.: The Anatomy of a Large Scale Hypertextual Web Search Engine. In: Proc. of 7th International WWW Conference (1998)Google Scholar
  8. 8.
    Briggs, A., Burke, P.: A Social History of the Media: From Gutenberg to the Internet. Polity Press, Cambridge (2001)Google Scholar
  9. 9.
    Blogdex - About Page: Media Lab. MIT (2001), http://blogdex.net/about.asp
  10. 10.
    Glassman, S.: A Caching Relay for the World Wide Web. In: Proc. of 1st International WWW Conference (1994)Google Scholar
  11. 11.
    Google Inc.: Google Zeitgeist, http://www.google.com/press/zeitgeist.html
  12. 12.
    Huberman, B., Pirolli, P., Pitkow, J., Lukose, R.: Strong regularities in World Wide Web surfing. Science 280, 95–97 (1998)CrossRefGoogle Scholar
  13. 13.
    Internet Archive: The Wayback Machine (2001), http://web.archive.org/collections/web.html
  14. 14.
    Kleinberg, J., Kumar, S.R., Raghavan, P., Rajagopalan, S., Tomkins, A.: The web as a graph: Measurements, models, and methods. In: Proceedings of lCCC (1999)Google Scholar
  15. 15.
    Kumar, R., Novak, J., Raghavan, P., Tomkins, A.: On the bursty evolution of blog-space. In: WWW, pp. 568-576 (2003)Google Scholar
  16. 16.
    Kumar, S.R., Raghavan, R., Rajagopalan, S., Sivakumar, D., Tomkins, A., Upfal, E.: Stochastic models for the web graph. In: Proc. Conference on Foundations of Computer-Science, FOCS (2000)Google Scholar
  17. 17.
    Randall, K., Stata, R., Wickremesinghe, R., Wiener, J.L.: The Link Database: Fast access to graphs ofthe Web. In: Proceedings ofthe Data Compression Conf. (2002)Google Scholar
  18. 18.
    Sifry Consulting: Technorati - Help (2002), http://www.technorati.com/resultshelp.html
  19. 19.
    World Wide Web Consortium: A Little History ofthe World Wide Web (2000), http://www.w3.org/History.html

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Krishna Bharat
    • 1
  1. 1.Google Inc.Mountain ViewUSA

Personalised recommendations