Skip to main content
Log in

Web caching: A way to improve web QoS

  • Computer Network and Internet
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

As the Internet and World Wide Web grow at a fast pace, it is essential that the Web's performance should keep up with increased demand and expectations. Web Caching technology has been widely accepted as one of the effective approaches to alleviating Web traffic and increase the Web Quality of Service (QoS). This paper provides an up-to-date survey of the rapidly expanding Web Caching literature. It discusses the state-of-the-art web caching schemes and techniques, with emphasis on the recent developments in Web Caching technology such as the differentiated Web services, heterogenous caching network structures, and dynamic content caching.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Zari M, Saiedian H, Naeem M. Understanding and reducting Web delays.IEEE Computer Magazine, Dec. 2001, 34(12): 30–37.

    Google Scholar 

  2. Abrams M, Standridge C R, Abdulla G Williams S, Fox E A Limitations and potentials. InProc. the 4th International WWW Conference, 1995, pp. 119–133.

  3. Aggarwal C, Wolf J L, Yu P S. Caching on the World Wide Web.IEEE Trans. Knowledge and Data Engineering, 1999, 11(1): 94–107.

    Article  Google Scholar 

  4. Davison B D. A Web caching primer.IEEE Internet Computing, July/Aug. 2001, 5(4): 38–45.

    Article  Google Scholar 

  5. Feldman A, Caceres R, Douglis F, Glass G, Rabinovich M. Performance of Web proxy caching in heterogeneous bandwidth environments. InProc. the IEEE Infocom'99 Conference, March 1999, pp. 107–116.

  6. Rabinovich M, Spatscheck O. Web Caching and Replication. first Edition. Addison Wesley Publishing Co., New York, Dec. 2001.

    Google Scholar 

  7. Wang J. A survey of Web caching, schemes for the Internet.ACM Computer Communication Review, Oct. 1999, 29(5): 36–46.

    Article  Google Scholar 

  8. Moon S M. Increasing cache bandwidth using multiport caches for exploiting ILP in non-numerical code. InIEE Proc. Computers and Digital Techniques, Sept. 1997, 144(5): 295–303.

  9. Thiebaut D, Stone H S, Wolf J L. Improving disk cache hit-ratios through cache partitioning.IEEE Trans. Computers, June 1992, 41(6): 665–676.

    Article  Google Scholar 

  10. Stiliadis D, Varma A. Selective victim caching: A method to improve the performance of direct-mapped caches.IEEE Trans. Computers, 1997, 46(5): 603–610.

    Article  Google Scholar 

  11. Kangasharju J, Kwon Y G, Ortega A. Design and implementation of a soft caching proxy.Journal of Computer Networks and ISDN Systems, 1998, (30): 2113–2121.

    Article  Google Scholar 

  12. Rabinovich M, Aggarwal A. RaDaR: A scalable architecture for a global Web hosting service. InProc. the 8th World Wide Web Conference, 1999, pp.467–483.

  13. Rabinovich M, Rabinovich I, Rajaraman R, Aggarwal A. A dynamic object replication and migration protocol for an Internet hosting service. InProc. the 19th IEEE Int. Conf. Distributed Computing Systems, 1999, pp. 101–113.

  14. Cooper I, Melve I, Tomlinson G. RFC 3040 Internet Web Replication and Caching Taxonomy, Jan. 2001.

  15. Chankhunthod A, Danzig P B, Neerdaeis C M F S, Worrell K J. A hierarchical Internet object cache. InProc. the USENIX 1996 Annual Technical Conference, Jan. 1996, pp. 153–163.

  16. Malpani R, Lorch J, Berger D. Making World Wide Web caching servers cooperate. In4th Int. WWW Conf., Dec. 1995, pp. 107–117.

  17. Chankhunthod A, Danzig P B, Neerdaels C, Schwartatz M F, Worrell K J. A hierarchical Internet object cache. InProc. USENIX'96, January 1996, pp. 22–26.

  18. Tewari R, Vin H, Dahlin M, Kay J S. Beyond hierarchies: Design considerations for distributed caching on the Internet. Technical Report TR98-04, Department of Computer Science, University of Texas at Austin, Feb. 1998.

  19. Wessels D, Claffy K. Application of Internet cache protocol (ICP). Internet Draft : draft-wessels-icp-v2-appl-00, Internet Engineering Task Force, RFC 2187, 1997.

  20. Michel S, Nguyen K, Rosenstein A, Zhang L. Adaptive Web caching: Towards a new caching architecture. In3rd Int. Caching Workshop, June 1998, pp. 2041–2046.

  21. Povey D, Harrison J. A distributed Internet cache. InProc. the 20th Australian Computer Science Conference, Sydney, Australia, Feb. 1997, pp. 175–184.

  22. Rodriguez P, Spanner C, Biersack E W. Web caching architectures: Hierarchical and distributed caching. In4th International Caching Workshop, 1999, pp. 37–48.

  23. Fan L, Cao P, Almeida J, Broder A Z. Summary cache: A scalable wide-area Web caching sharing protocol.ACM/IEEE Trans. Networking, 2000, 8(3): 281–293.

    Article  Google Scholar 

  24. Wang Z. Cachemesh: A distributed cache system for World Wide Web.97's Web Cache Workshop, 1997.

  25. Karger D, Lehman E, Leighton T, Levine M, Lewin D, Panigraphy R. Consistent hashing and random trees: Distributed caching protocols for relieving hot spots on the World Wide Web. InProc. the 29th Annual ACM Symposium on Theory of Computing, 1997, pp. 654–663.

  26. Ross K W. Hash-routing for collections for shared Web caches.IEEE Network, Nov./Dec. 1997, 11(6): 37–44.

    Article  Google Scholar 

  27. Wu K L, Yu P S. Latency-sensitive hashing for collaborative Web caching.Computer Networks, 2000, 33: 633–644.

    Article  Google Scholar 

  28. Rabinovich M, Chasse J, Gadde S. Not all hits are created equal: Cooperative proxy caching over a wide-area network.Computer Networks and ISDN Systems, Nov. 1998, 30(22–23): 2253–2259.

    Article  Google Scholar 

  29. Tsui K C, Liu J, Liu H L. Autonomy oriented load balancing in proxy cache servers. InFirst Asia-Pacific Conference on Web Intelligence: Research and Development, 2001, pp. 115–124.

  30. Wang F Y, Liu M K, Zeng D J. ISAAC: Intelligent strategies and architectures for adaptive caching. InTechnical Report 01-0402, PARCS Lab, The University of Arizona, 2002.

  31. Casilari E, Lecuona A R, Gonalez F J, Estrella A Det al. Characterization of Web traffic. InProc. the 2001 IEEE Global Telecommunications Conference, 2001, 3: 1862–1866.

  32. Abdulla G. Analysis and modeling of World Wide Web traffic [Dissertation]. Virginia Polytechnic Institute and State University, May 1998.

  33. Barford P, Bestarvros A, Bradley A, Crovella M. Changes in Web client access patterns: Characteristics and caching implications.World Wide Web Journal, Special Issue on Characterization and Performance Evaluation, 1999, 2(1): 15–28.

    Google Scholar 

  34. Breslau L, Cao P, Fan L, Phillips G, Shenker S. Web caching and Zipf-like distributions: Evidence and implications.IEEE INFOCOM 1999, 1999, 1: 127–134.

    Google Scholar 

  35. Chen X, Mohaptra P. Lifetime behavior and its impact on Web caching. InIEEE Workshop on Internet Application, 1999, pp. 54–61.

  36. Swaminathan N, Raghavan S V. Intelligent prefetch in WWW using client behavior characterization. InProc. 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, 2000.

  37. Cunha C R. Trace analysis and its applications to performance enhancements of distributed information systems [Dissertation]. Boston University, 1997.

  38. Leland W E, Taqqu M S, Willinger W, Wilson D V. On the self-similar nature of Ethernet traffic (extended version).IEEE/ACM Transactions on Networking, 1994, 2: 1–15.

    Article  Google Scholar 

  39. Dilley J. The effect of consistency on cache response time. Technical Report HPL-1999-107, Hewlett-Packard Labs, 1999.

  40. Douglis F, Feldmann A, Krishnamurthy B. Rate of change and other metrics: A live study of the World Wide Web. InProc. the USENIX Symposium on Internet Technologies and Systems, 1997, pp. 147–158.

  41. Eden A N, Joh B W, Mudge T. Web latency reduction via client-side prefetching. InProc. 2000 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS-2000), April 2000, pp. 193–200.

  42. Jiang Z, Kleinrock L. An adaptive network prefetch scheme.IEEE Journal of Selected Areas in Communications, April 1998, 6(3): 358–368.

    Article  Google Scholar 

  43. Kroeger T M, Long D D E, Mogul J C. Exploring the bounds of Web latency reduction from caching and prefetching. InProc. USENIX, Symposium on Internet Technology and Systems, Dec. 1997, pp. 13–22.

  44. Markatos E P, Chronaki C E. A top-10 approach to prefetching on the Web. InProc. the INET 98 Conference, 1998.

  45. Wang L. A study of measurement-based Web prefetch control.Presentation at CCECE2000 Halifax, May 7–10, 2000.

  46. Fan L, Cao P, Lin W, Jacobson Q. Web prefetching between low-bandwidth clients and proxies: Potential and performance. InProc. the Joint International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'99), May 1999, pp. 178–187.

  47. Jiang Z, Kleinrock L. Prefetching links on the WWW. InProc. the IEEE Int. Conf. Communications (ICC'97), Montreal, Canada, June 1997, pp. 483–489.

  48. Li J Q, Wang F Y. Consistency and prefetching for effective Web caching: An qualitative analysis.Technical Report 04-1201, PARCS Lab, the Univ. Arizona, 2001.

  49. Yu S, Kobayashi H. A New Prefetch Cache Scheme. InProc. 2000 IEEE Global Telecommunications Conference (GLOBECOM'00), 2000, 1: 350–355.

  50. Nanopoulos A, Katsaros D, Manolopoulos Y. Effective prediction of Web-user accesses: A data mining approach. In:Proc. the Workshop WEBKDD 2001, 2001.

  51. Mobasher B, Jain N, Han E, Srivastava J. Web mining: Pattern discovery from World Wide Web transactions.Technical Report TR-96050, Department of Computer Science, University of Minnesota, 1996.

  52. Pal S K, Talwar V, Mitra P. Web mining in soft computing framework: Relevance, state of the art and future directions.IEEE Trans. Neural Networks, 2002, 13(5): 1163–1177.

    Article  Google Scholar 

  53. Yang Q, Zhang Z. Model based predictive prefetching. InProc. 12th International Workshop on Database and Expert Systems Applications, 2001.

  54. Cheng K, Kambayashi Y. LRU-SP: A size-adjusted and popularity-aware LRU replacement algorithm for Web caching. InProc. 24th Annual International Computer Software and Applications Conference, 2000, pp. 48–53.

  55. O'Neil E J, O'Neil P E, Weikum G. The LRU-k page replacement algorithm for database disk buffering. InProc. ACM SIGMOD International Conference on Management of Data, New York, 1993, pp. 297–306.

  56. Kim K, Park D. Least popularity-per-byte replacement algorithm for a proxy cache. InProc. 8th International Conference on Parallel and Distributed Systems, 2001, pp. 780–787.

  57. Robinson J T, Devarkonda M V. Data cache management using frequency-based replacement.Performance Evaluation Review, May 1990, 18(1): 134–142.

    Article  Google Scholar 

  58. Williams S, Abrams M, Standridge C Ret al. Removal policies in network caches for World Wide Web documents. InProc. SIGCOMM'96, 1996.

  59. Michel B S, Nikoloudakis K, Reiher P, Zhang L. URL forwarding and compression in adaptive Web caching. InProc. IEEE INFOCOM 2000, Mar. 2000, 2: 670–678.

  60. Hosseini-Khayat S. On optimal replacement of, nonuniform cache objects.IEEE Trans. Computers, August 2000, 49(8): 769–778.

    Article  Google Scholar 

  61. Jin S, Bestavros A. Popularity-aware greedy dual-size Web caching algorithms.Technical Report TR-99/09, Computer Science Department, Boston University, 1999.

  62. Zeng D J, Wang F Y, Fang B. Multiple-queues: An adaptive document replacement policy in Web caching.Technical Report 03-0402, PARCS Lab, the University of Arizona, 2002.

  63. Cao P, Irani S. Cost-aware WWW proxy caching algorithms. InProc. the 1997 USENIX Symp. Internet Technology and Systems, Dec. 1997, pp.193–206.

  64. Mahanti A, Williamson C, Eager D. Traffic analysis of a Web proxy caching hierarchy.IEEE Network May/June 2000, 14(3): 16–23.

    Article  Google Scholar 

  65. Che H, Wang Z, Tung Y. Analysis and design of hierarchical Web caching systems. InProc. IEEE INFOCOM'2001, Anchorage, Alaska, April 2001, pp.1416–1424.

  66. Weikle D, McKee S, Wulf W. Caches as filters: A new approach to cache analysis. InProc. MASCOTS'98, Montreal, PQ, July 1998, pp.2–12.

  67. Busari M, Williamson C. Simulation evaluation of a heterogeneous Web proxy caching hierarchy. In9th Int. Symp. Modeling, Analysis and Simulation of Computer and Telecommunication System, 2001, pp.379–388.

  68. Gwertzman J, Seltzer M. World Wide Web cache consistency. InProc. the USENIX 1996 Annual Technical Conference, Jan, 1996, pp.141–152.

  69. Howard J, Kazar M, Menees S, Nichols D, Satyanarayanan M, Sidebotham R, West M. Scale and performance in a distributed file system.ACM Trans. Computer Systems, Feb. 1998, 6(1): 51–81.

    Article  Google Scholar 

  70. Krishnamurthy B, Wills C E. Piggyback server invalidation for proxy cache coherency. InProc. the WWW-7 Conference, 1998, pp.185–194.

  71. Yin J, Alvisi L, Dahlin M, Lin C. Volume leases for consistency in large-scale systems.IEEE Trans. Knowledge Data Engineering, July 1999, 11(4): 563–576.

    Article  Google Scholar 

  72. Cao P, Liu O. Maintaining strong cache consistency in the World Wide Web.IEEE Trans. Computers. April 1998, 47(4): 445–457.

    Article  MathSciNet  Google Scholar 

  73. Krishnamurthy B. Wills C E. Study of piggyback cache validation for proxy caches in the World Wide Web. InProc. the 1997 USENIX Symposium Internet Technology and Systems, Dec. 1997, pp.1–12.

  74. Gray C, Cheriton D. Leases: An efficient fault-tolerant mechanism for distributed file cache consistency. InProc. 12th ACM Symposium Operating System Principles, 1989, pp.202–210.

  75. Duvvuri V, Shenoy P, Tewari R. Adaptive leases: A strong consistency mechanism for the World Wide Web. In19th. Annual Joint Conference of the IEEE Computer and Communications Societies, INFOCOM 2000, 2000, 2: 834–843.

  76. Krishman P, Raz D, Shavitt Y. Transparent, en-route cache location for regular networks. InDIMACS Workshop on Robust Communication Networks: Interconnection and Survivability, DIMACS Book series, New Brunswick, NJ, USA, Nov. 1998.

  77. Valloppillil V, Ross K W. Cache Array Routing Protocol V1.0. Internet Draft: draft-vinod-carp-v1-03.txt, June 1997.

  78. Almeida J, Cao P. Measuring proxy performance with the Wisconsin proxy benchmark.Journal of Computer Networks and ISDN Systems, 1998, 30: 2179–2192.

    Article  Google Scholar 

  79. Chen Y. Experimental study of Internet traffic modeling and bandwidth allocation.2001 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, 2001, 2, 587–590

    Google Scholar 

  80. Lai G, Liu M K, Wang F Y, Zeng D J. Web Caching: Architectures and Performance Evaluation Survey.2001 IEEE International Conference on Systems, Man, and Cybernetics 2001, 5: 3039–3044.

    Google Scholar 

  81. Maltzahn C, Richarson K J. Performance issues of enterprise level Web proxies. InProc. the ACM SIGMETRICS Int. Conference, June 1997, pp.13–23.

  82. Lu Y, Saxena A, Abdelzaher T F. Differentiated caching services: A control-theoretical approach.IEEE 21st International Conference on Distributed Computing Systems, 2001, pp.615–622.

  83. Cao P, Zhang J, Beach K. Active cache: Caching dynamic contents on the Web. InProc. IFIP Int. Conf. Distributed Systems Platforms and Open Distributed Processing (Middleware'98) 1998, pp.373–388.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ming-Kuan Liu.

Additional information

This work is supported in part by the Outstanding Oversea Scholar Award and the Outstanding Young Scientist Research Fund from the Chinese Academy of Sciences

The authors of this, paper are listed alphabetially, all equally contributed to its completion.

Ming-Kuan Liu is a Ph.D. candidate in the Electrical and Computer Engineering Department at the University of Arizona. His research interests include intelligent control, speech recognition, web caching, voice over IP, network traffic modeling and simulation. He received an M.S. degree in information science from the Institute of Automation (IA), CAS and an M.S. degree in industrial engineering from the University of Arizona in 2000 and 2002, respectively.

Fei-Yue Wang received his Ph.D. degree in electrical, computer and systems engineering from the Rensselaer Polytechnic Institute, Troy, New York, in 1990 and currently is a professor at Systems and Industrial Engineering Department of the University of Arizona, and a research scientist at the Institute of Automation, the Chinese Academy of Sciences. His research interests include modeling, analysis, and control mechanism of complex systems, linguistic dynamic systems, agent-based control systems, intelligent control, real-time embedded systems, application specific operating systems (ASOS), intelligent transportation systems, intelligent vehicles and telematics, web caching and service caching, smart appliances and home systems, and network-based automation systems. He has published over 200 books, book chapters, and papers in those areas since 1984. In 1996 he received Caterpillar Research Invention Award and in 2001 the National Outstanding Young Scientist Research Award from the National Natural Science Foundation of China. He was the Editor-in-Chief of the International Journal of Intelligent Control and Systems from 1995 to 2000, and is the Editor-in-Charge of Series in Intelligent Control and Intelligent Automation, the Editor of IEEE Intelligent systems ITS Department, and an Associate Editor of the IEEE Transactions, on Systems, Man, and Cybernetics (SMC), ITS, and Robotics and Automation (R&A). He was the Program Chair of the 1998 IEEE Int. Symposium on Intelligent Control and the 2001 IEEE Int. Conference on Systems, Man, and Cybernetics, the General Chair of the 2003 IEEE Int. Conference on Intelligent Transportation Systems and will be the Co-Program Chair and General Chair of the 2004 and 2005 IEEE Int. Conf. on Intelligent Vehicles. He is the Vice President of the American Zhu Kezhen Education Foundation, Vice President of IEEE ITS Society. He is a IEEE Fellow.

Daniel Dajun Zeng received his Ph.D. degree from the Graduate School of Industrial Administration and the Robotics Institute at Carnegie Mellon University and currently is an assistant professor at Management Information Systems Department of the University of Arizona, and the associate director of Hoffman E-Commerce Lab at Eller College of Business and Public Administration. He has co-edited one book and published about 50 peer-reviewed articles in Management Information Systems and Computer, Science journals, edited books, and conference proceedings. He is currently directing four US's National Science Foundation-funded research projects as PI or co-PI. His research areas include multi-agent systems, distributed optimization, computational support for auctions and negotiations, intelligent information integration and caching, and recommender systems.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, MK., Wang, FY. & Zeng, D.D. Web caching: A way to improve web QoS. J. Comput. Sci. & Technol. 19, 113–127 (2004). https://doi.org/10.1007/BF02944789

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02944789

Keywords

Navigation