Skip to main content
Log in

Workload generation for YouTube

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper introduces a workload characterization study of the most popular short video sharing service of Web 2.0, YouTube. Based on a vast amount of data gathered in a five-month period, we analyzed characteristics of around 250,000 YouTube popular and regular videos. In particular, we collected lists of related videos for each video clip recursively and analyzed their statistical behavior. Understanding YouTube traffic and similar Web 2.0 video sharing sites is crucial to develop synthetic workload generators. Workload simulators are required for evaluating the methods addressing the problems of high bandwidth usage and scalability of Web 2.0 sites such as YouTube. The distribution models, in particular Zipf-like behavior of YouTube popular video files suggests proxy caching of YouTube popular videos can reduce network traffic and increase scalability of YouTube Web site. YouTube workload characteristics provided in this work enabled us to develop a workload generator to evaluate the effectiveness of this approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24

Similar content being viewed by others

References

  1. API Documentation (YouTube). http://youtube.com/dev docs.

  2. Cha M, Kwak H, Rodriguez P, Ahn Y, and Moon S (2007) “I Tube, You Tube, Everybody Tubes: Analyzing the World’s Largest User Generated Content Video System,” In Proceedings of the 7 th ACM SIGCOMM conference on Internet Measurement, pp. 1-14, San Diego, USA.

  3. Chattopadhyay S, Ramaswamy L, and Bhandarkar SM (2007) “A Framework for Encoding and Caching of Video of Quality Adaptive Progressive Download”, In Proceedings of the 15th international conference on Multimedia, Germany.

  4. Cheng X, Dale C, and Liu J (2007) “Understanding the Characteristics of Internet Short Video Sharing: YouTube as a Case Study,” Technical Report arXiv: 0707.3670v1 [cs.NI], Cornell University, arXiv e-prints.

  5. Gill P, Arlitt M, Li Z and Mahanti A, (2007) “YouTube Traffic Characterization: A View From the Edge,” In Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, pp. 15-28, San Diego, USA.

  6. Gomes L (2006) “Will All of Us Get Our 15 Min On a YouTube Video?” Wall Street Journal.

  7. Halvey M and Keane M (2007) “Exploring Social Dynamics in Online Media Sharing,” In Proceedings of the 16th international conference on World Wide Web, Banff, Canada.

  8. Law AM, Kenton WD (2000) Simulation Modeling and Analysis, 3rd edn. McGraw-Hill, Boston, pp 292–402

    Google Scholar 

  9. Sen S, Rexford J, Towsley D (1999) “Proxy Prefix Caching for Multimedia Streams.” In Proceedings of the 18th IEEE Conference on Computer Communications (INFOCOM’99), Volume 3, pp. 1310-1319, New York, NY, USA.

  10. The Wall Street Journal (from Wikipedia). http://en.wikipedia.org/wiki/The_Wall_Street_Journal

  11. YouTube: Video Format (from Wikipedia). http://en.wikipedia.org/wiki/Youtube#Video format

  12. Zink M Suh K and Kurose J (2008) “Watch Global, Cache Local: YouTube Network Traffic at a Campus Network - Measurement and Implications,” In Proceedings of ACM/SPIE MMCN ’08 conference, Volume 6818, pp. 5-13 San Jose, USA.

Download references

Acknowledgements

We would like to thank the anonymous reviewers of this paper for their suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdolreza Abhari.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abhari, A., Soraya, M. Workload generation for YouTube. Multimed Tools Appl 46, 91–118 (2010). https://doi.org/10.1007/s11042-009-0309-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-009-0309-5

Keywords

Navigation