Analyzing the Video Popularity Characteristics of Large-Scale User Generated Content Systems
Top Cited Papers
- 16 March 2009
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE/ACM Transactions on Networking
- Vol. 17 (5) , 1357-1370
- https://doi.org/10.1109/tnet.2008.2011358
Abstract
User generated content (UGC), now with millions of video producers and consumers, is reshaping the way people watch video and TV. In particular, UGC sites are creating new viewing patterns and social interactions, empowering users to be more creative, and generating new business opportunities. Compared to traditional video-on-demand (VoD) systems, UGC services allow users to request videos from a potentially unlimited selection in an asynchronous fashion. To better understand the impact of UGC services, we have analyzed the world's largest UGC VoD system, YouTube, and a popular similar system in Korea, Daum Videos. In this paper, we first empirically show how UGC services are fundamentally different from traditional VoD services. We then analyze the intrinsic statistical properties of UGC popularity distributions and discuss opportunities to leverage the latent demand for niche videos (or the so-called "the Long Tail" potential), which is not reached today due to information filtering or other system scarcity distortions. Based on traces collected across multiple days, we study the popularity lifetime of UGC videos and the relationship between requests and video age. Finally, we measure the level of content aliasing and illegal content in the system and show the problems aliasing creates in ranking the video popularity accurately. The results presented in this paper are crucial to understanding UGC VoD systems and may have major commercial and technical implications for site administrators and content owners.Keywords
This publication has 20 references indexed in Scilit:
- Youtube traffic characterizationPublished by Association for Computing Machinery (ACM) ,2007
- Topical interests and the mitigation of search engine biasProceedings of the National Academy of Sciences, 2006
- Power laws, Pareto distributions and Zipf's lawContemporary Physics, 2005
- A stochastic evolutionary model exhibiting power-law behaviour with an exponential cutoffPhysica A: Statistical Mechanics and its Applications, 2005
- Towards a Theory of Scale-Free Graphs: Definition, Properties, and ImplicationsInternet Mathematics, 2005
- Analysis of Enterprise Media Server Workloads: Access Patterns, Locality, Content Evolution, and Rates of ChangeIEEE/ACM Transactions on Networking, 2004
- Analyzing client interactivity in streaming mediaPublished by Association for Computing Machinery (ACM) ,2004
- A Brief History of Generative Models for Power Law and Lognormal DistributionsInternet Mathematics, 2004
- Self-similarity in World Wide Web traffic: evidence and possible causesIEEE/ACM Transactions on Networking, 1997
- Long-term movie popularity models in video-on-demand systemsPublished by Association for Computing Machinery (ACM) ,1997