Skip to main content

Zoomable Video Playback on Mobile Devices by Selective Decoding

  • Conference paper
Advances in Multimedia Information Processing – PCM 2012 (PCM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7674))

Included in the following conference series:

Abstract

Modern mobile devices support multi-touch gestures that allow users to naturally zoom into and pan around Web pages, photos, and videos. When users zoom into a video, only part of the region in the video frames are displayed. Ideally, only the regions that the user is viewing are decoded, reducing the computation time (hence increasing the playback frame rate) and power consumption. We call this selective decoding. We have implemented a system consisting of an offline analyzer and a mobile video player that implements selective decoding in MPEG-4 Part 2 Simple Profile codec. The analyzer traces various dependency relationships among macroblocks of a given video and produces a meta-data file. The mobile video player supports zoom and pan gestures, and uses the meta-data to trace the macroblocks that are needed to decode the RoI. The player uses a modified decoding process to decode macroblocks selectively based on the trace. Our experiments show that selective decoding can improve playback frame rate by up to 193.3% and reduce energy consumption by up to 64.5%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Khiem, N.Q.M., Ravindra, G., Ooi, W.T.: Towards understanding user tolerance to network latency in zoomable video streaming. In: Proceedings of the 19th ACM International Conference on Multimedia, MM 2011, pp. 977–980. ACM, New York (2011)

    Google Scholar 

  2. Mavlankar, A., Baccichet, P., Varodayan, D., Girod, B.: Optimal slice size for streaming regions of high resolution video with virtual pan/tilt/zoom functionality. In: Proc. of 15th European Signal Processing Conference, EUSIPCO (2007)

    Google Scholar 

  3. Feng, W.C., Dang, T., Kassebaum, J., Bauman, T.: Supporting region-of-interest cropping through constrained compression. ACM Trans. Multimedia Comput. Commun. Appl. 7(3), 17:1–17:16 (2011)

    Google Scholar 

  4. Ngo, K.Q.M., Guntur, R., Ooi, W.T.: Adaptive encoding of zoomable video streams based on user access pattern. In: Proceedings of the Second Annual ACM Conference on Multimedia Systems, MMSys 2011, pp. 211–222. ACM, New York (2011)

    Chapter  Google Scholar 

  5. Mavlankar, A., Noh, J., Baccichet, P., Girod, B.: Peer-to-peer multicast live video streaming with interactive virtual pan/tilt/zoom functionality. In: Proc. of IEEE International Conference on Image Processing, ICIP

    Google Scholar 

  6. Mavlankar, A., Varodayan, D., Girod, B.: Region-of-interest prediction for interactively streaming regions of high resolution video. In: Proc. International Packet Video Workshop (2007)

    Google Scholar 

  7. Shimoga, K.B.: Region-of-interest based video image transcoding for heterogenous client displays. In: Packet Video 2002 (2002)

    Google Scholar 

  8. Fan, X., Xie, X., Qin Zhou, H., Ying Ma, W.: Looking into video frames on small displays. In: Proc. of ACM Multimedia 2003, pp. 247–250. Press (2003)

    Google Scholar 

  9. Bae, T.M., Thang, T.C., Kim, D.Y., Ro, Y.M., Kang, J.W., Kim, J.G.: Multiple region-of-interest support in scalable video coding. ETRI Journal, 239–242 (2006)

    Google Scholar 

  10. Liu, C., Jin, X., Zhang, T., Goto, S.: Partial decoding scheme for H.264/AVC decoder. In: 2010 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS, pp. 1–4 (December 2010)

    Google Scholar 

  11. ffmpeg: Ffmpeg (May 2012), http://ffmpeg.org/index.html

  12. libyuv: libyuv (May 2012), http://code.google.com/p/libyuv/

  13. chromium: chromium (May 2012), http://code.google.com/p/chromium/issues/detail?id=71403

  14. Zhang, L., Tiwana, B., Qian, Z., Wang, Z., Dick, R.P., Mao, Z.M., Yang, L.: Accurate online power estimation and automatic battery behavior based power model generation for smartphones. In: Proceedings of the Eighth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES/ISSS 2010, pp. 105–114. ACM, New York (2010)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liu, F., Ooi, W.T. (2012). Zoomable Video Playback on Mobile Devices by Selective Decoding. In: Lin, W., et al. Advances in Multimedia Information Processing – PCM 2012. PCM 2012. Lecture Notes in Computer Science, vol 7674. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34778-8_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34778-8_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34777-1

  • Online ISBN: 978-3-642-34778-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics