Skip to main content

Measuring Bitrate and Quality Trade-Off in a Fast Region-of-Interest Based Video Coding

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6524))

Abstract

Prevailing video adaptation solutions change the quality of the video uniformly throughout the whole frame in the bitrate adjustment process; while region-of-interest (ROI)-based solutions selectively retains the quality in the areas of the frame where the viewers are more likely to pay more attention to. ROI-based coding can improve perceptual quality and viewer satisfaction while trading off some bandwidth. However, there has been no comprehensive study to measure the bitrate vs. perceptual quality trade-off so far. The paper proposes an ROI detection scheme for videos, which is characterized with low computational complexity and robustness, and measures the bitrate vs. quality trade-off for ROI-based encoding using a state-of-the-art H.264/AVC encoder to justify the viability of this type of encoding method. The results from the subjective quality test reveal that ROI-based encoding achieves a significant perceptual quality improvement over the encoding with uniform quality at the cost of slightly more bits. Based on the bitrate measurements and subjective quality assessments, the bitrate and the perceptual quality estimation models for non-scalable ROI-based video coding (AVC) are developed, which are found to be similar to the models for scalable video coding (SVC).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abdollahian, G., Taskiran, C.M., Pizlo, Z., Delp, E.J.: Camera Motion-based Analysis of User Generated Video. IEEE Trans. on Multimedia 12(1), 28–41 (2010)

    Article  Google Scholar 

  2. Ahmad, A.M.A.: Content-based Video Streaming Approaches and Challenges. In: Ibrahim, I.K. (ed.) Handbook of Research on Mobile Multimedia. Idea group Reference, London, pp. 357–367 (2006)

    Google Scholar 

  3. Azad, S., Song, W., Tjondronegoro, D.: Bitrate Modeling of Scalable Videos Using Quantization Parameter, Frame rate and Spatial Resolution. In: Proc. of ICASSP 2010, pp. 2334–2337. IEEE Press, Los Alamitos (2010)

    Google Scholar 

  4. Wandell, B.A.: Foundations of Vision. Sinauer, Sunderland (1995)

    Google Scholar 

  5. Chi, M., Chen, M., Yeh, C., Jhu, J.: Region-of-Interest Video Coding Based on Rate and Distortion Variations for H. 263+. Image Commun. 23(2), 127–142 (2008)

    Google Scholar 

  6. Ciubotaro, B., Muntean, G.-M., Ghinea, G.: Objective Assessment of Region of Interest-aware Adaptive Multimedia Streaming Quality. IEEE Trans. on Broadcasting. 55(2), 202–212 (1982)

    Article  Google Scholar 

  7. Deng, Y., Manjunath, B.S.: Unsupervised Segmentation of Color-Texture Regions in Images and Video. IEEE Trans. on Pattern Analysis and Machine Intelligence 23(8), 800–810 (2001)

    Article  Google Scholar 

  8. Ding, W., Lu, B.: Rate Control of MPEG Video Coding and Recording by Rate Quantization Modeling. IEEE Trans. on Circuits and Sys. for Video Technology 6, 12–20 (1996)

    Article  Google Scholar 

  9. Eadie, W.T., Drijard, D., James, F.E., Roos, M., Sadoulet, B.: Statistical Methods in Experimental Physics, pp. 269–271. North-Holland, Amsterdam (1971)

    MATH  Google Scholar 

  10. Gulliver, S., Ghinea, G.: Stars in Their Eyes: What Eye Tracking Reveals about Multimedia Perceptual Quality. IEEE Trans. on Sys., Man and Cybernetics 34(4), 472–482 (2004)

    Article  Google Scholar 

  11. Guo, C., Zhang, L.: A novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression. IEEE Trans. on Image Processing. 19(1), 185–198 (2010)

    Article  MathSciNet  Google Scholar 

  12. X264 codec, http://www.videolan.org/developers/x264.html

  13. ITU-T: Subjective video quality assessment methods for multimedia applications. P.910 Recommendation (1999)

    Google Scholar 

  14. Ou, Y.-F., Ma, Z., Wang, Y.: A novel quality metric for compressed video considering both frame rate and quantization artefacts. In: Proc. of Intl. Workshop Video Processing and Quality Metrics for Consumer, VPQM (2009)

    Google Scholar 

  15. Peer, P., Solina, F.: An automatic human face detection method. In: Proc. of CVWW 1999, pp. 122–130 (1999)

    Google Scholar 

  16. Solina, F., Peer, P., Batagelj, B., Juvan, S.: 15 seconds of fame - an interactive computer vision-based art installation. In: Proc. of ICARCV 2002, pp. 198–204 (2002)

    Google Scholar 

  17. Sullivan, G.J., Topiwala, P., Luthra, A.: The H.264/AVC Advanced Video Coding Standard: Overview and Introduction to the Fidelity Range Extensions. In: Pro. of the SPIE Conf. on Applications of Digital Image Processing, pp. 1–22 (2004)

    Google Scholar 

  18. Sullivan, G.J., Wiegand, T., Schwarz, H.: Amd.3 Scalable video coding, ISO/IEC JTC1/SC29/WG11, MPEG08/N9574, Antalya, TR (2008)

    Google Scholar 

  19. Wang, Y., Ma, Z., Ou, Y.-F.: Modeling rate and perceptual quality of scalable videos as functions of quantization and frame rate and its application in scalable video adaptation. In: Proc. of 7th International Packet Video Workshop (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Azad, S., Song, W., Tjondronegoro, D. (2011). Measuring Bitrate and Quality Trade-Off in a Fast Region-of-Interest Based Video Coding. In: Lee, KT., Tsai, WH., Liao, HY.M., Chen, T., Hsieh, JW., Tseng, CC. (eds) Advances in Multimedia Modeling. MMM 2011. Lecture Notes in Computer Science, vol 6524. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17829-0_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17829-0_42

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17828-3

  • Online ISBN: 978-3-642-17829-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics