Abstract
The concept of Regions of Interest (ROIs) within a video sequence is useful for many application scenarios. This paper concentrates on the exploitation of ROI coding within the H.264/AVC specification by making use of Flexible Macroblock Ordering. It shows how ROIs can be coded in an H.264/AVC compliant bitstream and how the MPEG-21 BSDL framework can be used for the extraction of the ROIs.
The first type of ROI extraction that is described, is simply dropping the slices that are not part of one of the ROIs. The second type is the replacement of these slices with so-called placeholder slices, the latter being implemented as P slices containing only macroblocks that are marked as ‘skipped’. The exploitation of ROI scalability, as achieved by the presented methodology, illustrates the possibilities that are offered by the single-layered H.264/AVC specification for content adaptation.
The results show that the bit rate needed to transmit the adapted bitstreams can be reduced significantly. Especially in the case of a static camera and a fixed background, this bit rate reduction has very little impact on the visual quality. Another advantage of the adaptation process is the fact that the execution speed of the receiving decoder fairly increases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cimprich, P.: Streaming transformations for XML (STX) version 1.0 working draft (2004), http://stx.sourceforge.net/documents/spec-stx-20040701.html
De Neve, W., De Schrijver, D., Van de Walle, D., Lambert, P., Van de Walle, R.: Description-based substitution methods for emulating temporal scalability in state-of-the-art video coding formats. In: Proceedings of the 7th International Workshop on Image Analysis for Multimedia Interactive Services, Korea (accepted, 2006)
De Neve, W., Van Deursen, D., De Schrijver, D., De Wolf, K., Van de Walle, R.: Using Bitstream Structure Descriptions for the Exploitation of Multi-layered Temporal Scalability in H.264/AVC’s Base Specification. In: Ho, Y.-S., Kim, H.J. (eds.) PCM 2005. LNCS, vol. 3767, pp. 641–652. Springer, Heidelberg (2005)
De Schrijver, D., Poppe, C., Lerouge, S., Neve, W.D., Walle, R.V.d.: MPEG-21 bitstream syntax descriptions for scalable video codecs (article in press). Multimedia Systems (2006), http://dx.doi.org/10.1007/s00530-006-0021-5
Devillers, S., Timmerer, C., Heuer, J., Hellwagner, H.: Bitstream syntax description-based adaptation in streaming and constrained environments. IEEE Trans. Multimedia 7(3), 463–470 (2005)
Dhondt, Y., Lambert, P., Notebaert, S., Van de Walle, R.: Flexible macroblock ordering as a content adaptation tool in H.264/AVC. In: Proceedings of the SPIE/Optics East conference, Boston (2005)
Hannuksela, M.M., Wang, Y.-K., Gabbouj, M.: Isolated regions in video coding. IEEE Transactions on Multimedia 6(2), 259–267 (2004)
Ichimura, D., Honda, Y., Sun, H., Lee, M., Shen, S.: A tool for interactive ROI scalability. In: JVT-Q020 (2005), http://ftp3.itu.ch/av-arch/jvt-site/2005_10_Nice/JVT-Q020.doc
ISO/IEC JTC1/SC29/WG11 : Applications and requirements for scalable video coding. ISO/IEC JTC1/SC29/WG11 N6880 (2005), http://www.chiariglione.org/mpeg/working_documents/mpeg-04/svc/requirements.zip
Kay, M.: XSLT Programmer’s Reference, 2nd edn. Wrox Press Ltd., Birmingham, UK (2001)
Lambert, P., De Neve, W., Dhondt, Y., Van de Walle, R.: Flexible macroblock ordering in H.264/AVC. Journal of Visual Communication and Image Representation 17(2), 358–375 (2006)
Li, W.: Overview of fine granularity scalability in MPEG-4 video standard. IEEE Trans. Circuits Syst. Video Technol. 11(3), 301–317 (2001)
Reichel, J., Schwarz, H., Wien, M.: Joint scalable video model JSVM-4. JVT-Q202 (2005), http://ftp3.itu.ch/av-arch/jvt-site/2005_10_Nice/JVT-Q202.zip
Taubman, D., Marcellin, M.: JPEG 2000: Image Compression Fundamentals, Standards and Practice. Kluwer Academic Publishers, Dordrecht (2002)
Thang, T.C., Kim, D., Bae, T.M., Kang, J.W., Ro, Y.M., Kim, J.G.: Show case of ROI extraction using scalability information SEI message. JVT-Q077 (2005), http://ftp3.itu.ch/av-arch/jvt-site/2005_10_Nice/JVT-Q077.doc
Vetro, A., Timmerer, C.: Text of ISO/IEC 21000-7 FCD - part 7: Digital item adaptation. ISO/IEC JTC1/SC29/WG11 N5845 (2003), http://www.chiariglione.org/mpeg/working_documents/mpeg-21/dia/dia_fcd.zip
Wiegand, T., Sullivan, G.J., Bjøntegaard, G., Luthra, A.: Overview of the H.264/AVC video coding standard. IEEE Trans. Circuits Syst. Video Technol. 13(7), 560–576 (2003)
Yin, P., Boyce, J., Pandit, P.: FMO and ROI scalability. JVT-Q029 (2005), http://ftp3.itu.ch/av-arch/jvt-site/2005_10_Nice/JVT-Q029.doc
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lambert, P., De Neve, W., De Schrijver, D., Dhondt, Y., Van de Walle, R. (2008). Using Placeholder Slices and MPEG-21 BSDL for ROI Extraction in H.264/AVC FMO-Encoded Bitstreams. In: Filipe, J., Obaidat, M.S. (eds) E-Business and Telecommunication Networks. ICETE 2006. Communications in Computer and Information Science, vol 9. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70760-8_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-70760-8_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70759-2
Online ISBN: 978-3-540-70760-8
eBook Packages: Computer ScienceComputer Science (R0)