Abstract
Video sequences have the rich texture information in practical applications, which makes the extraction of the semantic objects of interest more difficult. This paper presents a video object extraction algorithm based on depth map for multi-view video coding in three-dimensional video system. First of all, gradient operators are used to roughly segment color image into flat and texture regions with threshold, so object contours are extracted, while The OTSU algorithm is used to distinguish backgrounds and foregrounds in the color image, which can fill the pixels of semantic objects. Then the corresponding depth map is processed to highlight the human vision interested regions. At the same time, inter-frame difference is taken into account, which joins the moving objects into foregrounds, and extracts the interested region with morphological operations. Finally, object of block level is obtained though combination of operators outlined above and block-process though threshold. Compared with the existing algorithms, the proposed algorithm does not adopt popular clustering scheme but joins the OTSU algorithm, thus it can effectively avoid lots of computational complexity which the clustering algorithm brings. Experimental results show that the proposed algorithm can not only extract accurately the semantic objects, but also reduce the computational complexity. Whether the objects are static or not, the proposed algorithm can get good efficient segmentation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Xu, H., Younis, A., Kabuka, M.: Automatic Moving Object Extraction for Content-Based Applications. IEEE Transactions on Circuits and Systems for Video Technology 14(6), 796–812 (2004)
Haseyama, M., Yokoyama, Y.: Moving Object Extraction Using a Shape Constraint-Based Splitting Active Contour Model. In: 2005 IEEE Int. Conf. on Image Processing, Genoa, Italy, vol. 3, pp. 1260–1263 (2005)
Yang, H., Chang, Y.L., Huo, J., et al.: Depth Characteristic-Based Image Region Partition and Regional Disparity Estimation for Multi-View Video Coding. ACTA Optica Sinica 28, 1073–1078 (2010)
Sum, K., Cheung, P.: A Fast Parametric Snake Model with Enhanced Concave Object Extraction Capability. In: IEEE International Symposium on Signal Processing and Information Technology, pp. 454–457 (2007)
Song, X., Fan, G.: Joint Key-Frame Extraction and Object Segmentation for Content-Based Video Analysis. IEEE Transactions on Circuits and Systems for Video Technology 16, 904–914 (2006)
Pan, C., Zhang, Z., Shi, X., Shen, L.: An Approach to Video Object Extraction in the Compressed Domain. Journal of Image and Graphics 14, 904–914 (2009)
Zhang, Y.: Research on MVD Representation Based Multi-view Video Coding. Ph.D. dissertation, Graduate School of Chinese Academy of Sciences, Beijing (2010)
Mao, X.Y., Yu, M., Wang, X., et al.: Stereoscopic Image Quality Assessment Model with Three-Component Weighted Structure Similarity. In: International Conference on Audio, Language and Image Processing, Shanghai (2010)
Mei, T., Hua, X., Zhu, C., et al.: Home Video Visual Quality Assessment with Spatio-Temporal Factors. IEEE Transactions on Circuits System for Video Technology 17(6), 699–706 (2007)
Lu, Z., Lin, W., Yang, X., et al.: Modeling Visual Attention’s Modulatory Aftereffects on Visual Sensitivity and Quality Evaluation. IEEE Transactions on Image Process 14(11), 1928–1942 (2009)
Zitnick, C., Kang, S., Uyttendaele, M.: High-Quality Video View Interpolation Using a Layered Representation. ACM SIGGRAPH and ACM Transactions on Graphics 4, 600–608 (2004)
Tanimoto, M., Fujii, T., Fukushima, N.: 1D Parallel Test Sequences for MPEG-FTV. ISO/IEC JTC1/SC29/WG11, M15378, Archamps, France (April 2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag GmbH Berlin Heidelberg
About this paper
Cite this paper
Xiaoliang, Z. et al. (2012). A Video Object Extraction Algorithm Based on Depth Map for Multi-view Video. In: Zhang, Y. (eds) Future Wireless Networks and Information Systems. Lecture Notes in Electrical Engineering, vol 143. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27323-0_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-27323-0_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27322-3
Online ISBN: 978-3-642-27323-0
eBook Packages: EngineeringEngineering (R0)