Stereoscopic Video Coding
Coding of the stereoscopic video source has received significant interest recently. The MPEG committee decided to form an ad hoc group to define a new profile which is referred to as Multiview Profile (MVP) . The importance of multiview video representation is also recognized by the MPEG 4 committee as one of the eight functionalities to be addressed in the near future. In this paper, we will first review the technical results using temporal scalability (disparity analysis) in MPEG-2 as pioneering by  and . Based on temporal scalability, the concept is further generalized to affine transformation to consider the deformation and foreshortening due to the change of view point. Estimation of the affine parameters is crucial for the performance of the estimator. In this paper we propose a novel technique to find a convergent solution which results in the least mean square errors. Our result shows that about 40 percent of the macroblocks in a picture has benefited by using the affine transformation. In our approach, the additional computational complexity is minimal since a pyramidal scheme is used. In one of our experiments, only four interations are necessary to find a convergent solution. The improvement in prediction gain is found to be around 0.77 dB.
KeywordsAffine Transformation Enhancement Layer Convergent Solution Affine Parameter Stereoscopic Video
Unable to display preview. Download preview PDF.
- ISO, Information Technology-Generic Coding of Moving Pictures and Associated Audio Information: Video, Recommendation H.262, (Paris), May 1994.Google Scholar
- Chassaing F., Choquet B., Pele D., “A stereoscopic television system (3D-TV) and compatible transmission on a MAC channel (3D-MAC)”, Image Communication Nov, 1991.Google Scholar
- International Organisation for Standardisation, “Report of the ad hoc group on MPEG-2 applications for multi-viewpoint pictures”, ISO/IEC JTC/SC29/WG11 No. 861 March, 1995.Google Scholar
- International Organisation for Standardisation, “Status Report on the study of Multi-viewpoint pictures”, ISO/IEC JTC/SC29/WG11 No. 906 March, 1995.Google Scholar
- A. Zakhor, F. Lari, “Edge-Based 3-D Camera Motion Estimation with Application to Video Coding”, IEEE Trans. on Image Processing, Vol. 2, No. 4, E 1993.Google Scholar
- Roger Y. Tsai, Thomas S. Huang, “Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces”, IEEE Trans. on PAMI, Vol. 6, No. 1 Jan. 1984.Google Scholar
- Randall B. Perlow, Ph.D. Dissertation University of Pennsylvania, “The Application of Stereoscopic Techniques to High Resolution Radar Images for Improved Detection of Targets in Clutter”, 1994.Google Scholar
- Tihao Chiang, Ph.D. Dissertation Columbia University, “Hierarchical Coding of Digital Television”, 1995.Google Scholar
- 9.A. Puri, V. Kollarits, B.G. Haskell, “Stereoscopic Video Compression using temporal scalability”, SPIE Visual Communications and Image Processing, Taipei, Taiwan, May 1995.Google Scholar
- 10.B. L. Tseng, D. Anastassiou, “Compatible video coding of stereoscopic sequences using MPEG-2’s scalability and interlaced structure”, Workshop on HDTV’94, Torino, Oct. 1994.Google Scholar