Dimensionality Reduction in HRTF by Using Multiway Array Analysis
In a human centered robotic system, it is important to provide the robotic platform with multimodal human-like sensing, e.g. haptic, vision and audition, in order to improve interactions between the human and the robot. Recently, Head Related Transfer Functions (HRTFs) based techniques have become a promising methodology for robotic binaural hearing, which is a most prominent concept in human robot communication. In complex and dynamical applications, due to its high dimensionality, it is inefficient to utilize the originial HRTFs. To cope with this difficulty, Principle Component Analysis (PCA) has been successfully used to reduce the dimensionality of HRTF datasets. However, it requires in general a vectorization process of the original dataset, which is a three-way array, and consequently might cause loss of structure information of the dataset. In this paper we apply two multi-way array analysis methods, namely the Generalized Low Rank Approximations of Matrices (GLRAM) and the Tensor Singular Value Decomposition (Tensor-SVD), to dimensionality reductions in HRTF based applications. Our experimental results indicate that an optimized GLRAM outperforms significantly the PCA and performs nearly as well as Tensor-SVD with less computational complexity.
KeywordsDimensionality Reduction Principle Component Analysis Sound Localization Robotic Platform Dimensionality Reduction Method
Unable to display preview. Download preview PDF.
- 1.Algazi, V.R., Duda, R.O., Thompson, D.M., Avendano, C.: The CIPIC HRTF database. In: IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp. 21–24 (2001)Google Scholar
- 2.Blauert, J.: An introduction to binaural technology. In: Gilkey, G., Anderson, T. (eds.) Binaural and Spatial Hearing, pp. 593–609. Lawrence Erlbaum, Hilldale (1997)Google Scholar
- 3.Grindlay, G., Vasilescu, M.A.O.: A multilinear (tensor) framework for HRTF analysis and synthesis. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, USA, vol. 1, pp. 161–164 (2007)Google Scholar
- 4.Hu, L., Chen, H., Wu, Z.: The estimation of personalized HRTFs in individual VAS. In: Proceedings of the 2008 Fourth International Conference on Natural Computation, Washington, DC, USA, pp. 203–207 (2008)Google Scholar
- 9.Savas, B., Lim, L.: Best multilinear rank approximation of tensors with quasi-Newton methods on Grassmannians. Technical Report LITH-MAT-R-2008-01-SE, Department of Mathematics, Linköpings University (2008)Google Scholar
- 10.Sheehan, B.N., Saad, Y.: Higher order orthogonal iteration of tensors (HOOI) and its relation to PCA and GLRAM. In: Proceedings of the 2007 SIAM International Conference on Data Mining, Minnenpolis, Minnesota, USA, pp. 355–366 (2007)Google Scholar
- 11.Usman, M., Keyrouz, F., Diepold, K.: Real time humanoid sound source localization and tracking in a highly reverberant environment. In: Proceedings of 9th International Conference on Signal Processing, Beijing, China, pp. 2661–2664 (2008)Google Scholar