Dimensionality Reduction in HRTF by Using Multiway Array Analysis

  • Martin Rothbucher
  • Hao Shen
  • Klaus Diepold
Part of the Cognitive Systems Monographs book series (COSMOS, volume 6)


In a human centered robotic system, it is important to provide the robotic platform with multimodal human-like sensing, e.g. haptic, vision and audition, in order to improve interactions between the human and the robot. Recently, Head Related Transfer Functions (HRTFs) based techniques have become a promising methodology for robotic binaural hearing, which is a most prominent concept in human robot communication. In complex and dynamical applications, due to its high dimensionality, it is inefficient to utilize the originial HRTFs. To cope with this difficulty, Principle Component Analysis (PCA) has been successfully used to reduce the dimensionality of HRTF datasets. However, it requires in general a vectorization process of the original dataset, which is a three-way array, and consequently might cause loss of structure information of the dataset. In this paper we apply two multi-way array analysis methods, namely the Generalized Low Rank Approximations of Matrices (GLRAM) and the Tensor Singular Value Decomposition (Tensor-SVD), to dimensionality reductions in HRTF based applications. Our experimental results indicate that an optimized GLRAM outperforms significantly the PCA and performs nearly as well as Tensor-SVD with less computational complexity.


Dimensionality Reduction Principle Component Analysis Sound Localization Robotic Platform Dimensionality Reduction Method 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Algazi, V.R., Duda, R.O., Thompson, D.M., Avendano, C.: The CIPIC HRTF database. In: IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp. 21–24 (2001)Google Scholar
  2. 2.
    Blauert, J.: An introduction to binaural technology. In: Gilkey, G., Anderson, T. (eds.) Binaural and Spatial Hearing, pp. 593–609. Lawrence Erlbaum, Hilldale (1997)Google Scholar
  3. 3.
    Grindlay, G., Vasilescu, M.A.O.: A multilinear (tensor) framework for HRTF analysis and synthesis. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, USA, vol. 1, pp. 161–164 (2007)Google Scholar
  4. 4.
    Hu, L., Chen, H., Wu, Z.: The estimation of personalized HRTFs in individual VAS. In: Proceedings of the 2008 Fourth International Conference on Natural Computation, Washington, DC, USA, pp. 203–207 (2008)Google Scholar
  5. 5.
    Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, Heidelberg (2002)zbMATHGoogle Scholar
  6. 6.
    Kistler, D.J., Wightman, F.L.: A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. Journal of the Acoustical Society of America 91(3), 1637–1647 (1992)CrossRefGoogle Scholar
  7. 7.
    Lathauwer, L., Moor, B., Vandewalle, J.: A multilinear singular value decomposition. SIAM Journal on Matrix Analysis and Applications 21(4), 1253–1278 (2000)zbMATHCrossRefMathSciNetGoogle Scholar
  8. 8.
    Middlebrooks, J., Green, D.: Observations on a principal components analysis of head-related transfer functions. Journal of the Acoustical Society of America 92(1), 597–599 (1992)CrossRefGoogle Scholar
  9. 9.
    Savas, B., Lim, L.: Best multilinear rank approximation of tensors with quasi-Newton methods on Grassmannians. Technical Report LITH-MAT-R-2008-01-SE, Department of Mathematics, Linköpings University (2008)Google Scholar
  10. 10.
    Sheehan, B.N., Saad, Y.: Higher order orthogonal iteration of tensors (HOOI) and its relation to PCA and GLRAM. In: Proceedings of the 2007 SIAM International Conference on Data Mining, Minnenpolis, Minnesota, USA, pp. 355–366 (2007)Google Scholar
  11. 11.
    Usman, M., Keyrouz, F., Diepold, K.: Real time humanoid sound source localization and tracking in a highly reverberant environment. In: Proceedings of 9th International Conference on Signal Processing, Beijing, China, pp. 2661–2664 (2008)Google Scholar
  12. 12.
    Wenzel, E., Arruda, M., Kistler, D., Wightman, F.: Localization using nonindividualized head-related transfer functions. Journal of the Acoustical Society of America 94, 111–123 (1993)CrossRefGoogle Scholar
  13. 13.
    Ye, J.: Generalized low rank approximations of matrices. Machine Learning 61(1-3), 167–191 (2005)zbMATHCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Martin Rothbucher
    • 1
  • Hao Shen
    • 1
  • Klaus Diepold
    • 1
  1. 1.Institute for Data ProcessingTechnische Universität MünchenMünchenGermany

Personalised recommendations