Learning to Predict Where People Look with Tensor-Based Multi-view Learning

Pasupa, Kitsuchart; Szedmak, Sandor

doi:10.1007/978-3-319-26532-2_47

Learning to Predict Where People Look with Tensor-Based Multi-view Learning

Kitsuchart Pasupa¹⁷ &
Sandor Szedmak¹⁸

Conference paper
First Online: 12 November 2015

2076 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9489))

Abstract

Eye movements data collection is very expensive and laborious. Moreover, there are usually missing values. Assuming that we are collecting eye movements data on a set of images from different users (views). There is a possibility that we are not able to collect eye movements of all users on all images. One or more views are not represented in the image. We assume that the relationships among the views can be learnt from the complete items. The task is then to reproduce the missing part of the incomplete items from the relationships derived from the complete items and the known part of these items. Using the properties of tensor algebra we show that this problem can be formulated consistently as a regression type learning task. Furthermore, there is a maximum margin based optimisation framework where this problem can be solved in a tractable way. This problem is similar to learning to predict where human look. The proposed algorithm is proved to be more effective than well-known saliency detection techniques.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
The website of the authors provides an open source implementation to this problem.

References

Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20, 1254–1259 (1998)
Article Google Scholar
Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Advances in Neural Information Processing Systems, pp. 545–552 (2006)
Google Scholar
Judd, T., Ehinger, K., Durand, F., Torralba, A.: Learning to predict where humans look. In: IEEE 12th International Conference on Computer Vision, pp. 2106–2113 (2009)
Google Scholar
Henderson, J.M., Brockmole, J.R., Castelhano, M.S., Mack, M.: Visual saliency does not account for eye movements during visual search in real-world scenes. In: Eye Movements: A Window on Mind and Brain, pp. 537–562 (2007)
Google Scholar
Liu, J., Musialski, P., Wonka, P., Ye, J.: Tensor completion for estimating missing values in visual data. IEEE Trans. Pattern Anal. Mach. Intell. 35, 208–220 (2013)
Article Google Scholar
Chen, C.Y., Grauman, K.: Inferring unseen views of people. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2011–2018 (2014)
Google Scholar
Itskov, M.: Tensor Algebra and Tensor Analysis for Engineers With Applications to Continuum Mechanics. 2nd edn. Springer, Heidelberg (2009)
Google Scholar
Synge, J., Schild, A.: Tensor Calculus. Dover, New York (1978)
MATH Google Scholar
Astikainen, K., Holm, L., Pitkänen, E., Szedmak, S., Rousu, J.: Towards structured output prediction of enzyme function. In: BMC Proceedings, vol. 2(Suppl 4:S2) (2008)
Google Scholar
Szedmak, S., De Bie, T., Hardoon, D.R.: A metamorphosis of canonical correlation analysis into multivariate maximum margin learning. In: The 15th European Symposium on Artificial Neural Networks, pp. 211–216 (2007)
Google Scholar
Briët, J., Harremoës, P.: Properties of classical and quantum Jensen-Shannon divergence. Phys. Rev. A 79, 052311 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, King Mongkut’s Institute of Technology Ladkrabang, Bangkok, 10520, Thailand
Kitsuchart Pasupa
Institute of Computer Science, University of Innsbruck, 6020, Innsbruck, Austria
Sandor Szedmak

Authors

Kitsuchart Pasupa
View author publications
You can also search for this author in PubMed Google Scholar
Sandor Szedmak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kitsuchart Pasupa .

Editor information

Editors and Affiliations

University of Istanbul, Istanbul, Turkey
Sabri Arik
University at Qatar, Doha, Qatar
Tingwen Huang
Tunku Abdul Rahman University College, Kuala Lumpur, Malaysia
Weng Kin Lai
University of Science Technology, Wuhan, China
Qingshan Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pasupa, K., Szedmak, S. (2015). Learning to Predict Where People Look with Tensor-Based Multi-view Learning. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9489. Springer, Cham. https://doi.org/10.1007/978-3-319-26532-2_47

Download citation

DOI: https://doi.org/10.1007/978-3-319-26532-2_47
Published: 12 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26531-5
Online ISBN: 978-3-319-26532-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics