Abstract
The paper presents the concept, implementation, and a feasibility study of a user interface technique, named VAVS (“voice-assisted visual search”). VAVS employs user’s voice input for assisting the user in searching for objects of interest in complex displays. User voice input is compared with attributes of visually presented objects and, if there is a match, the matching object is highlighted to help the user visually locate the object. The paper discusses differences between, on the one hand, VAVS and, on the other hand, voice commands and multimodal input techniques. An interactive prototype implementing the VAVS concept and employing a standard voice recognition program is described. The paper reports an empirical study, in which an object location task was carried out with and without VAVS. It was found that the VAVS condition was associated with higher performance and use satisfaction. The paper concludes with a discussion of directions for future work.
Chapter PDF
References
Andrews, C., Endert, A., North, C.: Space to think: Large, high-resolution displays for sensemaking. In: Proc. CHI 2010, pp. 55–64. ACM Press, New York (2010)
Benyon, D., Turner, P., Turner, S.: Designing Interactive Systems: People, Activities, Contexts, Technologies. Addison-Wesley, NY (2005)
Bolt, R.A.: “Put-that-there”: Voice and gesture at the graphics interface. In: Proc. of the 7th Annual Conference on Computer Graphics and Interactive Techniques, pp. 262–270. ACM Press, New York (1980)
Fabiani, M., Low, K.A., Wee, E., Sable, J.J., Gratton, G.: Reduced Suppression or Labile Memory? Mechanisms of Inefficient Filtering of Irrelevant Information in Older Adults. J. Cogn. Neurosci. 18(4), 637–650 (2006)
Lau, T., Reed, D.: Speech-activated user interfaces and climbing Mt. Exascale. Communications of the ACM 52(6), 10–11 (2009)
Manaris, B.: Natural Language Processing: A Human-Computer Interaction Perspective. Advances in Computers 47, 2–68 (1998)
Nielsen, J.: Voice Interfaces: Assessing the Potential, http://www.useit.com/alertbox/20030127.html
Voida, S., Podlaseck, M., Kjeldsen, R., Pinhanez, C.: A study on the manipulation of 2D objects in a projector/camera-based augmented reality environment. In: Proc. CHI 2005, pp. 611–620. ACM Press, New York (2005)
Ware, C.: Information Visualization. Morgan Kaufmann, Amsterdam (2004)
What can I do with Windows Speech recognition?, http://windows.microsoft.com/en-US/windows7/What-can-I-do-with-Speech-Recognition
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 IFIP International Federation for Information Processing
About this paper
Cite this paper
Kaptelinin, V., Wåhlen, H. (2011). Speaking to See: A Feasibility Study of Voice-Assisted Visual Search. In: Campos, P., Graham, N., Jorge, J., Nunes, N., Palanque, P., Winckler, M. (eds) Human-Computer Interaction – INTERACT 2011. INTERACT 2011. Lecture Notes in Computer Science, vol 6946. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23774-4_37
Download citation
DOI: https://doi.org/10.1007/978-3-642-23774-4_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23773-7
Online ISBN: 978-3-642-23774-4
eBook Packages: Computer ScienceComputer Science (R0)