Abstract
We present a multimodal media center user interface with a hands-free speech recognition input method for users with physical disabilities. In addition to speech input, the application features a zoomable context + focus graphical user interface and several other modalities, including speech output, haptic feedback, and gesture input. These features have been developed in co-operation with representatives from the target user groups. In this article, we focus on the speech input interface and its evaluations. We discuss the user interface design and results from a long-term pilot study taking place in homes of physically disabled users, and compare the results to a public pilot study and laboratory studies carried out with non-disabled users.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bederson, B.B.: Fisheye menus. In: 13th Annual ACM Symposium on User interface Software and Technology, pp. 217–225. ACM, New York (2000)
Bederson, B.B., Clamage, A., Czerwinski, M.P., Robertson, G.G.: DateLens: A fisheye calendar interface for PDAs. ACM Trans. Comput.-Hum. Interact. 11(1), 90–119 (2004)
Bederson, B.B., Grosjean, J., Meyer, J.: Toolkit Design for Interactive Structured Graphics. IEEE Trans. Softw. Eng. 30(8), 535–546 (2004)
Berglund, A., Qvarfordt, P.: Error Resolution Strategies for Interactive Television Speech Interfaces. In: Rauterberg, M., Menozzi, M., Wesson, J. (eds.) Human-Computer Interaction: INTERACT 2003, pp. 105–112. IOS Press, Amsterdam (2003)
Hassenzahl, M.: The thing and I: understanding the relationship between user and product. In: Blythe, M.A., Overbeeke, K., Monk, A.F., Wright, P.C. (eds.) Funology: From Usability To Enjoyment, pp. 31–42. Kluwer Academic Publishers, Norwell (2004)
Ibrahim, A., Johansson, P.: Multimodal Dialogue Systems: A Case Study for Interactive TV. In: Carbonell, N., Stephanidis, C. (eds.) UI4ALL 2002. LNCS, vol. 2615, pp. 209–218. Springer, Heidelberg (2003)
Jetter, H.-C., Gerken, J.: A Simplified Model of user Experience for Practical Application. In: Second COST294-MAUSE International Open Workshop “User eXperience - Towards a unified view” (2006)
Pertilä, P., Korhonen, T., Visa, A.: Measurement Combination for Acoustic Source Localization in a Room Environment. EURASIP Journal on Audio, Speech, and Music Processing 2008, Article ID 278185 (2008)
Reichheld, F.F.: The One Number You Need to Grow. Harvard Business Review 81, 47–54 (2003)
Soronen, H., Turunen, M., Hakulinen, J.: Voice Commands in Home Environment - a Consumer Survey. In: INTERSPEECH 2008, pp. 2078–2081. ISCA (2008)
Turunen, M., Hakulinen, J., Melto, A., Heimonen, T., Laivo, T., Hella, J.: SUXES – User Experience Evaluation Method for Spoken and Multimodal Interaction. In: INTERSPEECH 2009. ISCA (2009)
Turunen, M., Hakulinen, J., Melto, A., Hella, J., Rajaniemi, J.-P., Mäkinen, E., Rantala, J., Heimonen, T., Laivo, T., Soronen, H., Hansen, M., Valkama, P., Miettinen, T., Raisamo, R.: Speech-based and Multimodal Media Center for Different User Groups. In: INTERSPEECH 2009. ISCA (2009)
Turunen, M., Melto, A., Hello, J., Heimonen, T., Hakulinen, J., Mäkinen, E., Laivo, T., Soronen, H.: User Expectations and User Experience with Different Modalities in a Mobile Phone Controlled Home Entertainment System. In: 11th International Conference on Human-Computer Interaction with Mobile Devices and Services, pp. 1–4. ACM, New York (2009)
Turunen, M., Melto, A., Hakulinen, J., Kainulainen, A., Heimonen, T.: User Expectations, User Experiences and Objective Metrics in a Multimodal Mobile Application. In: Third Workshop on Speech in Mobile and Pervasive Environments (2008)
Turunen, M., Hakulinen, J., Kainulainen, A.: Evaluation of a Spoken Dialogue System with Usability Tests and Long-term Pilot Studies: Similarities and Differences. In: INTERSPEECH 2006, pp. 1057–1060. ISCA (2006)
TÄPLÄ – Ambient Intelligence based on Sound, Speech and Multisensor Interaction, http://tapla.cs.tut.fi
Wittenburg, K., Lanning, T., Schwenke, D., Shubin, H., Vetro, A.: The prospects for unrestricted speech input for TV content search. In: Working Conference on Advanced Visual Interfaces, pp. 352–359. ACM, New York (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Turunen, M. et al. (2010). Accessible Speech-Based and Multimodal Media Center Interface for Users with Physical Disabilities. In: Esposito, A., Campbell, N., Vogel, C., Hussain, A., Nijholt, A. (eds) Development of Multimodal Interfaces: Active Listening and Synchrony. Lecture Notes in Computer Science, vol 5967. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12397-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-12397-9_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12396-2
Online ISBN: 978-3-642-12397-9
eBook Packages: Computer ScienceComputer Science (R0)