Abstract
This paper describes the development of a voice user interface (VUI) for Korean users with dysarthria. The development process, from target application decisions to prototype system evaluation, focuses on improving the usability of the interface by reflecting user needs. The first step of development is to decide target VUI application and its functions. 25 dysarthric participants (5 middle school students and 20 adults) are asked to list the devices they want to use with a VUI interface and what purposes they would use VUI devices for. From this user study, SMS sending, web searching and voice dialing on mobile phones and tablet PCs are decided as the target application and its functions. The second step is to design the system of the target application in order to improve usability. 120 people with dysarthria are asked to state the main problems of currently available VUI devices, and it is found that speech recognition failure (88%) is the main problem. This result indicates high speech recognition rate will improve usability. Therefore, to improve the recognition rate, an isolated word recognition based system with a customizable command list and a built-in word prediction function is designed for the target VUI devices. The final step is to develop and evaluate a prototype system. In this study, a prototype is developed for Apple iOS and Android platform devices, and then the system design is modified based on the evaluation results of 5 dysarthric evaluators.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Darley, F.L., Aronson, A.E., Brown, J.R.: Differential diagnostic patterns of dysarthria. Journal of Speech and Hearing Research 12, 246–269 (1969)
Kotler, A., Thomas-Stonell, N.: Effects of speech training on the accuracy of speech recognition for an individual with a speech impairment. Augmentative and Alternative Communication 13, 71–80 (1997)
Hux, K., Rankin-Erickson, J., Manasse, N., Lauritzen, E.: Accuracy of three speech recognition systems: Case study of dysarthric speech. Augmentative and Alternative Communication 16(3), 186–196 (2000)
Rosen, K., Yamplosky, S.: Automatic speech recognition and a review of its functioning with dysarthric speech. Augmentative and Alternative Communication 16(1), 48–60 (2000)
Blaney, B., Wilson, J.: Acoustic variability in dysarthria and computer speech recognition. Clinical Linguistics & Phonetics 14, 307–327 (2000)
Rudzicz, F.: Production knowledge in the recognition of dysarthric speech. Doctoral dissertation, Department of Computer Science of University of Toronto (2011), retrieved, http://www.cs.toronto.edu/~frank/Download/Papers/
Hamidi, F., Baljko, M., Livingston, N., Spalteholz, L.: CanSpeak: a Customizable Speech Interface for People with Dysarthric Speech. In: Miesenberger, K., Klaus, J., Zagler, W., Karshmer, A. (eds.) ICCHP 2010. LNCS, vol. 6179, pp. 605–612. Springer, Heidelberg (2010)
Hosom, J.P., Jakobs, T., Baker, A., Fager, S.: Automatic speech recognition for assistive writing in speech supplemented word prediction. In: Proceedings of Interspeech 2010, pp. 2674–2677 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hwang, Y. et al. (2012). Developing a Voice User Interface with Improved Usability for People with Dysarthria. In: Miesenberger, K., Karshmer, A., Penaz, P., Zagler, W. (eds) Computers Helping People with Special Needs. ICCHP 2012. Lecture Notes in Computer Science, vol 7383. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31534-3_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-31534-3_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31533-6
Online ISBN: 978-3-642-31534-3
eBook Packages: Computer ScienceComputer Science (R0)