Developing a Voice User Interface with Improved Usability for People with Dysarthria

Hwang, Yumi; Shin, Daejin; Yang, Chang-Yeal; Lee, Seung-Yeun; Kim, Jin; Kong, Byunggoo; Chung, Jio; Kim, Sunhee; Chung, Minhwa

doi:10.1007/978-3-642-31534-3_18

Developing a Voice User Interface with Improved Usability for People with Dysarthria

Yumi Hwang²⁰,
Daejin Shin²¹,
Chang-Yeal Yang²²,
Seung-Yeun Lee²³,
Jin Kim²⁴,
Byunggoo Kong²⁴,
Jio Chung²⁵,
Sunhee Kim²⁶ &
…
Minhwa Chung²⁷

Conference paper

4687 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7383))

Abstract

This paper describes the development of a voice user interface (VUI) for Korean users with dysarthria. The development process, from target application decisions to prototype system evaluation, focuses on improving the usability of the interface by reflecting user needs. The first step of development is to decide target VUI application and its functions. 25 dysarthric participants (5 middle school students and 20 adults) are asked to list the devices they want to use with a VUI interface and what purposes they would use VUI devices for. From this user study, SMS sending, web searching and voice dialing on mobile phones and tablet PCs are decided as the target application and its functions. The second step is to design the system of the target application in order to improve usability. 120 people with dysarthria are asked to state the main problems of currently available VUI devices, and it is found that speech recognition failure (88%) is the main problem. This result indicates high speech recognition rate will improve usability. Therefore, to improve the recognition rate, an isolated word recognition based system with a customizable command list and a built-in word prediction function is designed for the target VUI devices. The final step is to develop and evaluate a prototype system. In this study, a prototype is developed for Apple iOS and Android platform devices, and then the system design is modified based on the evaluation results of 5 dysarthric evaluators.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Darley, F.L., Aronson, A.E., Brown, J.R.: Differential diagnostic patterns of dysarthria. Journal of Speech and Hearing Research 12, 246–269 (1969)
Google Scholar
Kotler, A., Thomas-Stonell, N.: Effects of speech training on the accuracy of speech recognition for an individual with a speech impairment. Augmentative and Alternative Communication 13, 71–80 (1997)
Article Google Scholar
Hux, K., Rankin-Erickson, J., Manasse, N., Lauritzen, E.: Accuracy of three speech recognition systems: Case study of dysarthric speech. Augmentative and Alternative Communication 16(3), 186–196 (2000)
Article Google Scholar
Rosen, K., Yamplosky, S.: Automatic speech recognition and a review of its functioning with dysarthric speech. Augmentative and Alternative Communication 16(1), 48–60 (2000)
Article Google Scholar
Blaney, B., Wilson, J.: Acoustic variability in dysarthria and computer speech recognition. Clinical Linguistics & Phonetics 14, 307–327 (2000)
Article Google Scholar
Rudzicz, F.: Production knowledge in the recognition of dysarthric speech. Doctoral dissertation, Department of Computer Science of University of Toronto (2011), retrieved, http://www.cs.toronto.edu/~frank/Download/Papers/
Hamidi, F., Baljko, M., Livingston, N., Spalteholz, L.: CanSpeak: a Customizable Speech Interface for People with Dysarthric Speech. In: Miesenberger, K., Klaus, J., Zagler, W., Karshmer, A. (eds.) ICCHP 2010. LNCS, vol. 6179, pp. 605–612. Springer, Heidelberg (2010)
Chapter Google Scholar
Hosom, J.P., Jakobs, T., Baker, A., Fager, S.: Automatic speech recognition for assistive writing in speech supplemented word prediction. In: Proceedings of Interspeech 2010, pp. 2674–2677 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Interdisciplinary Program in Cognitive Science, Seoul National University, Seoul, Korea
Yumi Hwang
Weavers TnC, Seoul, Korea
Daejin Shin
GO Design Studio, Seoul, Korea
Chang-Yeal Yang
E-ROOM Consulting, Seoul, Korea
Seung-Yeun Lee
Infinity Telecom, Seoul, Korea
Jin Kim & Byunggoo Kong
HCILAB, Seoul, Korea
Jio Chung
Center for Humanities and Information, Seoul National University, Seoul, Korea
Sunhee Kim
Department of Linguistics, Seoul National University, Seoul, Korea
Minhwa Chung

Authors

Yumi Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Daejin Shin
View author publications
You can also search for this author in PubMed Google Scholar
Chang-Yeal Yang
View author publications
You can also search for this author in PubMed Google Scholar
Seung-Yeun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jin Kim
View author publications
You can also search for this author in PubMed Google Scholar
Byunggoo Kong
View author publications
You can also search for this author in PubMed Google Scholar
Jio Chung
View author publications
You can also search for this author in PubMed Google Scholar
Sunhee Kim
View author publications
You can also search for this author in PubMed Google Scholar
Minhwa Chung
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Linz, Institut Integriert Studieren, Altenbergerstraße 69, 4040, Linz, Austria
Klaus Miesenberger
University of San Francisco, 2130 Fulton St, 94117, San Francisco, CA, USA
Arthur Karshmer
Support Centre for Students with Special Needs, Masaryk University, Botanická 68A, 602 00, Brno, Czech Republic
Petr Penaz
Institute “integriert studieren”, Vienna University of Technology, Favoritenstr. 11/029, 1040, Vienna, Austria
Wolfgang Zagler

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hwang, Y. et al. (2012). Developing a Voice User Interface with Improved Usability for People with Dysarthria. In: Miesenberger, K., Karshmer, A., Penaz, P., Zagler, W. (eds) Computers Helping People with Special Needs. ICCHP 2012. Lecture Notes in Computer Science, vol 7383. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31534-3_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-31534-3_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31533-6
Online ISBN: 978-3-642-31534-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics