Abstract
We present the VoxBoox system, a system for making digital books accessible to visually impaired individuals via audio and voice. This is accomplished by automatically translating a book published in HTML to VoiceXML, and then further enhancing this VoiceXML rendering of the book to enable listener-controlled dynamic aural navigation. The VoxBoox system has the following salient features: (i) it leverages existing infrastructure since the book that is to be made accessible need only be published digitally using HTML on the visual Web, (ii) it is based on accepted Web standards of HTML and VoiceXML and thus books can be made accessible inexpensively, and (iii) it is user-centered in that the listener (the user) has complete control over (aural) navigation of the book. In this paper, we present details of the technologies that make the VoxBoox system possible, as well as the details of the system itself. A prototype of the VoxBoox system is operational.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Bingham, H.: Digital Talking Books Expanded Document Type Definitions (2002) http://www.loc.gov/nls/z3986/v100/dtbook110doc.htm
Cookson, J. et al.: Digital Talking Books: Planning for the Future (1998) http://www.loc.gov/nls/dtb.html#activity
The DAISY Consortium http://www.daisy.org/about_us/default.asp
American National Standards Institute. Specification of the Digital Talking Book (2002) http://www.niso.org/standards/resources/Z39-86-2002.html#Strategy
Nichols, M., Wang, Q., Gupta, G.: A VoiceXML-based Spoken Scripting Language for Voice-based Web Navigation. In: Human Computer Interaction Conference, July 2005, Lawrence Erlbaum, Mahwah (2005)
McGlashan, S., et al. (eds.) Voice Extensible Mark Language (Version 2.0) http://www.w3.org/TR/VoiceXML20/
Gupta, G., Sunderraman, S., Nichols, M.: DAWN: Dynamic Aural Web Navigation. In: Proceedings of, International Conference on Human Computer Interaction (2005)
Annamalai, N., Gupta, G., Prabhakaran, B.: An Extensible Translator for translating HTML to VoiceXML. In: Proc 9th International Conference on Computers Helping People. LNCS, vol. 3118, pp. 339–346. Springer, Heidelberg (2004)
Reddy, H., Annamalai, N., Gupta, G.: Listener-controlled Dynamic Navigation of VoiceXML documents. In: Proc 9th International Conference on Computers Helping People. LNCS, vol. 3118, pp. 337–354. Springer, Heidelberg (2004)
VoiceXML Review http://www.voicexmlreview.org
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jain, A., Gupta, G. (2007). VoxBoox: A System for Automatic Generation of Interactive Talking Books. In: Stephanidis, C. (eds) Universal Access in Human-Computer Interaction. Applications and Services. UAHCI 2007. Lecture Notes in Computer Science, vol 4556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73283-9_37
Download citation
DOI: https://doi.org/10.1007/978-3-540-73283-9_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73282-2
Online ISBN: 978-3-540-73283-9
eBook Packages: Computer ScienceComputer Science (R0)