VoxBoox: A System for Automatic Generation of Interactive Talking Books

Jain, Aanchal; Gupta, Gopal

doi:10.1007/978-3-540-73283-9_37

Aanchal Jain¹ &
Gopal Gupta¹

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 4556))

Included in the following conference series:

International Conference on Universal Access in Human-Computer Interaction

1798 Accesses
3 Citations

Abstract

We present the VoxBoox system, a system for making digital books accessible to visually impaired individuals via audio and voice. This is accomplished by automatically translating a book published in HTML to VoiceXML, and then further enhancing this VoiceXML rendering of the book to enable listener-controlled dynamic aural navigation. The VoxBoox system has the following salient features: (i) it leverages existing infrastructure since the book that is to be made accessible need only be published digitally using HTML on the visual Web, (ii) it is based on accepted Web standards of HTML and VoiceXML and thus books can be made accessible inexpensively, and (iii) it is user-centered in that the listener (the user) has complete control over (aural) navigation of the book. In this paper, we present details of the technologies that make the VoxBoox system possible, as well as the details of the system itself. A prototype of the VoxBoox system is operational.

Download to read the full chapter text

Chapter PDF

Babel VR: Multimodal Virtual Reality Environment for Shelf Browsing and Book Discovery

VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance

Automatic Generation of 3D Animations from Text and Images

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bingham, H.: Digital Talking Books Expanded Document Type Definitions (2002) http://www.loc.gov/nls/z3986/v100/dtbook110doc.htm
Cookson, J. et al.: Digital Talking Books: Planning for the Future (1998) http://www.loc.gov/nls/dtb.html#activity
The DAISY Consortium http://www.daisy.org/about_us/default.asp
American National Standards Institute. Specification of the Digital Talking Book (2002) http://www.niso.org/standards/resources/Z39-86-2002.html#Strategy
Nichols, M., Wang, Q., Gupta, G.: A VoiceXML-based Spoken Scripting Language for Voice-based Web Navigation. In: Human Computer Interaction Conference, July 2005, Lawrence Erlbaum, Mahwah (2005)
Google Scholar
McGlashan, S., et al. (eds.) Voice Extensible Mark Language (Version 2.0) http://www.w3.org/TR/VoiceXML20/
Gupta, G., Sunderraman, S., Nichols, M.: DAWN: Dynamic Aural Web Navigation. In: Proceedings of, International Conference on Human Computer Interaction (2005)
Google Scholar
Annamalai, N., Gupta, G., Prabhakaran, B.: An Extensible Translator for translating HTML to VoiceXML. In: Proc 9th International Conference on Computers Helping People. LNCS, vol. 3118, pp. 339–346. Springer, Heidelberg (2004)
Google Scholar
Reddy, H., Annamalai, N., Gupta, G.: Listener-controlled Dynamic Navigation of VoiceXML documents. In: Proc 9th International Conference on Computers Helping People. LNCS, vol. 3118, pp. 337–354. Springer, Heidelberg (2004)
Google Scholar
VoiceXML Review http://www.voicexmlreview.org

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Texas at Dallas, Richardson, TX 75080,
Aanchal Jain & Gopal Gupta

Authors

Aanchal Jain
View author publications
You can also search for this author in PubMed Google Scholar
Gopal Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Constantine Stephanidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jain, A., Gupta, G. (2007). VoxBoox: A System for Automatic Generation of Interactive Talking Books. In: Stephanidis, C. (eds) Universal Access in Human-Computer Interaction. Applications and Services. UAHCI 2007. Lecture Notes in Computer Science, vol 4556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73283-9_37

Download citation

DOI: https://doi.org/10.1007/978-3-540-73283-9_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73282-2
Online ISBN: 978-3-540-73283-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics