Abstract
Advances in voice-recognition platforms have led to new possibilities in deploying automated voice-interactive engines for Web content. We present Voice2Web, an architecture allowing to manage access to the resources of the World Wide Web using voice interaction. It rests on the VoiceXML standard and enables rapid composition of dynamic services querying the Web resources. We demonstrate its use on practical examples, discuss architecture implications and invite further platform experimentation.
Chapter PDF
References
Mobile cellular and Internet user penetration worldwide, ITU 1997-2007 ICT Market Information and Statistics, http://www.itu.int/ITU-D/ict/statistics/maps.html
Roe, D.B., Wilpon, J.G. (eds.): Voice Communication Between Humans and Machines. The National Academies Press, Washington (1994)
Voice Extensible Markup Language (VoiceXML) Version 2.0, W3C Recommendation (March 16, 2004), http://www.w3.org/TR/voicexml20/
Voice Extensible Markup Language (VoiceXML) 2.1, W3C Recommendation (June 19, 2007), http://www.w3.org/TR/voicexml21/
VoiceXML Forum, http://www.voicexml.org/
World Wide Web Consortium (W3C), http://www.w3.org/
González-Ferreras, C., Cardeñoso-Payo, V.: Building Voice Applications From Web Content. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 587–594. Springer, Heidelberg (2004)
Vankayala, R.R., Shi, H.: Dynamic Voice User Interface Using VoiceXML and Active Server Pages. In: Zhou, X., Li, J., Shen, H.T., Kitsuregawa, M., Zhang, Y. (eds.) APWeb 2006. LNCS, vol. 3841, pp. 1181–1184. Springer, Heidelberg (2006)
Agarwal, S., Kumar, A., Nanavati, A.A., Rajput, N.: The World Wide Telecom Web Browser. Poster at WWW 2008, Beijing, China, April 21-25 (2008)
Kumar, A., Rajput, N., Chakraborty, D., Agarwal, S.K., Nanavati, A.A.: WWTW: The World Wide Telecom Web. In: NSDR, Kyoto, Japan, August 27 (2007)
Goldman, E.L., Panttaja, E., Wojcikowski, A., Braudes, R.: Voice Portals - Where Theory Meets Practice. Int. Journal of Speech Technology 4, 227–240 (2001)
Agarwal, S.K., Chakraborty, D., Kumar, A., Nanavati, A.A., Rajput, N.: HSTP: Hyperspeech Transfer Protocol. In: ACM Hypertext, Manchester, UK, September 10-12 (2007)
Shrestha, S.: Mobile Web Browsing: Usability Study. In: Proceedings of ACM Mobility, Singapore, September 10-12 (2007)
Yin, M., Zhai, S.: The Benefits of Augmenting Telephone Voice Menu Navigation with Visual Browsing and Search. In: Proceedings of ACM CHI:Managing Voice Input, Montreal, Quebec, Canada, April 22-27 (2006)
Hanson, V.L., Richards, J.T., Lee, C.C.: Web Access for Older Adults: Voice Browsing? In: Stephanidis, C. (ed.) HCI 2007. LNCS, vol. 4554, pp. 904–913. Springer, Heidelberg (2007)
Christian, K., Kules, B., Shneiderman, B., Youssef, A.: A Comparison of Voice Controlled and Mouse Controlled Web Browsing. In: ASSETS 2000, Arlington, VA, USA (2000)
Ramakrishnan, I.V., Stent, A., Yang, G.: HearSay: Enabling Audio Browsing on Hypertext Content. In: WWW 2004, New York, NY, USA, May 17-22 (2004)
Sun, Z., Stent, A., Ramakrishnan, I.V.: Dialog Generation for Voice Browsing. In: W4A Workshop at WWW 2006, Edinburgh, UK, May 23-26 (2006)
Borodin, Y., Dausch., G., Ramakrishnan, I.V.: TeleWeb: Accessible Service for Web Browsing via Phone. In: W4A2009 collocated with WWW 2009, Madrid, Spain, April 20-21 (2009)
Frost, R.A., Ma, X., Shi, Y.: A browser for a public-domain SpeechWeb. In: Proceedings of the ACM WWW 2007, Banff, Alberta, Canada (2007)
Frost, R.A., et al.: MySpeechWeb: Software to Facilitate the Construction and Deployment of Speech Applications on the Web. In: Proceedings of ACM SIGACCESS ASSETS 2008, Halifax, Canada (October 2008)
Kolias, C., Kolias, V., Anagnostopoulos, I., Kambourakis, G., Kayafas, E.: A pervasive Wiki application based on VoiceXML. In: Proceedings of PETRA 2008, Athens, Greece, July 15-19, ACM, New York (2008)
Kawanaka, S., Masatomo, K., Takagi, H., Asakawa, C.: Accessibility Commons: A Metadata Repository for Web Accessibility. In: SIGWEB Newsletter, Issue Summer, June 2009. ACM, New York (2009)
Guoqiang, D., Yaoyao, L., Lingchao, H., Jianping, W.: Design and Implementation of Voice Web Pages for Online Shopping Based on .NET and Streaming Media. In: Management of e-Commerce and e-Government, 2008, ICMECG 2008, Nanchang, China, October 17-19, 2008, pp. 226–229 (2008)
SIPp test tool and traffic generator, http://sipp.source.forge
Asterisk Private Branch eXchange, http://www.asterisk.org/
IBM WebSphere Voice Server, http://www-01.ibm.com/software/voice/
Gulbrandsen, A., Vixie, P., Esibov, L.: A DNS RR for specifying the location of services (DNS SRV). IETF RFC 2782 (February 2000), http://tools.ietf.org/html/rfc2782
Shanmugham, S., Monaco, P., Eberman, B.: A Media Resource Control Protocol (MRCP), IETF RFC 4463 (April 2006), http://tools.ietf.org/html/rfc4463
Voice Browser Call Control: CCXML Version 1.0, W3C Working Draft (January 19, 2007), http://www.w3.org/TR/ccxml/
Voice2Web VoiceXML IDE, http://bolek.feld.cvut.cz:8080/vxmlide/
Yahoo! Weather, http://weather.yahoo.com
Weather Underground, http://www.wunderground.com
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 IFIP International Federation for Information Processing
About this paper
Cite this paper
Rudinsky, J., Mikula, T., Kencl, L., Dolezal, J., Garcia, X. (2009). Voice2Web: Architecture for Managing Voice-Application Access to Web Resources. In: Pfeifer, T., Bellavista, P. (eds) Wired-Wireless Multimedia Networks and Services Management. MMNS 2009. Lecture Notes in Computer Science, vol 5842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04994-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-04994-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04993-4
Online ISBN: 978-3-642-04994-1
eBook Packages: Computer ScienceComputer Science (R0)