Abstract
This paper lays the foundation for a new delivery paradigm for web-accessible content and functionality, i.e., conversational interaction. Instead of asking users to read text, click through links and type on the keyboard, the vision is to enable users to “speak to a website” and to obtain natural language, spoken feedback. The paper describes how state-of-the-art chatbot technology can enable a dialog between the user and the website, proposes a reference architecture for the automated inference of site-specific chatbots able to mediate between the user and the website, and discusses open challenges and research questions. The envisioned, bidirectional dialog paradigm advances current screen reader technology and aims to benefit both regular users in eyes-free usage scenarios as well as visually impaired users in everyday scenarios.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ahmed, F., Borodin, Y., Soviak, A., Islam, M., Ramakrishnan, I., Hedgpeth, T.: Accessible skimming: faster screen reading of web pages. In: UIST, pp. 367–378. ACM (2012)
Akpinar, M.E., Yeşilada, Y.: Discovering visual elements of web pages and their roles: users’ perception. Interact. Comput. 29(6), 845–867 (2017)
Amershi, S., et al.: Guidelines for human-AI interaction. In: Proceedings of CHI 2019, p. 3. ACM (2019)
Asakawa, C.: What’s the web like if you can’t see it? In: W4A, pp. 1–8. ACM (2005)
Ashok, V., Borodin, Y., Puzis, Y., Ramakrishnan, I.: Capti-speak: a speech-enabled web screen reader. In: W4A, p. 22. ACM (2015)
Ashok, V., Puzis, Y., Borodin, Y., Ramakrishnan, I.: Web screen reading automation assistance using semantic abstraction. In: IUI, pp. 407–418. ACM (2017)
Asri, L.E., et al.: Frames: a corpus for adding memory to goal-oriented dialogue systems. arXiv preprint arXiv:1704.00057 (2017)
Bigham, J.P.: Accessmonkey: enabling and sharing end user accessibility improvements. ACM SIGACCESS Access. Comput. 89, 3–6 (2007)
Bigham, J.P., Lau, T., Nichols, J.: Trailblazer: enabling blind users to blaze trails through the web. In: IUI, pp. 177–186. ACM (2009)
Bigham, J.P., Lin, I., Savage, S.: The effects of not knowing what you don’t know on web accessibility for blind web users. In: ACM SIGACCESS. ACM (2017)
Billah, S.M., Ashok, V., Porter, D.E., Ramakrishnan, I.: Ubiquitous accessibility for people with visual impairments: are we there yet? In: CHI, pp. 5862–5868. ACM (2017)
Borodin, Y.: Automation of repetitive web browsing tasks with voice-enabled macros. In: Proceedings of SIGACCESS 2008, pp. 307–308. ACM (2008)
Borodin, Y., et al.: Hearsay: a new generation context-driven multi-modal assistive web browser. In: WWW, pp. 1233–1236. ACM (2010)
Christian, K., Kules, B., Shneiderman, B., Youssef, A.: A comparison of voice controlled and mouse controlled web browsing. In: Proceedings of the Fourth International ACM Conference on Assistive Technologies, pp. 72–79. ACM (2000)
Conradi, J., Alexander, T.: Analysis of visual performance during the use of mobile devices while walking. In: Harris, D. (ed.) EPCE 2014. LNCS (LNAI), vol. 8532, pp. 133–142. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07515-0_14
De Rosa, A., Justice, D.: WebReader: a screen reader for everyone, everywhere. In: Proceedings of the 13th Web for All Conference, p. 10. ACM (2016)
Diggs, J., Craig, J., McCarron, S., Cooper, M.: Accessible rich Internet applications (WAI-ARIA) 1.1 (2016). Accessed 24 Apr 2016
Fecke, A., Jeleniowski, S., Joisten, M.: Accessible websites for the visually impaired: guidelines for designers. In: Mensch & Computer Workshopband, pp. 419–422 (2015)
Giraud, S., Thérouanne, P., Steiner, D.D.: Web accessibility: filtering redundant and irrelevant information improves website usability for blind users. Int. J. Hum Comput Stud. 111, 23–35 (2018)
Guerreiro, J., Gonçalves, D.: Faster text-to-speeches: enhancing blind people’s information scanning with faster concurrent speech. In: ACM SIGACCESS, pp. 3–11. ACM (2015)
Guha, R.V., Brickley, D., Macbeth, S.: Schema.org: evolution of structured data on the web. Commun. ACM 59(2), 44–51 (2016)
Harper, S., Patel, N.: Gist summaries for visually impaired surfers. In: ACM SIGACCESS Conference on Computers and Accessibility, pp. 90–97. ACM (2005)
Hyman Jr., I.E., Boss, S.M., Wise, B.M., McKenzie, K.E., Caggiano, J.M.: Did you see the unicycling clown? Inattentional blindness while walking and talking on a cell phone. Appl. Cogn. Psychol. 24(5), 597–607 (2010)
Inan, F.A., Namin, A.S., Pogrund, R.L., Jones, K.S.: Internet use and cybersecurity concerns of individuals with visual impairments. J. Educ. Technol. Soc. 19(1), 28–40 (2016)
Insurance, L.M.: Pedestrian safety survey (2013). https://bit.ly/2WIDeOM
Lau, T., Cerruti, J., Manzato, G., Bengualid, M., Bigham, J.P., Nichols, J.: A conversational interface to web automation. In: UIST, pp. 229–238. ACM (2010)
Lazar, J., Allen, A., Kleinman, J., Malarkey, C.: What frustrates screen reader users on the web: a study of 100 blind users. Int. J. Hum.-Comput. Interact. 22(3), 247–269 (2007)
Leshed, G., Haber, E.M., Matthews, T., Lau, T.: CoScripter: automating & sharing how-to knowledge in the enterprise. In: CHI, pp. 1719–1728. ACM (2008)
Mahmud, J.U., Borodin, Y., Ramakrishnan, I.: Csurf: a context-driven non-visual web-browser. In: Proceedings of WWW 2007, pp. 31–40. ACM (2007)
Michail, S., Christos, K.: Adaptive browsing shortcuts: personalising the user interface of a specialised voice web browser for blind people. In: Data Engineering Workshop, 2007, pp. 818–825. IEEE (2007)
Mika, P.: On schema.org and why it matters for the web. IEEE Internet Comput. 19(4), 52–55 (2015)
Murphy, E., Kuber, R., McAllister, G., Strain, P., Yu, W.: An empirical investigation into the difficulties experienced by visually impaired Internet users. Univ. Access Inf. Soc. 7(1–2), 79–91 (2008)
Nasar, J.L., Troyer, D.: Pedestrian injuries due to mobile phone use in public places. Accid. Anal. Prev. 57, 91–95 (2013)
Oney, S., Lundgard, A., Krosnick, R., Nebeling, M., Lasecki, W.S.: Arboretum and arbility: improving web accessibility through a shared browsing architecture. In: UIST, pp. 937–949. ACM (2018)
Oshry, M., Auburn, R., Baggia, P., Bodell, M., Burke, D., Burnett, D., et al.: Voice extensible markup language (voicexml) 2.1. w3c recommendation (2007)
Plessers, P., et al.: Accessibility: a web engineering approach. In: Proceedings of WWW 2005, pp. 353–362. ACM (2005)
Power, C., Freire, A., Petrie, H., Swallow, D.: Guidelines are only half of the story: accessibility problems encountered by blind users on the web. In: Proceedings of CHI 2012, pp. 433–442. ACM (2012)
Rohrbach, M., Qiu, W., Titov, I., Thater, S., Pinkal, M., Schiele, B.: Translating video content to natural language descriptions. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 433–440 (2013)
Ronallo, J.: Html5 microdata and schema.org. Code4Lib J. (16) (2012)
Sato, D., Takagi, H., Kobayashi, M., Kawanaka, S., Asakawa, C.: Exploratory analysis of collaborative web accessibility improvement. ACM TACCESS 3(2), 5 (2010)
Sears, A., Lin, M., Jacko, J., Xiao, Y.: When computers fade: pervasive computing and situationally-induced impairments and disabilities. HCI Int. 2, 1298–1302 (2003)
Takagi, H., Kawanaka, S., Kobayashi, M., Itoh, T., Asakawa, C.: Social accessibility: achieving accessibility through collaborative metadata authoring. In: ACM SIGACCESS Conference on Computers and Accessibility, pp. 193–200. ACM (2008)
Tran, K., et al.: Rich image captioning in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 49–56 (2016)
Voykinska, V., Azenkot, S., Wu, S., Leshed, G.: How blind people interact with visual content on social networking services. In: CSCW, pp. 1584–1595. ACM (2016)
Wang, K.: SALT: a spoken language interface for web-based multimodal dialog systems. In: Seventh International Conference on Spoken Language Processing (2002)
WWW-Consortium et al.: Web content accessibility guidelines (WCAG) 2.1 w3c candidate recommendation (2018). Accessed 30 Jan 2018
Yesilada, Y.: Web page segmentation: a review (2011)
Zhu, S., Sato, D., Takagi, H., Asakawa, C.: Sasayaki: an augmented voice-based web browsing experience. In: Proceedings of SIGACCESS 2010. ACM (2010)
Acknowledgement
The study was supported by the Russian Science Foundation (project n. 19-18-00282).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Baez, M., Daniel, F., Casati, F. (2020). Conversational Web Interaction: Proposal of a Dialog-Based Natural Language Interaction Paradigm for the Web. In: Følstad, A., et al. Chatbot Research and Design. CONVERSATIONS 2019. Lecture Notes in Computer Science(), vol 11970. Springer, Cham. https://doi.org/10.1007/978-3-030-39540-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-39540-7_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-39539-1
Online ISBN: 978-3-030-39540-7
eBook Packages: Computer ScienceComputer Science (R0)