Abstract
We have developed an automatic text visualization (ATV) system, named Preksha, that takes natural language text in Hindi as input and produces a run-time interactive 3D scene based on it. Preksha is the only ATV system which deals with complex processing of morphologically rich input in Hindi, a language of free-word-order nature. Its design and approach make Preksha extendible to other Indian languages. In this paper, we present challenges for evaluation of an ATV system and propose a subjective evaluation methodology. This evaluation design includes intelligibility, fidelity and complexity aspects of the scenes generated. Subsequently, Preksha is evaluated by using a total of 10,220 user responses through an online evaluation survey. The results show that Preksha is able to generate scenes with very high levels of intelligibility and fidelity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sevens, L.: ‘Words divide, pictographs unite’: pictograph communication technologies for people with an intellectual disability. Ph.D. thesis, University of Leuven, jury member (2018)
Coyne, B., Sproat, R.: WordsEye an automatic text-to-scene conversion system. In: Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH ’01, Computer Graphics Proceedings, New York, USA, pp. 487–496 (2001)
Karkar, G.A.: An ontology based text-to-picture multimedia m-learning system. Ph.D. Thesis, Khalifa Al-Khalifa, Dean, College of Engineering, Qatar University (2018)
Seversky, L.M., Yin, L.: Real-time automatic 3D scene generation from natural language voice and text descriptions. In: Multimedia ’06 Proceedings of the 14th Annual ACM International Conference on Multimedia, New York, USA, pp. 61–64 (2006)
Jain, P., Bhavsar, R.P., Kumar, A., Pawar, B.V., Darbari, H., Bhavsar, V.C.: Tree adjoining grammar based parser for a Hindi text-to-scene conversion system. In: 4th International Conference for Convergence in Technology (I2CT), and IEEE Xplore (2018)
Chang, A.X., Funkhouser, T., Guibas, L., Hanrahan, P., Huang, Q., Li, Z., Savarese, S., Savva, M., Song, S., Su, H., Xiao, J., Yi, L., Yu, F.: Shapenet, an information-rich 3d model repository. arXiv:151203012 (2015)
Coyne, B., Eric, R., Cecilia, S., Michael, B., Hirschberg, J.: Evaluating a text-to-scene generation system as an aid to literacy. Workshop on Speech and Language Technology in Education (SlaTE) at Interspeech (2011)
Bharati, A., Sangal, R.: Parsing free word order languages in the paninian framework. In: Proceedings of ACL (1993)
Husain, S.: A generalized parsing framework based on computational Paninian grammar. Ph.D. thesis, IIIT-Hyderabad, India (2011)
Likert, R.: A technique for the measurement of attitudes. Arch. Psychol. 140, 155 (1932)
Zitnick, C.L., Parikh, D., Vanderwende, L.: Learning the visual interpretation of sentences. In: IEEE International Conference on Computer Vision, IEEE Xplore and Digital Library (2013)
Hassani, K., Lee, W.S.: Visualizing natural language descriptions, a survey. In: ACM Computing Surveys (CSUR) Surveys Homepage archive, vol. 49, issue 1, article no. 17, NY, USA (2016)
Ulinski, M.: Leveraging text-to-scene generation for language elicitation and documentation. Ph.D. thesis, Columbia University (2019)
Scriven, M.: Evaluation as a cognitive process. School of Behavioral and Organizational Sciences. J. MultiDiscipl. Eval. 4(8) (2007). ISSN 1556–8180
Runco, M.A.: Creativity and cognition. In: International Encyclopedia of the Social & Behavioral Sciences. Elsevier, pp. 2892–2895 (2001)
Jain, P., Darbari, H., Bhavsar, V.C.: Vishit: a visualizer for Hindi text. In: 4th International Conference on Communication Systems and Network Technologies (CSNT), IEEE Xplore, pp. 886–890 (2014)
Li, W., Zhang, P., Zhang, L.: Object-driven text-to-image synthesis via adversarial training. arXiv:1902.10740v1 [cs.CV] 27 Feb 2019
Dong, H., Zhang, J., McIlwraith, D., Guo, Y.: I2T2I: learning text oo image synthesis with textual data augmentation. arXiv:1703.06676v3 [cs.CV] 3 Jun 2017
DAR, P.: Thanks to AI, you can now create cartoons from text based descriptions. https://www.analyticsvidhya.com/blog/2018/04/this-ai-create-cartoons-text-description/. Accessed 13 Oct 2019
Jain, P., Darbari, H., Bhavsar, V.C.: Cognitive support by language visualization a case study with Hindi Language. In: 2nd International Conference for Convergence in Technology (I2CT), IEEE Xplore (2017)
Jain, P., Shaik, K., Kumar, A., Darbari, H., Bhavsar, V.C.: Cascaded finite-state chunk parsing for Hindi language. In: International Conference on Communication and Information Processing (ICCIP) 17th–18th, May 2019 (to be published in Elsevier-SSRN)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Jain, P. et al. (2021). Evaluation of Automatic Text Visualization Systems: A Case Study. In: Hassanien, A., Bhatnagar, R., Darwish, A. (eds) Advanced Machine Learning Technologies and Applications. AMLTA 2020. Advances in Intelligent Systems and Computing, vol 1141. Springer, Singapore. https://doi.org/10.1007/978-981-15-3383-9_3
Download citation
DOI: https://doi.org/10.1007/978-981-15-3383-9_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-3382-2
Online ISBN: 978-981-15-3383-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)