Abstract
Automatic word sense disambiguation (WSD) from text is a task of great importance in various applications of natural language processing, for example, in machine translation, question answering, automatic summarization or sentiment analysis. There are different approaches to finding the meaning of a word within a context, whether using supervised, unsupervised, semi-supervised or knowledge-based methods. Several studies have been conducted to automatically translate from text to sign language, reproducing the result of the translation with a signing avatar, in a way that deaf users have access to informative contents that otherwise are highly inaccessible, because sign language is their mother tongue. The many proposals that have been made look forward to minimize these informative and communicative barriers. Sign languages, however, do not have as many words as the spoken languages, so an automatic translation must be as accurate and free of ambiguities as possible. In this paper, we propose to evaluate the use of public access big data resources, as well as appropriate techniques to access this type of resources for WSD tasks, illustrating their effects in a translation system from text in Spanish to Costa Rican Sign Language (LESCO). The architecture of the actual system incorporates the use of a folksonomy, from which the disambiguation process will benefit. When an exact word is not found for a given detected sense in the source text, the ontology will be fed back with a new relationship of hyperonymy, to alert the curator on the need to propose a new sign in that category, thus promoting an enrichment in a key component of the architecture. As a result of the evaluation, the most appropriate big data public resources and techniques for WSD for sign language will be elucidated.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bel-Enguix, G., Jiménez, M.: Language as a Complex System: Interdisciplinary Approaches. Cambridge Scholars Publishing (2010)
Kumar, R., Khanna, R., Goyal, V.: A review of literature on word sense disambiguation. Res. Cell Int. J. Eng. Sci. 6 (2012)
Lomotey, R., Deters, R.: Towards knowledge discovery in big data. In: IEEE 8th International Symposium on Service Oriented System Engineering (2014)
Simonini G., Guerra, F.: Using big data to support automatic word sense disambiguation. In: International Conference on High Performance Computing & Simulation (HPCS) (2014)
Belsare, R., Akarte, S.: A review on strategies of word sense disambiguation. Int. J. Sci. Eng. Appl. 5(4) (2016)
Kumar, A., Kumar, S.: Word sense disambiguation using association rules: a review. Int. J. Innov. Res. Stud. 2(2) (2013)
Siddiqui, T.: Review on word sense disambiguation techniques. Aryabhatta J. Math. Inform. 7(2) (2015)
Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, Cambridge, MA, pp. 189–96 (1995)
Rawat, S., Chandak, M.: Word sense disambiguation and classification algorithms: a review. Int. J. Comput. Sci. Appl. 8(1) (2015)
Guthrie, J., Guthrie, L., Wilks, Y., Aidinejad, H.: Subject dependent co-occurrence and word sense disambiguation. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics. ACL (1991)
Cowie, J., Guthrie, J., Guthrie, L.: Lexical disambiguation using simulated annealing. In: Proceedings of the International Conference on Computational Linguistics, COLING (1992)
Vasilescu, F., Langlais, P., Lapalme, G.: Evaluating variants of the Lesk approach for disambiguation words. In: Proceedings of Language Resources and Evaluation Conference (LREC), pp. 633–636 (2004)
DBPedia Homepage, http://es.dbpedia.org/. Last accessed 12 Jan 2018
Moore, R.: On log-likelihood-ratios and the significance of rare events. In: EMNLP (2004). http://aclweb.org/anthology/W04-3243
Yang, J., McAuley, J., Leskovec, J.: Community detection in networks with node attributes (2012). https://arxiv.org/pdf/1401.7267.pdf
Acknowledgements
The authors thank the School of Computing and the Computer Research Center of the Technological Institute of Costa Rica for the financial support, as well as ONICIT (Consejo Nacional para Investigaciones Científicas y Tecnológicas), Costa Rica, under grant 290-2006. This work was supported in part by the Spanish Ministry of Science, Innovation and Universities through the Project ECLIPSE-UA under Grant RTI2018-094283-B-C32, the Project RESCATA under Grant TIN2015-65100-R, the Project PROMETEO/2018/089, and the Lucentia AGI Grant.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Naranjo-Zeledón, L., Ferrández, A., Peral, J., Chacón-Rivas, M. (2019). Big Data-Assisted Word Sense Disambiguation for Sign Language. In: Visvizi, A., Lytras, M. (eds) Research & Innovation Forum 2019. RIIFORUM 2019. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-030-30809-4_40
Download citation
DOI: https://doi.org/10.1007/978-3-030-30809-4_40
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30808-7
Online ISBN: 978-3-030-30809-4
eBook Packages: EducationEducation (R0)