Abstract
Word Sense Disambiguation (WSD) is a task to identify the sense of a polysemy in given context. Recently, word embeddings are applied to WSD, as additional input features of a supervised classifier. However, previous approaches narrowly use word embeddings to represent surrounding words of target words. They may not make sufficient use of word embeddings in representing different features like dependency relations, word order and global contexts (the whole document). In this work, we combine local and global features to perform WSD. We explore utilizing word embeddings to leverage word order and dependency features. We also use word embeddings to represent global contexts as global features. We conduct experiments to evaluate our methods and find out that our methods outperform the state-of-the-art methods on Lexical Sample WSD datasets.
Notes
- 1.
- 2.
The sense inventory is from WordNet3.0 (wordnet.princeton.edu/)
References
Basile, P., Caputo, A., Semeraro, G.: An enhanced lesk word sense disambiguation algorithm through a distributional semantic model. In: COLING, pp. 1591–1600 (2014)
Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)
Carpuat, M., Dekai, W.: Improving statistical machine translation using word sense disambiguation. EMNLP-CoNLL 7, 61–72 (2007)
Chan, Y.S., Ng, H.T., Chiang, D.: Word sense disambiguation improves statistical machine translation. In: Annual Meeting-Association for Computational Linguistics, vol. 45, p. 33. Citeseer (2007)
Chen, D., Manning, C.D.: A fast and accurate dependency parser using neural networks. In: EMNLP, pp. 740–750 (2014)
Chen, P., Ding, W., Bowes, C., Brown, D.: A fully unsupervised word sense disambiguation method using dependency knowledge. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 28–36. Association for Computational Linguistics (2009)
Chen, X., Liu, Z., Sun, M.: A unified model for word sense representation and disambiguation. In: EMNLP, pp. 1025–1035. Citeseer (2014)
Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 160–167. ACM (2008)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391 (1990)
Edmonds, P., Cotton, S.: Senseval-2: overview. In: The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems, pp. 1–5. Association for Computational Linguistics (2001)
Firth, J.R.: A synopsis of linguistic theory, 1930–1955 (1957)
Harris, Z.S.: Distributional structure. Word 10(2–3), 146–162 (1954)
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn. 42(1), 177–196 (2001)
Iacobacci, I., Pilehvar, M.T., Navigli, R.: Embeddings for word sense disambiguation: an evaluation study. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 897–907 (2016)
Levy, O., Goldberg, Y.: Dependency-based word embeddings. In: ACL (2), pp. 302–308. Citeseer (2014)
Lin, D.: Using syntactic dependency as local context to resolve word sense ambiguity. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, pp. 64–71. Association for Computational Linguistics (1997)
Mihalcea, R., Chklovski, T.A., Kilgarriff, A.: The Senseval-3 English lexical sample task. Association for Computational Linguistics (2004)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. (CSUR) 41(2), 10 (2009)
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: EMNLP, vol. 14, pp. 1532–1543 (2014)
Rothe, S., Schütze, H.: Autoextend: extending word embeddings to embeddings for synsets and lexemes. arXiv preprint arXiv:1507.01127 (2015)
Taghipour, K., Ng, H.T.: Semi-supervised word sense disambiguation using word embeddings in general and specific domains. In: HLT-NAACL, pp. 314–323 (2015)
Vickrey, D., Biewald, L., Teyssier, M., Koller, D.: Word-sense disambiguation for machine translation. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 771–778. Association for Computational Linguistics (2005)
Yuan, D., Richardson, J., Doherty, R., Evans, C., Altendorf, E.: Semi-supervised word sense disambiguation with neural models. In: COLING (2016)
Zhang, D., Chow, C.-Y., Li, Q., Zhang, X., Yinlong, X.: SMashQ: spatial mashup framework for \(k\)-NN queries in time-dependent road networks. Distrib. Parallel Databases 31(2), 259–287 (2013)
Zhang, D., Chow, C.-Y., Li, Q., Zhang, X., Yinlong, X.: A spatial mashup service for efficient evaluation of concurrent \(k\)-NN queries. IEEE Trans. Comput. 65(8), 2428–2442 (2016)
Zhong, Z., Ng, H.T.: It makes sense: a wide-coverage word sense disambiguation system for free text. In: Proceedings of the ACL 2010 System Demonstrations, pp. 78–83. Association for Computational Linguistics (2010)
Zhong, Z., Ng, H.T.: Word sense disambiguation improves information retrieval. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1, pp. 273–282. Association for Computational Linguistics (2012)
Acknowledgement
This work is supported by the Fundamental Research Funds for the Central Universities, SCUT (Nos. 2017ZD048, 2015ZM136), Tiptop Scientific and Technical Innovative Youth Talents of Guangdong special support program (No. 2015TQ01X633), Science and Technology Planning Project of Guangdong Province, China (No. 2016A030310423), Science and Technology Program of Guangzhou (International Science & Technology Cooperation Program No. 201704030076) and Science and Technology Planning Major Project of Guangdong Province (No. 2015A070711001), the Start-Up Research Grant (RG 37/2016-2017R), and a grant from Research Grants Council of Hong Kong Special Administrative Region, China (UGC/FDS11/E03/16). This work is also partially supported by a CUHK Direct Grant for Research (Project Code EE16963).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Lei, X., Cai, Y., Li, Q., Xie, H., Leung, Hf., Wang, F.L. (2017). Combining Local and Global Features in Supervised Word Sense Disambiguation. In: Bouguettaya, A., et al. Web Information Systems Engineering – WISE 2017. WISE 2017. Lecture Notes in Computer Science(), vol 10570. Springer, Cham. https://doi.org/10.1007/978-3-319-68786-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-68786-5_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68785-8
Online ISBN: 978-3-319-68786-5
eBook Packages: Computer ScienceComputer Science (R0)