Combining Local and Global Features in Supervised Word Sense Disambiguation

Lei, Xue; Cai, Yi; Li, Qing; Xie, Haoran; Leung, Ho-fung; Wang, Fu Lee

doi:10.1007/978-3-319-68786-5_10

Xue Lei²⁴,
Yi Cai²⁴,
Qing Li²⁵,
Haoran Xie²⁶,
Ho-fung Leung²⁷ &
…
Fu Lee Wang²⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10570))

Included in the following conference series:

International Conference on Web Information Systems Engineering

1592 Accesses

Abstract

Word Sense Disambiguation (WSD) is a task to identify the sense of a polysemy in given context. Recently, word embeddings are applied to WSD, as additional input features of a supervised classifier. However, previous approaches narrowly use word embeddings to represent surrounding words of target words. They may not make sufficient use of word embeddings in representing different features like dependency relations, word order and global contexts (the whole document). In this work, we combine local and global features to perform WSD. We explore utilizing word embeddings to leverage word order and dependency features. We also use word embeddings to represent global contexts as global features. We conduct experiments to evaluate our methods and find out that our methods outperform the state-of-the-art methods on Lexical Sample WSD datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Notes

1.
code.google.com/archive/p/word2vec.
2.
The sense inventory is from WordNet3.0 (wordnet.princeton.edu/)

References

Basile, P., Caputo, A., Semeraro, G.: An enhanced lesk word sense disambiguation algorithm through a distributional semantic model. In: COLING, pp. 1591–1600 (2014)
Google Scholar
Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)
MATH Google Scholar
Carpuat, M., Dekai, W.: Improving statistical machine translation using word sense disambiguation. EMNLP-CoNLL 7, 61–72 (2007)
Google Scholar
Chan, Y.S., Ng, H.T., Chiang, D.: Word sense disambiguation improves statistical machine translation. In: Annual Meeting-Association for Computational Linguistics, vol. 45, p. 33. Citeseer (2007)
Google Scholar
Chen, D., Manning, C.D.: A fast and accurate dependency parser using neural networks. In: EMNLP, pp. 740–750 (2014)
Google Scholar
Chen, P., Ding, W., Bowes, C., Brown, D.: A fully unsupervised word sense disambiguation method using dependency knowledge. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 28–36. Association for Computational Linguistics (2009)
Google Scholar
Chen, X., Liu, Z., Sun, M.: A unified model for word sense representation and disambiguation. In: EMNLP, pp. 1025–1035. Citeseer (2014)
Google Scholar
Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 160–167. ACM (2008)
Google Scholar
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391 (1990)
Article Google Scholar
Edmonds, P., Cotton, S.: Senseval-2: overview. In: The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems, pp. 1–5. Association for Computational Linguistics (2001)
Google Scholar
Firth, J.R.: A synopsis of linguistic theory, 1930–1955 (1957)
Google Scholar
Harris, Z.S.: Distributional structure. Word 10(2–3), 146–162 (1954)
Article Google Scholar
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn. 42(1), 177–196 (2001)
Article Google Scholar
Iacobacci, I., Pilehvar, M.T., Navigli, R.: Embeddings for word sense disambiguation: an evaluation study. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 897–907 (2016)
Google Scholar
Levy, O., Goldberg, Y.: Dependency-based word embeddings. In: ACL (2), pp. 302–308. Citeseer (2014)
Google Scholar
Lin, D.: Using syntactic dependency as local context to resolve word sense ambiguity. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, pp. 64–71. Association for Computational Linguistics (1997)
Google Scholar
Mihalcea, R., Chklovski, T.A., Kilgarriff, A.: The Senseval-3 English lexical sample task. Association for Computational Linguistics (2004)
Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. (CSUR) 41(2), 10 (2009)
Article Google Scholar
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: EMNLP, vol. 14, pp. 1532–1543 (2014)
Google Scholar
Rothe, S., Schütze, H.: Autoextend: extending word embeddings to embeddings for synsets and lexemes. arXiv preprint arXiv:1507.01127 (2015)
Taghipour, K., Ng, H.T.: Semi-supervised word sense disambiguation using word embeddings in general and specific domains. In: HLT-NAACL, pp. 314–323 (2015)
Google Scholar
Vickrey, D., Biewald, L., Teyssier, M., Koller, D.: Word-sense disambiguation for machine translation. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 771–778. Association for Computational Linguistics (2005)
Google Scholar
Yuan, D., Richardson, J., Doherty, R., Evans, C., Altendorf, E.: Semi-supervised word sense disambiguation with neural models. In: COLING (2016)
Google Scholar
Zhang, D., Chow, C.-Y., Li, Q., Zhang, X., Yinlong, X.: SMashQ: spatial mashup framework for \(k\)-NN queries in time-dependent road networks. Distrib. Parallel Databases 31(2), 259–287 (2013)
Article Google Scholar
Zhang, D., Chow, C.-Y., Li, Q., Zhang, X., Yinlong, X.: A spatial mashup service for efficient evaluation of concurrent \(k\)-NN queries. IEEE Trans. Comput. 65(8), 2428–2442 (2016)
Article MathSciNet Google Scholar
Zhong, Z., Ng, H.T.: It makes sense: a wide-coverage word sense disambiguation system for free text. In: Proceedings of the ACL 2010 System Demonstrations, pp. 78–83. Association for Computational Linguistics (2010)
Google Scholar
Zhong, Z., Ng, H.T.: Word sense disambiguation improves information retrieval. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1, pp. 273–282. Association for Computational Linguistics (2012)
Google Scholar

Download references

Acknowledgement

This work is supported by the Fundamental Research Funds for the Central Universities, SCUT (Nos. 2017ZD048, 2015ZM136), Tiptop Scientific and Technical Innovative Youth Talents of Guangdong special support program (No. 2015TQ01X633), Science and Technology Planning Project of Guangdong Province, China (No. 2016A030310423), Science and Technology Program of Guangzhou (International Science & Technology Cooperation Program No. 201704030076) and Science and Technology Planning Major Project of Guangdong Province (No. 2015A070711001), the Start-Up Research Grant (RG 37/2016-2017R), and a grant from Research Grants Council of Hong Kong Special Administrative Region, China (UGC/FDS11/E03/16). This work is also partially supported by a CUHK Direct Grant for Research (Project Code EE16963).

Author information

Authors and Affiliations

School of Software Engineering, South China University of Technology, Guangzhou, China
Xue Lei & Yi Cai
Department of Computer Science, City University of Hong Kong, Hong Kong SAR, China
Qing Li
Department of Mathematics and Information Technology, The Education University of Hong Kong, Hong Kong SAR, China
Haoran Xie
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China
Ho-fung Leung
Caritas Institute of Higher Education, Hong Kong SAR, China
Fu Lee Wang

Authors

Xue Lei
View author publications
You can also search for this author in PubMed Google Scholar
Yi Cai
View author publications
You can also search for this author in PubMed Google Scholar
Qing Li
View author publications
You can also search for this author in PubMed Google Scholar
Haoran Xie
View author publications
You can also search for this author in PubMed Google Scholar
Ho-fung Leung
View author publications
You can also search for this author in PubMed Google Scholar
Fu Lee Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yi Cai .

Editor information

Editors and Affiliations

University of Sydney, Darlington, NSW, Australia
Athman Bouguettaya
Zhejiang University, Hangzhou, China
Yunjun Gao
Institute of Computing for Physics and Technology, Protvino, Russia
Andrey Klimenko
Nanyang Technological University, Singapore, Singapore
Lu Chen
King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Xiangliang Zhang
Institute of Computing for Physics and Technology, Protvino, Russia
Fedor Dzerzhinskiy
Shanghai Jiao Tong University, Minhang Qu, China
Weijia Jia
Institute of Computing for Physics and Technology, Protvino, Russia
Stanislav V. Klimenko
City University of Hong Kong, Kowloon, Hong Kong
Qing Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lei, X., Cai, Y., Li, Q., Xie, H., Leung, Hf., Wang, F.L. (2017). Combining Local and Global Features in Supervised Word Sense Disambiguation. In: Bouguettaya, A., et al. Web Information Systems Engineering – WISE 2017. WISE 2017. Lecture Notes in Computer Science(), vol 10570. Springer, Cham. https://doi.org/10.1007/978-3-319-68786-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-68786-5_10
Published: 04 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68785-8
Online ISBN: 978-3-319-68786-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics