Abstract
Aiming at the task of Entity Recognition and Linking in Chinese Search Queries in NLP&CC 2015, this paper proposes the solutions to entity recognition, entity linking and entity disambiguation. Dictionary, online knowledge base and SWJTU Chinese word segmentation are used in entity recognition. Synonyms thesaurus, redirect of Wikipedia and the combination of improved PED (Pinyin Edit Distance) algorithm and LCS (Longest Common Subsequence) are applied in entity linking. The methods of suffix supplement and link value computation based on online encyclopedia are adopted in entity disambiguation. The experiment results indicate that the proposed solutions in this paper are effective for the case of short queries and insufficient contexts.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lei, J., Tang, B., Lu, X., et al.: A comprehensive study of named entity recognition in Chinese clinical text. Journal of the American Medical Informatics Association 21(5), 808–814 (2014)
Al-Rfou, R., Skiena, S.: Speedread: a fast named entity recognition pipeline (2013). arXiv preprint arXiv: 1301.2857
Zhao, J., Liu, F.: Product named entity recognition in Chinese text. Language Resources & Evaluation 42(2), 197–217 (2008)
Konkol, M., BrychcÃn, T., KonopÃk, M.: Latent semantics in Named Entity Recognition. Expert Systems with Applications 42(7), 3470–3479 (2015)
Nothman, J., Ringland, N., Radford, W., et al.: Learning multilingual named entity recognition from wikipedia. Artificial Intelligence 194, 151–175 (2013)
Hachey, B., Radford, W., Nothman, J., et al.: Evaluating Entity Linking with wikipedia. Artificial Intelligence 194(194), 130–150 (2013)
Shen, W., Wang, J., Han, J.: Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions. IEEE Transactions on Knowledge and Data Engineering 27(2), 443–460 (2015)
Zhu, M., Jia, Z., Zuo, L., et al.: Research on Entity Linking of Chinese Micro Blog. Acta Scientiarum Naturalium Universitatis Pekinensis 1, 73–78 (2014). (in Chinese)
Gattani, A., Lamba, D.S., Garera, N., et al.: Entity extraction, linking, classification, and tagging for social media: a wikipedia-based approach. Proceedings of the VLDB Endowment 6(11), 1126–1137 (2013)
Yang, X., Li, P., Zhu, Q.: Name Disambiguation Based on Dependency Feature in Web Page Text. Computer Engineering 38(19), 133–136 (2012). (in Chinese)
Nguyen, H.T., Cao, T.H.: Exploring wikipedia and text features for named entity disambiguation. In: Nguyen, N.T., Le, M.T., Świątek, J. (eds.) ACIIDS 2010. LNCS, vol. 5991, pp. 11–20. Springer, Heidelberg (2010)
Zhao, J.: A Survey on Named Entity Recognition, Disambiguation and Cross-Lingual Coreference Resolution. Journal of Chinese information Processing 23(2), 3–13 (2009). (in Chinese)
Cao, J., Wu, X., Xia, Y., et al.: Pinyin-indexed method for approximate matching in Chinese. Journal of Tsinghua University (Science and Technology) 49(S1), 1328–1332 (2009). (in Chinese)
Cucerzan, S.: Large-scale named entity disambiguation based on wikipedia data. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 708–716 (2007)
Zheng, Z., Li, F., Huang, M., et al.: Learning to link entities with knowledge base. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 483–491 (2010)
Han, X., Sun, L., Zhao, J.: Collective entity linking in web text: a graph-based method. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 765–774 (2011)
Chen, Z., Ji, H.: Collaborative ranking: a case study on entity linking. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 771–781 (2011)
Zhang, W., Sim, Y.C., Su, J., Tan, C.L.: Entity linking with effective acronym expansion, instance selection and topic modeling. In: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, vol. 3, pp. 1909–1914 (2011)
Zou, X., Sun, C., Sun, Y., Liu, B., Lin, L.: Linking entities in tweets to wikipedia knowledge base. In: Zong, C., Nie, J.-Y., Zhao, D., Feng, Y. (eds.) NLPCC 2014. CCIS, vol. 496, pp. 368–378. Springer, Heidelberg (2014)
Davis, A., Veloso, A., Silva, A.S.D., et al.: Named entity disambiguation in streaming data. In: Proceedings of the Conference on European Chapter of the Association for Computational Linguistics, vol. 1, pp. 815–824 (2012)
Han, X., Zhao, J.: Named entity disambiguation by leveraging wikipedia semantic knowledge. In: Proceedings of the 18th ACM Conference on Information and knowledge management, pp. 215–224 (2009)
Meng, Z., Yu, D., Xun, E.: Chinese microblog entity linking system combining wikipedia and search engine retrieval results. In: Zong, C., Nie, J.-Y., Zhao, D., Feng, Y. (eds.) NLPCC 2014. CCIS, vol. 496, pp. 449–456. Springer, Heidelberg (2014)
Kataria, S.S., Kumar, K.S., Rastogi, R.R., et al.: Entity disambiguation with hierarchical topic models. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1037–1045 (2011)
Sen, P.: Collective context-aware topic models for entity disambiguation. In: Proceedings of the 21st International Conference on World Wide Web, pp. 729–738 (2012)
Nadeau, D., Turney, P.: A supervised learning approach to acronym identification. In: The Eighteenth Canadian Conference on Artificial Intelligence (2005)
SWJTU Chinese Word Segmentation System. http://ics.swjtu.edu.cn
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Yuan, J., Yang, Y., Jia, Z., Yin, H., Huang, J., Zhu, J. (2015). Entity Recognition and Linking in Chinese Search Queries. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2015. Lecture Notes in Computer Science(), vol 9362. Springer, Cham. https://doi.org/10.1007/978-3-319-25207-0_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-25207-0_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25206-3
Online ISBN: 978-3-319-25207-0
eBook Packages: Computer ScienceComputer Science (R0)