Abstract
There are two major issues of automatic entity relation extraction: human intervention and difficulty in labeling corpus. For these two problems, combined with the characteristics of the tourism domain, this paper adopts a weakly-supervised extraction method of entity relation based on entropy minimization. This method firstly extracts the characteristic words by the idea of scalar clustering with small-scale stratified marked instances, and constructs the initial classifier with maximum entropy machine learning algorithm. Then use the initial classifier of certain accuracy to classify the unlabeled instances, and add the instances of the minimum information entropy to the training corpus set to continually expand the scale of training data set. Finally, repeat the above iterative process until the performance of classifier is to be stabilized, and then a final extraction classifier of entity relation in specific domain will be constructed. Experiments performed on the corpus of tourism domain show that, not only can this method reduce the dependence of entity relation extraction on manual intervention, but it could effectively improve the performance of entity relation extraction, the F value of which is up to 63.69 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kambhatla N (2004) Combining lexical, syntactic and semantic features with maximum entropy models for extracting relations. In: Proceedings of the ACL 2004 on interactive poster and demonstration sessions, pp 178–181
Zhou GD, Su J, Zhang J (2005) Exploring various knowledge in relation extraction. In: Proceedings of the 43rd annual meeting on association for computational linguistics, pp 427–434
Zhao SB, Grishman R (2005) Extracting relations with integrated information using kernel methods. In: Proceedings of the 43rd annual meeting on association for computational linguistics, pp 419–426
Wang T, Li YY, Kalina B (2006). Automatic extraction of hierarchical relations from text. In: Proceedings of the third European semantic web conference, pp 401–416
Che WX, Liu T, Li S (2005) Automatic entity relation extraction. J Chin Inf Process 19(2):1–6 (in chinese)
Dong J, Sun L, Feng YY, Huang RH (2007) Chinese automatic entity relation extraction. J Chin Inf Process 21(4):80–85,91
Huang C, Qian LH, Zhou GD, Zhu QM (2010) Research on unsupervised Chinese entity relation extraction based on convolution tree kernel. J Chin Inf Process 24(4):11–17
Xi B, Zhou GD, Qian LH, Pan S (2008) Weakly-supervised semantic relation extraction using stratified strategy. J Guangxi Norm Univ 26(1):178–181 (in Chinese)
He TT, Xu C, Li J (2006) Named entity relation extraction method based on seed self-expansion. Comput Eng 32(21):183–184 (in Chinese)
Zhang Z (2004) Weakly-supervised relation classification for information extraction. In: Proceedings of the 13th conference on information and knowledge management (CIKM’2004), pp 581–588
Acknowledgments
This paper is supported by National Nature Science Foundation (No. 60863011, 61175068), and the Key Project of Yunnan Nature Science Foundation (No. 2008CC023), and the National Innovation Fund for Technology based Firms (No. 11C26215305905), and the Open Fund of Software Engineering Key Laboratory of Yunnan Province (No. 2011SE14), and the Ministry of Education of Returned Overseas Students to Start Research and Fund Projects.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhao, J., Guo, J., Yu, Z., Chen, P., Mao, C. (2013). Research on Weakly-Supervised Entity Relation Extraction of Specific Domain Based on Entropy Minimization. In: Sun, Z., Deng, Z. (eds) Proceedings of 2013 Chinese Intelligent Automation Conference. Lecture Notes in Electrical Engineering, vol 256. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38466-0_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-38466-0_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38465-3
Online ISBN: 978-3-642-38466-0
eBook Packages: EngineeringEngineering (R0)