Skip to main content

Research on Weakly-Supervised Entity Relation Extraction of Specific Domain Based on Entropy Minimization

  • Conference paper
  • First Online:
Proceedings of 2013 Chinese Intelligent Automation Conference

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 256))

  • 2182 Accesses

Abstract

There are two major issues of automatic entity relation extraction: human intervention and difficulty in labeling corpus. For these two problems, combined with the characteristics of the tourism domain, this paper adopts a weakly-supervised extraction method of entity relation based on entropy minimization. This method firstly extracts the characteristic words by the idea of scalar clustering with small-scale stratified marked instances, and constructs the initial classifier with maximum entropy machine learning algorithm. Then use the initial classifier of certain accuracy to classify the unlabeled instances, and add the instances of the minimum information entropy to the training corpus set to continually expand the scale of training data set. Finally, repeat the above iterative process until the performance of classifier is to be stabilized, and then a final extraction classifier of entity relation in specific domain will be constructed. Experiments performed on the corpus of tourism domain show that, not only can this method reduce the dependence of entity relation extraction on manual intervention, but it could effectively improve the performance of entity relation extraction, the F value of which is up to 63.69 %.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.homepages.inf.edu.ac.uk/lzhang10/maxent_toolkit.html

References

  1. Kambhatla N (2004) Combining lexical, syntactic and semantic features with maximum entropy models for extracting relations. In: Proceedings of the ACL 2004 on interactive poster and demonstration sessions, pp 178–181

    Google Scholar 

  2. Zhou GD, Su J, Zhang J (2005) Exploring various knowledge in relation extraction. In: Proceedings of the 43rd annual meeting on association for computational linguistics, pp 427–434

    Google Scholar 

  3. Zhao SB, Grishman R (2005) Extracting relations with integrated information using kernel methods. In: Proceedings of the 43rd annual meeting on association for computational linguistics, pp 419–426

    Google Scholar 

  4. Wang T, Li YY, Kalina B (2006). Automatic extraction of hierarchical relations from text. In: Proceedings of the third European semantic web conference, pp 401–416

    Google Scholar 

  5. Che WX, Liu T, Li S (2005) Automatic entity relation extraction. J Chin Inf Process 19(2):1–6 (in chinese)

    Google Scholar 

  6. Dong J, Sun L, Feng YY, Huang RH (2007) Chinese automatic entity relation extraction. J Chin Inf Process 21(4):80–85,91

    Google Scholar 

  7. Huang C, Qian LH, Zhou GD, Zhu QM (2010) Research on unsupervised Chinese entity relation extraction based on convolution tree kernel. J Chin Inf Process 24(4):11–17

    Google Scholar 

  8. Xi B, Zhou GD, Qian LH, Pan S (2008) Weakly-supervised semantic relation extraction using stratified strategy. J Guangxi Norm Univ 26(1):178–181 (in Chinese)

    Google Scholar 

  9. He TT, Xu C, Li J (2006) Named entity relation extraction method based on seed self-expansion. Comput Eng 32(21):183–184 (in Chinese)

    Google Scholar 

  10. Zhang Z (2004) Weakly-supervised relation classification for information extraction. In: Proceedings of the 13th conference on information and knowledge management (CIKM’2004), pp 581–588

    Google Scholar 

Download references

Acknowledgments

This paper is supported by National Nature Science Foundation (No. 60863011, 61175068), and the Key Project of Yunnan Nature Science Foundation (No. 2008CC023), and the National Innovation Fund for Technology based Firms (No. 11C26215305905), and the Open Fund of Software Engineering Key Laboratory of Yunnan Province (No. 2011SE14), and the Ministry of Education of Returned Overseas Students to Start Research and Fund Projects.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianyi Guo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhao, J., Guo, J., Yu, Z., Chen, P., Mao, C. (2013). Research on Weakly-Supervised Entity Relation Extraction of Specific Domain Based on Entropy Minimization. In: Sun, Z., Deng, Z. (eds) Proceedings of 2013 Chinese Intelligent Automation Conference. Lecture Notes in Electrical Engineering, vol 256. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38466-0_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38466-0_30

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38465-3

  • Online ISBN: 978-3-642-38466-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics