Skip to main content

Research on Reliability of Instance and Pattern in Semi-supervised Entity Relation Extraction

  • Conference paper
  • First Online:
Recent Developments in Intelligent Computing, Communication and Devices

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 752))

Abstract

In the current entity relation extraction technology, more and more researchers focus on semi-supervised Bootstrapping method, because it does not require a large number of artificial tagging corpus, needs only a small amount of seed set, by self-iterative extended to obtain large-scale knowledge base. However, after a large number of iterations, there will be “semantic drift,” that is, the accuracy will reduce due to the accumulation of errors. In order to improve the accuracy of the relation instance the quality of the pattern, it is necessary to evaluate the reliability of instances and patterns. This paper uses large-scale news headline sentences set in the search engine, evaluates the reliability of instances by co-occurrence relation between description words and sentences set, then evaluates the reliability of patterns by the number of positive and negative instances in patterns historical matching record, and selects new patterns to extend and optimize. The experimental results show that the reliability evaluation of instances and patterns used in the iteration effectively improves the accuracy of relation extraction and improves the quality of the extracted pattern.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Tan Hongye, Zhao Tiejun, Yao Jianmin, A Study on Pattern Generalization in Extended Named Entity Recognition. Chinese Journal of Electronic, 2007, 4:675–678.

    Google Scholar 

  2. Agichtein, Eugene, Gravano, et al. Snowball: extracting relations from large plain-text collections [J]. 2000:85–94.

    Google Scholar 

  3. Sarhan I, El-Sonbaty Y, El-Nasr M A. Semi-Supervised Pattern Based Algorithm for Arabic Relation Extraction [C]. IEEE, International Conference on TOOLS with Artificial Intelligence. IEEE, 2017:177–183.

    Google Scholar 

  4. Chen C, He L, Lin X. REV: extracting entity relations from World Wide Web [C]. International Conference on Ubiquitous Information Management and Communication. ACM, 2012:8.

    Google Scholar 

  5. Brin S. Extracting Patterns and Relations from the World Wide Web [C]. International Workshop on the World Wide Web and Databases. Springer Berlin Heidelberg, 1998:172–183.

    Google Scholar 

  6. Liu T, Che W, Zhenghua L I. Language Technology Platform [J]. Journal of Chinese Information Processing, 2011, 2(6):13–16.

    Google Scholar 

  7. Tian J L, Wei Z. Words Similarity Algorithm Based on Tongyici Cilin in Semantic Web Adaptive Learning System [J]. Journal of Jilin University, 2010, 28(06).

    Google Scholar 

  8. Mikolov T, Sutskever I, Chen K, et al. Distributed Representations of Words and Phrases and their Compositionality [J]. Advances in neural information processing systems, 2013, 26:3111–3119.

    Google Scholar 

  9. Bille P. A survey on tree edit distance and related problems [J]. Theoretical Computer Science, 2005, 337(1):217–239.

    Google Scholar 

  10. Altınçay H, Erenel Z. Analytical evaluation of term weighting schemes for text categorization [J]. Pattern Recognition Letters, 2010, 31(11):1310–1323.

    Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No. 61471232).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhentao Qin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Qin, Z., Ye, F. (2019). Research on Reliability of Instance and Pattern in Semi-supervised Entity Relation Extraction. In: Patnaik, S., Jain, V. (eds) Recent Developments in Intelligent Computing, Communication and Devices. Advances in Intelligent Systems and Computing, vol 752. Springer, Singapore. https://doi.org/10.1007/978-981-10-8944-2_44

Download citation

Publish with us

Policies and ethics