Skip to main content

Performance Evaluation of Knowledge Extraction Methods

  • Conference paper
  • First Online:
Trends in Applied Knowledge-Based Systems and Data Science (IEA/AIE 2016)

Abstract

This paper shows the precision, the recall and the F-measure for the knowledge extraction methods (under Open Information Extraction paradigm): ReVerb, OLLIE and ClausIE. For obtaining these three measures a subset of 55 newswires corpus was used. This subset was taken from the Reuters-21578 text categorization and test collection database. A handmade relation extraction was applied for each one of these newswires.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Banko, M., Cafarella, M. J., Soderland, S., Broadhead, M., Etzioni, O.: Open information extraction for the web. In: IJCAI, vol. 7, pp. 2670–2676, January 2007

    Google Scholar 

  • Christensen, J., Soderland, S., Etzioni, O.: An analysis of open information extraction based on semantic role labeling. In: Proceedings of the Sixth International Conference on Knowledge Capture, pp. 113–120. ACM (2011)

    Google Scholar 

  • Del Corro, L., Gemulla, R.: ClausIE: clause-based open information extraction. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 355–366. International World Wide Web Conferences Steering Committee, May 2013

    Google Scholar 

  • Etzioni, O., Cafarella, M., Downey, D., Popescu, A. M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extraction from the web: an experimental study. Artif. Intell. 165(1), 91–134 (2005)

    Article  Google Scholar 

  • Fader, A., Soderland, S., Etzioni, O.: Identifying relations for open information extraction. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1535–1545. Association for Computational Linguistics, July 2011

    Google Scholar 

  • Hamburg, M.: Basic Statistics: A Modern Approach. Jovanovich, New York (1979)

    MATH  Google Scholar 

  • Joachims, T.: Text categorization with support vector machines. In: Nédellec, C., Rouveirol, C. (eds.) Learning with many relevant features, pp. 137–142. Springer, Heidelberg (1998)

    Google Scholar 

  • Lewis, D.D.: Reuters-21578 text categorization test collection, distribution 1.0. http://www.research.att.com/~lewis/reuters21578.html

  • Mesquita, F., Merhav, Y., Barbosa, D.: Extracting information networks from the blogosphere: State-of-the-art and challenges. In: Proceedings of the Fourth AAAI Conference on Weblogs and Social Media (ICWSM), Data Challenge Workshop (2010)

    Google Scholar 

  • Mirrezaei, S.I., Martins, B., Cruz, I.F.: The triplex approach for recognizing semantic relations from noun phrases, appositions, and adjectives. In: The Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data (Know@LOD) Co-located with Extended Semantic Web Conference (ESWC), Portoroz, Slovenia (2015)

    Google Scholar 

  • Rancan, C., Kogan, A., Pesado, P., García-Martínez, R.: Knowledge discovery for knowledge based systems. Some experimental results. Res. Comput. Sci. J. 27, 3–13 (2007)

    Google Scholar 

  • Rodríguez, J.M., García-Martínez, R., Merlino, H.D.: Revisión Sistemática Comparativa de Evolución de Métodos de Extracción de Conocimiento para la Web. XXI Congreso Argentino de Ciencias de la Computación (CACIC 2015), Buenos Aires, Argentina (2015)

    Google Scholar 

  • Schmitz, M., Bart, R., Soderland, S., Etzioni, O.: Open language learning for information extraction. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 523–534, July 2012

    Google Scholar 

  • Wu, F., Weld, D.S.: Open information extraction using Wikipedia. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 118–127. Association for Computational Linguistics, July 2010

    Google Scholar 

  • Yahya, M., Whang, S.E., Gupta, R., Halevy, A.: Renoun: fact extraction for nominal attributes. In: Proceedings 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, October 2014

    Google Scholar 

Download references

Acknowledgments

The research reported in this paper was partially funded by Projects UNLa-33A205 and UNLa-33B177 of National University of Lanus (Argentina). Authors wish to thank to senior students in our courses within Information Engineering Bachelor Degree at Engineering School - University of Buenos Aires for their help during the experiment.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ramón García-Martínez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Rodríguez, J.M., Merlino, H.D., Pesado, P., García-Martínez, R. (2016). Performance Evaluation of Knowledge Extraction Methods. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42007-3_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42006-6

  • Online ISBN: 978-3-319-42007-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics