Performance Evaluation of Knowledge Extraction Methods

Rodríguez, Juan M.; Merlino, Hernán D.; Pesado, Patricia; García-Martínez, Ramón

doi:10.1007/978-3-319-42007-3_2

Juan M. Rodríguez^18,19,20,
Hernán D. Merlino^19,20,
Patricia Pesado²¹ &
…
Ramón García-Martínez²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9799))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

2693 Accesses
3 Citations

Abstract

This paper shows the precision, the recall and the F-measure for the knowledge extraction methods (under Open Information Extraction paradigm): ReVerb, OLLIE and ClausIE. For obtaining these three measures a subset of 55 newswires corpus was used. This subset was taken from the Reuters-21578 text categorization and test collection database. A handmade relation extraction was applied for each one of these newswires.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Banko, M., Cafarella, M. J., Soderland, S., Broadhead, M., Etzioni, O.: Open information extraction for the web. In: IJCAI, vol. 7, pp. 2670–2676, January 2007
Google Scholar
Christensen, J., Soderland, S., Etzioni, O.: An analysis of open information extraction based on semantic role labeling. In: Proceedings of the Sixth International Conference on Knowledge Capture, pp. 113–120. ACM (2011)
Google Scholar
Del Corro, L., Gemulla, R.: ClausIE: clause-based open information extraction. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 355–366. International World Wide Web Conferences Steering Committee, May 2013
Google Scholar
Etzioni, O., Cafarella, M., Downey, D., Popescu, A. M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extraction from the web: an experimental study. Artif. Intell. 165(1), 91–134 (2005)
Article Google Scholar
Fader, A., Soderland, S., Etzioni, O.: Identifying relations for open information extraction. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1535–1545. Association for Computational Linguistics, July 2011
Google Scholar
Hamburg, M.: Basic Statistics: A Modern Approach. Jovanovich, New York (1979)
MATH Google Scholar
Joachims, T.: Text categorization with support vector machines. In: Nédellec, C., Rouveirol, C. (eds.) Learning with many relevant features, pp. 137–142. Springer, Heidelberg (1998)
Google Scholar
Lewis, D.D.: Reuters-21578 text categorization test collection, distribution 1.0. http://www.research.att.com/~lewis/reuters21578.html
Mesquita, F., Merhav, Y., Barbosa, D.: Extracting information networks from the blogosphere: State-of-the-art and challenges. In: Proceedings of the Fourth AAAI Conference on Weblogs and Social Media (ICWSM), Data Challenge Workshop (2010)
Google Scholar
Mirrezaei, S.I., Martins, B., Cruz, I.F.: The triplex approach for recognizing semantic relations from noun phrases, appositions, and adjectives. In: The Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data (Know@LOD) Co-located with Extended Semantic Web Conference (ESWC), Portoroz, Slovenia (2015)
Google Scholar
Rancan, C., Kogan, A., Pesado, P., García-Martínez, R.: Knowledge discovery for knowledge based systems. Some experimental results. Res. Comput. Sci. J. 27, 3–13 (2007)
Google Scholar
Rodríguez, J.M., García-Martínez, R., Merlino, H.D.: Revisión Sistemática Comparativa de Evolución de Métodos de Extracción de Conocimiento para la Web. XXI Congreso Argentino de Ciencias de la Computación (CACIC 2015), Buenos Aires, Argentina (2015)
Google Scholar
Schmitz, M., Bart, R., Soderland, S., Etzioni, O.: Open language learning for information extraction. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 523–534, July 2012
Google Scholar
Wu, F., Weld, D.S.: Open information extraction using Wikipedia. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 118–127. Association for Computational Linguistics, July 2010
Google Scholar
Yahya, M., Whang, S.E., Gupta, R., Halevy, A.: Renoun: fact extraction for nominal attributes. In: Proceedings 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, October 2014
Google Scholar

Download references

Acknowledgments

The research reported in this paper was partially funded by Projects UNLa-33A205 and UNLa-33B177 of National University of Lanus (Argentina). Authors wish to thank to senior students in our courses within Information Engineering Bachelor Degree at Engineering School - University of Buenos Aires for their help during the experiment.

Author information

Authors and Affiliations

PhD Program on Computer Science, National University of La Plata, La Plata, Argentina
Juan M. Rodríguez
Intelligent Systems Group, University of Buenos Aires, Buenos Aires, Argentina
Juan M. Rodríguez & Hernán D. Merlino
Information Systems Research Group, National University of Lanús, Lanús, Argentina
Juan M. Rodríguez, Hernán D. Merlino & Ramón García-Martínez
III-LIDI. Computer Science School, National University of La Plata – CIC Bs As, La Plata, Argentina
Patricia Pesado

Authors

Juan M. Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Hernán D. Merlino
View author publications
You can also search for this author in PubMed Google Scholar
Patricia Pesado
View author publications
You can also search for this author in PubMed Google Scholar
Ramón García-Martínez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ramón García-Martínez .

Editor information

Editors and Affiliations

Iwate Prefectural University , Iwate, Japan
Hamido Fujita
Department Computer Science, Texas State University, San Marcos, Texas, USA
Moonis Ali
Universiti Teknologi Malaysis (UTM), Bahru, Malaysia
Ali Selamat
Iwate Prefectural University , Iwate, Japan
Jun Sasaki
Iwate Prefectural University , Iwate, Japan
Masaki Kurematsu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rodríguez, J.M., Merlino, H.D., Pesado, P., García-Martínez, R. (2016). Performance Evaluation of Knowledge Extraction Methods. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-42007-3_2
Published: 14 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42006-6
Online ISBN: 978-3-319-42007-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics