Support Vector Machines for Semantic Relation Extraction in Spanish Language

Torres, Jefferson Peña; de Piñerez Reyes, Raúl Gutierrez; Bucheli, Víctor A.

doi:10.1007/978-3-319-98998-3_26

Support Vector Machines for Semantic Relation Extraction in Spanish Language

Conference paper
First Online: 19 August 2018

1116 Accesses
5 Citations

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 885))

Abstract

Relation Extraction (RE) is one of the most important topics in NLP (Natural Language Processing). Many tasks such as semantic relation extraction, sentiment analysis, opinion mining, question answering systems and text summarization are supported by RE. The aim of this paper is to present a semantic relations classifier in which are incorporate lexical features, named entity features and syntactic structures. Relations between two entities are classified based on the Datasets for Generic Relation Extraction (reACE). We translate the reACE corpus to the Spanish language for all relation types and subtypes. The results shows a F-score of 75.25%, it is a significant improvement of 11.5% over the baseline model. Finally, we discuss the results according to the model and the useful information to support the forecasting process.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://www.cs.cornell.edu/people/tj/svm_light/svm_struct.html.

References

Textblob: Simplified text processing. http://textblob.readthedocs.org/. Accessed 22 Feb 2015
Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. J. Mach. Learn. Res. 2(Dec), 265–292 (2001)
MATH Google Scholar
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, ACL 2004, Stroudsburg, PA, USA. Association for Computational Linguistics (2004)
Google Scholar
Doddington, G.R., Mitchell, A., Przybocki, M.A., Ramshaw, L.A., Strassel, S., Weischedel, R.M.: The automatic content extraction (ACE) program-tasks, data, and evaluation (2004)
Google Scholar
Gutiérrez, R., Castillo, A., Bucheli, V., Solarte, O.: Named entity recognition for Spanish language and applications in technology forecasting reconocimiento de entidades nombradas para el idioma español y su aplicación en la vigilancia tecnológica (2015)
Google Scholar
Hachey, B., Grover, C., Tobin, R.: Datasets for generic relation extraction. Nat. Lang. Eng. 18(1), 21–59 (2012)
Article Google Scholar
Joachims, T.: Support vector machines for complex outputs (2008). http://www.cs.cornell.edu/people/tj/svm_light/svm_struct.html
Joachims, T.: Making large-scale SVM learning practical. Technical report, SFB 475: Komplexitätsreduktion in Multivariaten Datenstrukturen, Universität Dortmund (1998)
Google Scholar
Jurafsky, D., Martin, J.H.: Speech and Language Processing. Pearson Education, London (2009). International edition
Google Scholar
Kambhatla, N.: Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations. In: Proceedings of the ACL 2004 on Interactive Poster and Demonstration Sessions, ACLdemo 2004, Stroudsburg, PA, USA. Association for Computational Linguistics (2004)
Google Scholar
Kumar, S.: A survey of deep learning methods for relation extraction. CoRR, abs/1705.03645 (2017)
Google Scholar
Manning, C., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S., McClosky, D.: The Stanford CoreNLP natural language processing toolkit. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55–60 (2014)
Google Scholar
Miller, S., Fox, H., Ramshaw, L., Weischedel, R.: A novel use of statistical parsing to extract information from text. In: Proceedings of the 1st North American Chapter of the Association for Computational Linguistics Conference, NAACL 2000, Stroudsburg, PA, USA, pp. 226–233. Association for Computational Linguistics (2000)
Google Scholar
Moschitti, A.: A study on convolution kernels for shallow semantic parsing. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 335. Association for Computational Linguistics (2004)
Google Scholar
Padró, L., Stanilovsky, E.: Freeling 3.0: towards wider multilinguality. In: LREC2012 (2012)
Google Scholar
Song, Z., et al.: ACE 2007 multilingual training corpus LDC2014t18. Linguistic Data Consortium, Philadelphia (2014)
Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995). https://doi.org/10.1007/978-1-4757-3264-1
Book MATH Google Scholar
Walker, C., et al.: ACE 2005 multilingual training corpus LDC2006t06. Linguistic Data Consortium, Philadelphia (2006)
Google Scholar
Zelenko, D., Aone, C., Richardella, A.: Kernel methods for relation extraction. J. Mach. Learn. Res. 3, 1083–1106 (2003)
MathSciNet MATH Google Scholar
Zeng, D., Liu, K., Lai, S., Zhou, G., Zhao, J.: Relation classification via convolutional deep neural network. In: Proceedings of COLING 2014, The 25th International Conference on Computational Linguistics: Technical Papers, pp. 2335–2344 (2014)
Google Scholar
Zhang, Z.: Weakly-supervised relation classification for information extraction. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, CIKM 2004, pp. 581–588. ACM, New York (2004)
Google Scholar

Download references

Acknowledgments

The authors would like to acknowledge the systems and Computing Engineering school, Faculty of Engineer, The Universidad del Valle of Cali, Colombia.

Author information

Authors and Affiliations

Universidad del Valle, Cali, Colombia
Jefferson Peña Torres, Raúl Gutierrez de Piñerez Reyes & Víctor A. Bucheli

Authors

Jefferson Peña Torres
View author publications
You can also search for this author in PubMed Google Scholar
Raúl Gutierrez de Piñerez Reyes
View author publications
You can also search for this author in PubMed Google Scholar
Víctor A. Bucheli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jefferson Peña Torres .

Editor information

Editors and Affiliations

Universidad Tecnológica de Bolívar, Cartagena, Colombia
Jairo E. Serrano C.
Universidad Tecnológica de Bolívar, Cartagena, Colombia
Juan Carlos Martínez-Santos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Torres, J.P., de Piñerez Reyes, R.G., Bucheli, V.A. (2018). Support Vector Machines for Semantic Relation Extraction in Spanish Language. In: Serrano C., J., Martínez-Santos, J. (eds) Advances in Computing. CCC 2018. Communications in Computer and Information Science, vol 885. Springer, Cham. https://doi.org/10.1007/978-3-319-98998-3_26

Download citation

DOI: https://doi.org/10.1007/978-3-319-98998-3_26
Published: 19 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98997-6
Online ISBN: 978-3-319-98998-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics