Abstract
Relation Extraction (RE) is one of the most important topics in NLP (Natural Language Processing). Many tasks such as semantic relation extraction, sentiment analysis, opinion mining, question answering systems and text summarization are supported by RE. The aim of this paper is to present a semantic relations classifier in which are incorporate lexical features, named entity features and syntactic structures. Relations between two entities are classified based on the Datasets for Generic Relation Extraction (reACE). We translate the reACE corpus to the Spanish language for all relation types and subtypes. The results shows a F-score of 75.25%, it is a significant improvement of 11.5% over the baseline model. Finally, we discuss the results according to the model and the useful information to support the forecasting process.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Textblob: Simplified text processing. http://textblob.readthedocs.org/. Accessed 22 Feb 2015
Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. J. Mach. Learn. Res. 2(Dec), 265–292 (2001)
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, ACL 2004, Stroudsburg, PA, USA. Association for Computational Linguistics (2004)
Doddington, G.R., Mitchell, A., Przybocki, M.A., Ramshaw, L.A., Strassel, S., Weischedel, R.M.: The automatic content extraction (ACE) program-tasks, data, and evaluation (2004)
Gutiérrez, R., Castillo, A., Bucheli, V., Solarte, O.: Named entity recognition for Spanish language and applications in technology forecasting reconocimiento de entidades nombradas para el idioma español y su aplicación en la vigilancia tecnológica (2015)
Hachey, B., Grover, C., Tobin, R.: Datasets for generic relation extraction. Nat. Lang. Eng. 18(1), 21–59 (2012)
Joachims, T.: Support vector machines for complex outputs (2008). http://www.cs.cornell.edu/people/tj/svm_light/svm_struct.html
Joachims, T.: Making large-scale SVM learning practical. Technical report, SFB 475: Komplexitätsreduktion in Multivariaten Datenstrukturen, Universität Dortmund (1998)
Jurafsky, D., Martin, J.H.: Speech and Language Processing. Pearson Education, London (2009). International edition
Kambhatla, N.: Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations. In: Proceedings of the ACL 2004 on Interactive Poster and Demonstration Sessions, ACLdemo 2004, Stroudsburg, PA, USA. Association for Computational Linguistics (2004)
Kumar, S.: A survey of deep learning methods for relation extraction. CoRR, abs/1705.03645 (2017)
Manning, C., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S., McClosky, D.: The Stanford CoreNLP natural language processing toolkit. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55–60 (2014)
Miller, S., Fox, H., Ramshaw, L., Weischedel, R.: A novel use of statistical parsing to extract information from text. In: Proceedings of the 1st North American Chapter of the Association for Computational Linguistics Conference, NAACL 2000, Stroudsburg, PA, USA, pp. 226–233. Association for Computational Linguistics (2000)
Moschitti, A.: A study on convolution kernels for shallow semantic parsing. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 335. Association for Computational Linguistics (2004)
Padró, L., Stanilovsky, E.: Freeling 3.0: towards wider multilinguality. In: LREC2012 (2012)
Song, Z., et al.: ACE 2007 multilingual training corpus LDC2014t18. Linguistic Data Consortium, Philadelphia (2014)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995). https://doi.org/10.1007/978-1-4757-3264-1
Walker, C., et al.: ACE 2005 multilingual training corpus LDC2006t06. Linguistic Data Consortium, Philadelphia (2006)
Zelenko, D., Aone, C., Richardella, A.: Kernel methods for relation extraction. J. Mach. Learn. Res. 3, 1083–1106 (2003)
Zeng, D., Liu, K., Lai, S., Zhou, G., Zhao, J.: Relation classification via convolutional deep neural network. In: Proceedings of COLING 2014, The 25th International Conference on Computational Linguistics: Technical Papers, pp. 2335–2344 (2014)
Zhang, Z.: Weakly-supervised relation classification for information extraction. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, CIKM 2004, pp. 581–588. ACM, New York (2004)
Acknowledgments
The authors would like to acknowledge the systems and Computing Engineering school, Faculty of Engineer, The Universidad del Valle of Cali, Colombia.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Torres, J.P., de Piñerez Reyes, R.G., Bucheli, V.A. (2018). Support Vector Machines for Semantic Relation Extraction in Spanish Language. In: Serrano C., J., Martínez-Santos, J. (eds) Advances in Computing. CCC 2018. Communications in Computer and Information Science, vol 885. Springer, Cham. https://doi.org/10.1007/978-3-319-98998-3_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-98998-3_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98997-6
Online ISBN: 978-3-319-98998-3
eBook Packages: Computer ScienceComputer Science (R0)