Semi-supervised SRL System with Bayesian Inference

Lorenzo, Alejandra; Cerisara, Christophe

doi:10.1007/978-3-642-54906-9_35

Alejandra Lorenzo¹⁷ &
Christophe Cerisara¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8403))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

2036 Accesses

Abstract

We propose a new approach to perform semi-supervised training of Semantic Role Labeling models with very few amount of initial labeled data. The proposed approach combines in a novel way supervised and unsupervised training, by forcing the supervised classifier to overgenerate potential semantic candidates, and then letting unsupervised inference choose the best ones. Hence, the supervised classifier can be trained on a very small corpus and with coarse-grain features, because its precision does not need to be high: its role is mainly to constrain Bayesian inference to explore only a limited part of the full search space. This approach is evaluated on French and English. In both cases, it achieves very good performance and outperforms a strong supervised baseline when only a small number of annotated sentences is available and even without using any previously trained syntactic parser.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Màrquez, L., Carreras, X., Litkowski, K.C., Stevenson, S.: Semantic role labeling: An introduction to the special issue. Comput. Linguist. 34, 145–159 (2008)
Article Google Scholar
Pradhan, S.S., Ward, W., Martin, J.H.: Towards robust semantic role labeling. Comput. Linguist. 34, 289–310 (2008)
Article Google Scholar
He, S., Gildea, H.: Self-training and Cotraining for Semantic Role Labeling: Primary Report. Technical report, TR 891, University of Colorado at Boulder (2006)
Google Scholar
Lee, J.Y., Song, Y.I., Rim, H.C.: Investigation of weakly supervised learning for semantic role labeling. In: ALPIT, pp. 165–170 (2007)
Google Scholar
Daumé III, H.: Semi-supervised or semi-unsupervised? In: Proc. NAACL Wokshop on Semi-supervised Learning for NLP (2009)
Google Scholar
Titov, I., Klementiev, A.: Semi-supervised semantic role labeling: Approaching from an unsupervised perspective. In: Proceedings of the International Conference on Computational Linguistics (COLING), Bombay, India (2012)
Google Scholar
Jain, D., Beetz, M.: Soft evidential update via markov chain monte carlo inference. In: Dillmann, R., Beyerer, J., Hanebeck, U.D., Schultz, T. (eds.) KI 2010. LNCS, vol. 6359, pp. 280–290. Springer, Heidelberg (2010)
Chapter Google Scholar
Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: An annotated corpus of semantic roles. Comput. Linguist. 31, 71–106 (2005)
Article Google Scholar
Dowty, D.: Thematic proto-roles and argument selection. Language 67, 547–619 (1991)
Article Google Scholar
Bohnet, B.: Top accuracy and fast dependency parsing is not a contradiction. In: Proc. International Conference on Computational Linguistics, Beijing, China (2010)
Google Scholar
Björkelund, A., Hafdell, L., Nugues, P.: In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, CoNLL 2009, pp. 43–48. Association for Computational Linguistics, Stroudsburg (2009)
Chapter Google Scholar
Deschacht, K., Moens, M.F.: Semi-supervised semantic role labeling using the latent words language model. In: EMNLP, pp. 21–29 (2009)
Google Scholar
van der Plas, L., Merlo, P., Henderson, J.: Scaling up automatic cross-lingual semantic role annotation. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, HLT 2011, vol. 2, pp. 299–304. Association for Computational Linguistics (2011)
Google Scholar
Surdeanu, M., Johansson, R., Meyers, A., Màrquez, L., Nivre, J.: The conll-2008 shared task on joint parsing of syntactic and semantic dependencies. In: Proceedings of the Twelfth Conference on Computational Natural Language Learning, CoNLL 2008, pp. 159–177. Association for Computational Linguistics, Stroudsburg (2008)
Chapter Google Scholar
Zhu, X.: Semi-Supervised Learning Literature Survey. Technical report, Computer Sciences, University of Wisconsin-Madison (2005)
Google Scholar
Pise, N.N., Kulkarni, P.: A survey of semi-supervised learning methods. In: Proceedings of the 2008 International Conference on Computational Intelligence and Security, CIS 2008, vol. 2, pp. 30–34. IEEE Computer Society, Washington, DC (2008)
Chapter Google Scholar
Fürstenau, H., Lapata, M.: Graph alignment for semi-supervised semantic role labeling. In: EMNLP, pp. 11–20 (2009)
Google Scholar
Das, D., Smith, N.A.: Semi-supervised frame-semantic parsing for unknown predicates. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT 2011, vol. 1, pp. 1435–1444. Association for Computational Linguistics, Stroudsburg (2011)
Google Scholar
Haghighi, A., Klein, D.: Prototype-driven learning for sequence models. In: Proceedings of the Main Conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, HLT-NAACL 2006, pp. 320–327. Association for Computational Linguistics, Stroudsburg (2006)
Chapter Google Scholar
Haghighi, A., Klein, D.: Prototype-driven grammar induction. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics. ACL-44, pp. 881–888. Association for Computational Linguistics, Stroudsburg (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

LORIA / UMR 7503, Vandoeuvre-les-Nancy, France
Alejandra Lorenzo & Christophe Cerisara

Authors

Alejandra Lorenzo
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Cerisara
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, Av. Juan Dios Bátiz, Col. Nueva Industrial Vallejo, 07738, Mexico D.F., Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lorenzo, A., Cerisara, C. (2014). Semi-supervised SRL System with Bayesian Inference. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54906-9_35

Download citation

DOI: https://doi.org/10.1007/978-3-642-54906-9_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54905-2
Online ISBN: 978-3-642-54906-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Semi-supervised SRL System with Bayesian Inference