Skip to main content

Semi-supervised SRL System with Bayesian Inference

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8403))

  • 2036 Accesses

Abstract

We propose a new approach to perform semi-supervised training of Semantic Role Labeling models with very few amount of initial labeled data. The proposed approach combines in a novel way supervised and unsupervised training, by forcing the supervised classifier to overgenerate potential semantic candidates, and then letting unsupervised inference choose the best ones. Hence, the supervised classifier can be trained on a very small corpus and with coarse-grain features, because its precision does not need to be high: its role is mainly to constrain Bayesian inference to explore only a limited part of the full search space. This approach is evaluated on French and English. In both cases, it achieves very good performance and outperforms a strong supervised baseline when only a small number of annotated sentences is available and even without using any previously trained syntactic parser.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Màrquez, L., Carreras, X., Litkowski, K.C., Stevenson, S.: Semantic role labeling: An introduction to the special issue. Comput. Linguist. 34, 145–159 (2008)

    Article  Google Scholar 

  2. Pradhan, S.S., Ward, W., Martin, J.H.: Towards robust semantic role labeling. Comput. Linguist. 34, 289–310 (2008)

    Article  Google Scholar 

  3. He, S., Gildea, H.: Self-training and Cotraining for Semantic Role Labeling: Primary Report. Technical report, TR 891, University of Colorado at Boulder (2006)

    Google Scholar 

  4. Lee, J.Y., Song, Y.I., Rim, H.C.: Investigation of weakly supervised learning for semantic role labeling. In: ALPIT, pp. 165–170 (2007)

    Google Scholar 

  5. Daumé III, H.: Semi-supervised or semi-unsupervised? In: Proc. NAACL Wokshop on Semi-supervised Learning for NLP (2009)

    Google Scholar 

  6. Titov, I., Klementiev, A.: Semi-supervised semantic role labeling: Approaching from an unsupervised perspective. In: Proceedings of the International Conference on Computational Linguistics (COLING), Bombay, India (2012)

    Google Scholar 

  7. Jain, D., Beetz, M.: Soft evidential update via markov chain monte carlo inference. In: Dillmann, R., Beyerer, J., Hanebeck, U.D., Schultz, T. (eds.) KI 2010. LNCS, vol. 6359, pp. 280–290. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  8. Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: An annotated corpus of semantic roles. Comput. Linguist. 31, 71–106 (2005)

    Article  Google Scholar 

  9. Dowty, D.: Thematic proto-roles and argument selection. Language 67, 547–619 (1991)

    Article  Google Scholar 

  10. Bohnet, B.: Top accuracy and fast dependency parsing is not a contradiction. In: Proc. International Conference on Computational Linguistics, Beijing, China (2010)

    Google Scholar 

  11. Björkelund, A., Hafdell, L., Nugues, P.: In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, CoNLL 2009, pp. 43–48. Association for Computational Linguistics, Stroudsburg (2009)

    Chapter  Google Scholar 

  12. Deschacht, K., Moens, M.F.: Semi-supervised semantic role labeling using the latent words language model. In: EMNLP, pp. 21–29 (2009)

    Google Scholar 

  13. van der Plas, L., Merlo, P., Henderson, J.: Scaling up automatic cross-lingual semantic role annotation. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, HLT 2011, vol. 2, pp. 299–304. Association for Computational Linguistics (2011)

    Google Scholar 

  14. Surdeanu, M., Johansson, R., Meyers, A., Màrquez, L., Nivre, J.: The conll-2008 shared task on joint parsing of syntactic and semantic dependencies. In: Proceedings of the Twelfth Conference on Computational Natural Language Learning, CoNLL 2008, pp. 159–177. Association for Computational Linguistics, Stroudsburg (2008)

    Chapter  Google Scholar 

  15. Zhu, X.: Semi-Supervised Learning Literature Survey. Technical report, Computer Sciences, University of Wisconsin-Madison (2005)

    Google Scholar 

  16. Pise, N.N., Kulkarni, P.: A survey of semi-supervised learning methods. In: Proceedings of the 2008 International Conference on Computational Intelligence and Security, CIS 2008, vol. 2, pp. 30–34. IEEE Computer Society, Washington, DC (2008)

    Chapter  Google Scholar 

  17. Fürstenau, H., Lapata, M.: Graph alignment for semi-supervised semantic role labeling. In: EMNLP, pp. 11–20 (2009)

    Google Scholar 

  18. Das, D., Smith, N.A.: Semi-supervised frame-semantic parsing for unknown predicates. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT 2011, vol. 1, pp. 1435–1444. Association for Computational Linguistics, Stroudsburg (2011)

    Google Scholar 

  19. Haghighi, A., Klein, D.: Prototype-driven learning for sequence models. In: Proceedings of the Main Conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, HLT-NAACL 2006, pp. 320–327. Association for Computational Linguistics, Stroudsburg (2006)

    Chapter  Google Scholar 

  20. Haghighi, A., Klein, D.: Prototype-driven grammar induction. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics. ACL-44, pp. 881–888. Association for Computational Linguistics, Stroudsburg (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lorenzo, A., Cerisara, C. (2014). Semi-supervised SRL System with Bayesian Inference. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54906-9_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-54906-9_35

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-54905-2

  • Online ISBN: 978-3-642-54906-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics