Semantic Role Labeling of Speech Transcripts Without Sentence Boundaries

Shrestha, Niraj; Moens, Marie-Francine

doi:10.1007/978-3-030-00794-2_41

Niraj Shrestha¹⁹ &
Marie-Francine Moens¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11107))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1356 Accesses

Abstract

Speech data is an extremely rich and important source of information. However, we lack suitable methods for the semantic annotation of speech data. For instance, semantic role labeling (SRL) of speech that has been transcribed by an automated speech recognition (ASR) system is still an unsolved problem. SRL of ASR data is difficult and complex due to the absence of sentence boundaries, punctuation, grammar errors, words that are wrongly transcribed, and word deletions and insertions. In this paper we propose a novel approach to SRL of ASR data based on the following idea: (1) train the SRL system on data segmented into frames, where each frame consists of a predicate and its semantic roles without considering sentence boundaries; (2) label it with the semantics of PropBank roles; and to assist the above (3) train a part-of-speech (POS) tagger to work on noisy and error prone ASR data. Experiments with the OntoNotes corpus show improvements compared to the state-of-the-art SRL applied on ASR data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The transcribed corpus is provided by [8] with the consent of SRI (http://www.sri.com).
2.
http://www.cnts.ua.ac.be/conll2000/chunking/.
3.
We use the subset from 00 to 21 of WSJ.
4.
The subsets of the corpus that we use in this work are: bc/cnn, bc/msnbc, bn/abc, bn/cnn, bn/mnb, bn/nbc, bn/pri, bn/voa as done in [8].

References

Punyakanok, V., Roth, D., Yih, W.: The importance of syntactic parsing and inference in semantic role labeling. Comput. Linguist. 34(2), 257–287 (2008)
Article Google Scholar
Johansson, R., Nugues, P.: Dependency-based semantic role labeling of PropBank. In: Proceedings of the EMNLP, Stroudsburg, PA, USA. ACL, pp. 69–78 (2008)
Google Scholar
Zhao, H., Chen, W., Kit, C., Zhou, G.: Multilingual dependency learning: a huge feature engineering method to semantic dependency parsing. In: Proceedings of the Thirteenth CoNLL 2009, Boulder, Colorado, USA, pp. 55–60 (2009)
Google Scholar
Stenchikova, S., Hakkani-Tür, D., Tür, G.: QASR: question answering using semantic roles for speech interface. In: Proceeding of INTERSPEECH, ISCA (2006)
Google Scholar
Kolomiyets, O., Moens, M.F.: A survey on question answering technology from an information retrieval perspective. Inf. Sci. 181(24), 5412–5434 (2011)
Article MathSciNet Google Scholar
Hüwel, S., Wrede, B.: Situated speech understanding for robust multi-modal human-robot communication. In: Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pp. 391–398. ACL (2006)
Google Scholar
Huang, X., Baker, J., Reddy, R.: A historical perspective of speech recognition. Commun. ACM 57(1), 94–103 (2014)
Article Google Scholar
Favre, B., Bohnet, B., Hakkani-Tür, D.: Evaluation of semantic role labeling and dependency parsing of automatic speech recognition output. In: Proceedings of ICASSP 2010, pp. 5342–5345, March 2010
Google Scholar
Hovy, E., Marcus, M., Palmer, M., Ramshaw, L., Weischedel, R.: OntoNotes: the 90% solution. In: Proceedings of NAACL HLT, Stroudsburg, PA, USA, pp. 57–60. ACL (2006)
Google Scholar
Mohammad, S., Zhu, X., Martin, J.: Semantic role labeling of emotions in Tweets. In: Proceedings of the 5th Workshop on WASSA, Maryland, pp. 32–41. ACL June 2014
Google Scholar
Stolcke, A.: SRILM - an extensible language modeling toolkit. In: Proceedings of the 7th ICSLP 2002, pp. 901–904 (2002)
Google Scholar
Fonseca, E., Rosa, J.: A two-step convolutional neural network approach for semantic role labeling. In: Proceedings of IJCNN 2013, pp. 1–7, August 2013
Google Scholar
Shrestha, N., Moens, M.F.: Semi-automatically alignment of predicates between speech and ontonotes data. In: Proceedings of the 10th edition of LREC 2016 (2016)
Google Scholar
Manning, C.D.: Part-of-speech tagging from 97% to 100%: is it time for some linguistics? In: Gelbukh, A.F. (ed.) CICLing 2011. LNCS, vol. 6608, pp. 171–189. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-19400-9_14
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, KU Leuven, Leuven, Belgium
Niraj Shrestha & Marie-Francine Moens

Authors

Niraj Shrestha
View author publications
You can also search for this author in PubMed Google Scholar
Marie-Francine Moens
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Niraj Shrestha .

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Aleš Horák
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shrestha, N., Moens, MF. (2018). Semantic Role Labeling of Speech Transcripts Without Sentence Boundaries. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2018. Lecture Notes in Computer Science(), vol 11107. Springer, Cham. https://doi.org/10.1007/978-3-030-00794-2_41

Download citation

DOI: https://doi.org/10.1007/978-3-030-00794-2_41
Published: 08 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00793-5
Online ISBN: 978-3-030-00794-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics