Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process

Huang, Songfang; Renals, Steve

doi:10.1007/978-3-540-85853-9_20

Songfang Huang¹ &
Steve Renals¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5237))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

877 Accesses
2 Citations

Abstract

In this paper, we address the modeling of topic and role information in multiparty meetings, via a nonparametric Bayesian model called the hierarchical Dirichlet process. This model provides a powerful solution to topic modeling and a flexible framework for the incorporation of other cues such as speaker role information. We present our modeling framework for topic and role on the AMI Meeting Corpus, and illustrate the effectiveness of the approach in the context of adapting a baseline language model in a large-vocabulary automatic speech recognition system for multiparty meetings. The adapted LM produces significant improvements in terms of both perplexity and word error rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. Journal of Machine Learning Research 3, 1137–1155 (2003)
Article MATH Google Scholar
Blitzer, J., Globerson, A., Pereira, F.: Distributed latent variable models of lexical co-occurrences. In: Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics (2005)
Google Scholar
Teh, Y.W.: A hierarchical Bayesian language model based on Pitman-Yor processes. In: Proc. of the Annual Meeting of the ACL, vol. 44 (2006)
Google Scholar
Huang, S., Renals, S.: Hierarchical Pitman-Yor language models for ASR in meetings. In: Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2007) (2007)
Google Scholar
Bilmes, J.A., Kirchhoff, K.: Factored language models and generalized parallel backoff. In: Proceedings of HLT/NACCL, pp. 4–6 (2003)
Google Scholar
Xu, P., Emami, A., Jelinek, F.: Training connectionist models for the structured language model. In: Empirical Methods in Natural Language Processing, EMNLP 2003 (2003)
Google Scholar
Wallach, H.M.: Topic modeling: Beyond bag-of-words. In: Proceedings of the 23rd International Conference on Machine Learning, Pittsburgh, PA (2006)
Google Scholar
Carletta, J.: Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus. Language Resources and Evaluation Journal 41(2), 181–190 (2007)
Article Google Scholar
Teh, Y., Jordan, M., Beal, M., Blei, D.: Hierarchical Dirichlet processes. Journal of the American Statistical Association 101(476), 1566–1581 (2006)
Article MATH MathSciNet Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. Journal of Machine Learning Research 3 (2003)
Google Scholar
Mrva, D., Woodland, P.C.: Unsupervised language model adaptation for Mandarin broadcast conversation transcription. In: Proc. of Interspeech (2006)
Google Scholar
Tam, Y.C., Schultz, T.: Unsupervised LM adaptation using latent semantic marginals. In: Proc. of Interspeech (2006)
Google Scholar
Hsu, B.J., Glass, J.: Style and topic language model adaptation using HMM-LDA. In: Proc. of EMNLP (2006)
Google Scholar
Akita, Y., Nemoto, Y., Kawahara, T.: PLSA-based topic detection in meetings for adaptation of lexicon and language model. In: Proc. of Interspeech (2007)
Google Scholar
Teh, Y.W.: Dirichlet processes. Encyclopedia of Machine Learning (Submitted, 2007)
Google Scholar
Teh, Y.W., Kurihara, K., Welling, M.: Collapsed variational inference for HDP. Advances in Neural Information Processing Systems 20 (2008)
Google Scholar
Ferguson, T.S.: A Bayesian analysis of some nonparametric problems. Annals of Statistics 1(2), 209–230 (1973)
Article MATH MathSciNet Google Scholar
Sethuraman, J.: A constructive definition of Dirichlet priors. Statistica Sinica 4, 639–650 (1994)
MATH MathSciNet Google Scholar
Kneser, R., Peters, J., Klakow, D.: Language model adaptation using dynamic marginals. In: Proc. of Eurospeech, Rhodes (1997)
Google Scholar
Hain, T., et al.: The AMI system for the transcription of speech in meetings. In: Proc. of ICASSP 2007 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

The Centre for Speech Technology Research, University of Edinburgh, Edinburgh, EH8 9LW, UK
Songfang Huang & Steve Renals

Authors

Songfang Huang
View author publications
You can also search for this author in PubMed Google Scholar
Steve Renals
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Andrei Popescu-Belis Rainer Stiefelhagen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, S., Renals, S. (2008). Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process. In: Popescu-Belis, A., Stiefelhagen, R. (eds) Machine Learning for Multimodal Interaction. MLMI 2008. Lecture Notes in Computer Science, vol 5237. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85853-9_20

Download citation

DOI: https://doi.org/10.1007/978-3-540-85853-9_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85852-2
Online ISBN: 978-3-540-85853-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics