Learning the Dialog POMDP Model Components

Chinaei, Hamidreza; Chaib-draa, Brahim

doi:10.1007/978-3-319-26200-0_4

Learning the Dialog POMDP Model Components

Hamidreza Chinaei³ &
Brahim Chaib-draa⁴

Chapter
First Online: 09 February 2016

490 Accesses

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

Abstract

In this chapter, we propose methods for learning the model components of intent-based dialog POMDPs from unannotated and noisy dialogs.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Atrash, A., & Pineau, J. (2010). A Bayesian method for learning POMDP observation parameters for robot interaction management systems. In The POMDP Practitioners Workshop.
Google Scholar
Bishop, C. M. (2006). Pattern recognition and machine learning. Secaucus, New York: Springer.
MATH Google Scholar
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.
MATH Google Scholar
Cassandra, A., Kaelbling, L., & Littman, M. (1995). Acting optimally in partially observable stochastic domains. In Proceedings of the 12th National Conference on Artificial Intelligence (AAAI’95), Seattle, Washington.
Google Scholar
Chinaei, H. R., & Chaib-Draa, B. (2011). Learning dialogue POMDP models from data. In Proceedings of the 24th Canadian Conference on Advances in Artificial Intelligence (Canadian AI’11), St. John’s, Newfoundland.
Google Scholar
Chinaei, H. R., & Chaib-Draa, B. (2014b). Dialogue POMDP components (part i): Learning states and observations. International Journal of Speech Technology, 17(4), 309–323.
Google Scholar
Chinaei, H. R., Chaib-Draa, B., & Lamontagne, L. (2009). Learning user intentions in spoken dialogue systems. In Proceedings of the 1st International Conference on Agents and Artificial Intelligence (ICAART’09), Porto.
Google Scholar
Chinaei, H. R., Chaib-draa, B., & Lamontagne, L. (2012). Learning observation models for dialogue POMDPs. In Proceedings of the 24th Canadian Conference on Advances in Artificial Intelligence (Canadian AI’12), Toronto.
Google Scholar
Doshi, F., & Roy, N. (2007). Efficient model learning for dialog management. In Proceedings of the 2nd ACM SIGCHI/SIGART Conference on Human-Robot Interaction (HRI’07), Arlington, VA.
Google Scholar
Doshi, F., & Roy, N. (2008). Spoken language interaction with model uncertainty: An adaptive human-robot interaction system. Connection Science, 20(4), 299–318.
Article Google Scholar
Griffiths, T., & Steyvers, J. (2004). Finding scientific topics. Proceedings of the National Academy of Science, 101, 5228–5235.
Google Scholar
Gruber, A., & Popat, A. (2007). Notes regarding computations in open htmm. http://openhtmm.googlecode.com/files/htmm_computations.pdf
Google Scholar
Gruber, A., Rosen-Zvi, M., & Weiss, Y. (2007). Hidden topic Markov models. In Artificial Intelligence and Statistics (AISTATS’07), San Juan, PR.
Google Scholar
Kim, D., Kim, J., & Kim, K. (2011). Robust performance evaluation of POMDP-based dialogue systems. IEEE Transactions on Audio, Speech, and Language Processing, 19(4), 1029–1040.
Article Google Scholar
Lison, P. (2013). Model-based bayesian reinforcement learning for dialogue management. In Proceedings of 14th Annual Conference of the International Speech Communication Association (INTERSPEECH’13), Lyon.
Google Scholar
Matsubara, S., Kimura, S., Kawaguchi, N., Yamaguchi, Y., & Inagaki, Y. (2002). Example-based speech intention understanding and its application to in-car spoken dialogue system. In Proceedings of the 19th International Conference on Computational linguistics - Volume 1, Taipei.
Google Scholar
Ortiz, L. E., & Kaelbling, L. P. (1999). Accelerating EM: An empirical study. In Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence (UAI’99), Stockholm.
Google Scholar
Png, S., & Pineau, J. (2011). Bayesian reinforcement learning for POMDP-based dialogue systems. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’11), Prague.
Google Scholar
Rabiner, L. R. (1990). A tutorial on hidden Markov models and selected applications in speech recognition. In Readings in speech recognition (pp. 267–296). San Francisco: Morgan Kaufmann Publishers.
Google Scholar
Ross, S., Chaib-draa, B., & Pineau, J. (2007). Bayes-adaptive POMDPs. In Proceedings of the 21st Annual Conference on Neural Information Processing Systems (NIPS’07), Vancouver, BC.
Google Scholar
Ross, S., Pineau, J., Chaib-draa, B., & Kreitmann, P. (2011). A Bayesian approach for learning and planning in partially observable Markov decision processes. Journal of Machine Learning Research, 12, 1729–1770.
MathSciNet MATH Google Scholar
Roy, N., Pineau, J., & Thrun, S. (2000). Spoken dialogue management using probabilistic reasoning. In Proceedings of the 38th Annual Meeting on Association for Computational Linguistics (ACL’00), Hong Kong.
Google Scholar
Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge: MIT Press.
Google Scholar
Weilhammer, K., Williams, J. D., & Young, S. (2004). The SACTI-2 corpus: Guide for research users. Cambridge University. Technical Report.
Google Scholar
Williams, J. D. (2006). Partially observable Markov decision processes for spoken dialogue management. Ph.D. thesis, Department of Engineering, University of Cambridge.
Google Scholar
Williams, J. D., & Young, S. (2005). The SACTI-1 corpus: Guide for research users. Department of Engineering, University of Cambridge. Technical Report.
Google Scholar
Williams, J. D., & Young, S. (2007). Partially observable Markov decision processes for spoken dialog systems. Computer Speech and Language, 21, 393–422.
Article Google Scholar
Zhang, B., Cai, Q., Mao, J., & Guo, B. (2001b). Planning and acting under uncertainty: A new model for spoken dialogue system. In Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence (UAI’01), Seattle, Washington.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Toronto, Toronto, ON, Canada
Hamidreza Chinaei
Université Laval, Quebec, QC, Canada
Brahim Chaib-draa

Authors

Hamidreza Chinaei
View author publications
You can also search for this author in PubMed Google Scholar
Brahim Chaib-draa
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Chinaei, H., Chaib-draa, B. (2016). Learning the Dialog POMDP Model Components. In: Building Dialogue POMDPs from Expert Dialogues. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-26200-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-26200-0_4
Published: 09 February 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26198-0
Online ISBN: 978-3-319-26200-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics