Abstract
Processing of speech and language inputs has been studied under a statistical point of view for quite some time. Data-driven methods pioneered by speech recognition researchers such as Rabiner [8] and Jelinek [2] in the late 1970s were applied to natural language understanding only in the early 1990s [5]. It is only a decade ago that dialogue management has benefited from statistical modeling and data-driven methods [3]. Following this trend, this book described the recent advances in statistical data-driven methods for spoken dialogue systems, especially within the European CLASSiC project funded under the seventh framework program. The aim of this project, as reflected by this book, was to produce generic methods for statistical optimization from end to end of a spoken dialogue system, starting with speech recognition and ending with speech synthesis. Machine learning techniques, such as reinforcement learning, were expected to provide useful approaches to this problem because of their ability to solve sequential decision-making problems but also because they rely on a strong mathematical background and on interpretable optimization criteria. Reinforcement learning has thus been applied for optimizing dialogue management and natural language generation (NLG) but also, to some extent, to produce user simulation techniques. Other machine learning methods, such as support vector machines (SVM) or Bayesian networks, were also integrated to make a fully operational system. This book summarized the technical and practical results obtained during this project.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Gasic, M., Jurcicek, F., Thomson, B., Yu, K., Young, S.: On-line policy optimisation of spoken dialogue systems via live interaction with human subjects”. In: Proc. of ASRU 2011, Hawaii (USA) (2011)
Jelinek, F.: Continuous speech recognition by statisical methods. IEEE Proceedings 64(4), 532–536 (1976)
Levin, E., Pieraccini, R., Eckert, W.: A stochastic model of human-machine interaction for learning dialog strategies. IEEE Trans. Speech. Audio. Process. 8(1), 11–23 (2000)
Li, L., Balakrishnan, S., Williams, J.: Reinforcement Learning for Dialog Management using Least-Squares Policy Iteration and Fast Feature Selection. In: Proc. of InterSpeech’09, Brighton (UK) (2009)
Pieraccini, R., Levin, E.: Stochastic representation of semantic structure for speech understanding. Speech Commun. 11(2-3), 238–288 (1992)
Pietquin, O., Geist, M., Chandramohan, S., Frezza-Buet, H.: Sample-Efficient Batch Reinforcement Learning for Dialogue Management Optimization. ACM Trans. Speech. Lang. Process. 7(3), 7:1–7:21 (2011)
Pietquin, O., Geist, M., Chandramohan, S.: Sample Efficient On-line Learning of Optimal Dialogue Policies with Kalman Temporal Differences. In: Proc. of IJCAI 2011, Barcelona, Spain (2011)
Rabiner, L.R., Levinson, S.E., Rosenberg, A.E., Wilpon, J.G.: Speaker independent recognition of isolated words using clustering techniques. IEEE Transactions on Acoustics, Speech and Signal Processing AASP- 27(4), 336–349 (1979)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media New York
About this chapter
Cite this chapter
Pietquin, O. (2012). Conclusion and Future Research Directions. In: Lemon, O., Pietquin, O. (eds) Data-Driven Methods for Adaptive Spoken Dialogue Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-4803-7_9
Download citation
DOI: https://doi.org/10.1007/978-1-4614-4803-7_9
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-4802-0
Online ISBN: 978-1-4614-4803-7
eBook Packages: Computer ScienceComputer Science (R0)