Integration of Statistical Dialog Management Techniques to Implement Commercial Dialog Systems

  • David Griol
  • Zoraida Callejas
  • Ramón López-Cózar
Conference paper


In this paper we present a proposal to develop commercial dialog systems that avoids the effort of manually defining the dialog strategy for the dialog manager and also takes into account the benefits of using standards like VoiceXML. In our proposal the dialog manager is trained using a labeled dialog corpus, and selects the next system response considering a classification process based on neural networks that takes into account the complete dialog history. Thus, system developers only need to define a set of VoiceXML files, each including a system prompt and the associated grammar to recognize user responses. The statistical dialog model automatically selects the next system prompt.We have applied this technique to develop a dialog system in VoiceXML that provides railway information in Spanish.


Interactive Voice Response Training Corpus Partially Observable Markov Decision Process Dialog System Speak Dialogue System 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.



This research has been funded by Spanish project ASIES TIN2010-17344.


  1. 1.
    Bunt, H., Alexandersson, J., Carletta, J., Choe, J., Fang, A.C., Hasida, K., Lee, K., Petukhova, V., Popescu-Belis, A., Romary, L., Soria, C., Traum, D.: To wards an iso standard for dialogue act annotation. In: Proc. of the 7th Conference on International Language Resources and Evaluation (LREC’10). Valletta, Malta (2010)Google Scholar
  2. 2.
    Cuayáhuitl, H., Renals, S., Lemon, O., Shimodaira, H.: Human-Computer Dialogue Simulation Using Hidden Markov Models. In: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU’05), pp. 290–295. San Juan, Puerto Rico (2005)Google Scholar
  3. 3.
    Espana-Boquera, S., Zamora-Martinez, F., Castro-Bleda, M., Gorbe-Moya, J.: Efficient BP Algorithms for General Feedforward Neural Networks. Lecture Notes in Computer Science 4527, 327–336 (2007)CrossRefGoogle Scholar
  4. 4.
    Fikes, R., Kehler, T.: The role of frame-based representation in knowledge representation and reasoning. Communications of the ACM 28, 904–920 (1985)CrossRefGoogle Scholar
  5. 5.
    Georgila, K., Henderson, J., Lemon, O.: User Simulation for Spoken Dialogue Systems: Learning and Evaluation. In: Proc. of the 9th Interspeech/ICSLP, pp. 1065–1068. Pittsburgh, USA (2006)Google Scholar
  6. 6.
    Griol, D., Callejas, Z., López-Cózar, R.: A Comparison between Dialog Corpora Acquired with Real and Simulated Users. In: Proc. of the SIGDIAL 2009 Conference, pp. 326–332. Association for Computational Linguistics, London, UK (2009)Google Scholar
  7. 7.
    Griol, D., Hurtado, L., Segarra, E., Sanchis, E.: A Statistical Approach to Spoken Dialog Systems Design and Evaluation. Speech Communication 50(8–9), 666–682 (2008)CrossRefGoogle Scholar
  8. 8.
    Griol, D., Riccardi, G., Sanchis, E.: A Statistical Dialog Manager for the LUNA Project. In: Proc. of Interspeech/ICSLP’09, pp. 272–275. Brighton, UK (2009)Google Scholar
  9. 9.
    Hurtado, L., Planells, J., Segarra, E., E.Sanchis, Griol, D.: A stochastic finitestate transducer approach to spoken dialog management. In: Proc. of the 11th Annual Conference of the International Speech Communication Association (Interspeech’10), pp. 3002–3005 (2010)Google Scholar
  10. 10.
    Levin, E., Pieraccini, R., Eckert, W.: A stochastic model of human-machine interaction for learning dialog strategies. IEEE Transactions on Speech and Audio Processing 8(1), 11–23 (2000)CrossRefGoogle Scholar
  11. 11.
    McTear, M.F.: Spoken Dialogue Technology: Towards the Conversational User Interface. Springer (2004)Google Scholar
  12. 12.
    Paek, T., Horvitz, E.: Conversation as Action Under Uncertainty. In: Proc. of the 16th Conference on Uncertainty in Artificial Intelligence, pp. 455–464. San Francisco, USA (2000)Google Scholar
  13. 13.
    Paek, T., Pieraccini, R.: Automating spoken dialogue management design using machine learning: An industry perspective. Speech Communication 50(8– 9), 716–729 (2008)CrossRefGoogle Scholar
  14. 14.
    Pieraccini, R., Suendermann, D., Dayanidhi, K., Liscombe, J.: Are We There Yet? Research in Commercial Spoken Dialog Systems. Lecture Notes in Computer Science 5729, 3–13 (2009)Google Scholar
  15. 15.
    Pietquin, O., Dutoit, T.: Aided Design of Finite-State Dialogue Management Systems. In: Proc. of the IEEE International Conference on Multimedia and Expo (ICME’03), vol. 3, pp. 545–548 (2003)Google Scholar
  16. 16.
    Ravindra-Kumar, R., Sulochana, K., Stephen, J.: Automatic Speech Segmentation and Multi Level Labeling Tool. Communications in Computer and Information Science 139, 9–14 (2011)CrossRefGoogle Scholar
  17. 17.
    Roy, N., Pineau, J., Thrun, S.: Spoken dialogue management using probabilistic reasoning. In: Proc. of the 38th Annual Meeting of the Association for Computational Linguistics (ACL’00), pp. 93–100. Hong Kong, China (2000)Google Scholar
  18. 18.
    Rumelhart, D.E., Hinton, G.E., Williams, R.J.: PDP: Computational models of cognition and perception, I, chap. Learning internal representations by error propagation, pp. 319–362. MIT Press (1986)Google Scholar
  19. 19.
    Schatzmann, J., Weilhammer, K., Stuttle, M., Young, S.: A Survey of Statistical User Simulation Techniques for Reinforcement-Learning of Dialogue Management Strategies. Knowledge Engineering Review 21(2), 97–126 (2006)CrossRefGoogle Scholar
  20. 20.
    Williams, J., Young, S.: Partially Observable Markov Decision Processes for Spoken Dialog Systems. Computer Speech and Language 21(2), 393–422 (2007)CrossRefGoogle Scholar
  21. 21.
    Young, S., Schatzmann, J., Weilhammer, K., Ye, H.: The Hidden Information State Approach to Dialogue Management. In: Proc. of the 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’07), vol. 4, pp. 149–152. Honolulu, Haway, USA (2007)Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  • David Griol
    • 1
  • Zoraida Callejas
    • 2
  • Ramón López-Cózar
    • 2
  1. 1.Dept. of Computer ScienceCarlos III University of MadridMadridSpain
  2. 2.Dept. of Languages and Computer Systems, CITIC-UGRUniversity of GranadaGranadaSpain

Personalised recommendations