Detecting Multiple Domains from User’s Utterance in Spoken Dialog System

Ryu, Seonghan; Song, Jaiyoun; Koo, Sangjun; Kwon, Soonchoul; Lee, G. G.

doi:10.1007/978-3-319-19291-8_10

Seonghan Ryu⁵,
Jaiyoun Song⁵,
Sangjun Koo⁵,
Soonchoul Kwon⁵ &
…
G. G. Lee⁵

1092 Accesses
1 Citations

Abstract

Multi-domain spoken dialog system should be able to detect more than one domain from a user’s utterance. However, it is difficult to train an accurate binary classifier of a domain based on only positive and unlabeled examples. This paper improves hierarchical clustering algorithm to automatically identify reliable negative examples among unlabeled examples. This paper also verifies three linkage criteria that measure the distance between two clusters. In experiments, the proposed method resulted in the highest gain of F ₁ score compared to the existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chang C, Lin C (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):27:1–27:27
Google Scholar
Dempster AP, Laird NM, Rubin DB (1997) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B Methodol 39(1):1–38
Google Scholar
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning, 2nd edn. Springer, New York, pp 520–528
Book MATH Google Scholar
Lane I, Kawahara T, Matsui T, Nakamura S (2007) Out-of-domain utterance detection using classification confidences of multiple topics. IEEE Trans Audio Speech Lang Process 15(1):150–161
Article Google Scholar
Li X, Liu B (2003) Learning to classify texts using positive and unlabeled data. In: Proceedings of the 18th international joint conference on artificial intelligence, Acapulco, Mexico, August 2003
Google Scholar
Li X, Roth D (2002) Learning question classifiers. In: Proceedings of the 19th international conference on computational linguistics, Taipei, Taiwan, September 2002
Google Scholar
Liu B, Lee WS, Yu PS, Li X (2002) Partially supervised classification of text documents. In: Proceedings of the 19th international conference on machine learning, New South Wales, Sydney, July 2002
Google Scholar
Liu B, Dai Y, Li X, Lee WS, Yu PS (2003) Building text classifiers using positive and unlabeled examples. In: Proceedings of the 3rd IEEE international conference on data mining, Melbourne, Florida, USA, November 2003
Google Scholar
McCallum A, Nigam K (1998) A comparison of event models for Naive Bayes text classification. In: Proceedings of the 15th natural conference on artificial intelligence: workshop on learning from text categorization, Madison, Wisconsin, USA, July 1998
Google Scholar
Rocchio J (1971) Relevance feedback in information retrieval. In: The smart retrieval system: experiments in automatic document processing, Englewood Cliffs, New Jersey, USA, 1971
Google Scholar
Ryu S, Lee D, Lee I, Han S, Lee GG, Kim M, Kim K (2012) A hierarchical domain model-based multi-domain selection framework for multi-domain dialog systems. In: Proceedings of the 24th international conference on computational linguistics, Mumbai, India, December 2012
Google Scholar
Schölkopf B, Platt JC, Shawe-Taylor J, Smola AJ, Williamson RC (2001) Estimating the support of a high-dimensional distribution. Neural Comput 13(7):1443–1471
Article MATH Google Scholar
Yu H, Han J, Chang KC (2002) PEBL: positive example based learning for web page classification using SVM. In: Proceedings of the 8th ACM SIGKDD international conference of knowledge discovery and data mining, Edmonton, Alberta, Canada, July 2002
Google Scholar

Download references

Acknowledgments

This work was supported by ICT R&D program of MSIP/IITP [14-824-09-014, Basic Software Research in Human-level Lifelong Machine Learning (Machine Learning Center)]. This work was supported by National Research Foundation of Korean (NRF) [NRF-2014R1A2A1A01003041, Development of Multi-party Anticipatory Knowledge-Intensive Natural Language Dialog System].

Author information

Authors and Affiliations

Pohang University of Science and Technology, Pohang, Republic of Korea
Seonghan Ryu, Jaiyoun Song, Sangjun Koo, Soonchoul Kwon & G. G. Lee

Authors

Seonghan Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Jaiyoun Song
View author publications
You can also search for this author in PubMed Google Scholar
Sangjun Koo
View author publications
You can also search for this author in PubMed Google Scholar
Soonchoul Kwon
View author publications
You can also search for this author in PubMed Google Scholar
G. G. Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Seonghan Ryu .

Editor information

Editors and Affiliations

Department of Computer Science and Engin, Pohang University of Science & Tech, Namgu, Pohang, Korea (Republic of)
G.G. Lee
School of Information and Communications, Gwangju Institute of Science and Tech, Buk-gu, Gwangju, Korea (Republic of)
H.K. Kim
Microsoft Corporation, Redmond, Washington, USA
M. Jeong
Dept of Computer Science and Engineering, Sogang University, Mapo-gu, Seoul, Korea (Republic of)
J.-H. Kim

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ryu, S., Song, J., Koo, S., Kwon, S., Lee, G.G. (2015). Detecting Multiple Domains from User’s Utterance in Spoken Dialog System. In: Lee, G., Kim, H., Jeong, M., Kim, JH. (eds) Natural Language Dialog Systems and Intelligent Assistants. Springer, Cham. https://doi.org/10.1007/978-3-319-19291-8_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-19291-8_10
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19290-1
Online ISBN: 978-3-319-19291-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics