Abstract
Given the typical length of queries submitted to Information Retrieval systems, it is easy to imagine that the effects of word recognition errors (WRE) in spoken queries must be severely destructive on the system’s effectiveness. The effects of word recognition errors in Spoken Document Retrieval have been well studied and well reported in recent Information Retrieval literature, but much less experimental work has been devoted to studying the effects of word recognition errors in Spoken Query Processing (SQP). The experimental work reported in this paper shows that the use of classical IR techniques for SQP is quite robust to considerably high levels of WRE, in particular for long queries. Moreover, in the case of short queries, both standard relevance feedback and pseudo relevance feedback can be effectively employed to improve the effectiveness of SQP.
Most of the work reported in this paper was carried out while the author was at the International Computer Science Institute, Berkeley, CA, USA.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barnett, J., Anderson, S., Broglio, J., Singh, M., Hudson, R., and Kuo, S. (1997). Experiments in spoken queries for document retrieval. In Eurospeech 97, volume 3, pages 1323–1326, Rodhes, Greece.
Crestani, F. (1999a). An experimental study of the effects of word recognition errors in spoken queries on the effectiveness of an information retrieval system. Technical Report TR-99–016, International Computer Science Institute, Berkeley, CA, USA.
Crestani, F. (1999b). Vocal access to a newspaper archive: design issues and preliminary investigation. In Proceedings of ACM Digital Libraries, pages 59–68, Berkeley, CA, USA.
Crestani, 2000) Crestani, F. (2000). Combination of semantic and phonetic term similarity for spoken document retrieval and spoken query processing. In Proceedings of the 8th Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU),Madrid, Spain. In press.
Frakes, W. and Baeza-Yates, R., editors (1992). Information Retrieval: data structures and algorithms. Prentice Hall, Englewood Cliffs, New Jersey, USA.
Garofolo, J., Voorhees, E., Auzanne, C., Stanford, V., and Lund, B. (1998). 1998 TREC-7 spoken document retrieval track overview and results. In Proceedings of the TREC Conference, pages 79–90, Gaithersburg, MD, USA.
Harman, D. (1992a). Ranking algorithms. In Frakes, W. and Baeza-Yates, R., editors, Information Retrieval: data structures and algorithms, chapter 14. Prentice Hall, Englewood Cliffs, New Jersey, USA.
Harman, D. (1992b). Relevance feedback and other query modification techniques. In Frakes, W. and Baeza-Yates, R., editors, Information Retrieval: data structures and algorithms, chapter 11. Prentice Hall, Englewood Cliffs, New Jersey, USA.
Keller, E., editor (1994). Fundamentals of Speech Synthesis and Speech Recognition. John Wiley and Sons, Chichester, UK.
Porter, M. (1980). An algorithm for suffix stripping. Program, 14 (3): 130–137.
Robertson, S. and Spark Jones, K. (1997). Simple, proven approaches to text retrieval. Technical Report TR356, Computer Laboratory, University of Cambridge, UK.
Sanderson, M. (1996). Word Sense Disambiguation and Information Retrieval. PhD Thesis, Department of Computing Science, University of Glasgow, Glasgow, Scotland, UK.
Singhal, A., Choi, J., Hindle, D., Lewis, D., and Pereira, F. (1998). ATandT at TREC-7. In Proceedings of the TREC Conference, pages 239253, Washington DC, USA.
Singhal, A. and Pereira, F. (1999). Document expansion for speech retrieval. In Proceedings of ACM SIGIR, pages 34–41, Berkeley, CA, USA.
Voorhees, E. (1993). On expanding query vectors with lexically related words. In Proceedings of the TREC Conference, pages 223–232, Gaithersburg, MD, USA.
Voorhees, E. and harman, D. (1998). Overview of the seventh text retrieval conference (TREC-7). In Proceedings of the TREC Conference, pages 1–24, Gaithersburg, MD, USA.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Crestani, F. (2001). Word Recognition Errors and Relevance Feedback in Spoken Query Processing. In: Larsen, H.L., Andreasen, T., Christiansen, H., Kacprzyk, J., Zadrożny, S. (eds) Flexible Query Answering Systems. Advances in Soft Computing, vol 7. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1834-5_25
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1834-5_25
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-1347-0
Online ISBN: 978-3-7908-1834-5
eBook Packages: Springer Book Archive