Advertisement

Word Recognition Errors and Relevance Feedback in Spoken Query Processing

  • Fabio Crestani
Part of the Advances in Soft Computing book series (AINSC, volume 7)

Abstract

Given the typical length of queries submitted to Information Retrieval systems, it is easy to imagine that the effects of word recognition errors (WRE) in spoken queries must be severely destructive on the system’s effectiveness. The effects of word recognition errors in Spoken Document Retrieval have been well studied and well reported in recent Information Retrieval literature, but much less experimental work has been devoted to studying the effects of word recognition errors in Spoken Query Processing (SQP). The experimental work reported in this paper shows that the use of classical IR techniques for SQP is quite robust to considerably high levels of WRE, in particular for long queries. Moreover, in the case of short queries, both standard relevance feedback and pseudo relevance feedback can be effectively employed to improve the effectiveness of SQP.

Keywords

Average Precision Relevance Feedback Query Expansion Information Retrieval System Original Query 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [Barnett et al., 1997]
    Barnett, J., Anderson, S., Broglio, J., Singh, M., Hudson, R., and Kuo, S. (1997). Experiments in spoken queries for document retrieval. In Eurospeech 97, volume 3, pages 1323–1326, Rodhes, Greece.Google Scholar
  2. [Crestani, 1999a]
    Crestani, F. (1999a). An experimental study of the effects of word recognition errors in spoken queries on the effectiveness of an information retrieval system. Technical Report TR-99–016, International Computer Science Institute, Berkeley, CA, USA.Google Scholar
  3. [Crestani, 1999b]
    Crestani, F. (1999b). Vocal access to a newspaper archive: design issues and preliminary investigation. In Proceedings of ACM Digital Libraries, pages 59–68, Berkeley, CA, USA.Google Scholar
  4. Crestani, 2000) Crestani, F. (2000). Combination of semantic and phonetic term similarity for spoken document retrieval and spoken query processing. In Proceedings of the 8th Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU),Madrid, Spain. In press.Google Scholar
  5. [Frakes and Baeza-Yates, 1992]
    Frakes, W. and Baeza-Yates, R., editors (1992). Information Retrieval: data structures and algorithms. Prentice Hall, Englewood Cliffs, New Jersey, USA.Google Scholar
  6. [Garofolo et al., 1998]
    Garofolo, J., Voorhees, E., Auzanne, C., Stanford, V., and Lund, B. (1998). 1998 TREC-7 spoken document retrieval track overview and results. In Proceedings of the TREC Conference, pages 79–90, Gaithersburg, MD, USA.Google Scholar
  7. [Harman, 1992a]
    Harman, D. (1992a). Ranking algorithms. In Frakes, W. and Baeza-Yates, R., editors, Information Retrieval: data structures and algorithms, chapter 14. Prentice Hall, Englewood Cliffs, New Jersey, USA.Google Scholar
  8. [Harman, 1992b]
    Harman, D. (1992b). Relevance feedback and other query modification techniques. In Frakes, W. and Baeza-Yates, R., editors, Information Retrieval: data structures and algorithms, chapter 11. Prentice Hall, Englewood Cliffs, New Jersey, USA.Google Scholar
  9. [Keller, 1994]
    Keller, E., editor (1994). Fundamentals of Speech Synthesis and Speech Recognition. John Wiley and Sons, Chichester, UK.MATHGoogle Scholar
  10. [Porter, 1980]
    Porter, M. (1980). An algorithm for suffix stripping. Program, 14 (3): 130–137.CrossRefGoogle Scholar
  11. [Robertson and Spark Jones, 1997]
    Robertson, S. and Spark Jones, K. (1997). Simple, proven approaches to text retrieval. Technical Report TR356, Computer Laboratory, University of Cambridge, UK.Google Scholar
  12. [Sanderson, 1996]
    Sanderson, M. (1996). Word Sense Disambiguation and Information Retrieval. PhD Thesis, Department of Computing Science, University of Glasgow, Glasgow, Scotland, UK.Google Scholar
  13. [Singhal et al., 1998]
    Singhal, A., Choi, J., Hindle, D., Lewis, D., and Pereira, F. (1998). ATandT at TREC-7. In Proceedings of the TREC Conference, pages 239253, Washington DC, USA.Google Scholar
  14. [Singhal and Pereira, 1999]
    Singhal, A. and Pereira, F. (1999). Document expansion for speech retrieval. In Proceedings of ACM SIGIR, pages 34–41, Berkeley, CA, USA.Google Scholar
  15. [Voorhees, 1993]
    Voorhees, E. (1993). On expanding query vectors with lexically related words. In Proceedings of the TREC Conference, pages 223–232, Gaithersburg, MD, USA.Google Scholar
  16. [Voorhees and harman, 1998]
    Voorhees, E. and harman, D. (1998). Overview of the seventh text retrieval conference (TREC-7). In Proceedings of the TREC Conference, pages 1–24, Gaithersburg, MD, USA.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Fabio Crestani
    • 1
  1. 1.Department of Computer ScienceUniversity of StrathclydeGlasgowScotland, UK

Personalised recommendations