Skip to main content

Improving FAQ Retrieval Using Query Log Clustering in Latent Semantic Space

  • Conference paper
Book cover Information Retrieval Technology (AIRS 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3689))

Included in the following conference series:

  • 1022 Accesses

Abstract

Lexical disagreement problems often occur in FAQ retrieval because FAQs unlike general documents consist of just one or two sentences. To resolve lexical disagreement problems, we propose a high-performance FAQ retrieval system using query log clustering. During indexing time, using latent semantic analysis techniques, the proposed system classifies and groups the logs of users’ queries into predefined FAQ categories. During retrieval time, the proposed system uses the query log clusters as a form of FAQ smoothing. In our experiment, we found that the proposed system could resolve some lexical disagreement problems between queries and FAQs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. El-Hamdouchi, A., Willet, P.: Comparison of Hierarchic Agglomerative Clustering Methods for Document Retrieval. The Computer Journal 32(3), 220–227 (1989)

    Article  Google Scholar 

  2. Hammond, K., Burke, R., Martin, C., Lytinen, S.: FAQ Finder: a Case-Based Approach to Knowledge Navigation. In: Proceedings of the 11th Conference on Artificial Intelligence for Applications, pp. 80–86 (1995)

    Google Scholar 

  3. Hearst, M.A., Pedersen, J.O.: Re-examining the Cluster Hypothesis: Scatter/Gather on Retrieval Results. In: Proceedings of SIGIR 1996, pp. 76–84 (1996)

    Google Scholar 

  4. Jardine, N., van Rijsbergen, C.J.: The Use of Hierarchical Clustering in Information Retrieval. Information Storage and Retrieval 7, 217–240 (1971)

    Article  Google Scholar 

  5. Landauer, T.K., Foltz, P.W., Laham, D.: Introduction to Latent Semantic Analysis. Discourse Processes 25, 259–284 (1998)

    Article  Google Scholar 

  6. Lee, S.: A Korean Part-of-Speech Tagging System with Handling Unknown Words, (in Korean) MS thesis, KAIST, Korea (1992)

    Google Scholar 

  7. Lemur-3.0. The Lemur Toolkit for Language Modeling and Information Retrieval (Version 3.0). Copyright (c) 2000–2004 Carnegie Mellon University (2000–2004)

    Google Scholar 

  8. Liu, X., Croft, W.B.: Cluster-Based Retrieval Using Language Models. In: Proceedings of SIGIR 2004, pp. 25–29 (2004)

    Google Scholar 

  9. Maarek, Y.S., Berry, D.M., Kaiser, G.E.: An Information Retrieval Approach for Automatically Construction Software Libraries. IEEE Transaction on Software Engineering 17(8), 800–813 (1991)

    Article  Google Scholar 

  10. Miller, G.: WordNet: An On-Line Lexical Database. International Journal of Lexicography 3(4), 1–12 (1990)

    Google Scholar 

  11. Muller, J., Pischel, M.: Doing Business in the Information Marketplace. In: Proceedings of 1999 International Conference on Autonomous Agents, pp. 139–146 (1999)

    Google Scholar 

  12. van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworths, London (1979)

    Google Scholar 

  13. van Rijsbergen, C.J., Croft, W.B.: Document Clustering: An Evaluation of Some Experiments with the Cranfield 1400 Collection. Information Processing and Management 11, 171–182 (1975)

    Article  Google Scholar 

  14. Robertson, S.E., Walker, S., Jones, S., Beaulieu, M.M., Gatford, M.: Okapi at TREC–3. In: Proceedings of TREC–3, pp. 109–126 (1994)

    Google Scholar 

  15. Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval (Computer Series). McGraw-Hill, New York (1983)

    Google Scholar 

  16. Sneiders, E.: Automated FAQ Answering: Continued Experience with Shallow Language Understanding. In: Papers from the 1999 AAAI Fall Symposium, pp. 97–107 (1999)

    Google Scholar 

  17. Tombros, A., Villa, R., van Rijsbergen, C.J.: The Effectiveness of Query-specific Hierarchic Clustering in Information Retrieval. Information Processing and Management 38, 559–582 (2002)

    Article  MATH  Google Scholar 

  18. Voorhees, E.: The Cluster Hypothesis Revisited. In: Proceedings of SIGIR 1985, pp. 188–196 (1985)

    Google Scholar 

  19. Voorhees, E., Tice, D.M.: The TREC-8 Question Answering Track Evaluation. In: Proceedings of TREC-8, pp. 83–105 (1999)

    Google Scholar 

  20. Whitehead, S.D.: Auto-FAQ: an Experiment in Cyberspace Leveraging. Computer Networks and ISDN Systems 28(1-2), 137–146 (1995)

    Article  Google Scholar 

  21. Willet, P.: Recent Trends in Hierarchical Document Clustering: A Critical Review. Information Processing and Management 24(5), 577–597 (1988)

    Article  Google Scholar 

  22. Zhai, C., Lafferty, J.: A Study of Smoothing Methods for Language Models Applied to Ad hoc Information Retrieval. In: Proceedings of SIGIR 2001, pp. 334–342 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kim, H., Lee, H., Seo, J. (2005). Improving FAQ Retrieval Using Query Log Clustering in Latent Semantic Space. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.H. (eds) Information Retrieval Technology. AIRS 2005. Lecture Notes in Computer Science, vol 3689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562382_18

Download citation

  • DOI: https://doi.org/10.1007/11562382_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29186-2

  • Online ISBN: 978-3-540-32001-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics