Abstract
Query clustering helps users frame an optimum query to obtain relevant documents. The content-based approach to query clustering has been criticized since queries are usually very short and consist of a wide variety of keywords, making this method ineffective in finding clusters. Clustering based on similar search results URLs has also performed inadequately due to the large number of distinct URLs. Our previous work has demonstrated that a hybrid approach combining the two is effective in generating good clusters. This study aims to extend our work by using lexical knowledge from WordNet to examine the effect on the quality of query clusters. Our results show that surprisingly, the use of lexical knowledge does not produce any significant improvement in quality, thus demonstrating the robustness of the hybrid clustering approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chien, S., Immorlica, N.: Semantic similarity between search engine queries using temporal correlation. In: WWW 2005, pp. 2–11 (2005)
Chuang, S.L., Chien, L.F.: Towards automatic generation of query taxonomy: a hierarchical query clustering approach. In: Proceedings of IEEE 2002 International Conference on Data Mining, pp. 75–82 (2002)
Cui, H., Wen, J.R., Nie, J.Y., Ma, W.Y.: Probabilistic query expansion using query logs. In: Proceedings of the eleventh international conference on World Wide Web, pp. 325–332 (2002)
Fonseca, B.M., Golgher, P.B., De Moura, E.S., Ziviani, N.: Using association rules to discover search engines related queries. In: First Latin American Web Congress (LA-WEB 2003), pp. 66–71 (2003)
Fu, L., Goh, D.H., Foo, S., Na, J.C.: Collaborative querying through a hybrid query clustering approach. In: Sembok, T.M.T., Zaman, H.B., Chen, H., Urs, S.R., Myaeng, S.-H. (eds.) ICADL 2003. LNCS, vol. 2911, pp. 111–122. Springer, Heidelberg (2003)
Fu, L., Goh, D.H., Foo, S., Supangat, Y.: Collaborative querying for enhanced information retrieval. In: Heery, R., Lyon, L. (eds.) ECDL 2004. LNCS, vol. 3232, pp. 378–388. Springer, Heidelberg (2004)
Furnas, G.W., Landauer, T.K., Gomez, L.M., Dumais, S.T.: The vocabulary problem in human-system communication. Communications of the ACM 30(11), 964–971 (1987)
Huang, C.K., Oyang, Y.J., Chien, L.F.: A contextual term suggestion mechanism for interactive search. In: Zhong, N., Yao, Y., Ohsuga, S., Liu, J. (eds.) WI 2001. LNCS (LNAI), vol. 2198, pp. 272–281. Springer, Heidelberg (2001)
Jansen, B.J., Spink, A., Saracevic, T.: Real life, real users and real needs: A study and analysis of users queries on the Web. Information Processing and Management 36(2), 207–227 (2000)
Lokman, I.M., Stephanie, W.H.: Information–seeking behavior and use of social science faculty studying stateless nations: A case study. Journal of library and Information Science Research 23(1), 5–25 (2001)
Matwin, S., Scott, S.: Text classification using wordnet hypernyms. In: Proceedings of the COLING/ACL Workshop on Usage of WordNet in Natural Language Processing Systems, COLING-ACL 1998, pp. 45–52 (1998)
Raghavan, V.V., Sever, H.: On the reuse of past optimal queries. In: Proceedings of the Eighteenth International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 344–350 (1995)
Turney, P.D.: Word sense disambiguation by web mining for word co-occurrence probabilities. In: Proceedings Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (SENSEVAL-3), pp. 239–242 (2004)
Velardi, P., Navigli, R.: An analysis of ontology-based query expansion strategies. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 210–221. Springer, Heidelberg (2003)
Voorhees, E.: Using wordnet to disambiguate word senses for text retrieval. In: Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 171–180 (1993)
Wen, J.R., Nie, J.Y., Zhang, H.J.: Query clustering using user logs. ACM Transactions on Information Systems 20(1), 59–81 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ray, C.S., Goh, D.HL., Foo, S. (2006). The Effect of Lexical Relationships on the Quality of Query Clusters. In: Sugimoto, S., Hunter, J., Rauber, A., Morishima, A. (eds) Digital Libraries: Achievements, Challenges and Opportunities. ICADL 2006. Lecture Notes in Computer Science, vol 4312. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11931584_25
Download citation
DOI: https://doi.org/10.1007/11931584_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49375-4
Online ISBN: 978-3-540-49377-8
eBook Packages: Computer ScienceComputer Science (R0)