Abstract
Question categorization, which automatically suggests a few categories to host a user’s question, is a useful technique in Web-based question answering systems. In this paper, we propose a question categorization method which makes use of user feedback to the system’s automatic suggestions to improve question categorization. We initialize the categorization model by training a set of accumulated questions. When a user asks a question, the system automatically suggests a few categories for the question using the current categorization model. The user can then select one of these suggestions or another category and such feedback information is used to revise the categorization model. The revised model is used to categorize new questions. Experimental results show that our method is effective to take the advantages of both positive and negative feedback to improve the precision of question categorization but finish the revision of the categorization model in real time.
Chapter PDF
Similar content being viewed by others
References
Li, X., Roth, D.: Learning Question Classifiers. In: Proceedings of the 19th International Conference on Computational Linguistics (2002)
Zhang, D., Lee, W.S.: Question Classification using Support Vector Machine. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval (2003)
Voorhees, E.: Overview of the TREC 2001 Question Answering Track. In: Proceedings of the 10th Text Retrieval Conference (TREC10), pp. 157–165. NIST, Gaithersburg (2001)
McCallum, A., Nigam, K.: A Comparison of Event Models for naïve Bayes text classification. In: AAAI 1998 Workshop on Learning for Text Categorization. Tech. Rep. WS-98-05. AAAI Press, Menlo Park (1998)
Joachims, T.: A Probabilistic Analysis of the Rochhio Algorithm with TFIDF for Text Categorization. In: Proceedings of the Fourteenth International Conference on Machine Learning (1997)
Joachims, T.: Text Categorization with Support Vector Machines: Learning with many Relevant Features. In: Proceedings of the Tenth European Conference on Machine Learning (1998)
Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Raghavan, H., Madani, O., Jones, R.: Active Learning with Feedback on Features and Instances. Journal of Machine Learning Research 7, 1655–1686 (2006)
Luan, H., Neo, S., Goh, H., Zhang, Y., Lin, S., Chua, T.: Segregated feedback with performance-based adaptive sampling for interactive news video retrieval. In: Proceedings of the 15th international conference on Multimedia, pp. 293–296 (2007)
Huang, Y., Mitchell, T.M.: Text clustering with extended user feedback. In: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2006), pp. 413–420 (2006)
Rada, R., Mili, H., Bichnell, E., Blettner, M.: Development and Application of Metric on Semantic Nets. IEEE Transactions on Systems, Man, and Cybernetics 9(1), 17–30 (1989)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Williamson, E.R.: Does relevance feedback improve document retrieval performance? In: Proceedings of the 1st annual international ACM SIGIR conference on Information storage and retrieval (SIGIR 1978), pp. 151–170 (1978)
Buyans, http://www.buyans.com
Yahoo Answers, http://answers.yahoo.com/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Song, W., Wenyin, L., Gu, N., Quan, X. (2009). User Feedback for Improving Question Categorization in Web-Based Question Answering Systems. In: Chen, L., et al. Advances in Web and Network Technologies, and Information Management. APWeb WAIM 2009 2009. Lecture Notes in Computer Science, vol 5731. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03996-6_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-03996-6_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03995-9
Online ISBN: 978-3-642-03996-6
eBook Packages: Computer ScienceComputer Science (R0)