Abstract
Information filtering has become an important component of modern information systems due to significant increase in its applica- tions. The objective of an information filtering is to classify/categorize documents as they arrive into the system. In this paper, we investigate an information filtering method based on steepest descent induction algorithm combined with a two-level preference relation on user ranking. The performance of the proposed algorithm is experimentally evaluated. The experiments are conducted using Reuters-21578 data collection. A micro-average breakeven effectiveness measure is used for performance evaluation. The best size of negative data employed in the training set is empirically determined and the effect of R norm factor on the learning process is evaluated. Finally, we demonstrate effectiveness of proposed method by comparing experimental results to other inductive methods.
This work is supported by the Army Research Office of USA, Grant No. DAAH04- 96-1-0325, under DEPSCoR program of Advanced Research Projects Agency, De- partment of Defense, and by University of Bahrain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Apt C., Damerau F., and Weiss S. M. Automated learning of decision rules for text categorization. ACM Transactions on Information Systems, 12(3):233–251, July 1994.
Bollman, P. and Wong, S. K. M. Adaptive linear information retrieval models. In Proc. of the 10 th Int. ACM SIGIR Conference on Research and Development in Information Retrieval, pages 157–163, New Orleans, LA., June 1987.
Dumais, S., Platt, J., Heckerman, D., and Sahami, M. Inductive learning algorithms and representations for text categorization. In Proceedings of ACM-CIKM98, Nov. 1998.
Lewis, David D. Evaluating text categorization. In Proceedings of Speech and Natural Language Workshop, pages 312–318, February, 1991.
Raghavan, V. V., and Sever H. On the reuse of past optimal queries. In Proc. of the 18 th Int. ACM-SIGIR Conference on Research and Development of Information Retrieval, pages 344–350, 1995.
Salton, G, Automatic Text Processing, The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesely, 1988.
Schütze, H., Hull, D. A., and Pederson, J. O. A comparison of classiffers and document representations for the routing problem. In Proc. of the 18th Annual Int. ACM-SIGIR Conference on Research and Development of Information Retrieval (Seattle, Washington, 1995) 229–237.
Wong, S. K. M., and Yao, Y. Y. Query formulation in linear retrieval models. Journal of the American Society for Information Science, 41, 5 (1990), 334–341.
Wong, S. K. M. Measuring retrieval effectiveness based on user preference of documents. Journal of the American Society for Information Science, 46(2):133–145, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Alsaffar, A.H., Deogun, J., Sever, H. (2000). Optimal Queries in Information Filtering. In: RaÅ›, Z.W., Ohsuga, S. (eds) Foundations of Intelligent Systems. ISMIS 2000. Lecture Notes in Computer Science(), vol 1932. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-39963-1_46
Download citation
DOI: https://doi.org/10.1007/3-540-39963-1_46
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41094-2
Online ISBN: 978-3-540-39963-6
eBook Packages: Computer ScienceComputer Science (R0)