Advertisement

An Integrated Approach to Information Retrieval with Fuzzy Clustering and Fuzzy Inferencing

  • Jianhua Chen
  • Andreja Mikulcic
  • Donald H. Kraft
Part of the Studies in Fuzziness and Soft Computing book series (STUDFUZZ, volume 39)

Abstract

We present an integrated approach to fuzzy information retrieval which combines techniques in fuzzy set theory with methodologies in textual retrieval in order to achieve optimal retrieval performance. To capture the relationships among index terms, fuzzy logic rules (with truth value assignment in the 0–1 interval) are used. We adapt several fuzzy clustering methods (such as fuzzy c-means and fuzzy hierarchical clustering) to the task of clustering documents with respect to the terms. The clusters generated provide a basis for building the fuzzy logic rules. The clusters can also be used to form hyperlinks between documents. A previously developed fuzzy logic system, found to be sound and complete, is applied for fuzzy inferencing to derive useful modifications of the initial query, which will guide the search for relevant documents. Thus, this method combines fuzzy inference with traditional relevance feedback approach for retrieval. The advantage of this method is in the emphasis on semantic information (embodied in the rules and the inference mechanisms) which should lead to superior performance. A series of experiments conducted in order to validate this approach are presented, along with results and conclusions.

Keywords

Fuzzy Logic Fuzzy Logic System Fuzzy Logic Rule Percent Recall Engineer Data Compendium 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Bibliography

  1. [1]
    J.C. Bezdek, A convergence theorem for the fuzzy ISODATA clustering algorithms, IEEE Transactions on Pattern Analysis and Machine Intelligence (2), 1980, pp. 1–8.CrossRefGoogle Scholar
  2. [2]
    J.C. Bezdek, R.J. Hathaway, M.J. Sabin, and W.T. Tucker, Convergence theory for fuzzy c-Means: counterexamples and repairs, IEEE Transactions on Systems, Man, and Cybernetics (17), 1987, pp. 873–877.CrossRefGoogle Scholar
  3. [3]
    K.R. Boff, J.E. Lincoln (Eds), Engineering Data Compendium: Human Perception and Performance y I, II, and III (Wright-Patterson Air Force Base, OH: Human Engineering Division, Harry G. Armstrong Medical Research Laboratory ), 1988.Google Scholar
  4. [4]
    K.R. Boff, D.L. Monk, W.J. Cody, Computer Aided Systems Human Engineering: A Hypermedia Tool, Space Operation Applications and Research (SOAR) 1991, July Houston: NASA.Google Scholar
  5. [5]
    K.R. Boff, D.L. Monk, S.J. Swiereitga, C.E. Brown, W.J. Cody, Computer-Aided Human Factors for Systems Designers, July 1991 ( San Francisco: Human Factors Society annual meeting).Google Scholar
  6. [6]
    J. Chen, S. Kundu, A sound and complete fuzzy logic system using Zadeh’s implication operator, Foundations of Intelligent Systems: Lecture Notes in Computer Science 1079, 1996, pp. 233–242.CrossRefGoogle Scholar
  7. [7]
    Department of Defense, Human Engineering Design Criteria for Military Systems, Equipment, and Facilities (MIL-STD-1472D), Notice 3, Washington, DC, 1994.Google Scholar
  8. [8]
    D. Dubois, H. Prade, R.R. Yager (Eds), Fuzzy Information Engineering: A Guided Tour of Applications, New York, NY: Wiley, 1997.Google Scholar
  9. [9]
    W. B. Frakes, Stemming algorithms, In: W. B. Frakes, R. Baeza-Yates (Eds), Information Retrieval: Data Structures & Algorithms, Prentice Hall, 1992.Google Scholar
  10. [10]
    G.J. Klir, T.A. Folger, Fuzzy Sets, Uncertainty, and Information, Englewood Cliffs, NJ: Prentice-Hall, 1988.Google Scholar
  11. [11]
    G.J. Klir, B. Yuan, Fuzzy Sets and Fuzzy Logic: Theory and Applications, Upper Saddle Rive, NJ: Prentice-Hall, 1995.Google Scholar
  12. [12]
    D.H. Kraft, An exploratory study of weighted fuzzy keyword retrieval With hypertext links for the CASHE:PVS system, Final report for the Summer faculty research associate program, Wright-Patterson AFB, OH, 1994.Google Scholar
  13. [13]
    D.H. Kraft, C. Barry, Relevance in textual Retrieval,“ American Association for Artificial Intelligence (AAAI), AAAI-94 Fall Symposium - Relevance, New Orleans, LA, November, 1994, Working Notes.Google Scholar
  14. [14]
    Kraft, D. H., Bordogna, G., and Pasi, G., An Extended Fuzzy Linguistic Approach to Generalize Boolean Information Retrieval, Information Sciences, (2), November, 1995, pp. 119–134.Google Scholar
  15. [15]
    D.H. Kraft, B.R. Boyce, Approaches to Intelligent Information Retrieval, in F.E. Petry, M.L. Delcambre (Eds), Advances in Databases and Artificial Intelligence, volume 1: Intelligent Database Technology: Approaches and Applications, Greenwich, CT: JAI Press, 1995, pp. 243–261.Google Scholar
  16. [16]
    D.H. Kraft and D.A. Buell, Fuzzy Sets and Generalized Boolean Retrieval Systems, International Journal of Man-Machine Studies, v. 19, 1983, pp. 45–56; reprinted in D. Dubois, H. Prade, and R. Yager, (Eds), Readings in Fuzzy Sets for Intelligent Systems, San Mateo, CA: Morgan Kaufmann Publishers, 1992.Google Scholar
  17. [17]
    D.H. Kraft, D. Monk, Applications of Fuzzy Computation - Information Retrieval: A Case Study with the CASHE:PVS System, In E. Ruspini, R Bonissone, W. Pedrycz (Eds), Handbook of Fuzzy Computation,Information Science, New York, NY: Oxford University Press and Institute of Physics Publishing, in press, 1997.Google Scholar
  18. [18]
    S. Kundu, J. Chen, Fuzzy linear invariant clustering for applications in fuzzy control, Proceedings of NAFIPS/IFIS/NASA’94, San Antonio, TX, 1994.Google Scholar
  19. [19]
    J. Lincoln, D. Monk, private communications, 1997.Google Scholar
  20. [20]
    F. Martin, E. Thro, Fuzzy Logic: A Practical Approach, Chestnut Hill, MA: Academic Press, 1994.Google Scholar
  21. [21]
    A. Mikulcic, Automatic Document Indexing and Classification, Report, Department of Computer Science, Louisiana State University, Baton Rouge, LA, May, 1995.Google Scholar
  22. [22]
    A. Mikulcic, J. Chen, Experiments on using fuzzy linear clustering from fuzzy control system design, Proceedings of IEEE/FUZZ’96, New Orleans, September 1996.Google Scholar
  23. [23]
    S. Miyamoto, Fuzzy Sets in Information Retrieval and Cluster Analysis, Boston, MA: Kluwer Academic Publishers, 1990.CrossRefGoogle Scholar
  24. [24]
    F.E. Petry, Fuzzy Databases: Principles and Applications,Norwell, MA: Kluwer Academic Publishers, 1996, with contribution by P. Bosc.Google Scholar
  25. [25]
    E. Rasmussen, Clustering Algorithms, In W.B. Frakes, R. Baeza-Yates (Eds), Information Retrieval: Data Structures & Algorithms, Englewood Cliffs, NJ: Prentice Hall, 1992.Google Scholar
  26. [26]
    G. Salton, Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer, Reading, MA: Addison-Wesley, 1989Google Scholar
  27. [27]
    http://www.dtic.dla.mil/iac/cseriac/cashe/cashe.htm#edc
  28. [28]
    L.A. Zadeh, Fuzzy sets, Information and Control (8), 1965, pp. 338–353.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • Jianhua Chen
    • 1
  • Andreja Mikulcic
    • 1
  • Donald H. Kraft
    • 1
  1. 1.Department of Computer ScienceLouisiana State UniversityBaton RougeUSA

Personalised recommendations