Advertisement

Granular Information Retrieval

  • S. K. Michael Wong
  • Y. Y. Yao
  • Cory J. Butz
Part of the Studies in Fuzziness and Soft Computing book series (STUDFUZZ, volume 50)

Abstract

There are three main problems when designing an information retrieval (IR) system, namely, uncertainty in the representation of documents and queries, computational complexity, and the diversity of users. An IR system may be designed to be adaptive by allowing the modification of document and query representation. As well, different retrieval methods can be used for different users. The combina-tion of multi-representation of documents and multi-strategy retrieval may provide a Solution for the diversity of users. A widely used Solution for reducing computational costs is cluster-based retrieval. However, the use of document clustering only reduces the dimensionality of documents. The same number of terms is used for the representation of the Clusters. One may reduce the dimensionality of terms by constructing a term hierarchy in parallel to the construction of a document hierarchy. The proposed framework of granular IR enables us to incorporate multi-representation of documents and multi-strategy retrieval. Hence, granular IR may provide a method for developing knowledge based intelligent IR Systems.

Keywords

Equivalence Relation Information Retrieval Information Retrieval System Index Term Document Cluster 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chen, H., Ng, T., Martinez, J., Schatz, B. (1997): A Concept Space Approach to Addressing the Vokabular Problem in Scientific Information Retrieval: An Experiment on the Worm Community System. Journal of the American Society for Information Science. 48(1), 17–31CrossRefGoogle Scholar
  2. 2.
    Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R. (1990): Indexing by Latent Semantic Analysis. Journal of the American Society for Information Science. 41(6), 391–407CrossRefGoogle Scholar
  3. 3.
    Han, J.W., Cai, Y., Cercone, N. (1993): Data-Driven Discovery of Quantitative Rules in Relational Databases. IEEE Transactions on Knowledge and Data Engineering. 5, 29–40CrossRefGoogle Scholar
  4. 4.
    Jardine, N., Sibson, R. (1971): Mathematical Taxonomy. Wiley, New York.MATHGoogle Scholar
  5. 5.
    Spark Jones, K. (1971): Automatic Keyword Classification for Information Retrieval. Butterworths, London, UKGoogle Scholar
  6. 6.
    Lin, T.Y., Hadjimichael, M. (1996) Non-Classificatory Generation in Data Mining. In: Tsumoto, S., Kobayashi, S., Yokomori, T., Tanaka, H., Nakamura, A. (Eds.): Proceedings of the Fourth International Workshop on Rough Sets f Fuzzy Sets, and Machine Discovery. Japanese Research Group on Rough Sets, 404–411Google Scholar
  7. 7.
    Qiu, Y., Frei, H. (1993) Concept Based Query Expansion. In: Proceedings of the Sixteenth ACM International Conference on Research and Development in Information Retrieval. 160–169Google Scholar
  8. 8.
    Rasmussen, E. (1992) Clustering Algorithms. In: Frakes, W., Baeza-Yates, R. (Eds.): Information Retrieval: Data Structures and Algorithms. Prentice Hall, Englewood ClifFs, USA, 419–442Google Scholar
  9. 9.
    Salton, G., McGill, M. (1983): Introduction to Modern Information Retrieval. McGraw Hill, New York, USAMATHGoogle Scholar
  10. 10.
    Schäuble, P., Knaus, D. (1993) The Various Roles of Information Structures. In: Opitz, O., Lausen, B., Klar, R., (Eds.): Proceedings of the Sixteenth Annual Conference of the Gesellschaft für Klassifikation. Springer Verlag, Heidelberg, DE, 282–290Google Scholar
  11. 11.
    van Rijsbergen, C.J. (1979): Information Retrieval Butterworths, London, UKGoogle Scholar
  12. 12.
    Willett. (1988) Recent Trends in Hierarchie Document Clustering: A Critical Review. Information Processing and Management, 24(5):577–597CrossRefGoogle Scholar
  13. 13.
    Wong, S.K.M., Butz, C.J. (1999) Contextual Weak Independence in Bayesian Networks. To Appear In: Laskey, K., Prade, H. (Eds.): Proceedings of the Fif-teenth Conference on Uncertainty in Artificial Intelligence. Probook.Google Scholar
  14. 14.
    Yang, Y., Pederson, J. (1997) A Comparative Study on Feature Selection in Text Categorization. In: Proceedings of the Fourteenth Conference on Machine Learning. Morgan Kaufmann, 412–420Google Scholar
  15. 15.
    Yao, Y. (1999) Stratified Rough Sets and Granulär Computing. In: Dave, R.N., Sudkamp, T. (Eds.): Proceedings of the 18th International Conference of the North American Fuzzy Information Processing Society. IEEE Press, 800–804Google Scholar
  16. 16.
    Yao, Y. (1999) Rough Sets, Neighborhood Systems, and Granulär Computing. In: Meng, M. (Ed.): Proceedings of the 1999 IEEE Canadian Conference on Electrica! and Computer Engineering. IEEE Press, 1553–1558Google Scholar
  17. 17.
    Zadeh, L.A. (1997): Towards a Theory of Fuzzy Information Granulation and Its Centrality in Human Reasoning and Fuzzy Logic. Fuzzy Sets and Systems. 19, 111–127MathSciNetCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • S. K. Michael Wong
    • 1
  • Y. Y. Yao
    • 1
  • Cory J. Butz
    • 1
  1. 1.Department of Computer ScienceUniversity of ReginaReginaCanada

Personalised recommendations