Abstract
In this paper a flexible query language for expressing soft selection conditions on structured documents is presented and formalized within fuzzy set theory. Documents are represented as entities structured into logical sections in which the index terms play a distinct role. Users can indicate the preferred sections of documents, i.e., those which they estimate bearing the most interesting information, as well as quantify the number of sections which determine the global potential interest of the documents. A linguistic quantifier that specifies the approximate number of the sections in which the query terms should appear in the relevant documents expresses this last information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bookstein A. (1980) Fuzzy requests: an approach to weighted Boolean searches. J. of the American Society for Information Science 31, 240–247.
Bordogna, G., Pasi, G. A. (1993) Fuzzy linguistic approach generalizing Boolean IR: a model and its evaluation. J. of the American Society for Information Science, 44 (2), 70–82.
Bordogna G., Pasi G. (1995) Controlling retrieval through a user adaptive representation of documents, Int. J. of approximate reasoning 12, 317–339.
Bordogna G., and Pasi G. (1995) Linguistic aggregation operators of selection criteria in fuzzy information retrieval, International Journal of Intelligent Systems, 10, 233–248.
Buell D.A., and Kraft D.H. (1981) Threshold values and Boolean retrieval systems. Information Processing & Management 17, 127–136.
Kim H., Cho S., (2000), Structured storage and retrieval of SGML documents using GROVE, Information Processing and Management, 36, 643–657.
Krovetz R., Croft W.B., (1992) Lexical ambiguity and information retrieval. ACM Trans. on Information System, (10)2, 115–141.
Klir G.J., Folger T.A. (1988) Fuzzy sets, uncertainty and information, Prentice Hall PTR Englewood Cliffs.
Kraft, D. H., Bordogna, G. and Pasi, G. (1995) An extended fuzzy linguistic approach to generalize Boolean information retrieval, Journal of Information Sciences, Applications., 2 (3), 119–134.
Lalmas M., Ruthven I., (1998), Representing and retrieving structured documents using the Dempster-Shafer theory of evidence: Modelling and Evaluation, Journal of Documentation, 54 (5), 529–565.
Macleod I. (1990), Storage and retrieval of structured documents, Information Processing and Management, 26 (2), 197–208.
Molinari, A., G. Pasi G. (1996) A fuzzy representation of HTML documents for information retrieval systems, in proc. of IEEE International Conference on Fuzzy Systems, New Orleans, 8–12 September, 1996.
Negoita, C. V. (1973) On the notion of relevance in information retrieval. Kybernetes, 2 (3), 161–165.
Paice, C. D. (1984) Soft evaluation of Boolean search queries in information retrieval systems. Information Technology: Research Development Applications, 3 (1), 33–41.
Perez-Carballo, J., Strzalkowski, T., (2000) Natural language information retrieval: Progress Report, Information Processing and Management, 36, 155–178.
Rao A., et al. (2000) Query processing in TREC-6, Information Processing and Management, 36, 179–186.
Sager N., (1981) Natural language information processing, Addison Wesley.
Salton, G., Fox, E., Wu, H. (1983) Extended Boolean information retrieval. Communications of the ACM, 26 (12), 1022–1036.
Salton G., and McGill M.J. (1984) Introduction to modern information retrieval. McGraw-Hill Int. Book Co.
Sparck Jones, K. A. (1971) Automatic keyword classification for information retrieval. London, England: Butterworths.
Sparck, K. A. (1972) A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28 (1), 11–20.
van Rijsbergen, C. J. (1979) Information retrieval. London, England, Butterworths & Co., Ltd.
Yager R. R. (1988) On Ordered Weighted Averaging aggregation operators in multi criteria decision making, IEEE Trans. on Systems, Man and Cybernetics 18 (1), 183–190.
The Ordered Weighted Averaging operators: theory and applications, R.R Yager and J. Kacprzyk eds., Kluwer Academic Publishers (1997).
Zadeh, L.A. (1965) Fuzzy sets. Information and control, 8, 338–353.
Zadeh L.A. (1983) A computational approach to fuzzy quantifiers in natural languages, Computing and Mathematics with Applications. 9, 149–184.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bordogna, G., Pasi, G. (2001). Flexible Querying of Structured Documents. In: Larsen, H.L., Andreasen, T., Christiansen, H., Kacprzyk, J., Zadrożny, S. (eds) Flexible Query Answering Systems. Advances in Soft Computing, vol 7. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1834-5_32
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1834-5_32
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-1347-0
Online ISBN: 978-3-7908-1834-5
eBook Packages: Springer Book Archive