Flexible Querying of Structured Documents

Bordogna, Gloria; Pasi, Gabriella

doi:10.1007/978-3-7908-1834-5_32

Gloria Bordogna⁴ &
Gabriella Pasi⁴

Part of the book series: Advances in Soft Computing ((AINSC,volume 7))

142 Accesses
3 Citations

Abstract

In this paper a flexible query language for expressing soft selection conditions on structured documents is presented and formalized within fuzzy set theory. Documents are represented as entities structured into logical sections in which the index terms play a distinct role. Users can indicate the preferred sections of documents, i.e., those which they estimate bearing the most interesting information, as well as quantify the number of sections which determine the global potential interest of the documents. A linguistic quantifier that specifies the approximate number of the sections in which the query terms should appear in the relevant documents expresses this last information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bookstein A. (1980) Fuzzy requests: an approach to weighted Boolean searches. J. of the American Society for Information Science 31, 240–247.
Article Google Scholar
Bordogna, G., Pasi, G. A. (1993) Fuzzy linguistic approach generalizing Boolean IR: a model and its evaluation. J. of the American Society for Information Science, 44 (2), 70–82.
Article Google Scholar
Bordogna G., Pasi G. (1995) Controlling retrieval through a user adaptive representation of documents, Int. J. of approximate reasoning 12, 317–339.
Article MathSciNet MATH Google Scholar
Bordogna G., and Pasi G. (1995) Linguistic aggregation operators of selection criteria in fuzzy information retrieval, International Journal of Intelligent Systems, 10, 233–248.
Article Google Scholar
Buell D.A., and Kraft D.H. (1981) Threshold values and Boolean retrieval systems. Information Processing & Management 17, 127–136.
Article MATH Google Scholar
Kim H., Cho S., (2000), Structured storage and retrieval of SGML documents using GROVE, Information Processing and Management, 36, 643–657.
Article Google Scholar
Krovetz R., Croft W.B., (1992) Lexical ambiguity and information retrieval. ACM Trans. on Information System, (10)2, 115–141.
Article Google Scholar
Klir G.J., Folger T.A. (1988) Fuzzy sets, uncertainty and information, Prentice Hall PTR Englewood Cliffs.
Google Scholar
Kraft, D. H., Bordogna, G. and Pasi, G. (1995) An extended fuzzy linguistic approach to generalize Boolean information retrieval, Journal of Information Sciences, Applications., 2 (3), 119–134.
Google Scholar
Lalmas M., Ruthven I., (1998), Representing and retrieving structured documents using the Dempster-Shafer theory of evidence: Modelling and Evaluation, Journal of Documentation, 54 (5), 529–565.
Google Scholar
Macleod I. (1990), Storage and retrieval of structured documents, Information Processing and Management, 26 (2), 197–208.
Article Google Scholar
Molinari, A., G. Pasi G. (1996) A fuzzy representation of HTML documents for information retrieval systems, in proc. of IEEE International Conference on Fuzzy Systems, New Orleans, 8–12 September, 1996.
Google Scholar
Negoita, C. V. (1973) On the notion of relevance in information retrieval. Kybernetes, 2 (3), 161–165.
Article MathSciNet MATH Google Scholar
Paice, C. D. (1984) Soft evaluation of Boolean search queries in information retrieval systems. Information Technology: Research Development Applications, 3 (1), 33–41.
Google Scholar
Perez-Carballo, J., Strzalkowski, T., (2000) Natural language information retrieval: Progress Report, Information Processing and Management, 36, 155–178.
Article Google Scholar
Rao A., et al. (2000) Query processing in TREC-6, Information Processing and Management, 36, 179–186.
Article Google Scholar
Sager N., (1981) Natural language information processing, Addison Wesley.
Google Scholar
Salton, G., Fox, E., Wu, H. (1983) Extended Boolean information retrieval. Communications of the ACM, 26 (12), 1022–1036.
Article MathSciNet MATH Google Scholar
Salton G., and McGill M.J. (1984) Introduction to modern information retrieval. McGraw-Hill Int. Book Co.
Google Scholar
Sparck Jones, K. A. (1971) Automatic keyword classification for information retrieval. London, England: Butterworths.
Google Scholar
Sparck, K. A. (1972) A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28 (1), 11–20.
Article Google Scholar
van Rijsbergen, C. J. (1979) Information retrieval. London, England, Butterworths & Co., Ltd.
Google Scholar
Yager R. R. (1988) On Ordered Weighted Averaging aggregation operators in multi criteria decision making, IEEE Trans. on Systems, Man and Cybernetics 18 (1), 183–190.
Article MathSciNet MATH Google Scholar
The Ordered Weighted Averaging operators: theory and applications, R.R Yager and J. Kacprzyk eds., Kluwer Academic Publishers (1997).
Google Scholar
Zadeh, L.A. (1965) Fuzzy sets. Information and control, 8, 338–353.
MathSciNet MATH Google Scholar
Zadeh L.A. (1983) A computational approach to fuzzy quantifiers in natural languages, Computing and Mathematics with Applications. 9, 149–184.
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Istituto per le Tecnologie Informatiche Multimediali CNR — Milano, Italy
Gloria Bordogna & Gabriella Pasi

Authors

Gloria Bordogna
View author publications
You can also search for this author in PubMed Google Scholar
Gabriella Pasi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, Roskilde University, P.O. Box 260, 4000, Roskilde, Denmark
Henrik L. Larsen , Troels Andreasen & Henning Christiansen , &
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447, Warsaw, Poland
Janusz Kacprzyk & Sławomir Zadrożny &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bordogna, G., Pasi, G. (2001). Flexible Querying of Structured Documents. In: Larsen, H.L., Andreasen, T., Christiansen, H., Kacprzyk, J., Zadrożny, S. (eds) Flexible Query Answering Systems. Advances in Soft Computing, vol 7. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1834-5_32

Download citation

DOI: https://doi.org/10.1007/978-3-7908-1834-5_32
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-1347-0
Online ISBN: 978-3-7908-1834-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics