Skip to main content

An Algebra for Structured Queries in Bayesian Networks

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3493))

Abstract

We present a system based on a Bayesian Network formalism for structured documents retrieval. The parameters of this model are learned from the document collection (documents, queries and assessments). The focus of the paper is on an algebra which has been designed for the interpretation of structured information queries and can be used within our Bayesian Network framework. With this algebra, the representation of the information demand is independent from the structured query language. It allows us to answer both vague and strict structured queries.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Quass, D., McHugh, J., Widom, J., Wiener, J.L.: The lorel query language for semistructured data. International Journal on Digital Libraries 1, 68–88 (1997)

    Article  Google Scholar 

  2. Callan, J., Croft, W.B., Harding, S.M.: The INQUERY Retrieval System. In: Tjoa, A.M., Ramos, I. (eds.) Database and Expert Systems Applications, Proceedings of the International Conference, Valencia, Spain, pp. 78–83. Springer, Heidelberg (1992)

    Google Scholar 

  3. Crestani, F., de Campos, L.M., Fernandez-Luna, J.M., Huete, J.F.: A multi-layered Bayesian network model for structured document retrieval. In: Nielsen, T.D., Zhang, N.L. (eds.) ECSQARU 2003. LNCS (LNAI), vol. 2711, pp. 74–86. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  4. Crestani, F., de Campos, L.M., Fernandez-Luna, J.M., Huete, J.F.: Ranking Structured Documents Using Utility Theory in the Bayesian Network Retrieval Model. In: Nascimento, M.A., de Moura, E.S., Oliveira, A.L. (eds.) SPIRE 2003. LNCS, vol. 2857, pp. 168–182. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  5. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from incomplete data via de EM algorithm. The Journal of Royal Statistical Society 39, 1–37 (1977)

    MATH  MathSciNet  Google Scholar 

  6. Fuhr, N., Grossjohann, K.: XIRQL: A query language for information retrieval in XML documents. In: Croft, W.B., Harper, D.J., Kraft, D.H., Zobel, J. (eds.) The 24th International Conference on Research and Developmenent in Information Retrieval, New Orleans, Louisiana, USA. ACM, New York (2001)

    Google Scholar 

  7. List, J., Mihajlovic, V., de Vries, A.P., Ramirez, G.: The TIJAX XML-IR system at INEX 2003. In: Fuhr, N., Lalmas, M., Malik, S. (eds.) Proceedings of the Second INEX Workshop INitiative for the Evaluation of XML Retrieval (INEX), Dagstuhl, Germany (2003)

    Google Scholar 

  8. Myaeng, S.H., Jang, D.H., Kim, M.H., Zhoo, Z.C.: A Flexible Model for Retrieval of SGML documents. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, pp. 138–140. ACM Press, New York (1998)

    Chapter  Google Scholar 

  9. Navarro, G., Baeza-Yates, R.: Proximal nodes: A model to query document databases by content and structure. ACM TOIS 15, 401–435 (1997)

    Article  Google Scholar 

  10. Piwowarski, B., Faure, G.E., Gallinari, P.: Bayesian networks and INEX. In: Proceedings of the First Annual Workshop of the Initiative for the Evaluation of XML retrieval (INEX), DELOS workshop, Dagstuhl, Germany. ERCIM (2002)

    Google Scholar 

  11. Piwowarski, B., Gallinari, P.: A Bayesian Network for XML Information Retrieval: Searching and Learning with the INEX Collection. In: Information Retrieval (2004)

    Google Scholar 

  12. Piwowarski, B., Vu, H.T., Gallinari, P.: Bayesian Networks and INEX 2003. In: Proceedings of the Second INEX Workshop INitiative for the Evaluation of XML Retrieval (2003)

    Google Scholar 

  13. Piwowarski, B., Gallinari, P.: An algebra for probabilistic XML Retrieval. In: The First Twente Data Management Workshop (2004)

    Google Scholar 

  14. Robertson, S.: Threshold setting and performance optimization in adaptive Filtering. Information Retrieval 5, 239–256 (2002)

    Article  Google Scholar 

  15. Walker, S., Robertson, S.E.: Okapi/Keenbow at TREC-8. In: Voorhees, E.M., Harman, D.K. (eds.) NIST Special Publication 500-246: The Eighth Text REtrieval Conference (TREC-8), Gaithersburg, Maryland, USA (1999)

    Google Scholar 

  16. Zadeh, L.A.: Fuzzy sets (1965)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Vittaut, JN., Piwowarski, B., Gallinari, P. (2005). An Algebra for Structured Queries in Bayesian Networks. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds) Advances in XML Information Retrieval. INEX 2004. Lecture Notes in Computer Science, vol 3493. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424550_9

Download citation

  • DOI: https://doi.org/10.1007/11424550_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26166-7

  • Online ISBN: 978-3-540-32053-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics