Skip to main content

CADIAL Search Engine at INEX

  • Conference paper
  • 396 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5631))

Abstract

Semi-structured document retrieval is becoming more popular with the increasing quantity of data available in XML format. In this paper, we describe a search engine model that exploits the structure of the document and uses language modelling and smoothing at the document and collection levels for calculating the relevance of each element from all the documents in the collection to a user query. Element priors, CAS query constraint filtering, and the +/- operators are also used in the ranking procedure. We also present the results of our participation in the INEX 2008 Ad Hoc Track.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dalbelo Bašić, B., Tadić, M., Moens, M.-F.: Computer Aided Document Indexing for Accessing Legislation, Toegang tot de wet. Die Keure, Brugge, pp. 107–117 (2008)

    Google Scholar 

  2. Denoyer, L., Gallinari, P.: The wikipedia XML corpus. In: ACM SIGIR Forum, vol. 40, pp. 64–69. ACM Press, New York (2006)

    Google Scholar 

  3. Huang, F.: The role of shallow features in XML retrieval. In: INEX 2007 Workshop Proceedings, pp. 33–38 (2007)

    Google Scholar 

  4. Liu, J., Lin, H., Han, B.: Study on reranking XML retrieval elements based on combining strategy and topics categorization. In: INEX 2007 Workshop Proceedings, pp. 170–176 (2007)

    Google Scholar 

  5. Mijić, J., Dalbelo Bašić, B., Šnajder, J.: Building a search engine model with morphological normalization support. In: ITI 2008 Proceedings of the 30th International Conference on Information Technology Interfaces, pp. 619–624 (2008)

    Google Scholar 

  6. Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: SIGIR 1998: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281. ACM Press, New York (1998)

    Google Scholar 

  7. Porter, M.F.: An algorithm for suffix stripping. Program: electronic library and information systems 40(3), 211–218 (2006)

    Article  MathSciNet  Google Scholar 

  8. Šilić, A., Šarić, F., Dalbelo Bašić, B., Šnajder, J.: TMT: Object-oriented text classification library. In: ITI 2007 Proceedings of the 29th International Conference on Information Technology Interfaces, pp. 559–566 (2007)

    Google Scholar 

  9. Šnajder, J., Dalbelo Bašić, B., Tadić, M.: Automatic acquisition of inflectional lexica for morphological normalisation. Information Processing & Management 44(5), 1720–1731 (2008)

    Article  Google Scholar 

  10. Wang, Q., Li, Q., Wang, S.: Preliminary work on XML retrieval. In: INEX 2007 Workshop Proceedings, pp. 70–76 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mijić, J., Moens, MF., Dalbelo Bašić, B. (2009). CADIAL Search Engine at INEX. In: Geva, S., Kamps, J., Trotman, A. (eds) Advances in Focused Retrieval. INEX 2008. Lecture Notes in Computer Science, vol 5631. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03761-0_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03761-0_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03760-3

  • Online ISBN: 978-3-642-03761-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics