Skip to main content

A Multi-dependency Language Modeling Approach to Information Retrieval

  • Conference paper
Emerging Technologies in Knowledge Discovery and Data Mining (PAKDD 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4819))

Included in the following conference series:

  • 1485 Accesses

Abstract

This paper presents a multi-dependency language modeling approach to information retrieval. The approach extends the basic KL-divergence retrieval approach by introducing the hybrid dependency structure, which includes syntactic dependency, syntactic proximity dependency and co-occurrence dependency, to describe dependencies between terms. Term and dependency language models are constructed for both document and query. The relevant between a document and a query is then evaluated by using the KL-divergence between their corresponding models. The new dependency retrieval model has been compared with other traditional retrieval models. Experiment results indicate that it produces significant improvements in retrieval effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cao, G., Nie, J.-Y., Bai, J.: Integrating word relationships into language models. In: SIGIR, pp. 298–305 (2005)

    Google Scholar 

  2. Lafferty, J., Zhai, C.: Document Language Models, Query Models, and Risk Minimization for Information Retrieval. In: Proc. of SIGIR (2001)

    Google Scholar 

  3. Ponte, J.M., Croft, W.B.: A Language Modeling Approach to Information Retrieval. In: Proc. of the 21st Intl. Conf. on Research and Development in Information Retrieval (1998)

    Google Scholar 

  4. Croft, W.B., Turtle, H.R., Lewis, D.D.: The Use of Phrases and Structured Queries in Information Retrieval. In: Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, Chicago, pp. 32–45 (October 1991)

    Google Scholar 

  5. Fagan, L.: Automatic phrase indexing for document retrieval: an examination of syntactic and non-syntactic methods. In: Proc of SIGIR 1987, pp. 91–101 (1987)

    Google Scholar 

  6. Van Rijsbergen, C J.: A Theoretical Basis for the Use of Co-occurrence Data in Information Retrieval. Journal of Documentation 33, 106–119 (1977)

    Google Scholar 

  7. Song, F., Croft, W.: A general language model for information retrieval. In: Proc. of Eighth Intl. Conf. on Information and Knowledge Management (1999)

    Google Scholar 

  8. Srikanth, M., Srihari, R.: Biterm Language Models for Document Retrieval. In: Proceedings of SIGIR, New York, pp. 425–426 (2002)

    Google Scholar 

  9. Nallapati, R., Allan, J.: Capturing term dependencies using a sentence tree based language model. In: CIKM (2002)

    Google Scholar 

  10. Gao, J., Nie, J.-Y., Wu, G., Cao, G.: Dependence language model for information retrieval. In: SIGIR, pp. 170–177 (2004)

    Google Scholar 

  11. Nallapati, R., Allan, J.: An Adaptive Local Dependency language Model: Relaxing the Naïve Bayes Assumption. In: Workshop on Mathematical and Formal Models in IR, ACM Special Interest Group in Information Retrieval (2003)

    Google Scholar 

  12. Hays, D G.: Dependency theory: A formalism and some observations. Language 40, 511–525 (1964)

    Article  Google Scholar 

  13. Lin, D.: Principar—an efficient, broadcoverage, principle-based parser. In: Proceedings of COLING–94, Kyoto, Japan, pp. 482–488 (1994)

    Google Scholar 

  14. Zhai, C., Lafferty, J.: A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval. In: Proc. of SIGIR (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Takashi Washio Zhi-Hua Zhou Joshua Zhexue Huang Xiaohua Hu Jinyan Li Chao Xie Jieyue He Deqing Zou Kuan-Ching Li Mário M. Freire

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cai, K., Chen, C., Bu, J., Qiu, G., Huang, P. (2007). A Multi-dependency Language Modeling Approach to Information Retrieval. In: Washio, T., et al. Emerging Technologies in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4819. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77018-3_48

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77018-3_48

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77016-9

  • Online ISBN: 978-3-540-77018-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics