Skip to main content

Bengali, Hindi and Telugu to English Ad-Hoc Bilingual Task at CLEF 2007

  • Conference paper
Book cover Advances in Multilingual and Multimodal Information Retrieval (CLEF 2007)

Abstract

This paper presents the experiments carried out at Jadavpur University as part of the participation in the CLEF 2007 ad-hoc bilingual task. This is our first participation in the CLEF evaluation task and we have considered Bengali, Hindi and Telugu as query languages for the retrieval from English document collection. We have discussed our Bengali, Hindi and Telugu to English CLIR system as part of the ad-hoc bilingual task, the English IR system for the ad-hoc monolingual task and the associated experiments at CLEF. Query construction was manual for Telugu-English ad-hoc bilingual task, while it was automatic for all other tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Oard, D.: Alternative Approaches for Cross Language Text Retrieval. In: AAAI Symposium on Cross Language Text and Speech Retrieval, USA (1997)

    Google Scholar 

  2. Lavrenko, V., Choquette, M., Croft, W.: Cross-Lingual Relevance Models. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM 2002, pp. 175–182. ACM Press, New York (2002)

    Chapter  Google Scholar 

  3. Oard, D.: The Surprise Language Exercises. ACM Transactions on Asian Language Information Processing 2(2), 79–84 (2003)

    Article  Google Scholar 

  4. Dorr, B., Zajic, D., Schwartz, R.: Cross-language Headline Generation for Hindi. ACM Transactions on Asian Language Information Processing (TALIP) 2(3), 270–289 (2003)

    Article  Google Scholar 

  5. Sekine, S., Grishman, R.: Hindi-English Cross-Lingual Question-Answering System. ACM Transactions on Asian Language Information Processing (TALIP) 2(3), 181–192 (2003)

    Article  Google Scholar 

  6. Pingali, P., Jagarlamudi, J., Varma, V.: Webkhoj: Indian Language IR from Multiple Character Encodings. In: WWW 2006: Proceedings of the 15th International Conference on World Wide Web, pp. 801–809 (2006)

    Google Scholar 

  7. Pingali, P., Varma, V.: Hindi and Telugu to English Cross Language Information Retrieval at CLEF 2006. In: Working Notes for the CLEF 2006 Wokshop (Cross Language Adhoc Task), Alicante, Spain, 20-22 September (2006)

    Google Scholar 

  8. Bharati, A., Sangal, R., Sharma, D.M., Kulkarni, A.P.: Machine Translation Activities in India: A Survey. In: The Proceedings of Workshop on Survey on Research and Development of Machine Translation in Asian Countries (2002)

    Google Scholar 

  9. Naskar, S., Bandyopadhyay, S.: Use of Machine Translation in India: Current Status. In: Proceedings of MT SUMMIT-X, Phuket, Thailand, pp. 465–470 (2005)

    Google Scholar 

  10. CLIA Consortium: Cross Lingual Information Access System for Indian Languages. In: Demo/Exhibition of the 3rd International Joint Conference on Natural Language Processing, Hyderabad, India, pp. 973–975 (2008)

    Google Scholar 

  11. Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval, ch. 6. Cambridge University Press, Cambridge (2000)

    Google Scholar 

  12. Porter, M.F.: An Algorithm for Suffix Stripping. Program 14(3), 130–137 (1980)

    Google Scholar 

  13. Mayfield, J., McNamee, P.: Converting On-line Bilingual Dictionaries from Human-readable form to Machine-readable form. In: Proceedings of 25th Annual International ACM SIGIR Conference on Research and Development in Informational Retrieval, pp. 405–406. ACM Press, New York (2002)

    Chapter  Google Scholar 

  14. Ekbal, A., Naskar, S., Bandyopadhyay, S.: A Modified Joint Source-Channel Model for Transliteration. In: Proceedings of the COLING/ACL, Sydney, Australia, pp. 191–198 (2006)

    Google Scholar 

  15. Nunzio, G.M.D., Ferro, N., Mandi, T., Peters, C.: CLEF 2007: Ad HOC Track Overview. In: Peters, C., et al. (eds.) CLEF 2007. LNCS, vol. 5152, pp. 13–32. Springer, Heidelberg (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carol Peters Valentin Jijkoun Thomas Mandl Henning Müller Douglas W. Oard Anselmo Peñas Vivien Petras Diana Santos

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bandyopadhyay, S., Mondal, T., Naskar, S.K., Ekbal, A., Haque, R., Godhavarthy, S.R. (2008). Bengali, Hindi and Telugu to English Ad-Hoc Bilingual Task at CLEF 2007. In: Peters, C., et al. Advances in Multilingual and Multimodal Information Retrieval. CLEF 2007. Lecture Notes in Computer Science, vol 5152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85760-0_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85760-0_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85759-4

  • Online ISBN: 978-3-540-85760-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics