Skip to main content

Hindi, Telugu, Oromo, English CLIR Evaluation

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4730))

Abstract

This paper presents the Cross Language Information Retrieval (CLIR) experiments of Language Technologies Research Centre (LTRC, IIIT-Hyderabad) as part of our participation in the ad-hoc track of CLEF 2006. This is our first participation in the CLEF evaluation campaign and we focused on Afaan Oromo, Hindi and Telugu as source (query) languages for retrieval of documents from English text collection. We have used a dictionary based approach for CLIR. After a brief description of our CLIR system we discuss the evaluation results of various experiments we conducted using CLEF 2006 dataset.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Grefenstette, G., Grefenstette, G.: Cross-Language Information Retrieval. Kluwer Academic Publishers, Norwell (1998)

    Google Scholar 

  2. Oard, D.: Alternative approaches for cross language text retrieval. In: AAAI Symposium on Cross Language Text and Speeck Retrieval, USA (1997)

    Google Scholar 

  3. Oard, D.W.: The surprise language exercises. ACM Transactions on Asian Language Information Processing (TALIP) 2(2), 79–84 (2003)

    Article  Google Scholar 

  4. Dorr, B., Zajic, D., Schwartz, R.: Cross-language headline generation for hindi. ACM Transactions on Asian Language Information Processing (TALIP) 2(3), 270–289 (2003)

    Article  Google Scholar 

  5. Sekine, S., Grishman, R.: Hindi-English cross-lingual question-answering system. ACM Transactions on Asian Language Information Processing (TALIP) 2(3), 181–192 (2003)

    Article  Google Scholar 

  6. Pingali, P., Jagarlamudi, J., Varma, V.: Webkhoj: Indian language ir from multiple character encodings. In: WWW 2006: Proceedings of the 15th international conference on World Wide Web, Edinburgh, Scotland, pp. 801–809. ACM Press, New York (2006)

    Chapter  Google Scholar 

  7. Cosijn, E., Pirkola, A., Bothma, T., Jrvelin, K.: Information access in indigenous languages: a case study in Zulu. In: Proceedings of the fourth International Conference on Conceptions of Library and Information Science (CoLIS 4), Seattle, USA (2002)

    Google Scholar 

  8. Cosijn, E., Keskustalo, H., Pirkola, A.: Afrikaans - English Cross-language Information Retrieval. In: Proceedings of the 3rd biennial DISSAnet Conference, Pretoria (2004)

    Google Scholar 

  9. Alemu, A., Asker, L., Coster, R., Karlgen, J.: Dictionary Based Amharic English Information Retrieval. In: Peters, C., Clough, P.D., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, Springer, Heidelberg (2005)

    Google Scholar 

  10. Alemu, A., Asker, L., Coster, R., Karlgen, J.: Dictionary Based Amharic French Information Retrieval. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)

    Google Scholar 

  11. Alemu, A., Asker, L., Coster, R., Karlgen, J.: Dictionary Based Amharic English Information Retrieval. In: CLEF 2006, Bilingual Task (2006)

    Google Scholar 

  12. Bharati, A., Sangal, R., Sharma, D.M., Kulkarni, A.P.: Machine translation activities in India: A survey. In: the Proceedings of workshop on survey on Research and Development of Machine Translation in Asian Countries (2002)

    Google Scholar 

  13. Salton, G., Buckley, C.: Term-weighting Approaches in Automatic Text Retrieval. Information Process. Management 24(5), 513–523 (1988)

    Article  Google Scholar 

  14. Larkey, L.S., Connell, M.E., Abduljaleel, N.: Hindi CLIR in thirty days. ACM Transactions on Asian Language Information Processing (TALIP) 2(2), 130–142 (2003)

    Article  Google Scholar 

  15. Pingali, P., Jagarlamudi, J., Varma, V.: Experiments in Cross Language Query Focused Multi-Document Summarization. In: IJCAI 2007 Workshop on CLIA, Hyderabad, India (2007)

    Google Scholar 

  16. Pingali, P., Varma, V., Tune, K.K.: Evaluation of Oromo-English Cross-Language Information Retrieval. In: IJCAI 2007 Workshop on CLIA, Hyderabad, India (2007)

    Google Scholar 

  17. Philips, L.: The Double-Metaphone Search Algorithm. C/C++ User’s Journal 18(6) (2000)

    Google Scholar 

  18. Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: SIGIR 1998: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, pp. 275–281. ACM Press, New York (1998)

    Chapter  Google Scholar 

  19. Lund, K., Burgess, C.: Producing high-dimensional semantic spaces from lexical co-occurrence. In: Behavior Research Methods, Instrumentation, and Computers, pp. 203–208 (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carol Peters Paul Clough Fredric C. Gey Jussi Karlgren Bernardo Magnini Douglas W. Oard Maarten de Rijke Maximilian Stempfhuber

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pingali, P., Tune, K.K., Varma, V. (2007). Hindi, Telugu, Oromo, English CLIR Evaluation. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74999-8_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74998-1

  • Online ISBN: 978-3-540-74999-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics