Abstract
This paper presents the Cross Language Information Retrieval (CLIR) experiments of Language Technologies Research Centre (LTRC, IIIT-Hyderabad) as part of our participation in the ad-hoc track of CLEF 2006. This is our first participation in the CLEF evaluation campaign and we focused on Afaan Oromo, Hindi and Telugu as source (query) languages for retrieval of documents from English text collection. We have used a dictionary based approach for CLIR. After a brief description of our CLIR system we discuss the evaluation results of various experiments we conducted using CLEF 2006 dataset.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Grefenstette, G., Grefenstette, G.: Cross-Language Information Retrieval. Kluwer Academic Publishers, Norwell (1998)
Oard, D.: Alternative approaches for cross language text retrieval. In: AAAI Symposium on Cross Language Text and Speeck Retrieval, USA (1997)
Oard, D.W.: The surprise language exercises. ACM Transactions on Asian Language Information Processing (TALIP) 2(2), 79–84 (2003)
Dorr, B., Zajic, D., Schwartz, R.: Cross-language headline generation for hindi. ACM Transactions on Asian Language Information Processing (TALIP) 2(3), 270–289 (2003)
Sekine, S., Grishman, R.: Hindi-English cross-lingual question-answering system. ACM Transactions on Asian Language Information Processing (TALIP) 2(3), 181–192 (2003)
Pingali, P., Jagarlamudi, J., Varma, V.: Webkhoj: Indian language ir from multiple character encodings. In: WWW 2006: Proceedings of the 15th international conference on World Wide Web, Edinburgh, Scotland, pp. 801–809. ACM Press, New York (2006)
Cosijn, E., Pirkola, A., Bothma, T., Jrvelin, K.: Information access in indigenous languages: a case study in Zulu. In: Proceedings of the fourth International Conference on Conceptions of Library and Information Science (CoLIS 4), Seattle, USA (2002)
Cosijn, E., Keskustalo, H., Pirkola, A.: Afrikaans - English Cross-language Information Retrieval. In: Proceedings of the 3rd biennial DISSAnet Conference, Pretoria (2004)
Alemu, A., Asker, L., Coster, R., Karlgen, J.: Dictionary Based Amharic English Information Retrieval. In: Peters, C., Clough, P.D., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, Springer, Heidelberg (2005)
Alemu, A., Asker, L., Coster, R., Karlgen, J.: Dictionary Based Amharic French Information Retrieval. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)
Alemu, A., Asker, L., Coster, R., Karlgen, J.: Dictionary Based Amharic English Information Retrieval. In: CLEF 2006, Bilingual Task (2006)
Bharati, A., Sangal, R., Sharma, D.M., Kulkarni, A.P.: Machine translation activities in India: A survey. In: the Proceedings of workshop on survey on Research and Development of Machine Translation in Asian Countries (2002)
Salton, G., Buckley, C.: Term-weighting Approaches in Automatic Text Retrieval. Information Process. Management 24(5), 513–523 (1988)
Larkey, L.S., Connell, M.E., Abduljaleel, N.: Hindi CLIR in thirty days. ACM Transactions on Asian Language Information Processing (TALIP) 2(2), 130–142 (2003)
Pingali, P., Jagarlamudi, J., Varma, V.: Experiments in Cross Language Query Focused Multi-Document Summarization. In: IJCAI 2007 Workshop on CLIA, Hyderabad, India (2007)
Pingali, P., Varma, V., Tune, K.K.: Evaluation of Oromo-English Cross-Language Information Retrieval. In: IJCAI 2007 Workshop on CLIA, Hyderabad, India (2007)
Philips, L.: The Double-Metaphone Search Algorithm. C/C++ User’s Journal 18(6) (2000)
Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: SIGIR 1998: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, pp. 275–281. ACM Press, New York (1998)
Lund, K., Burgess, C.: Producing high-dimensional semantic spaces from lexical co-occurrence. In: Behavior Research Methods, Instrumentation, and Computers, pp. 203–208 (1996)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pingali, P., Tune, K.K., Varma, V. (2007). Hindi, Telugu, Oromo, English CLIR Evaluation. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-74999-8_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74998-1
Online ISBN: 978-3-540-74999-8
eBook Packages: Computer ScienceComputer Science (R0)