Advertisement

Two-Stage Refinement of Query Translation in a Pivot Language Approach to Cross-Lingual Information Retrieval: An Experiment at CLEF 2003

  • Kazuaki Kishida
  • Noriko Kando
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3237)

Abstract

This paper reports experimental results of cross-lingual information retrieval from German to Italian. The authors are concerned with CLIR in cases where available language resources are very limited. Thus transitive translation of queries using English as a pivot language was used to search Italian document collections for German queries without any direct bilingual dictionary or MT system for these two languages. In order to remove irrelevant translations produced by the transitive translation, we propose a new disambiguation technique, in which two stages of refinement of query translation are executed. Basically, this refinement is based on the idea of pseudo relevance feedback. Our experimental results show that the two-stage refinement method is able to significantly improve search performance of bilingual IR using a pivot language.

Keywords

Information Retrieval Query Expansion Bilingual Dictionary Pseudo Relevance Feedback Query Translation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Ballesteros, L.A., Croft, W.B.: Phrasal translation and query expansion techniques for cross-language information retrieval. In: Proceedings of the 20th ACM SIGIR conference on Research and Development in Information Retrieval, pp. 84–91 (1977)Google Scholar
  2. 2.
    Ballesteros, L., Croft, W.B.: Resolving ambiguity for cross-language retrieval. In: Proceedings of the 21st ACM SIGIR conference on Research and Development in Information Retrieval, pp. 64–71 (1988)Google Scholar
  3. 3.
    Yamabana, K., Muraki, K., Doi, S., Kamei, S.: A language conversion front-end for cross-language information retrieval. In: Grefenstette, G. (ed.) Cross-Language Information Retrieval, pp. 93–104. Kluwer, Boston (1998)Google Scholar
  4. 4.
    Gao, J., Nie, J.Y., Xun, E.X., Zhang, J., Zhou, M., Huang, C.: Improving query translation for cross-language information retrieval using statistical models. In: Proceedings of 24th ACM SIGIR conference on Research and Development in Information Retrieval, pp. 96–104 (2001)Google Scholar
  5. 5.
    Lin, C.J., Lin, W.C., Bian, G.W., Chen, H.H.: Description of the NTU Japanese-English cross-lingual information retrieval system used for NTCIR workshop. In: Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, National Institute of Informatics, Tokyo (1999), http://research.nii.ac.jp/ntcir/workshop/
  6. 6.
    Sadat, F., Maeda, A., Yoshikawa, M., Uemura, S.: Query expansion techniques for the CLEF Bilingual track. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 177–184. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  7. 7.
    Adriani, M.: English-Dutch CLIR using query translation techniques. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 219–225. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  8. 8.
    Qu, Y., Grefenstette, G., Evans, D.A.: Resolving translation ambiguity using monolingual corpora: a report on Clairvoyance CLEF 2002 experiments. In: Working Notes for the CLEF 2002 Workshop, pp. 115–126 (2002)Google Scholar
  9. 9.
    Seo, H.C., Kim, S.B., Kim, B.I., Rim, H.C., Lee, S.Z.: KUNLP system for NTCIR-3 English-Korean cross-language information retrieval. In: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, National Institute of Informatics, Tokyo (2003), http://research.nii.ac.jp/ntcir/workshop/
  10. 10.
    Ballesteros, L.A.: Cross-language retrieval via transitive translation. In: Croft, W.B. (ed.) Advances in Information Retrieval: Recent Research from the Center for Intelligent Information Retrieval, pp. 203–234. Kluwer, Boston (2000)Google Scholar
  11. 11.
    Franz, M., McCarley, J.S., Roukos, S.: Ad hoc and multilingual information retrieval at IBM. In: Proceedings of the TREC-7, National Institute of Standards and Technology, Gaithersburg (1999), http://trec.nist.gov/pubs/
  12. 12.
    Gey, F.C., Jiang, H., Chen, A., Larson, R.R.: Manual queries and machine translation in cross-language retrieval at TREC-7. In: Proceedings of the TREC-7, National Institute of Standards and Technology, Gaithersburg (1999), http://trec.nist.gov/pubs/
  13. 13.
    Hiemstra, D., Kraaij, W.: Twenty-one at TREC-7: ad-hoc and cross-language track. In: Proceedings of the TREC-7, National Institute of Standards and Technology, Gaithersburg (1999), http://trec.nist.gov/pubs/
  14. 14.
    Chen, A., Gey, F.C.: Experiments on cross-language and patent retrieval at NTCIR-3 workshop. In: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, National Institute of Informatics, Tokyo (2003), http://research.nii.ac.jp/ntcir/workshop/ Google Scholar
  15. 15.
    Lin, W.C., Chen, H.H.: Description of NTU approach to Multilingual Information retrieval. In: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, National Institute of Informatics, Tokyo (2003), http://research.nii.ac.jp/ntcir/workshop/ Google Scholar
  16. 16.
    Gollins, T., Sanderson, M.: Improving cross language information retrieval with triangulated translation. In: Proceedings of the 24th ACM SIGIR conference on Research and Development in Information Retrieval, pp. 90–95 (2001)Google Scholar
  17. 17.
    Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)Google Scholar
  18. 18.
    Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M.M., Gatford, M.: Okapi at TREC-3. In: Proceedings of TREC-3, National Institute of Standards and Technology, Gaithersburg (1995), http://trec.nist.gov/pubs/ Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Kazuaki Kishida
    • 1
  • Noriko Kando
    • 2
  1. 1.Surugadai UniversityHanno, SaitamaJapan
  2. 2.National Institute of Informatics (NII)TokyoJapan

Personalised recommendations