Abstract
This paper reports experimental results for cross-language infor-mation retrieval (CLIR) from German to French, in which a hybrid approach to query and document translation was attempted, i.e., combining the results of query translation (German to French) and of document translation (French to German). In order to reduce the complexity of computation when translating a large amount of texts, we performed pseudo-translation, i.e., a simple replacement of terms by a bilingual dictionary (for query translation, a machine translation system was used). In particular, since English was used as an intermediary language for both translation directions between German and French, English translations at the middle stage were employed as document representations in order to reduce the number of translation steps. By omitting a translation step (English to German), the performance was improved. Unfortunately, our hybrid approach did not show better performance than a simple query translation. This may be due to the low performance of document translation, which was carried out by a simple replacement of terms using a bilingual dictionary with no term disambiguation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kang, I.S., Na, S.H., Lee, J.H.: POSTECH at NTCIR-4: CJKE Monolingual and Korean-related Cross-Language Retrieval Experiments. In: NTCIR Workshop 4 Meeting Working Notes, pp. 89–95 (2004)
Kishida, K.: Technical issues of cross-language information retrieval: a review. Information Processing & Management 41, 433–455 (2005)
Scott McCarley, J.: Should we translate the documents or the queries in cross-language information retrieval? In: Proceedings of the 37th conference on Association for Computational Linguistics, pp. 208–214 (1999)
Fujii, A., Ishikawa, T.: Japanese-English cross-language information retrieval integrating query and document translation methods. The Transactions of the Institute of Electronics, Information and Communication Engineers J84-D-II, 362–369 (2001) (In Japanese)
Kishida, K., Kando, N.: Two-Stage Refinement of Query Translation in a Pivot Language Approach to Cross-Lingual Information Retrieval: An Experiment at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 253–262. Springer, Heidelberg (2004)
Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)
Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M.M., Gatford, M.: Okapi at TREC-3. In: Proceedings of TREC-3, National Institute of Standards and Technology, Gaithersburg (1995), http://trec.nist.gov/pubs/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kishida, K., Kando, N. (2006). A Hybrid Approach to Query and Document Translation Using a Pivot Language for Cross-Language Information Retrieval. In: Peters, C., et al. Accessing Multilingual Information Repositories. CLEF 2005. Lecture Notes in Computer Science, vol 4022. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11878773_10
Download citation
DOI: https://doi.org/10.1007/11878773_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45697-1
Online ISBN: 978-3-540-45700-8
eBook Packages: Computer ScienceComputer Science (R0)