Skip to main content

GIRT and the Use of Subject Metadata for Retrieval

  • Conference paper
Multilingual Information Access for Text, Speech and Images (CLEF 2004)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3491))

Included in the following conference series:

Abstract

The use of domain-specific metadata (subject keywords) is tested for monolingual and bilingual retrieval on the GIRT social science collection. A new technique, Entry Vocabulary Modules, which adds subject keywords selected from the controlled vocabulary to the query, has been tested. As in previous years, we compare our techniques of thesaurus matching and Entry Vocabulary Modules to simple machine translation techniques in bilingual retrieval. A combination of machine translation and thesaurus matching achieves better results, whereas the introduction of Entry Vocabulary Modules has negligent impact on the retrieval results. Retrieval results for the German and English GIRT collection for monolingual as well as bilingual retrieval (with English and German as query languages) will be represented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chen, A., Cooper, W., Gey, F.: Full text retrieval based on probabilistic equations with coefficients fitted by logistic regression. In: Harman, D.K. (ed.) The Second Text Retrieval Conference (TREC-2), March 1994, pp. 57–66 (1994)

    Google Scholar 

  2. Plaunt, C., Norgard, B.A.: An Association-Based Method for Automatic Indexing with Controlled Vocabulary. Journal of the American Society for Information Science 49(10), 888–902 (1998)

    Google Scholar 

  3. Gey, F., et al.: Advanced Search Technology for Unfamiliar Metadata. In: Proceedings of the Third IEEE Metadata Conference, Bethesda, Maryland (April 1999)

    Google Scholar 

  4. Petras, V., Perelman, N., Gey, F.: Using Thesauri in Cross-Language Retrieval of German and French Indexed Collections. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 349–362. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Petras, V., Perelman, N., Gey, F.: UC Berkeley at CLEF-2003 – Russian Language Experiments and Domain-Specific Retrieval. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 401–411. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  6. Schott, H.: Thesaurus for the Social Sciences, vol. 1 (German-English), vol. 2 (English-German). Informations-Zentrum Sozialwissenschaften, Bonn (2000)

    Google Scholar 

  7. Chen, A., Gey, F.: Multilingual Information Retrieval Using Machine Translation, Relevance Feedback and Decompounding. Information Retrieval 7(1-2), 149–182 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Petras, V. (2005). GIRT and the Use of Subject Metadata for Retrieval. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_31

Download citation

  • DOI: https://doi.org/10.1007/11519645_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27420-9

  • Online ISBN: 978-3-540-32051-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics