Skip to main content

Meta-data Extraction and Query Translation. Treatment of Semantic Heterogeneity

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2458))

Abstract

The project CARMEN 1 (“Content Analysis, Retrieval and Metadata: Effective Networking”) aimed among other goals at improving the expansion of searches in bibliographic databases into Internet searches. We pursued a set of different approaches to the treatment of semantic heterogeneity (meta-data extraction, query translation using statistic relations and cross-concordances). This paper describes the concepts and implementation of this approaches and the evaluation of the impact for the retrieval result.

Funded by the German Federal Ministry of Education and Research in the context of the programme “Global Info”, FKZ 08SFC08 3.

http://dublincore.org/

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Krause, J., Marx, J.: Vocabulary switching and automatic metadata extraction or how to get useful information from a digital library. In: Information Seeking, Searching and Querying in Digital Libraries: Pre— Proceedings of the First DELOS Network of Excellence Workshop. Zürich, Switzerland, December, 11–12, 2000, Zürich (2000) 133–134

    Google Scholar 

  2. Krause, J.: Informationserschließung und-bereitstellung zwischen Deregulation, Kommerzialisierung und weltweiter Vernetzung (”Schalenmodell”). IZ-Arbeitsbericht; Nr. 6. IZ Sozialwissenschaften, Bonn (1996)

    Google Scholar 

  3. Hull, R.: Managing semantic heterogeneity in databases. a theoretical perspective. In: ACM Symposium on Principles of Databases. Proceedings. ACM (1997) 51–61

    Google Scholar 

  4. Bright, M.W., Hurson, A.R., Pakzad, S.H.: Automated resolution of semantic heterogeneity in multidatabases. ACM Transactions on Database Systems (TODS) 19 (1994) 212–253

    Article  Google Scholar 

  5. Biebricher, P., Fuhr, N., Lustig, G., Schwantner, M., Knorz, G.: The automatic indexing system air/phys. from research to application. In Chiaramella, Y., ed.: SIGIR’88, Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Grenoble, France, June 13–15, 1988. ACM (1988) 333–342

    Google Scholar 

  6. Schäuble, P.: An information structure dealing with term dependence and polysemy. In Chiaramella, Y., ed.: SIGIR’88, Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Grenoble, France, June 13–15, 1988. ACM (1988) 519–533

    Google Scholar 

  7. Braschler, M., Schäuble, P.: Multilingual information retrieval based on document alignment techniques. In Nikolaou, C., Stephanidis, C., eds.: Research and Advanced Technology for Digital Libraries, Second European Conference, ECDL’ 98, Heraklion, Crete, Greece, September 21–23, 1998, Proceedings. Volume 1513 of Lecture Notes in Computer Science. Springer (1998) 183–197

    Google Scholar 

  8. Braschler, M., Schäuble, P.: Using corpus-based approaches in a system for multilingual information retrieval. Information Retrieval 3 (2000) 273–284

    Article  MATH  Google Scholar 

  9. Chung, Y.M., He, Q., Powell, K., Schatz, B.: Semantic indexing for a complete subject discipline. In: Proceedings of the fourth ACM conference on Digital libraries, ACM Press (1999) 39–48

    Google Scholar 

  10. Chang, C.T.K., Schatz, B.R.: Performance and implications of semantic indexing in a distributed environment. In: Proceedings of the eighth international conference on Information and knowledge management, ACM Press (1999) 391–398

    Google Scholar 

  11. Harman, D.: Towards interactive query expansion. In Chiaramella, Y., ed.: SIGIR’ 88, Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Grenoble, France, June 13–15, 1988. ACM (1988) 321–331

    Google Scholar 

  12. Strötgen, R., Kokkelink, S.: Metadatenextraktion aus internetquellen: Heterogenitätsbehandlung im projekt carmen. In Schmidt, R., ed.: Information Research & Content Management: Orientierung, Ordnung und Organisation im Wissensmarkt; 23. Online-Tagung der DGI und 53. Jahrestagung der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V., DGI, Frankfurt am Main, 8. bis 10. Mai 2001; Proceedings. Tagungen der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis; 4. DGI, Frankfurt am Main (2001) 56–66

    Google Scholar 

  13. Binder, G., Marx, J., Mutschke, P., Strötgen, R., Plümer, J., Kokkelink, S.: Heterogenitätsbehandlung bei textueller Information verschiedener Datentypen und Inhaltserschließungsverfahren. IZ-Arbeitsbericht; Nr. 24. IZ Sozialwissenschaften, Bonn (2002)

    Google Scholar 

  14. Hellweg, H., Krause, J., Mandl, T., Marx, J., Müller, M.N., Mutschke, P., Strötgen, R.: Treatment of Semantic Heterogeneity in Information Retrieval. IZArbeitsbericht; Nr. 23. IZ Sozialwissenschaften, Bonn (2001)

    Google Scholar 

  15. Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)

    MATH  Google Scholar 

  16. Ferber, R.: Automated indexing with thesaurus descriptors: A co-occurence based approach to multilingual retrieval. In Peters, C., Thanos, C., eds.: Research and Advanced Technology for Digital Libraries. First European Conference, ECDL’ 97, Pisa, Italy, 1–3 September, Proceedings. Volume 1324 of Lecture Notes in Computer Science. Springer (1997) 233–252

    Chapter  Google Scholar 

  17. Grievel, L., Mutschke, P., Polanco, X.: Thematic mapping on bibliographic databases by cluster analysis: A description of the sdoc environment with solis. Knowledge Organisation 22 (1995) 8

    Google Scholar 

  18. Hellweg, H.: Einsatz von statistisch erstellten transferbeziehungen zur anfragetransformation in elvira. In Krause, J., Stempfhuber, M., eds.: Integriertes Retrieval in heterogenen Daten. Text-Fakten-Integration am Beispiel des Verband-informationssystems ELVIRA. Volume 4 of Forschungsberichte des IZ Sozialwissenschaften. IZ Sozialwissenschaften, Bonn (2002)

    Google Scholar 

  19. Krause, J.: Virtual libraries, library content analysis, metadata and the remaining heterogeneity. In: ICADL 2000: Challenging to Knowledge Exploring for New Millennium: the Proceedings of the 3rd International Conference of Asian Digital Library and the 3rd Conference on Digital Libraries, Seoul, Korea, December 6–8, 2000, Seoul (2001) 209–214

    Google Scholar 

  20. Fuhr, N., Großjohann, K., Kokkelink, S.: Cap7: Searching and browsing in distributed document collections. In Borbinha, J.L., Baker, T., eds.: Research and Advanced Technology for Digital Libraries, 4th European Conference, ECDL 2000, Lisbon, Portugal, September 18–20, 2000, Proceedings. Volume 1923 of Lecture Notes in Computer Science. Springer (2000) 364–367

    Chapter  Google Scholar 

  21. Marx, J., Müller, M.N.: The social science virtual library project. dealing with semantic heterogeneity at the query processing level. In: Third DELOS Network of Excellence Workshop ”Interoperability and Mediation in Heterogeneous Digital Libraries”. Darmstadt, Germany, September 8–9, 2001, Darmstadt (2001) 19–23

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Strötgen, R. (2002). Meta-data Extraction and Query Translation. Treatment of Semantic Heterogeneity. In: Agosti, M., Thanos, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2002. Lecture Notes in Computer Science, vol 2458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45747-X_27

Download citation

  • DOI: https://doi.org/10.1007/3-540-45747-X_27

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44178-6

  • Online ISBN: 978-3-540-45747-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics