Abstract
The project CARMEN 1 (“Content Analysis, Retrieval and Metadata: Effective Networking”) aimed among other goals at improving the expansion of searches in bibliographic databases into Internet searches. We pursued a set of different approaches to the treatment of semantic heterogeneity (meta-data extraction, query translation using statistic relations and cross-concordances). This paper describes the concepts and implementation of this approaches and the evaluation of the impact for the retrieval result.
Funded by the German Federal Ministry of Education and Research in the context of the programme “Global Info”, FKZ 08SFC08 3.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Krause, J., Marx, J.: Vocabulary switching and automatic metadata extraction or how to get useful information from a digital library. In: Information Seeking, Searching and Querying in Digital Libraries: Pre— Proceedings of the First DELOS Network of Excellence Workshop. Zürich, Switzerland, December, 11–12, 2000, Zürich (2000) 133–134
Krause, J.: Informationserschließung und-bereitstellung zwischen Deregulation, Kommerzialisierung und weltweiter Vernetzung (”Schalenmodell”). IZ-Arbeitsbericht; Nr. 6. IZ Sozialwissenschaften, Bonn (1996)
Hull, R.: Managing semantic heterogeneity in databases. a theoretical perspective. In: ACM Symposium on Principles of Databases. Proceedings. ACM (1997) 51–61
Bright, M.W., Hurson, A.R., Pakzad, S.H.: Automated resolution of semantic heterogeneity in multidatabases. ACM Transactions on Database Systems (TODS) 19 (1994) 212–253
Biebricher, P., Fuhr, N., Lustig, G., Schwantner, M., Knorz, G.: The automatic indexing system air/phys. from research to application. In Chiaramella, Y., ed.: SIGIR’88, Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Grenoble, France, June 13–15, 1988. ACM (1988) 333–342
Schäuble, P.: An information structure dealing with term dependence and polysemy. In Chiaramella, Y., ed.: SIGIR’88, Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Grenoble, France, June 13–15, 1988. ACM (1988) 519–533
Braschler, M., Schäuble, P.: Multilingual information retrieval based on document alignment techniques. In Nikolaou, C., Stephanidis, C., eds.: Research and Advanced Technology for Digital Libraries, Second European Conference, ECDL’ 98, Heraklion, Crete, Greece, September 21–23, 1998, Proceedings. Volume 1513 of Lecture Notes in Computer Science. Springer (1998) 183–197
Braschler, M., Schäuble, P.: Using corpus-based approaches in a system for multilingual information retrieval. Information Retrieval 3 (2000) 273–284
Chung, Y.M., He, Q., Powell, K., Schatz, B.: Semantic indexing for a complete subject discipline. In: Proceedings of the fourth ACM conference on Digital libraries, ACM Press (1999) 39–48
Chang, C.T.K., Schatz, B.R.: Performance and implications of semantic indexing in a distributed environment. In: Proceedings of the eighth international conference on Information and knowledge management, ACM Press (1999) 391–398
Harman, D.: Towards interactive query expansion. In Chiaramella, Y., ed.: SIGIR’ 88, Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Grenoble, France, June 13–15, 1988. ACM (1988) 321–331
Strötgen, R., Kokkelink, S.: Metadatenextraktion aus internetquellen: Heterogenitätsbehandlung im projekt carmen. In Schmidt, R., ed.: Information Research & Content Management: Orientierung, Ordnung und Organisation im Wissensmarkt; 23. Online-Tagung der DGI und 53. Jahrestagung der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V., DGI, Frankfurt am Main, 8. bis 10. Mai 2001; Proceedings. Tagungen der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis; 4. DGI, Frankfurt am Main (2001) 56–66
Binder, G., Marx, J., Mutschke, P., Strötgen, R., Plümer, J., Kokkelink, S.: Heterogenitätsbehandlung bei textueller Information verschiedener Datentypen und Inhaltserschließungsverfahren. IZ-Arbeitsbericht; Nr. 24. IZ Sozialwissenschaften, Bonn (2002)
Hellweg, H., Krause, J., Mandl, T., Marx, J., Müller, M.N., Mutschke, P., Strötgen, R.: Treatment of Semantic Heterogeneity in Information Retrieval. IZArbeitsbericht; Nr. 23. IZ Sozialwissenschaften, Bonn (2001)
Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Ferber, R.: Automated indexing with thesaurus descriptors: A co-occurence based approach to multilingual retrieval. In Peters, C., Thanos, C., eds.: Research and Advanced Technology for Digital Libraries. First European Conference, ECDL’ 97, Pisa, Italy, 1–3 September, Proceedings. Volume 1324 of Lecture Notes in Computer Science. Springer (1997) 233–252
Grievel, L., Mutschke, P., Polanco, X.: Thematic mapping on bibliographic databases by cluster analysis: A description of the sdoc environment with solis. Knowledge Organisation 22 (1995) 8
Hellweg, H.: Einsatz von statistisch erstellten transferbeziehungen zur anfragetransformation in elvira. In Krause, J., Stempfhuber, M., eds.: Integriertes Retrieval in heterogenen Daten. Text-Fakten-Integration am Beispiel des Verband-informationssystems ELVIRA. Volume 4 of Forschungsberichte des IZ Sozialwissenschaften. IZ Sozialwissenschaften, Bonn (2002)
Krause, J.: Virtual libraries, library content analysis, metadata and the remaining heterogeneity. In: ICADL 2000: Challenging to Knowledge Exploring for New Millennium: the Proceedings of the 3rd International Conference of Asian Digital Library and the 3rd Conference on Digital Libraries, Seoul, Korea, December 6–8, 2000, Seoul (2001) 209–214
Fuhr, N., Großjohann, K., Kokkelink, S.: Cap7: Searching and browsing in distributed document collections. In Borbinha, J.L., Baker, T., eds.: Research and Advanced Technology for Digital Libraries, 4th European Conference, ECDL 2000, Lisbon, Portugal, September 18–20, 2000, Proceedings. Volume 1923 of Lecture Notes in Computer Science. Springer (2000) 364–367
Marx, J., Müller, M.N.: The social science virtual library project. dealing with semantic heterogeneity at the query processing level. In: Third DELOS Network of Excellence Workshop ”Interoperability and Mediation in Heterogeneous Digital Libraries”. Darmstadt, Germany, September 8–9, 2001, Darmstadt (2001) 19–23
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Strötgen, R. (2002). Meta-data Extraction and Query Translation. Treatment of Semantic Heterogeneity. In: Agosti, M., Thanos, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2002. Lecture Notes in Computer Science, vol 2458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45747-X_27
Download citation
DOI: https://doi.org/10.1007/3-540-45747-X_27
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44178-6
Online ISBN: 978-3-540-45747-3
eBook Packages: Springer Book Archive