Abstract
TopiCA is an architecture for a distributed information navigation infrastructure which aims to organize the information space across interconnected digital libraries around subject-areas to provide a sound basis for semantic information discovery and retrieval. TopiCA links document-bases by concentrating on topic similarity in such a way that clusters of document combinations are formed. This system provides topic-level browsing support and couples the search process with a lexicographic facility that makes use of a controlled vocabulary to find variant forms of terms and support term suggestion.
The paper describes the fundamentals of the TopiCA-topology and explains how this system imposes a semantic organization on the distributed information space to support a suite of activities ranging from a well-defined search for a specific document to a non-specific desire to understand what information is available in a federation of digital libraries.
Chapter PDF
Similar content being viewed by others
Keywords
References
C. M. Bowman et al. (1995), Harvest: A Scalable, Customizable Discovery and Access System, Univ. of Colorado - Boulder, CS Dept., techn. report CU-CS 732–94, (revised March 1995 ).
J. Carbonell et al. (1995) Translingual Information Retrieval: A comparative evaluation, IJCAI ‘87, Nagoya, Japan.
H. Chen (1994) Collaborative Systems: Solving the Vocabulary Problem, IEEE Computer, May.
Cycorp, Inc. Cyc Ontology (1995) http://www.cyc.com/cyc-2-1/intropubic.html.
B. Everitt (1981) Cluster Analysis, Heinemann Educational Books Ltd., Great Britain.
N.V. Findler (1979) A Heuristic Information Retrieval System Based on Associative Networks, in Associative Networks, (ed. N. V. Findler ), Academic Press.
P. Francis, T.Kambayashi, S. Sato, S. Shimuzu (1995) INGRID: A Self-Configuring Information Navigation Infrastructure, 4th Int’l WWW Conference Proceedings, Boston, Ma.
J. Gilarranz, J. Gonzalo, F. Verdejo (1997) An Approach to Conceptual Text Retrieval Using the EuroWordNet Multi-Lingual Semantic Database, Working Notes of AAAI Spring Symposium on Cross-Language and Text Retrieval, Stanford Ca.
M. Hearst, J. Pedersen (1996) Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results, ACM SIGIR’96 Conf., Zurich, Switzerland.
Y. W. Kim, J.H. Kim (1990) A Model of Knowledge Based Information Retrieval with Hierarchical Concept Graph“, Journal of Documentation, vol. 46, no. 2.
M. Koster (1994) ALIWEB - Archie-like Indexing in the Web, Procs 1st International Conf. on the World-Wide Web, Geneva, Switzerland.
C. Lagoze (1996) The Warwick Framework: A Container Architecture for Diverse Sets of Meta-data, Digital Libraries Magazine, July/August.
R. Larson (1992) Experiments in Automatic Library of Congress Classification, Journal of American Society for Information Science, vol. 43, no. 2.
The Library of Congress (1996) Machine-Readable Cataloging, http://lcweb.loc.gov/marc/marc.html.
M.L. Mauldin, J.R. Levitt (1994) Web-agent related Research at the CMT, Procs. ACM Special Interest Group on Networked Information Discovery and Retrieval (SIGIR’94).
P. Mokapetris (1995) Domain Names - Implementation and Specification, RFC 1035, anonymous ftp:ds.internick.net/rfc/rfc1035.txt.
G. Miller (1995) WordNet: A Lexical Database for English, Communications of ACM, vol. 38, no. 11.
S. Milliner, A. Bouguettaya, and M. Papazoglou (1995) A Scalable Architecture for Autonomous Heterogeneous Database Interactions, 21 Int’l Conference on Very Large Databases, Zurich, Switzerland.
S. Milliner, M. Papazoglou, H. Weigand (1996) “Linguistic Tool based Information Elicitation in Large Heterogeneous Database Networks”, NLDB ‘86 Natural Language and Databases Workshop, Amsterdam.
Y. Papakonstantinou, H. Garcia-Molina, J. Ullman — (1996) MedMaker: A Mediation System Based on Declarative Specifications 12th Int’l Conf. on Data Engineering, New Orleanes.
M. Papazoglou (1995) Unraveling the Semantics of Conceptual Schemas, Communications of ACM, vol. 38, no. 9.
M.P. Papazoglou, S. Milliner (1996) Pro-active Information Elicitation in Wide-Area Information Networks, Int’l Symposium on Cooperative Database Systems for Advanced Applications, Kyoto, Japan.
B. Pinkerton (1994) Finding what People Want: Experiences with the We- bCrawler, Procs. 1st Int’l Conference on the WWW, Geneva.
R. Rada et al. (1989) Development and Application of a Metric on Semantic Nets, IEEE Transactions on Systems, Man and Cybernetics, vol. 19, no. 1.
G. Salton (1989) Automatic Text Processing, Addison-Wesley, Reading Mass.
R.B Schatz et al. (1996) Interactive Term Suggestion for Users of Digital Libraries, 1st ACM International Conf. on Digital Libraries, Bethesda MD.
R.B Schatz et. al. (1996) Federating Repositories of Scientific Literature: the Illinois Digital Library Project, IEEE Computer, May.
T. Smith (1996) The Meta-Data Information Environment of Digital Libraries, Digital Libraries Magazine, July/August.
S. Weibel, J. Goldby, E. Miller (1996) OCLC/NCSA Meta-data Workshop Report, http://www.ocic.org:5046/ocic/research/conferences/metadata/dublin_core_report.html.
H. Weigand (1997) A Multi-lingual Ontology-based Lexicon for News Filtering - The TREVI project IJCAI-Workshop Ontologies and Multilingual NLP, Nagoya, Japan.
R. Wiess, et al. (1996) HyPersuit: A Hierarchical Network search Engine that Exploits Content-link Hypertxet Clustering, 7th ACM Conf. on Hypertext, Washington DC.
P. Willett (1988) Recent Trends in Hierarchical Document Clustering, Information Processing and Management, vol. 24, no. 5.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Papazoglou, M.P., Weigand, H., Milliner, S. (1998). TopiCA: A Semantic Framework for Landscaping the Information Space in Federated Digital Libraries. In: Spaccapietra, S., Maryanski, F. (eds) Data Mining and Reverse Engineering. IFIP — The International Federation for Information Processing. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-35300-5_13
Download citation
DOI: https://doi.org/10.1007/978-0-387-35300-5_13
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-4910-6
Online ISBN: 978-0-387-35300-5
eBook Packages: Springer Book Archive