Abstract
One of the important technologies in knowledge discovery is to access the desired information from the large amount of data stored on the WWW. At present, such information can be accessed by a browser itself or by using a keyword search function. However, browsing is a time consuming task where a user must access individual pages one by one. Furthermore, it is hard for users to provide reasonable keywords to dis- cover their desired pages in general. This paper outlines an approach of integrating information visualization and retrieval to improve effective- ness WWW information access. In this approach, the link structure of WWWis displayed in a 3-D hyperbolic tree in which the height of a node within the tree indicates a user’s “interestingness”. Here, interestingness is calculated by a fitting function between a page and user-supplied key- words, and this measure can be used to filter irrelevant pages, reducing the size of the link structure. Such functions are incorporated within our browser, allowing us to discover desired pages from a large web site incrementally. Relatively large web sites were selected to show the per- formance of the proposed method with improved accuracy and efficiency in WWW information access.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
N. Ashish and C. Knoblock, Wrapper generation for semistructured Internet sources, Proc. of the Workshop on Management of Semistructured Data, Tucson, Arizona, May 1997.
P. Atzeni, G. Mecca and P. Merialdo, Semistructured and structured data on the Web: going back and forth, Proc. of the Workshop on Management of Semistructured Data, Tucson, Arizona, May 1997.
W. Cohen, Reasoning about Textual Similarity in a Web-Based Information Access System, Autonomous Agents and Multi-Agent System, Vol. 2, pp. 65–86, 1999.
Hirokawa, S. and Taguchi, T.: KN on ZK-Knowledge Network on Network Note Pad ZK, Proc. of the First International Conference on Discovery Science, pp.411–412, 1998.
J. Lamping, R. Rao, P. Pirolli, A Focus+Context Technique Based on Hyperbolic Geometry for Visualizing Large Hierarchies, Proc. of ACM CHI’ 95, 1995.
Y. Matsumoto, The Japanese Morphological Analysis System ChaSen, NAIST Technical Report, NAIST-IS-TR99008, April, 1999.
T. Munzner, Exploring Large Graphs in 3D Hyperbolic Space, IEEE Computer Graphics and Applications, Vol. 18, No. 4, pp. 18–23, 1998.
G. G. Robertson, J. D. Mackinlay, S. K. Card, Cone Trees: Animated 3D Visualizations of Hierarchical Information, Proc. of ACM CHI’ 91, 1991.
G. Salton, Developments in automatic text retrieval, Science, Vol. 253, pp. 974–980, 1991.
Sawai, H., Ohwada, H. and Mizoguchi, F.: Incorporating a navigation tool into a browser for mining WWW information, Proc. of the First International Conference on Discovery Science, pp.453–454, 1998.
Shibayama, E., Yabe, J., Takahashi, S. and Matsuda, M.: Visualizing Semantic Clusters in the Internet Information Space, Proc. of the First International Conference on Discovery Science, pp.409–410, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ohwada, H., Mizoguchi, F. (2000). Integrating Information Visualization and Retrieval for Discovering Internet Sources. In: Arikawa, S., Morishita, S. (eds) Discovery Science. DS 2000. Lecture Notes in Computer Science(), vol 1967. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44418-1_4
Download citation
DOI: https://doi.org/10.1007/3-540-44418-1_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41352-3
Online ISBN: 978-3-540-44418-3
eBook Packages: Springer Book Archive