Abstract
Due to the popularity of link-based applications like Wikipedia, one of the most important issues in online research is how to alleviate information overload on the World Wide Web (WWW) and facilitate effective information-seeking. To address the problem, we propose a semantically-based navigation application that is based on the theories and techniques of link mining, semantic relatedness analysis and text summarization. Our goal is to develop an application that assists users in efficiently finding the related subtopics for a seed query and then quickly checking the content of articles. We establish a topic network by analyzing the internal links of Wikipedia and applying the Normalized Google Distance algorithm in order to quantify the strength of the semantic relationships between articles via key terms. To help users explore and read topic-related articles, we propose a SNA-based summarization approach to summarize articles. To visualize the topic network more efficiently, we develop a semantically-based WikiMap to help users navigate Wikipedia effectively.
Chapter PDF
References
Almpanidisa, G., Kotropoulos, C., Pitas, I.: Combining Text and Link Analysis for Focused Crawling—An Application for Vertical Search Engines. Information Systems 32(6), 886–908 (2007)
Brin, S., Page, L.: The Anatomy of a Large-scale Hypertextual Web Search Engine. Computer Network 30(1-7), 107–117 (1998)
Cilibrasi, R., Vitányi, P.: The Google Similarity Distance. In: IEEE Transactions on Knowledge and Data Engineering, pp. 370–383. IEEE Press, New York (2007)
Evangelista, A., Kjos-Hanssen, B.: Google Distance between Words. Frontiers in Undergraduate Research. University of Connecticut (2006)
Finkelstein, L., Gabrilovich, Y.M., Rivlin, E., Solan, Z., Wolfman, G., Ruppin, E.: Placing search in context: the concept revisited. ACM Transactions on Information Systems (TOIS) 20(1), 406–414 (2001)
Forsyth, R., Rada, R.: Adding an Edge in Machine Learning: Applications in Expert Systems and Information Retrieval. Ellis Horwood Ltd., pp.198-212 (1986)
Hahn, U., Mani, I.: The Challenges of Automatic Summarization. Journal of IEEE Computer 33(11), 29–36 (2000)
Hovy, E., Lin, C.Y.: Identifying Topic by Position. In: Proceedings of the 5th Conference on Applied Natural Language Processing (ANLP), Washington, DC, pp. 283–290 (1997)
Liu (ed.): Web Data Mining: Exploring Hyperlinks, Contents and Usage Data. Springer, New York (2007)
Radev, D.R., Jing, H., Stys, M., Tam, D.: Centroid-based Summarization of Multiple Documents. Information Processing and Management 40(6), 919–938 (2004)
Teufel, S., Moens, M.: Sentence extraction as a classification task. In: Proceedings of the Workshop on Intelligent Scalable Summarization. ACL/EACL Conference, Madrid, Spain, pp. 58–65 (1999)
Wasserman, S., Faust, K. (eds.): Social Network Analysis: Methods and Applications. Cambridge University Press, UK (1994)
Wu, I.-C., Wu, C.-Y.: Using Internal Link and Social Network Analysis to Support Searches in Wikipedia: A Model and Its Evaluation. Journal of Information Science 37(2), 189–207 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, IC., Lin, YS., Liu, CH. (2011). An Exploratory Study of Navigating Wikipedia Semantically: Model and Application. In: Ozok, A.A., Zaphiris, P. (eds) Online Communities and Social Computing. OCSC 2011. Lecture Notes in Computer Science, vol 6778. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21796-8_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-21796-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21795-1
Online ISBN: 978-3-642-21796-8
eBook Packages: Computer ScienceComputer Science (R0)