A Study on Web Clustering with Respect to XiangShan Science Conference

Zhang, Wen; Tang, Xijin

doi:10.1007/978-3-540-78733-4_7

Wen Zhang¹⁰ &
Xijin Tang¹⁰

Part of the book series: Studies in Computational Intelligence ((SCI,volume 123))

434 Accesses
1 Citations

Summary

This paper has presented two clustering results using two different methods to cluster the same Boolean vectors represented the Web documents of XiangShan Science Conference (XSSC). Then, average co-occurrence and average difference are introduced to evaluate the effectiveness of theses two different clustering methods. With these two indicators, the evaluation of experimental results from these two clustering methods is presented. Also, an extended research on Web clustering is presented in this paper, that is, the automatic concepts generation. At last, the reliability of the automatic concept generation is discussed in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Han, J.W., Kamber, M.: Data Mining Concepts and Techniques. Mogran Kaufmann Publishers, 2001. 335
Google Scholar
Macqueen, J.: Some methods for classification and analysis of multivariate observations. In: LeCam, L. M. and Neyman, J. (eds.): Proceedings of the 5th Berkeley Symposium on Mathematics and Statistics, Berkeley: University of California Press, 1967. 281-297.
Google Scholar
Zhang, T., Ramakrishnan, R., and Livny, M.: BIRCH: An effective data clustering method for very large database. In Proceedings of the 1996 ACM-SIGMOD conference International Conference on Management of Data, Montreal, Canada, 1996. 103-114.
Google Scholar
Ester, M., Kriegel, H. P. and Sander, J.: Spatial data mining: A database approach. In Proceedings of Symposium on Large Spatial Databases (SSD’97), Berlin, Germany, July 1997. 47-66.
Google Scholar
Xia, H.X. et al.: Ant-based text clustering using semantic similarity measure: progress report and first stage experiment. In: Gu, J.F. and Chroust, G. (eds.): Proceedings of the First World Congress of the International Federation for Systems Research, Kobe, Japan, 2005. 428
Google Scholar
Fisher, D.: Improving inference through conceptual clustering. In proceeding of 1987 AAAI Conference, Seattle, Washington, July 1987. 461–465.
Google Scholar
Liu, Y. J.: Studies on Creativity Support System. Doctoral dissertation, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, 2006. (in Chinese)
Google Scholar
Tang, X.J., Liu, Y.J., Zhang, W.: Computerized Support for Idea Generation during Knowledge Creating Process. R. In: Khosla, R. J. Howlett, and L. C. Jain (eds.): Knowledge-Based Intelligent Information & Engineering Systems (proceedings of KES’2005, Part IV), Lecture Notes on Artificial Intelligence, Vol.3684, Springer-Verlag, Berlin Heidelberg, 2005. 437–443.
Google Scholar
Chen, X. and Womersley, R.S.: A parallel inexact Newton method for stochastic programs with recourse. Annals of Operations Research. 64(1996) 113-141. online: http://citeseer.ist.psu.edu/article/chen96parallel.html
Google Scholar
Lawrence S., Giles C.L.: Searching the World Wide Web. Science, 280 (1998) 98–100.
Article Google Scholar
Liu, N.K., Luo, W.D., Chan, M.C.: Design and Implement a Web News Retrieval System. In: R. Khosla, R. J. Howlett and L. C. Jain (eds.): Knowledge-Based Intelligent Information & Engineering Systems (proceedings of KES’2005, Part III), LNAI 3683, Springer, 2005, 149-156.
Google Scholar
Zhang, W.: Information support tool based on Web Text mining and its application. Master thesis, 2006. Academy of Mathematics and Systems Science, Chinese Academy of Sciences. (in Chinese)
Google Scholar
Van Rijsbergen, C.J.: Information Retrieval, 2nd Edition, Butterworths, London, UK, 1979, 213, 214.
Google Scholar
Miller, G.: WordNet: a lexical database for English. Communication of the ACM, 38(11), 1995. 39–41.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Systems Science, Academy of Mathematics and Systems Science Chinese Academy of Sciences, Beijing, 100080, P.R. China
Wen Zhang & Xijin Tang

Authors

Wen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xijin Tang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
Shuichi Iwata
Department of Systems Innovation School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
Yukio Ohsawa
Department of Medical Informatics School of Medicine, Shimane University, Enya-cho Izumo City Shimane, 693-8501, Japan
Shusaku Tsumoto
Department of Information Engineering, Maebashi Institute of Technology, 460-1, Kamisadori-Cho, Maebashi-City, 371-0816, Japan
Ning Zhong (Director) (Director)
WICI/BJUT, China
Ning Zhong (Director) (Director)
Research Center on Data Technology and Knowledge Economy, Chinese Academy of Sciences, Beijing, 100080, PR China
Yong Shi (Director) (Director)
College of Information Science and Technology, University of Nebraska, Omaha, NE, 68182, USA
Yong Shi (Director) (Director)
Department of Philosophy, University of Pavia, Piazza Botta 6, 27100, Pavia, Italy
Lorenzo Magnani

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhang, W., Tang, X. (2008). A Study on Web Clustering with Respect to XiangShan Science Conference. In: Iwata, S., Ohsawa, Y., Tsumoto, S., Zhong, N., Shi, Y., Magnani, L. (eds) Communications and Discoveries from Multidisciplinary Data. Studies in Computational Intelligence, vol 123. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78733-4_7

Download citation

DOI: https://doi.org/10.1007/978-3-540-78733-4_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78732-7
Online ISBN: 978-3-540-78733-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics