Summary
This paper has presented two clustering results using two different methods to cluster the same Boolean vectors represented the Web documents of XiangShan Science Conference (XSSC). Then, average co-occurrence and average difference are introduced to evaluate the effectiveness of theses two different clustering methods. With these two indicators, the evaluation of experimental results from these two clustering methods is presented. Also, an extended research on Web clustering is presented in this paper, that is, the automatic concepts generation. At last, the reliability of the automatic concept generation is discussed in this paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Han, J.W., Kamber, M.: Data Mining Concepts and Techniques. Mogran Kaufmann Publishers, 2001. 335
Macqueen, J.: Some methods for classification and analysis of multivariate observations. In: LeCam, L. M. and Neyman, J. (eds.): Proceedings of the 5th Berkeley Symposium on Mathematics and Statistics, Berkeley: University of California Press, 1967. 281-297.
Zhang, T., Ramakrishnan, R., and Livny, M.: BIRCH: An effective data clustering method for very large database. In Proceedings of the 1996 ACM-SIGMOD conference International Conference on Management of Data, Montreal, Canada, 1996. 103-114.
Ester, M., Kriegel, H. P. and Sander, J.: Spatial data mining: A database approach. In Proceedings of Symposium on Large Spatial Databases (SSD’97), Berlin, Germany, July 1997. 47-66.
Xia, H.X. et al.: Ant-based text clustering using semantic similarity measure: progress report and first stage experiment. In: Gu, J.F. and Chroust, G. (eds.): Proceedings of the First World Congress of the International Federation for Systems Research, Kobe, Japan, 2005. 428
Fisher, D.: Improving inference through conceptual clustering. In proceeding of 1987 AAAI Conference, Seattle, Washington, July 1987. 461–465.
Liu, Y. J.: Studies on Creativity Support System. Doctoral dissertation, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, 2006. (in Chinese)
Tang, X.J., Liu, Y.J., Zhang, W.: Computerized Support for Idea Generation during Knowledge Creating Process. R. In: Khosla, R. J. Howlett, and L. C. Jain (eds.): Knowledge-Based Intelligent Information & Engineering Systems (proceedings of KES’2005, Part IV), Lecture Notes on Artificial Intelligence, Vol.3684, Springer-Verlag, Berlin Heidelberg, 2005. 437–443.
Chen, X. and Womersley, R.S.: A parallel inexact Newton method for stochastic programs with recourse. Annals of Operations Research. 64(1996) 113-141. online: http://citeseer.ist.psu.edu/article/chen96parallel.html
Lawrence S., Giles C.L.: Searching the World Wide Web. Science, 280 (1998) 98–100.
Liu, N.K., Luo, W.D., Chan, M.C.: Design and Implement a Web News Retrieval System. In: R. Khosla, R. J. Howlett and L. C. Jain (eds.): Knowledge-Based Intelligent Information & Engineering Systems (proceedings of KES’2005, Part III), LNAI 3683, Springer, 2005, 149-156.
Zhang, W.: Information support tool based on Web Text mining and its application. Master thesis, 2006. Academy of Mathematics and Systems Science, Chinese Academy of Sciences. (in Chinese)
Van Rijsbergen, C.J.: Information Retrieval, 2nd Edition, Butterworths, London, UK, 1979, 213, 214.
Miller, G.: WordNet: a lexical database for English. Communication of the ACM, 38(11), 1995. 39–41.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Zhang, W., Tang, X. (2008). A Study on Web Clustering with Respect to XiangShan Science Conference. In: Iwata, S., Ohsawa, Y., Tsumoto, S., Zhong, N., Shi, Y., Magnani, L. (eds) Communications and Discoveries from Multidisciplinary Data. Studies in Computational Intelligence, vol 123. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78733-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-540-78733-4_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78732-7
Online ISBN: 978-3-540-78733-4
eBook Packages: EngineeringEngineering (R0)