Abstract
This paper develops a new method for hierarchical clustering based on a generative dendritic cluster model. The objects are viewed as being generated through a tree structured refinement process. In certain problems, this generative model naturally captures the physical mechanisms responsible for relationships among objects, for example, in genetic studies and network topology identification. The networking problem is examined in some detail, to illustrate the new clustering method. In general, the generative model is not representative of actual physical mechanisms, but it nonetheless provides a means for dealing with errors in the similarity matrix, simultaneously promoting two desirable features in clustering: intra-class similarity and inter-class dissimilarity.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Fasulo, D.: An analysis of recent work on clustering algorithms. Technical Report # 01-03-02, Department of Computer Science and Engineering, University of Washington (1999)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31, 264–323 (1999)
Fisher, D.: Iterative optimization and simplification of hierarchical clusterings. Journal of Artificial Intelligence Research 4, 147–180 (1996)
Murtagh, F.: A survey of recent advances in hierarchical clustering algorithms. The Computer Journal 26 (1983)
Banfield, J.D., Raftery, A.E.: Model-based gaussian and non-gaussian clustering. Biometrics 49, 803–821 (1993)
Willet, P.: Recent trends in hierarchical document clustering: a critical review. Information Processing and Management 24, 577–597 (1988)
Voorhees, E.M.: Implementing agglomerative hierarchical clustering algorithms for use in document retrieval. Information Processing and Management 22, 465–476 (1986)
El-Hamdouchi, A., Willet, P.: Hierarchic document clustering using ward’s method. In: proceedings of the Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1986)
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: ACM SIGMOD International Conference on Management of Data, Montreal, Canada, pp. 103–114 (1996)
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Machine Learning 42, 177–196 (2001)
Fraley, C.: Algorithms for model-based gaussian hierarchical clustering. SIAM Journal on Scientific Computing 20, 270–281 (1998)
Vaithyanathan, S., Dom, B.: Model-based hierarchical clustering. In: Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence. Stanford University, Stanford (2000)
Kamvar, S.D., Klein, D., Manning, C.D.: Interpreting and extending classical agglomerative clustering algorithms using a model-based approach. In: International Conference on Machine Learning, ICML (2002)
Berger, J.O., Liseo, B., Wolpert, R.L.: Integrated likelihood methods for eliminating nuisance parameters. Statistical Science 14, 1–28 (1999)
Tanner, M.A.: Tools for Statistical Inference - Methods for the Exploration of Posterior Distributions and Likelihood Functions. Statistics. Springer, Heidelberg (1996)
Hastings, W.: Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57, 97–109 (1970)
Ratnasamy, S., McCanne, S.: Inference of multicast routing trees and bottleneck bandwidths using end-to-end measurements. In: Proceedings of IEEE INFOCOM 1999, New York, NY (1999)
Duffield, N., Horowitz, J., Presti, F.L., Towsley, D.: Multicast topology inference from end-to-end measurements. In: ITC Seminar on IP Traffic, Measurement and Modelling, Monterey, CA (2000)
Castro, R., Coates, M., Gadhiok, M., King, R., Nowak, R., Rombokas, E., Tsang, Y.: Maximum likelihood network topology identification from edge-based unicast measurements. Technical Report TREE0107, Department of Electrical and Computer Engineering, Rice University (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Castro, R., Nowak, R. (2003). Likelihood Based Hierarchical Clustering and Network Topology Identification. In: Rangarajan, A., Figueiredo, M., Zerubia, J. (eds) Energy Minimization Methods in Computer Vision and Pattern Recognition. EMMCVPR 2003. Lecture Notes in Computer Science, vol 2683. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45063-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-45063-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40498-9
Online ISBN: 978-3-540-45063-4
eBook Packages: Springer Book Archive