Summary
Cluster Analysis is at the crossroad of many disciplines, and has numerous and diverse methods and applications. Mathematical Programming (together with graph theory, complexity theory and data structures) can be used to give a coherent view of the field. This leads to define clustering paradigms, specify problems within these paradigms as mathematical programs, study their complexity, design polynomial algorithms for easy problems, non polynomial ones or heuristics for hard ones and evaluate results obtained. An overview of this line of research is given, stressing exact algorithms.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aggarwal, A., Imai, H., Katoh, N., and Suri, S. (1991): Finding k Points with Minimum Diameter and Related Problems. Journal of Algorithms, 12, 38–56.
Arthanari, T.S., and Dodge, Y. (1981): Mathematical Programming in Statistics. Wiley, New York.
Balas, E., and Xue, J. (1993): Weighted and Unweighted Maximum Clique Algorithms with Upper Bounds from Fractional Coloring. Research Report [num] MSSR-590, Carnegie Mellon.
Bandelt, H.J., and Dress, A.W.M. (1989): Weak Hierarchies Associated with Similarity Measures: an Additive Clustering Technique. Bulletin of Mathematical Biology, 51, 133–166.
Beasley, J.E. (1985): A Note on Solving Large p-median Problems. European Journal of Operational Research, 21, 270–273.
Benzecri, J.P. (1982): Construction d’une classification ascendante hiérarchique par la recherche en chaîne des voisins réciproques. Les Cahiers de FAnalyse des Données, 7, 209–218.
Bezdek, J.C. (1981): Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York.
Boros, E., and Hammer, P.L. (1989): On Clustering Problems with Connected Optima in Euclidean Spaces. Discrete Mathematics, 75, 81–88.
Brucker, P. (1978): On the Complexity of Clustering Problem. In: M. Beckmann and H.P. Kiinzi (eds.): Optimization and Operations Research. Springer, Heidelberg, 45–54.
Bruynooghe, M. (1978): Classification ascendante hiérarchique des grands ensembles de données: un algorithme rapide fondé sur 1a. construction des voisinages réductibles. Les Cahiers de F Analyse des Données, 3, 7–33.
Carraghan, R., and Pardalos, P.M. (1990): An Exact Algorithm for the Maximum Clique Problem. Operations Research Letters, 9, 375–382.
Chaillou, P., Hansen, P., and Mahieu, Y. (1989): Best Network Flow Bounds for the Quadratic Knapsack Problem. In: B. Simeone (ed.): Combinatorial Optimization. Lecture Notes in Mathematics, 1403, 225–235.
Chandon, J.L., Lemaire, J., and Pouget, J. (1980): Construction de l’ultramétrique la plus proche d’une dissimilarité au sens des moindres carrés. RAIRO-Recherche Opérationnelle, 14, 157–170.
Christofides, N., and Beasley, J.E. (1982): A Tree Search Algorithm for the p-median Problem. European Journal of Operational Research, 10, 196–204.
Cooper, L. (1964): Heuristic Methods for Location-Allocation Problems. SI AM Review, 6, 37–53.
Day, W.H.E., and Edelsbrunner, H. (1984): Efficient Algorithms for Ag- glomerative Hierarchical Clustering Methods. Journal of Classification, 1, 7–24.
Diehr,G. (1985): Evaluation of a Branch and Bound Algorithm for Clustering. SI AM Journal on Scientific and Statistical Computing, 6, 268–284.
Dorndorf, U., and Pesch, E. (1994): Fast Clustering Algorithms. ORSA Journal on Computing 6, 141–153.
Delattre, M., and Hansen, P. (1980): Bicriterion Cluster Analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI 2, 277–291.
Erlenkotter, D. (1978): A Dual-based Procedure for Uncapacitated Facility Location. Operations Research, 26, 1590–1602.
Gallo, G., Hammer, P.L., and Simeone, B. (1980): Quadratic Knapsack Problem. Mathematical Programming Study, 12, 132–149.
Gelinas, S., Hansen, P., and Jaumard, B. (1995): A Labelling Algorithm for Minimum Sum of Diameters Partitioning of Graphs. In: I. Cox, P. Hansen and B. Julesz (eds): Paiiitioning Data Sets. American Mathematical Society, Providence, 89–96.
Gordon, A. (1981): Classification. Chapman and Hall, London.
Gordon, A. (1994): Clustering Algorithms and Cluster Validation. In: P. Dir- schedl and R. Osterman (eds.): Computational Statistics. Physica-Verlag, Heidelberg, 497–512.
Gower, J.C., and Ross, G.J.S. (1969): Minimum Spanning Trees and Single Linkage Cluster Analysis. Applied Statistics, 18, 54–64.
Guenoche, A. (1989): Partitions with Minimum Diameter. Paper presented at the second Conference of the International Federation of Classification Societies (IFCS-89). Charlottesville.
Guenoche, A., Hansen, P., and Jaumard, B. (1991): Efficient Algorithms for Divisive Hierarchical Clustering with the Diameter Criterion. Journal of Classification, 8, 5–30.
Grotschel, M., and Wakabayashi, Y. (1989): A Cutting Plane Algorithm for a Clustering Problem. Mathematical Programming, 45, 59–96.
Grotschel, M., and Wakabayashi, Y. (1990): Facets of the Clique Partitioning Polytope. Matheinatical Programming, 47, 367–387.
Hansen, P., and Delattre, M. (1978): Complete-Link Cluster Analysis by Graph Coloring. Journal of the American Statistical Association, 73, 397–403.
Hansen, P., and Jaumard, B. (1987): Minimum Sum of Diameters Clustering. Journal of Classification, 4, 215–226.
Hansen, P., Jaumard, B., and Frank, O. (1989): Maximum Sum-of-Splits Clustering. Journal of Classification, 6, 177–193.
Hansen, P., Jaumard, B., and Krau, S. (1995): A Column Generation Algorithm for the Multisource Weber Problem. In preparation.
Hansen, P., Jaumard, B., and Mladenovic, N. (1995a): How to Choose k Entities among N. In: I. Cox, P. Hansen and B. Julesz (eds): Partitioning Data Sets. American Mathematical Society, Providence, 105–116.
Hansen, P., Jaumard, B., and Mladenovic, N. (1995b): Sequential Cluster Analysis with Split and Radius Criteria. In preparation.
Hansen, P., Jaumard, B., and Sanlaville, E. (1993): Weight Constrained Minimum Sum-of-Stars Clustering. Les Cahiers du GERAD, G-93-38, Montreal, Canada. To appear in Journal of Classification.
Hansen, P., Jaumard, B., and da Silva, E. (1991): Average-Linkage Divisive Hierarchical Clustering. Les Cahiers du GERAD, G-91-55, Montreal, Canada. To appear in Journal of Classification.
Hansen, P., Jaumard, B., Simeone, B., and Döring, V. (1993): Maximum Split Clustering under Connectivity Constraints. Les Cahiers du GERAD, G-93-06, Montreal, Canada.
Hansen, P., Minoux, M., and Labbe, M. (1987): Extension de la programmation linéaire généralisée au cas des programmes mixtes. Comptes Rendus de VAcadémie des Sciences, Paris, 305, 569–572.
Hansen, P., Labbe, M., and Minoux, M. (1995): Thep-center Sum Location Problem. Les Cahiers du GERAD, G-94-30, Montréal, Canada.
Hansen, P., and Mladenovic, N. (1992): Two Algorithms for Maximum Cliques in Dense Graphs. Les Cahiers du GERAD, G-92-18, Montréal, Canada. To appear in European Journal of Operational Research.
Hartigan, J.A. (1975): Clustering Algorithms. Wiley, New York.
Hubert, L.J. (1974): Some Applications of Graph Theory to Clustering. Psy- cliometrika 39, 283–309.
Hwang, F.K., Rothblum, U.G., and Yao, Y.-C. (1995): Localizing Combinatorial Properties of Partitions ATamp;T Bell Labs Report.
Jambu, M. (1976): Classification automatique pour Vanalyse des données. Tome 1. Dunod, Paris.
Jambu, M. (1991): Exploratory and Multivariate Data Analysis. Academic Press, New York.
Janowitz, M.F. (1978): An Order Theoretic Model for Cluster Analysis. SIAM Journal on Applied Mathematics 34, 55–72.
Jensen, R.E. (1969): A Dynamic Programming Algorithm for Cluster Analysis. Operations Research 17, 1034–1057.
Johnson, E.L., Mehrotra, A., and Nemhauser, G.L. (1993): Min-cut Clustering. Mathematical Programming, 62, 133–151.
Kariv, and Hakimi, S.L. (1979): An Algorithmic Approach to Network Location Problems. The p-Medians. SIAM Journal on Applied Mathematics, 37, 539–560.
Kaufman, L., and Rousseeuw, P.J. (1990): Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York.
Klein, G., and Aronson, J.E. (1991): Optimal Clustering: A Model and Method. Naval Research Logistics, 38, 311–323.
Korkel, M. (1986): On the Exact Solution of Large Scale Simple Plant Location with Single Source Constraints. Journal of the Operational Research Society, 37, 495–500.
Krivanek, M. (1986): On the Computational Complexity of Clustering. In: E. Diday etal. (eds): Data Analysis and Informatics, 4. North-Holland, Amsterdam, 89–96.
Krivanek, M., and Moravek, J. (1986): NP-Hard Problems in Hierarchical- Tree Clustering. Acta Informatica, 23, 311–323.
Lance, G.N., and Williams, W.T. (1967): A General Theory of Classificatory Sorting Strategies. 1. Hierarchical Systems. The Computer Journal, 9, 373–380.
Leclerc, B. (1994): The Residuation Model for the Ordinal Construction of Dissimilarities and other Valued Objects. In: B. Van Cutsem (ed.): Classification and Dissimilarity Analysis. Lecture Notes in Statistics, no. 93, Springer-Verlag, New York, 149–172.
Leclerc, B. (1981): Description Combinatoire des Ultramétriques. Mathématiques et Sciences Humaines 73, 5–31.
Marcotorchino, J.F., and Michaud, P. (1979): Optimisation en analyse ordinale des données. Paris, Masson.
Martello, S., Soumis, F., and Toth, P. (1992): An Exact Algorithm for Makespan Minimization on Unrelated Parallel Machines. In: E. Balas, G. Cornue- jols and R. Kannan (eds): Proceedings Second IPCO Conference. Carnegie-Mellon University, 181–200.
Massart, D.L., Plastria, F., and Kaufman, L. (1983): Non-Hierarchical Clustering with MASLOC. Pattern Recognition, 16, 507–516.
Massart, D.L., and Kaufman, L. (1983): The Interpretation of Analytical Chemical Data by the use of Cluster Analysis. Wiley, New York.
Minieka, E. (1970): The M-Center Problem. SI AM Review, 12, 138–139.
Mirkin, B. (1987): Additive Clustering and Qualitative Factor Analysis Methods for Similarity matrices. Journal of Classification, 4, 7–31.
Monma, C., and Suri, S. (1991): Partitioning Points and Graphs to Minimize the Maximum or the Sum of Diameters. In: Y. Alari, G. Chartrand, O.R. 01- lerman and A.J. Schwenk (eds): Proceedings of the 6 th Quadrennial International Conference on the Theory and Applications of Graphs. Wiley, New York.
Mulvey, J.M., and Beck, M.P. (1984): Solving Capacitated Clustering Problems. European Journal of Operational Research, 18, 339–398.
Murtagh, F. (1983): A Survey of Recent Advances in Hierarchical Clustering Algorithms. The Computer Journal, 26, 329–340.
Rao, M.R. (1971): Cluster Analysis and Mathematical Programming. Journal of the American Statistical Association, 66, 622–626.
Reeves, C.R. (ed.) (1993): Modem Heuristic Techniques for Combinatorial Problems. Black well, London,
Rosenstiehl, P. (1967): L’arbre minimum d’un graphe. In: P. Rosenstielil (ed.): Theorie des Graphes. Rome, I.C.C., Paris, Dunod, 357–368.
Rosing, K. (1982): An Optimal Method for Solving the (generalized) Multi- Weber Problem. European Journal of Operational Research, 58, 414–426.
Shepard, R.N., and Arabie, P. (1979): Additive Clustering Representation of Similarities as Combinations of Discrete Overlapping Properties. Psychological Review, 86, 87–123.
Späth, H. (1980): Cluster Analysis Algorithms for Data Reduction and Classification of Objects. Ellis Horwood, Chichester.
Vinod, H.D. (1969): Integer Programming and the Theory of Grouping. Journal of the American Statistical Association, 64, 506–519.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Hansen, P., Jaumard, B. (1996). Computational Methods in Clustering from a Mathematical Programming Viewpoint. In: Bock, HH., Polasek, W. (eds) Data Analysis and Information Systems. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-80098-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-80098-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60774-8
Online ISBN: 978-3-642-80098-6
eBook Packages: Springer Book Archive