Computational Methods in Clustering from a Mathematical Programming Viewpoint

Hansen, Pierre; Jaumard, Brigitte

doi:10.1007/978-3-642-80098-6_3

Computational Methods in Clustering from a Mathematical Programming Viewpoint

Pierre Hansen⁶ &
Brigitte Jaumard⁷

Conference paper

403 Accesses

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

Summary

Cluster Analysis is at the crossroad of many disciplines, and has numerous and diverse methods and applications. Mathematical Programming (together with graph theory, complexity theory and data structures) can be used to give a coherent view of the field. This leads to define clustering paradigms, specify problems within these paradigms as mathematical programs, study their complexity, design polynomial algorithms for easy problems, non polynomial ones or heuristics for hard ones and evaluate results obtained. An overview of this line of research is given, stressing exact algorithms.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aggarwal, A., Imai, H., Katoh, N., and Suri, S. (1991): Finding k Points with Minimum Diameter and Related Problems. Journal of Algorithms, 12, 38–56.
Article Google Scholar
Arthanari, T.S., and Dodge, Y. (1981): Mathematical Programming in Statistics. Wiley, New York.
Google Scholar
Balas, E., and Xue, J. (1993): Weighted and Unweighted Maximum Clique Algorithms with Upper Bounds from Fractional Coloring. Research Report [num] MSSR-590, Carnegie Mellon.
Google Scholar
Bandelt, H.J., and Dress, A.W.M. (1989): Weak Hierarchies Associated with Similarity Measures: an Additive Clustering Technique. Bulletin of Mathematical Biology, 51, 133–166.
Google Scholar
Beasley, J.E. (1985): A Note on Solving Large p-median Problems. European Journal of Operational Research, 21, 270–273.
Article Google Scholar
Benzecri, J.P. (1982): Construction d’une classification ascendante hiérarchique par la recherche en chaîne des voisins réciproques. Les Cahiers de FAnalyse des Données, 7, 209–218.
Google Scholar
Bezdek, J.C. (1981): Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York.
Google Scholar
Boros, E., and Hammer, P.L. (1989): On Clustering Problems with Connected Optima in Euclidean Spaces. Discrete Mathematics, 75, 81–88.
Article Google Scholar
Brucker, P. (1978): On the Complexity of Clustering Problem. In: M. Beckmann and H.P. Kiinzi (eds.): Optimization and Operations Research. Springer, Heidelberg, 45–54.
Google Scholar
Bruynooghe, M. (1978): Classification ascendante hiérarchique des grands ensembles de données: un algorithme rapide fondé sur 1a. construction des voisinages réductibles. Les Cahiers de F Analyse des Données, 3, 7–33.
Google Scholar
Carraghan, R., and Pardalos, P.M. (1990): An Exact Algorithm for the Maximum Clique Problem. Operations Research Letters, 9, 375–382.
Article Google Scholar
Chaillou, P., Hansen, P., and Mahieu, Y. (1989): Best Network Flow Bounds for the Quadratic Knapsack Problem. In: B. Simeone (ed.): Combinatorial Optimization. Lecture Notes in Mathematics, 1403, 225–235.
Google Scholar
Chandon, J.L., Lemaire, J., and Pouget, J. (1980): Construction de l’ultramétrique la plus proche d’une dissimilarité au sens des moindres carrés. RAIRO-Recherche Opérationnelle, 14, 157–170.
Google Scholar
Christofides, N., and Beasley, J.E. (1982): A Tree Search Algorithm for the p-median Problem. European Journal of Operational Research, 10, 196–204.
Article Google Scholar
Cooper, L. (1964): Heuristic Methods for Location-Allocation Problems. SI AM Review, 6, 37–53.
Article Google Scholar
Day, W.H.E., and Edelsbrunner, H. (1984): Efficient Algorithms for Ag- glomerative Hierarchical Clustering Methods. Journal of Classification, 1, 7–24.
Article Google Scholar
Diehr,G. (1985): Evaluation of a Branch and Bound Algorithm for Clustering. SI AM Journal on Scientific and Statistical Computing, 6, 268–284.
Article Google Scholar
Dorndorf, U., and Pesch, E. (1994): Fast Clustering Algorithms. ORSA Journal on Computing 6, 141–153.
Google Scholar
Delattre, M., and Hansen, P. (1980): Bicriterion Cluster Analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI 2, 277–291.
Article Google Scholar
Erlenkotter, D. (1978): A Dual-based Procedure for Uncapacitated Facility Location. Operations Research, 26, 1590–1602.
Article Google Scholar
Gallo, G., Hammer, P.L., and Simeone, B. (1980): Quadratic Knapsack Problem. Mathematical Programming Study, 12, 132–149.
Google Scholar
Gelinas, S., Hansen, P., and Jaumard, B. (1995): A Labelling Algorithm for Minimum Sum of Diameters Partitioning of Graphs. In: I. Cox, P. Hansen and B. Julesz (eds): Paiiitioning Data Sets. American Mathematical Society, Providence, 89–96.
Google Scholar
Gordon, A. (1981): Classification. Chapman and Hall, London.
Google Scholar
Gordon, A. (1994): Clustering Algorithms and Cluster Validation. In: P. Dir- schedl and R. Osterman (eds.): Computational Statistics. Physica-Verlag, Heidelberg, 497–512.
Chapter Google Scholar
Gower, J.C., and Ross, G.J.S. (1969): Minimum Spanning Trees and Single Linkage Cluster Analysis. Applied Statistics, 18, 54–64.
Article Google Scholar
Guenoche, A. (1989): Partitions with Minimum Diameter. Paper presented at the second Conference of the International Federation of Classification Societies (IFCS-89). Charlottesville.
Google Scholar
Guenoche, A., Hansen, P., and Jaumard, B. (1991): Efficient Algorithms for Divisive Hierarchical Clustering with the Diameter Criterion. Journal of Classification, 8, 5–30.
Article Google Scholar
Grotschel, M., and Wakabayashi, Y. (1989): A Cutting Plane Algorithm for a Clustering Problem. Mathematical Programming, 45, 59–96.
Article Google Scholar
Grotschel, M., and Wakabayashi, Y. (1990): Facets of the Clique Partitioning Polytope. Matheinatical Programming, 47, 367–387.
Article Google Scholar
Hansen, P., and Delattre, M. (1978): Complete-Link Cluster Analysis by Graph Coloring. Journal of the American Statistical Association, 73, 397–403.
Article Google Scholar
Hansen, P., and Jaumard, B. (1987): Minimum Sum of Diameters Clustering. Journal of Classification, 4, 215–226.
Article Google Scholar
Hansen, P., Jaumard, B., and Frank, O. (1989): Maximum Sum-of-Splits Clustering. Journal of Classification, 6, 177–193.
Article Google Scholar
Hansen, P., Jaumard, B., and Krau, S. (1995): A Column Generation Algorithm for the Multisource Weber Problem. In preparation.
Google Scholar
Hansen, P., Jaumard, B., and Mladenovic, N. (1995a): How to Choose k Entities among N. In: I. Cox, P. Hansen and B. Julesz (eds): Partitioning Data Sets. American Mathematical Society, Providence, 105–116.
Google Scholar
Hansen, P., Jaumard, B., and Mladenovic, N. (1995b): Sequential Cluster Analysis with Split and Radius Criteria. In preparation.
Google Scholar
Hansen, P., Jaumard, B., and Sanlaville, E. (1993): Weight Constrained Minimum Sum-of-Stars Clustering. Les Cahiers du GERAD, G-93-38, Montreal, Canada. To appear in Journal of Classification.
Google Scholar
Hansen, P., Jaumard, B., and da Silva, E. (1991): Average-Linkage Divisive Hierarchical Clustering. Les Cahiers du GERAD, G-91-55, Montreal, Canada. To appear in Journal of Classification.
Google Scholar
Hansen, P., Jaumard, B., Simeone, B., and Döring, V. (1993): Maximum Split Clustering under Connectivity Constraints. Les Cahiers du GERAD, G-93-06, Montreal, Canada.
Google Scholar
Hansen, P., Minoux, M., and Labbe, M. (1987): Extension de la programmation linéaire généralisée au cas des programmes mixtes. Comptes Rendus de VAcadémie des Sciences, Paris, 305, 569–572.
Google Scholar
Hansen, P., Labbe, M., and Minoux, M. (1995): Thep-center Sum Location Problem. Les Cahiers du GERAD, G-94-30, Montréal, Canada.
Google Scholar
Hansen, P., and Mladenovic, N. (1992): Two Algorithms for Maximum Cliques in Dense Graphs. Les Cahiers du GERAD, G-92-18, Montréal, Canada. To appear in European Journal of Operational Research.
Google Scholar
Hartigan, J.A. (1975): Clustering Algorithms. Wiley, New York.
Google Scholar
Hubert, L.J. (1974): Some Applications of Graph Theory to Clustering. Psy- cliometrika 39, 283–309.
Google Scholar
Hwang, F.K., Rothblum, U.G., and Yao, Y.-C. (1995): Localizing Combinatorial Properties of Partitions ATamp;T Bell Labs Report.
Google Scholar
Jambu, M. (1976): Classification automatique pour Vanalyse des données. Tome 1. Dunod, Paris.
Google Scholar
Jambu, M. (1991): Exploratory and Multivariate Data Analysis. Academic Press, New York.
Google Scholar
Janowitz, M.F. (1978): An Order Theoretic Model for Cluster Analysis. SIAM Journal on Applied Mathematics 34, 55–72.
Google Scholar
Jensen, R.E. (1969): A Dynamic Programming Algorithm for Cluster Analysis. Operations Research 17, 1034–1057.
Article Google Scholar
Johnson, E.L., Mehrotra, A., and Nemhauser, G.L. (1993): Min-cut Clustering. Mathematical Programming, 62, 133–151.
Article Google Scholar
Kariv, and Hakimi, S.L. (1979): An Algorithmic Approach to Network Location Problems. The p-Medians. SIAM Journal on Applied Mathematics, 37, 539–560.
Article Google Scholar
Kaufman, L., and Rousseeuw, P.J. (1990): Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York.
Google Scholar
Klein, G., and Aronson, J.E. (1991): Optimal Clustering: A Model and Method. Naval Research Logistics, 38, 311–323.
Article Google Scholar
Korkel, M. (1986): On the Exact Solution of Large Scale Simple Plant Location with Single Source Constraints. Journal of the Operational Research Society, 37, 495–500.
Google Scholar
Krivanek, M. (1986): On the Computational Complexity of Clustering. In: E. Diday etal. (eds): Data Analysis and Informatics, 4. North-Holland, Amsterdam, 89–96.
Google Scholar
Krivanek, M., and Moravek, J. (1986): NP-Hard Problems in Hierarchical- Tree Clustering. Acta Informatica, 23, 311–323.
Article Google Scholar
Lance, G.N., and Williams, W.T. (1967): A General Theory of Classificatory Sorting Strategies. 1. Hierarchical Systems. The Computer Journal, 9, 373–380.
Google Scholar
Leclerc, B. (1994): The Residuation Model for the Ordinal Construction of Dissimilarities and other Valued Objects. In: B. Van Cutsem (ed.): Classification and Dissimilarity Analysis. Lecture Notes in Statistics, no. 93, Springer-Verlag, New York, 149–172.
Google Scholar
Leclerc, B. (1981): Description Combinatoire des Ultramétriques. Mathématiques et Sciences Humaines 73, 5–31.
Google Scholar
Marcotorchino, J.F., and Michaud, P. (1979): Optimisation en analyse ordinale des données. Paris, Masson.
Google Scholar
Martello, S., Soumis, F., and Toth, P. (1992): An Exact Algorithm for Makespan Minimization on Unrelated Parallel Machines. In: E. Balas, G. Cornue- jols and R. Kannan (eds): Proceedings Second IPCO Conference. Carnegie-Mellon University, 181–200.
Google Scholar
Massart, D.L., Plastria, F., and Kaufman, L. (1983): Non-Hierarchical Clustering with MASLOC. Pattern Recognition, 16, 507–516.
Article Google Scholar
Massart, D.L., and Kaufman, L. (1983): The Interpretation of Analytical Chemical Data by the use of Cluster Analysis. Wiley, New York.
Google Scholar
Minieka, E. (1970): The M-Center Problem. SI AM Review, 12, 138–139.
Article Google Scholar
Mirkin, B. (1987): Additive Clustering and Qualitative Factor Analysis Methods for Similarity matrices. Journal of Classification, 4, 7–31.
Article Google Scholar
Monma, C., and Suri, S. (1991): Partitioning Points and Graphs to Minimize the Maximum or the Sum of Diameters. In: Y. Alari, G. Chartrand, O.R. 01- lerman and A.J. Schwenk (eds): Proceedings of the 6 th Quadrennial International Conference on the Theory and Applications of Graphs. Wiley, New York.
Google Scholar
Mulvey, J.M., and Beck, M.P. (1984): Solving Capacitated Clustering Problems. European Journal of Operational Research, 18, 339–398.
Article Google Scholar
Murtagh, F. (1983): A Survey of Recent Advances in Hierarchical Clustering Algorithms. The Computer Journal, 26, 329–340.
Google Scholar
Rao, M.R. (1971): Cluster Analysis and Mathematical Programming. Journal of the American Statistical Association, 66, 622–626.
Article Google Scholar
Reeves, C.R. (ed.) (1993): Modem Heuristic Techniques for Combinatorial Problems. Black well, London,
Google Scholar
Rosenstiehl, P. (1967): L’arbre minimum d’un graphe. In: P. Rosenstielil (ed.): Theorie des Graphes. Rome, I.C.C., Paris, Dunod, 357–368.
Google Scholar
Rosing, K. (1982): An Optimal Method for Solving the (generalized) Multi- Weber Problem. European Journal of Operational Research, 58, 414–426.
Article Google Scholar
Shepard, R.N., and Arabie, P. (1979): Additive Clustering Representation of Similarities as Combinations of Discrete Overlapping Properties. Psychological Review, 86, 87–123.
Article Google Scholar
Späth, H. (1980): Cluster Analysis Algorithms for Data Reduction and Classification of Objects. Ellis Horwood, Chichester.
Google Scholar
Vinod, H.D. (1969): Integer Programming and the Theory of Grouping. Journal of the American Statistical Association, 64, 506–519.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. MQ, École des Hautes Études Commerciales, 5255 avenue Decelles, Montréal, Québec, H3T 1V6, Canada
Pierre Hansen
École Polytechnique de Montréal, Montréal, Canada
Brigitte Jaumard

Authors

Pierre Hansen
View author publications
You can also search for this author in PubMed Google Scholar
Brigitte Jaumard
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Statistik und Wirtschaftsmathematik, Rheinisch-Westfälische Technische Hochschule Aachen (RWTH), Wüllnerstr. 3, D-52056, Aachen, Germany
Hans-Hermann Bock
Institut für Statistik und Ökonometrie, Universität Basel, Holbeinstr. 12, CH-4051, Basel, Switzerland
Wolfgang Polasek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hansen, P., Jaumard, B. (1996). Computational Methods in Clustering from a Mathematical Programming Viewpoint. In: Bock, HH., Polasek, W. (eds) Data Analysis and Information Systems. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-80098-6_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-80098-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60774-8
Online ISBN: 978-3-642-80098-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics