Skip to main content

Computational Methods in Clustering from a Mathematical Programming Viewpoint

  • Conference paper
  • 403 Accesses

Summary

Cluster Analysis is at the crossroad of many disciplines, and has numerous and diverse methods and applications. Mathematical Programming (together with graph theory, complexity theory and data structures) can be used to give a coherent view of the field. This leads to define clustering paradigms, specify problems within these paradigms as mathematical programs, study their complexity, design polynomial algorithms for easy problems, non polynomial ones or heuristics for hard ones and evaluate results obtained. An overview of this line of research is given, stressing exact algorithms.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Aggarwal, A., Imai, H., Katoh, N., and Suri, S. (1991): Finding k Points with Minimum Diameter and Related Problems. Journal of Algorithms, 12, 38–56.

    Article  Google Scholar 

  • Arthanari, T.S., and Dodge, Y. (1981): Mathematical Programming in Statistics. Wiley, New York.

    Google Scholar 

  • Balas, E., and Xue, J. (1993): Weighted and Unweighted Maximum Clique Algorithms with Upper Bounds from Fractional Coloring. Research Report [num] MSSR-590, Carnegie Mellon.

    Google Scholar 

  • Bandelt, H.J., and Dress, A.W.M. (1989): Weak Hierarchies Associated with Similarity Measures: an Additive Clustering Technique. Bulletin of Mathematical Biology, 51, 133–166.

    Google Scholar 

  • Beasley, J.E. (1985): A Note on Solving Large p-median Problems. European Journal of Operational Research, 21, 270–273.

    Article  Google Scholar 

  • Benzecri, J.P. (1982): Construction d’une classification ascendante hiérarchique par la recherche en chaîne des voisins réciproques. Les Cahiers de FAnalyse des Données, 7, 209–218.

    Google Scholar 

  • Bezdek, J.C. (1981): Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York.

    Google Scholar 

  • Boros, E., and Hammer, P.L. (1989): On Clustering Problems with Connected Optima in Euclidean Spaces. Discrete Mathematics, 75, 81–88.

    Article  Google Scholar 

  • Brucker, P. (1978): On the Complexity of Clustering Problem. In: M. Beckmann and H.P. Kiinzi (eds.): Optimization and Operations Research. Springer, Heidelberg, 45–54.

    Google Scholar 

  • Bruynooghe, M. (1978): Classification ascendante hiérarchique des grands ensembles de données: un algorithme rapide fondé sur 1a. construction des voisinages réductibles. Les Cahiers de F Analyse des Données, 3, 7–33.

    Google Scholar 

  • Carraghan, R., and Pardalos, P.M. (1990): An Exact Algorithm for the Maximum Clique Problem. Operations Research Letters, 9, 375–382.

    Article  Google Scholar 

  • Chaillou, P., Hansen, P., and Mahieu, Y. (1989): Best Network Flow Bounds for the Quadratic Knapsack Problem. In: B. Simeone (ed.): Combinatorial Optimization. Lecture Notes in Mathematics, 1403, 225–235.

    Google Scholar 

  • Chandon, J.L., Lemaire, J., and Pouget, J. (1980): Construction de l’ultramétrique la plus proche d’une dissimilarité au sens des moindres carrés. RAIRO-Recherche Opérationnelle, 14, 157–170.

    Google Scholar 

  • Christofides, N., and Beasley, J.E. (1982): A Tree Search Algorithm for the p-median Problem. European Journal of Operational Research, 10, 196–204.

    Article  Google Scholar 

  • Cooper, L. (1964): Heuristic Methods for Location-Allocation Problems. SI AM Review, 6, 37–53.

    Article  Google Scholar 

  • Day, W.H.E., and Edelsbrunner, H. (1984): Efficient Algorithms for Ag- glomerative Hierarchical Clustering Methods. Journal of Classification, 1, 7–24.

    Article  Google Scholar 

  • Diehr,G. (1985): Evaluation of a Branch and Bound Algorithm for Clustering. SI AM Journal on Scientific and Statistical Computing, 6, 268–284.

    Article  Google Scholar 

  • Dorndorf, U., and Pesch, E. (1994): Fast Clustering Algorithms. ORSA Journal on Computing 6, 141–153.

    Google Scholar 

  • Delattre, M., and Hansen, P. (1980): Bicriterion Cluster Analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI 2, 277–291.

    Article  Google Scholar 

  • Erlenkotter, D. (1978): A Dual-based Procedure for Uncapacitated Facility Location. Operations Research, 26, 1590–1602.

    Article  Google Scholar 

  • Gallo, G., Hammer, P.L., and Simeone, B. (1980): Quadratic Knapsack Problem. Mathematical Programming Study, 12, 132–149.

    Google Scholar 

  • Gelinas, S., Hansen, P., and Jaumard, B. (1995): A Labelling Algorithm for Minimum Sum of Diameters Partitioning of Graphs. In: I. Cox, P. Hansen and B. Julesz (eds): Paiiitioning Data Sets. American Mathematical Society, Providence, 89–96.

    Google Scholar 

  • Gordon, A. (1981): Classification. Chapman and Hall, London.

    Google Scholar 

  • Gordon, A. (1994): Clustering Algorithms and Cluster Validation. In: P. Dir- schedl and R. Osterman (eds.): Computational Statistics. Physica-Verlag, Heidelberg, 497–512.

    Chapter  Google Scholar 

  • Gower, J.C., and Ross, G.J.S. (1969): Minimum Spanning Trees and Single Linkage Cluster Analysis. Applied Statistics, 18, 54–64.

    Article  Google Scholar 

  • Guenoche, A. (1989): Partitions with Minimum Diameter. Paper presented at the second Conference of the International Federation of Classification Societies (IFCS-89). Charlottesville.

    Google Scholar 

  • Guenoche, A., Hansen, P., and Jaumard, B. (1991): Efficient Algorithms for Divisive Hierarchical Clustering with the Diameter Criterion. Journal of Classification, 8, 5–30.

    Article  Google Scholar 

  • Grotschel, M., and Wakabayashi, Y. (1989): A Cutting Plane Algorithm for a Clustering Problem. Mathematical Programming, 45, 59–96.

    Article  Google Scholar 

  • Grotschel, M., and Wakabayashi, Y. (1990): Facets of the Clique Partitioning Polytope. Matheinatical Programming, 47, 367–387.

    Article  Google Scholar 

  • Hansen, P., and Delattre, M. (1978): Complete-Link Cluster Analysis by Graph Coloring. Journal of the American Statistical Association, 73, 397–403.

    Article  Google Scholar 

  • Hansen, P., and Jaumard, B. (1987): Minimum Sum of Diameters Clustering. Journal of Classification, 4, 215–226.

    Article  Google Scholar 

  • Hansen, P., Jaumard, B., and Frank, O. (1989): Maximum Sum-of-Splits Clustering. Journal of Classification, 6, 177–193.

    Article  Google Scholar 

  • Hansen, P., Jaumard, B., and Krau, S. (1995): A Column Generation Algorithm for the Multisource Weber Problem. In preparation.

    Google Scholar 

  • Hansen, P., Jaumard, B., and Mladenovic, N. (1995a): How to Choose k Entities among N. In: I. Cox, P. Hansen and B. Julesz (eds): Partitioning Data Sets. American Mathematical Society, Providence, 105–116.

    Google Scholar 

  • Hansen, P., Jaumard, B., and Mladenovic, N. (1995b): Sequential Cluster Analysis with Split and Radius Criteria. In preparation.

    Google Scholar 

  • Hansen, P., Jaumard, B., and Sanlaville, E. (1993): Weight Constrained Minimum Sum-of-Stars Clustering. Les Cahiers du GERAD, G-93-38, Montreal, Canada. To appear in Journal of Classification.

    Google Scholar 

  • Hansen, P., Jaumard, B., and da Silva, E. (1991): Average-Linkage Divisive Hierarchical Clustering. Les Cahiers du GERAD, G-91-55, Montreal, Canada. To appear in Journal of Classification.

    Google Scholar 

  • Hansen, P., Jaumard, B., Simeone, B., and Döring, V. (1993): Maximum Split Clustering under Connectivity Constraints. Les Cahiers du GERAD, G-93-06, Montreal, Canada.

    Google Scholar 

  • Hansen, P., Minoux, M., and Labbe, M. (1987): Extension de la programmation linéaire généralisée au cas des programmes mixtes. Comptes Rendus de VAcadémie des Sciences, Paris, 305, 569–572.

    Google Scholar 

  • Hansen, P., Labbe, M., and Minoux, M. (1995): Thep-center Sum Location Problem. Les Cahiers du GERAD, G-94-30, Montréal, Canada.

    Google Scholar 

  • Hansen, P., and Mladenovic, N. (1992): Two Algorithms for Maximum Cliques in Dense Graphs. Les Cahiers du GERAD, G-92-18, Montréal, Canada. To appear in European Journal of Operational Research.

    Google Scholar 

  • Hartigan, J.A. (1975): Clustering Algorithms. Wiley, New York.

    Google Scholar 

  • Hubert, L.J. (1974): Some Applications of Graph Theory to Clustering. Psy- cliometrika 39, 283–309.

    Google Scholar 

  • Hwang, F.K., Rothblum, U.G., and Yao, Y.-C. (1995): Localizing Combinatorial Properties of Partitions ATamp;T Bell Labs Report.

    Google Scholar 

  • Jambu, M. (1976): Classification automatique pour Vanalyse des données. Tome 1. Dunod, Paris.

    Google Scholar 

  • Jambu, M. (1991): Exploratory and Multivariate Data Analysis. Academic Press, New York.

    Google Scholar 

  • Janowitz, M.F. (1978): An Order Theoretic Model for Cluster Analysis. SIAM Journal on Applied Mathematics 34, 55–72.

    Google Scholar 

  • Jensen, R.E. (1969): A Dynamic Programming Algorithm for Cluster Analysis. Operations Research 17, 1034–1057.

    Article  Google Scholar 

  • Johnson, E.L., Mehrotra, A., and Nemhauser, G.L. (1993): Min-cut Clustering. Mathematical Programming, 62, 133–151.

    Article  Google Scholar 

  • Kariv, and Hakimi, S.L. (1979): An Algorithmic Approach to Network Location Problems. The p-Medians. SIAM Journal on Applied Mathematics, 37, 539–560.

    Article  Google Scholar 

  • Kaufman, L., and Rousseeuw, P.J. (1990): Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York.

    Google Scholar 

  • Klein, G., and Aronson, J.E. (1991): Optimal Clustering: A Model and Method. Naval Research Logistics, 38, 311–323.

    Article  Google Scholar 

  • Korkel, M. (1986): On the Exact Solution of Large Scale Simple Plant Location with Single Source Constraints. Journal of the Operational Research Society, 37, 495–500.

    Google Scholar 

  • Krivanek, M. (1986): On the Computational Complexity of Clustering. In: E. Diday etal. (eds): Data Analysis and Informatics, 4. North-Holland, Amsterdam, 89–96.

    Google Scholar 

  • Krivanek, M., and Moravek, J. (1986): NP-Hard Problems in Hierarchical- Tree Clustering. Acta Informatica, 23, 311–323.

    Article  Google Scholar 

  • Lance, G.N., and Williams, W.T. (1967): A General Theory of Classificatory Sorting Strategies. 1. Hierarchical Systems. The Computer Journal, 9, 373–380.

    Google Scholar 

  • Leclerc, B. (1994): The Residuation Model for the Ordinal Construction of Dissimilarities and other Valued Objects. In: B. Van Cutsem (ed.): Classification and Dissimilarity Analysis. Lecture Notes in Statistics, no. 93, Springer-Verlag, New York, 149–172.

    Google Scholar 

  • Leclerc, B. (1981): Description Combinatoire des Ultramétriques. Mathématiques et Sciences Humaines 73, 5–31.

    Google Scholar 

  • Marcotorchino, J.F., and Michaud, P. (1979): Optimisation en analyse ordinale des données. Paris, Masson.

    Google Scholar 

  • Martello, S., Soumis, F., and Toth, P. (1992): An Exact Algorithm for Makespan Minimization on Unrelated Parallel Machines. In: E. Balas, G. Cornue- jols and R. Kannan (eds): Proceedings Second IPCO Conference. Carnegie-Mellon University, 181–200.

    Google Scholar 

  • Massart, D.L., Plastria, F., and Kaufman, L. (1983): Non-Hierarchical Clustering with MASLOC. Pattern Recognition, 16, 507–516.

    Article  Google Scholar 

  • Massart, D.L., and Kaufman, L. (1983): The Interpretation of Analytical Chemical Data by the use of Cluster Analysis. Wiley, New York.

    Google Scholar 

  • Minieka, E. (1970): The M-Center Problem. SI AM Review, 12, 138–139.

    Article  Google Scholar 

  • Mirkin, B. (1987): Additive Clustering and Qualitative Factor Analysis Methods for Similarity matrices. Journal of Classification, 4, 7–31.

    Article  Google Scholar 

  • Monma, C., and Suri, S. (1991): Partitioning Points and Graphs to Minimize the Maximum or the Sum of Diameters. In: Y. Alari, G. Chartrand, O.R. 01- lerman and A.J. Schwenk (eds): Proceedings of the 6 th Quadrennial International Conference on the Theory and Applications of Graphs. Wiley, New York.

    Google Scholar 

  • Mulvey, J.M., and Beck, M.P. (1984): Solving Capacitated Clustering Problems. European Journal of Operational Research, 18, 339–398.

    Article  Google Scholar 

  • Murtagh, F. (1983): A Survey of Recent Advances in Hierarchical Clustering Algorithms. The Computer Journal, 26, 329–340.

    Google Scholar 

  • Rao, M.R. (1971): Cluster Analysis and Mathematical Programming. Journal of the American Statistical Association, 66, 622–626.

    Article  Google Scholar 

  • Reeves, C.R. (ed.) (1993): Modem Heuristic Techniques for Combinatorial Problems. Black well, London,

    Google Scholar 

  • Rosenstiehl, P. (1967): L’arbre minimum d’un graphe. In: P. Rosenstielil (ed.): Theorie des Graphes. Rome, I.C.C., Paris, Dunod, 357–368.

    Google Scholar 

  • Rosing, K. (1982): An Optimal Method for Solving the (generalized) Multi- Weber Problem. European Journal of Operational Research, 58, 414–426.

    Article  Google Scholar 

  • Shepard, R.N., and Arabie, P. (1979): Additive Clustering Representation of Similarities as Combinations of Discrete Overlapping Properties. Psychological Review, 86, 87–123.

    Article  Google Scholar 

  • Späth, H. (1980): Cluster Analysis Algorithms for Data Reduction and Classification of Objects. Ellis Horwood, Chichester.

    Google Scholar 

  • Vinod, H.D. (1969): Integer Programming and the Theory of Grouping. Journal of the American Statistical Association, 64, 506–519.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Hansen, P., Jaumard, B. (1996). Computational Methods in Clustering from a Mathematical Programming Viewpoint. In: Bock, HH., Polasek, W. (eds) Data Analysis and Information Systems. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-80098-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-80098-6_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-60774-8

  • Online ISBN: 978-3-642-80098-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics