Abstract
This paper addresses the Cluster Editing problem. The objective of this problem is to transform a graph into a disjoint union of cliques using a minimum number of edge modifications. This problem has been considered in the context of bioinformatics, document clustering, image segmentation, consensus clustering, qualitative data clustering among others. Here, we focus on the Integer Linear Programming (ILP) formulation of this problem. The ILP creates models with a large number of constraints. This limits the size of the problems that can be optimally solved. In order to overcome this limitation, this paper proposes a novel preprocessing technique to construct a reduced model that feasibly maintains the optimal solution set. In comparison to the original model, the reduced model preserves the optimal solution and achieves considerable computational time speed-up in the experiments performed on different datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ailon, N., Charikar, M., Newman, A.: Aggregating inconsistent information. J. ACM 55, 1–27 (2008). https://doi.org/10.1145/1411509.1411513
Bansal, N., Blum, A., Chawla, S.: Correlation clustering. Mach. Learn. 56, 89–113 (2004). https://doi.org/10.1023/B:MACH.0000033116.57574.95
Bastos, L., Ochi, L.S., Protti, F., Subramanian, A., Martins, I.C., Pinheiro, R.G.S.: Efficient algorithms for cluster editing. J. Comb. Optim. 31(1), 347–371 (2016). https://doi.org/10.1007/s10878-014-9756-7
Ben-Dor, A., Shamir, R., Yakhini, Z.: Clustering gene expression patterns. J. Comput. Biol. 6(3–4), 281–297 (1999). https://doi.org/10.1089/106652799318274
Böcker, S., Baumbach, J.: Cluster editing. In: Bonizzoni, P., Brattka, V., Löwe, B. (eds.) CiE 2013. LNCS, vol. 7921, pp. 33–44. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39053-1_5
Böcker, S., Briesemeister, S., Klau, G.W.: Exact algorithms for cluster editing: evaluation and experiments. Algorithmica 60, 316–334 (2011). https://doi.org/10.1007/s00453-009-9339-7
Charikar, M., Guruswami, V., Wirth, A.: Clustering with qualitative information. J. Comput. Syst. Sci. 71, 360–383 (2005). https://doi.org/10.1016/j.jcss.2004.10.012
Dinh, T.N., Thai, M.T.: Toward optimal community detection: from trees to general weighted networks. Internet Math. 11, 181–200 (2014). https://doi.org/10.1080/15427951.2014.950875
Fellows, M., Langston, M., Rosamond, F., Shaw, P.: Efficient parameterized preprocessing for cluster editing. In: Csuhaj-Varjú, E., Ésik, Z. (eds.) FCT 2007. LNCS, vol. 4639, pp. 312–321. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-74240-1_27
Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. ACM Trans. Knowl. Discov. Data (TKDD) 1(1), 4 (2007). https://doi.org/10.1145/1217299.1217303
Grötschel, M., Wakabayashi, Y.: A cutting plane algorithm for a clustering problem. Math. Program. 45, 59–96 (1989). https://doi.org/10.1007/bf01589097
Guo, J.: A more effective linear kernelization for cluster editing. Theor. Comput. Sci. 410, 718–726 (2009). https://doi.org/10.1016/j.tcs.2008.10.021
IBM: IBM ILOG CPLEX 12.7.1 (1987–2017)
Kim, S., Yoo, C.D., Nowozin, S., Kohli, P.: Image segmentation using higher-order correlation clustering. IEEE Trans. Pattern Anal. Mach. Intell. 36, 1761–1774 (2014). https://doi.org/10.1109/tpami.2014.2303095
Lancichinetti, A., Fortunato, S., Radicchi, F.: Benchmark graphs for testing community detection algorithms. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 78, 046110 (2008). https://doi.org/10.1103/physreve.78.046110
Miyauchi, A., Sukegawa, N.: Redundant constraints in the standard formulation for the clique partitioning problem. Optim. Lett. 9, 199–207 (2014). https://doi.org/10.1007/s11590-014-0754-6
Nguyen, D.P., Minoux, M., Nguyen, V.H., Nguyen, T.H., Sirdey, R.: Improved compact formulations for a wide class of graph partitioning problems in sparse graphs. Discrete Optim. 25, 175–188 (2017). https://doi.org/10.1016/j.disopt.2016.05.003
Protti, F., da Silva, M.D., Szwarcfiter, J.L.: Applying modular decomposition to parameterized cluster editing problems. Theory Comput. Syst. 44, 91–104 (2007). https://doi.org/10.1007/s00224-007-9032-7
Shamir, R., Sharan, R., Tsur, D.: Cluster graph modification problems. In: Goos, G., Hartmanis, J., van Leeuwen, J., Kučera, L. (eds.) WG 2002. LNCS, vol. 2573, pp. 379–390. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-36379-3_33
Acknowledgements
The authors thanks FAPESP (Grant No. 2011/18496-7), CNPq (Grant No. 310908/2015-9 and 301836/2014-0), CAPES and IBM for support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Lorena, L.H.N., Quiles, M.G., de Carvalho, A.C.P.d.L.F., Lorena, L.A.N. (2018). Preprocessing Technique for Cluster Editing via Integer Linear Programming. In: Huang, DS., Bevilacqua, V., Premaratne, P., Gupta, P. (eds) Intelligent Computing Theories and Application. ICIC 2018. Lecture Notes in Computer Science(), vol 10954. Springer, Cham. https://doi.org/10.1007/978-3-319-95930-6_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-95930-6_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95929-0
Online ISBN: 978-3-319-95930-6
eBook Packages: Computer ScienceComputer Science (R0)