Clique Editing to Support Case Versus Control Discrimination

  • Riccardo DondiEmail author
  • Giancarlo Mauri
  • Italo Zoppis
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 56)


We present a graph-based approach to support case vs control discrimination problems. The goal is to partition a given input graph in two sets, a clique and an independent set, such that there is no edge connecting a vertex of the clique with a vertex of the independent set. Following a parsimonious principle, we consider the problem that aims to modify the input graph into a most similar output graph that consists of a clique and an independent set (with no edge between the two sets). First, we present a theoretical result showing that the problem admits a polynomial-time approximation scheme. Then, motivated by the complexity of such an algorithm, we propose a genetic algorithm and we present an experimental analysis on simulated data.


  1. 1.
    Arora, S., Frieze, A.M., Kaplan, H.: A new rounding procedure for the assignment problem with applications to dense graph arrangement problems. In: 37th Annual Symposium on Foundations of Computer Science, FOCS ’96, Burlington, Vermont, USA, 14–16 October, 1996, pp. 21–30 (1996)Google Scholar
  2. 2.
    Baluja, S., Caruana, R.: Removing the genetics from the standard genetic algorithm. In: Prieditis, A., Russell, S.J. (eds.) Machine Learning, Proceedings of the Twelfth International Conference on Machine Learning, Tahoe City, California, USA, July 9–12, 1995, pp. 38–46. Morgan Kaufmann (1995)Google Scholar
  3. 3.
    Bansal, N., Blum, A., Chawla, S.: Correlation clustering. Mach. Learn. 56(1–3), 89–113 (2004)MathSciNetCrossRefzbMATHGoogle Scholar
  4. 4.
    Bollobas, B.: Random Graphs. Cambridge University Press (2001)Google Scholar
  5. 5.
    Bonizzoni, P., Della Vedova, G., Dondi, R., Jiang, T.: On the approximation of correlation clustering and consensus clustering. J. Comput. Syst. Sci. 74(5), 671–696 (2008)MathSciNetCrossRefzbMATHGoogle Scholar
  6. 6.
    Cava, C., Zoppis, I., Gariboldi, M., Castiglioni, I., Mauri, G., Antoniotti, M.: Copy–number alterations for tumor progression inference. Artif. Intell. Med. 104–109 (2013)Google Scholar
  7. 7.
    Cava, C., Zoppis, I., Gariboldi, M., Castiglioni, I., Mauri, G., Antoniotti, M.: Combined analysis of chromosomal instabilities and gene expression for colon cancer progression inference. J. Clin. Bioinform. 4, 2 (2014)CrossRefGoogle Scholar
  8. 8.
    Giotis, I., Guruswami, V.: Correlation clustering with a fixed number of clusters. Theory Comput. 2(13), 249–266 (2006).
  9. 9.
    Hüffner, F., Komusiewicz, C., Nichterlein, A.: Editing graphs into few cliques: complexity, approximation, and kernelization schemes. In: Algorithms and Data Structures - 14th International Symposium, WADS 2015, Victoria, BC, Canada, August 5–7, 2015. Proceedings, pp. 410–421 (2015)Google Scholar
  10. 10.
    Jiang, T., Kearney, P.E., Li, M.: A polynomial time approximation scheme for inferring evolutionary trees from quartet topologies and its application. SIAM J. Comput. 30(6), 1942–1961 (2000)MathSciNetCrossRefzbMATHGoogle Scholar
  11. 11.
    Kolaczyk, E.D.: Statistical Analysis of Network Data: Methods and Models. Springer (2009)Google Scholar
  12. 12.
    Long, B., Zhang, Z., Yu, P.S.: Relational Data Clustering: Models, Algorithms, and Applications. Chapman and Hall/CRC (2010)Google Scholar
  13. 13.
    Mitchell, M.: An Introduction to Genetic Algorithms. Complex Adaptive Systems. MIT Press, Cambridge (Mass.) (1996)Google Scholar
  14. 14.
    Scrucca, L.: GA: a package for genetic algorithms in R. J. Stat. Softw. 53(4), 1–37 (2013)CrossRefGoogle Scholar
  15. 15.
    Wirth, A.: Correlation clustering. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 227–231. Springer (2010)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.Dipartimento di Scienze Umane e SocialiUniversità Degli Studi di BergamoBergamoItaly
  2. 2.Dipartimento di Informatica, Sistemistica e ComunicazioneUniversità Degli Studi di Milano-BicoccaMilanoItaly

Personalised recommendations