Simultaneous Clustering: A Survey
Conference paper
Abstract
Although most of the clustering literature focuses on one-sided clustering algorithms, simultaneous clustering has recently gained attention as a powerful tool that allows to circumvent some limitations of classical clustering approach. Simultaneous clustering methods perform clustering in the two dimensions simultaneously. In this paper, we introduce a large number of existing simultaneous clustering approaches applied in bioinformatics as well as in text mining, web mining and information retrieval and classify them in accordance with the methods used to perform the clustering and the target applications.
Keywords
Simultaneous clustering Biclusters Block clustering Download
to read the full conference paper text
References
- 1.Ahmad, W., Khokhar, A.: cHawk: an efficient biclustering algorithm based on bipartite graph crossing minimization. VLDB. ACM, New York (2007)Google Scholar
- 2.Balbi, S., Miele, R., Scepi, G.: Clustering of documents from a two-way viewpoint. In: 10th Int. Conf. on Statistical Analysis of Textual Data (2010)Google Scholar
- 3.Bichot, C.E.: Co-clustering documents and words by minimizing the normalized cut objective function. JMMA 9, 131–147 (2010)MathSciNetMATHGoogle Scholar
- 4.Ben-Dor, A., Chor, B., Karp, R.: Discovering local structure in gene expression data: The order–preserving submatrix problem. J. of Comput. Biol. 10, 373–384 (2003)CrossRefGoogle Scholar
- 5.Busygin, S., Jacobsen, G., Kramer, E.: Double conjugated clustering applied to leukemia microarray data. In: 2nd SIAM Int. Conf. on Data Mining (2002)Google Scholar
- 6.Caldas, J., Kaski, S.: Bayesian biclustering with the plaid model. In: IEEE Intern. Workshop on Machine Learning for Signal Processing, pp. 291–296 (2008)Google Scholar
- 7.Califano, A., Stolovitzky, G., Tu, Y.: Analysis of gene expression microarays for phenotype classification. In: Int. Conf. on Computational Molecular Biology (2000)Google Scholar
- 8.Charrad, M., Lechevallier, Y., Ahmed, M.b., Saporta, G.: Block Clustering for Web Pages Categorization. In: Corchado, E., Yin, H. (eds.) IDEAL 2009. LNCS, vol. 5788, pp. 260–267. Springer, Heidelberg (2009)CrossRefGoogle Scholar
- 9.Charrad, M.: une approche generique pour l’analyse croisant usage et contenu de sites Web par des methodes de bipartitionnement. PhD Thesis, Paris (2010)Google Scholar
- 10.Cheng, Y., Church, G.M.: Biclustering of expression data. In: 8th Int. Conf. on Intelligent Systems for Molecular Biology, pp. 93–103 (2000)Google Scholar
- 11.Costa, G., Manco, G., Ortale, R.: A hierarchical model-based approach to co-clustering high-dimensional data. In: ACM sym. on App. comput., pp. 886–890 (2008)Google Scholar
- 12.Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: 7th ACM SIGKDD 2001, California, pp. 269–274 (2001)Google Scholar
- 13.Dhillon, I.S., Mallela, S., Modha, D.S.: Information-theoretic co-clustering. In: ACM SIGKDD, pp. 89–98. ACM, Washington DC (2003)Google Scholar
- 14.Getoor, L., Friedman, N., Koller, D., Taskar, B.: Learning probabilistic models of link structure. J. Mach. Learn. Res. 3, 679–707 (2002)MathSciNetMATHGoogle Scholar
- 15.Getz, G., Levine, E., Domany, E.: Coupled two-way clustering analysis of gene microarray data. Proc. of the Natural Academy of Sciences USA (2000)Google Scholar
- 16.Govaert, G., Nadif, M.: Clustering with block mixture models. J. of the Pattern Recognition, 463–473 (2003)Google Scholar
- 17.Govaert, G.: Classification croisee. Th. de doctorat d’Etat, Paris (1983)Google Scholar
- 18.Grimal, C., Bisson, G.: Classification a partir d’une collection de matrices. CAp2010 (2010)Google Scholar
- 19.Gu, J.: Bayesian biclustering of gene expression data. BMC Genomics (2008).Google Scholar
- 20.Hartigan, J.A.: Direct clustering of a data matrix. J. of American Statistical Association 67(337), 123–129 (1972)CrossRefGoogle Scholar
- 21.Hochreiter, S., Bodenhofer, U., Heusel, M., Mayr, A.: FABIA: factor analysis for bicluster acquisition. Bioinformatics journal 26(12), 1520–1527 (2010)CrossRefGoogle Scholar
- 22.Lazzeroni, L., Owen, A.: Plaid models for gene expression data. Technical report, Stanford University (2002)Google Scholar
- 23.Madeira, S.C., Oliveira, A.L.: Biclustering Algorithms for Biological Data Analysis: A Survey. IEEE/ACM Trans. on Comp. Biol. and Bioinfor., 24–45 (2004)Google Scholar
- 24.Madeira, S.C., Teixeira, M.C.: Identification of regulatory modules in time series gene expression data using a linear time biclustering algorithm. IEEE ACM (2010)Google Scholar
- 25.Murali, T.M., Kasif, S.: Extracting conserved gene expression motifs from gene expression data. In: Pacific Sym. on Biocomputing, Hawaii, USA, pp. 77–88 (2003)Google Scholar
- 26.Nadif, M., Govaert, G.: Block clustering of contingency table and mixture model. In: Famili, A.F., Kok, J.N., Peña, J.M., Siebes, A., Feelders, A. (eds.) IDA 2005. LNCS, vol. 3646, pp. 249–259. Springer, Heidelberg (2005)CrossRefGoogle Scholar
- 27.Prelic, A., Bleuler, S., Zimmermann, P.: A systematic comparison and evaluation of biclustering methods for gene expression data. Bioinformatics, 122–129 (2006)Google Scholar
- 28.Rege, M., Dong, M.: Co-clustering Documents and Words Using Bipartite Isoperimetric Graph Partitioning. In: 6th IEEE Int. Conf. on Data Mining, pp. 532–541 (2006)Google Scholar
- 29.Reiss, D.J.: Integrated biclustering of heterogeneous genome-wide datasets for the inference of global regulatory networks. BMC Bioinfor., 280–302 (2006)Google Scholar
- 30.Robardet, C.: Contribution à la classification non supervisee : proposition d’une methode de bi-partitionnement, PhD Thesis, Claude Bernard University (2002).Google Scholar
- 31.Tanay, A., Sharan, R., Shamir, R.: Biclustering Algorithms: A Survey. In: Aluru, S. (ed.) Handbook of Comp. Molecular Biology, Chapman, Boca Raton (2004)Google Scholar
- 32.Tang, C., Zhang, L.A.: Interrelated two-way clustering: an unsupervised approach for gene expression data analysis. In: IEEE Int. Sym. on Bioinfo. and Bioeng. (2001)Google Scholar
- 33.Tibshirani, R., Hastie, T., Eisen, M.: Clustering methods for the analysis of DNA microarray data. Technical report, Stanford University (1999)Google Scholar
- 34.Van den, B.T.: Robust Algorithms for Inferring Regulatory Networks Based on Gene Expression Measurements. PhD Thesis (2009)Google Scholar
- 35.Wang, H., Wang, W., Yang, J., Yu, P.S.: Clustering by pattern similarity in large data sets. In: ACM SIGMOD Int. Conf. on Management of Data, pp. 394–405 (2002)Google Scholar
- 36.Yang, J., Wang, H., Wang, W., Yu, P.S.: An improved biclustering method for analyzing gene expression profiles. Int. J. on Art. Int. Tools, 771–790 (2005)Google Scholar
- 37.Klugar, Y., Basri, R., Chang, J.T., Gerstein, M.: Spectral biclustering of microarray data: coclustering genes and conditions. Genome Research 13, 703–716 (2003)CrossRefGoogle Scholar
- 38.Zha, H., He, X., Ding, C., Simon, H., Gu, M.: Bipartite Graph Partitioning and Data Clustering. In: ACM Conf. on Inf. and Knowledge Management, pp. 25–32 (2001)Google Scholar
Copyright information
© Springer-Verlag Berlin Heidelberg 2011