Abstract
Let matrix A represent a data set of m features and n samples. Each element of the matrix, aij, corresponds to the expression of the i-th feature in the j-th sample. Biclustering is a classification of the samples as well as features into k classes. In other words, we need to classify columns and rows of the matrix A. Doing so, let S 1, S 2,..., S k and F 1, F 2,..., F k denote the classes of the samples (columns) and features (rows), respectively. Formally biclustering can be defined as follows.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ben-Dor A., Bruhn L., Nachman I., Schummer M., Yakhini Z.: Tissue Classification with Gene Expression Profiles. Journal of Computational Bilology, 7, 559–584 (2000)
Ben-Dor A., Ffiedman N., Yakhin Z.: Class Descovery in Gene Expression Data. Procidings of Fifth Annual International Conveference on Computational Malecular Biology, (2001)
Bryan K., Cunningham P., Bolshakova N.: Biclustering of Expression Data Using Simulated Annealing. Proceedings of the 18th IEEE Symposium on Computer-Based Medical Systems, 383–388 (2005)
Busygin S., Prokopyev O., Pardalos P.: Feature Selection for Consistent Biclustering via Fractional 0-1 Programming. Journal of Combinatorial Optimization, 10, 7–21 (2005)
Cheng Y., Church G. M.: Biclustering of Expression Data. Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, 93–103 (2000)
Dhillon I. S.: Co-Clustering Documents and Words Using Bipartite Spectral Graph Partitioning. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco, 26–29 (2001)
Dhillon I. S., Mallela S., Modha D. S.: Information-Theoretic Coclustering. Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 89–98 (2003)
Golub T. R., Slonim D. K., Tamayo P., Huard C., Gaasenbeek M., Mesirov J. P., Coller H., Loh M. L., Downing J. R., Caligiuri M. A., Bloomfield C. D., Lander E. S.: Molecular Classification of Cancer: Class Discovery and Class Prodiction by Gene Expression Monitoring. Science, 286, 531–537 (1999)
Hartigan J. A.: Direct Clustering of a Data Matrix. Journal of American Statistical Association, 67, 123–129 (1972)
HeatMap Builder Software, Quertermous Laboratory, Stanford University, http://quertermous.stanford.edu/heatmap.htm.
Kluger Y., Basri R., Chang J. T., Gerstein M.: Spectral Biclustering of Microarray Data: Coclustering Genes and Conditions. Genome Research, 703–716 (2003)
Sheng Q., Moreau Y., De Moor B.: Biclustering Microarray Data by Gibbs Sampling. Bioinformatics, 19, ii196–ii205 (2003)
Weston J., Mukherjee S., Chapelle O., Pontil M., Poggio T., Vapnik V.: Feature Selection for SVMs. NIPS, (2001)
Xing E.P., Karp R.M.: CLIFF: Clustering of High-Dimensional Microarray Data Via Iterative Feature Filtering Using Normilized Cuts. Bioinfomatics Discovery Note, 1, 1–9 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Nahapetyan, A., Busygin, S., Pardalos, P. (2008). The Improved Heuristic for Consistent Biclustering Problems. In: Mondaini, R.P., Pardalos, P.M. (eds) Mathematical Modelling of Biosystems. Applied Optimization, vol 102. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76784-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-76784-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76783-1
Online ISBN: 978-3-540-76784-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)