Abstract
A novel graph-theoretic clustering (GTC) is presented. The method relies on a weighted graph arrangement of the genes, and the iterative partitioning of the respective minimum spanning tree of the graph. The final result is the hierarchical clustering of the genes. GTC utilizes information about the functional classification of genes to knowledgeably guide the clustering process and achieve more informative clustering results. The method was applied and tested on an indicative real-world domain producing satisfactory and biologically valid results. Future R&D directions are also posted.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Lin, K., Sawhney, H.S., Shim, K.: Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases. In: Proceddings of the 21st VLDB Conf., Zurich, Switzerland, pp. 490–501 (1995)
Bassett, D.E., Eisen, M.B., Boguski, M.S.: Gene expression informatics: it’ s. all in your mine. Nat. Genet. 21, 51–55 (1999)
Fisher, D.: Knowledge Acquisition via Incremental Conceptual Clustering. Machine Learning 2, 139–172 (1987)
Mora López, L., Fortes Ruiz, I., Morales Bueno, R., Triguero Ruiz, F.: Dynamic discretization of continuous values from time series. In: Lopez de Mantaras, R., Plaza, E. (eds.) ECML 2000. LNCS (LNAI), vol. 1810, pp. 280–291. Springer, Heidelberg (2000)
Pielou, E.C.: The Measurement of Diversity in Different Types of Biological Collections. Journ. Theoret. Biology 13, 131–144 (1966)
Potamias, G.: Utilizing Genes Functional Classification in Microarray Data Analysis: a Hybrid Clustering Approach. In: Margaritis, K., Pitas, I. (eds.) Proceedings of the 9th Panhellenic Conference in Informatics, Thessaliniki, Greece, pp. 415–430 (2003)
Potamias, G., Dermon, C.R.: Protein Synthesis Profiling in the Developing Brain: A Graph Theoretic Clustering Approach. Computer Methods and Programs in Biomedicine (2004) (in press)
Prim, P.: Shortest connection networks and some generalizations. Bell System Technical Journal 36, 1389–1401 (1957)
Scena, M., Schena, M.: Microarray Biochip Technology, 1st edn. Eaton Pub. Co. (2000)
Schena, M., Heller, R.A., Theriault, T.P., Konrad, K., Lachenmeier, E., Davis, R.W.: Microassays: biotechnology’s discovery platform for functional genomics. Trends Biotechnol. 16(7), 301–306 (1998)
Wen, X., Fuhrman, S., Michaels, G.S., Carr, D.B., Smith, S., Barker, J.L., Somogyi, R.: Large-scale temporal gene expression mapping of central nervous system development. Procs. Natl. Acad. Sci. (PNAS) 95, 334–339 (1999)
Wilson, D., Martinez, T.: Improved heterogeneous distance functions. Journal of Artificial Intelligence Research 6, 1–34 (1997)
Zweiger, G.: Knowledge discovery in gene-expression-microarray data: mining the information output of the genome. Trends Biotechnol. 17(11), 429–436 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Potamias, G. (2004). Knowledgeable Clustering of Microarray Data. In: Barreiro, J.M., Martín-Sánchez, F., Maojo, V., Sanz, F. (eds) Biological and Medical Data Analysis. ISBMDA 2004. Lecture Notes in Computer Science, vol 3337. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30547-7_49
Download citation
DOI: https://doi.org/10.1007/978-3-540-30547-7_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23964-2
Online ISBN: 978-3-540-30547-7
eBook Packages: Springer Book Archive