Gene Selection Using Random Voronoi Ensembles

Masulli, Francesco; Rovetta, Stefano

doi:10.1007/978-3-540-45216-4_34

Francesco Masulli^7,8 &
Stefano Rovetta^7,9

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2859))

Included in the following conference series:

Italian Workshop on Neural Nets

381 Accesses

Abstract

In this paper we propose a flexible method for analyzing the relevance of input variables in high dimensional problems with respect to a given dichotomic classification problem. Both linear and non-linear cases are considered. In the linear case, the application of derivative-based saliency yields a commonly adopted ranking criterion. In the non-linear case, the method is extended by introducing a resampling technique and by clustering the obtained results for stability of the estimate. The method was preliminarly validated on the data published by T.R. Golub et al. on a study, at the molecular level, of two kinds of leukemia: Acute Myeloid Leukemia and Acute Lymphoblastic Leukemia (Science 5439-286, 531-537, 1999). Our technique indicates that, among the top 20 genes found by the final cluster analysis, 8 of the 50 genes listed in the original work feature a stronger discriminating power.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aurenhammer, F.: Voronoi diagrams-a survey of a fundamental geometric data structure. ACM Computing Surveys 3(23), 345–405 (1991)
Article Google Scholar
Bezdek, J.C.: Pattern recognition with fuzzy objective function algorithms. Plenum, NewYork (1981)
MATH Google Scholar
Bilban, M., Buehler, L.K., Head, S., Desoye, G., Quaranta, V.: Normalizing DNA microarray data. Curr. Issues Mol. Biol. 4(2), 57–64 (2002)
Google Scholar
Brank, J., Grobelnik, M., Milic-Frayling, N., Mladenic, D.: Feature selection using linear support vector machines, Tech. Rep. MSR-TR-2002-63, Microsoft Research (June 2002)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge Univ. Press, Cambridge (2000)
Google Scholar
Dietterich, T.G.: Machine-learning research: Four current directions. The AI Magazine 4(18), 97–136 (1998)
Google Scholar
Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. John Wiley and Sons, New York (1973)
MATH Google Scholar
Golub, T.R., et al.: Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science 5439(286), 531–537 (1999)
Article Google Scholar
Ho, T.K., Basu, M.: Complexity measures of supervised classification problems. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(3), 289–300 (2002)
Article Google Scholar
Krishnapuram, R., Keller, J.M.: A possibilistic approach to clustering. IEEE Trans. on Fuzzy Systems 2(1), 98–110 (1993)
Article Google Scholar
Krishnapuram, R., Keller, J.M.: The possibilistic c-Means algorithm: insights and recommendations. IEEE Trans. on Fuzzy Systems 3(4), 385–393 (1996)
Article Google Scholar
Masulli, F., Rovetta, S.: Soft transition from probabilistic to possibilistic fuzzy clustering, DISI Technical Report DISI-TR-03-02, Department of Computer and Information Sciences, University of Genoa, Italy (April 2002), http://www.disi.unige.it/person/RovettaS/research/techrep/DISI-TR-02-03.ps.gz
Moneta, C., Parodi, G., Rovetta, S., Zunino, R.: Automated diagnosis and disease characterization using neural network analysis. In: Proc. of the 1992 IEEE Int. Conf. on Systems, Man and Cybernetics, Chicago USA, October 1992, pp. 123–128 (1992)
Google Scholar
Ripley, B.D.: Pattern recognition and neural networks. Cambridge Univ. Press, Cambridge (1996)
MATH Google Scholar
Rose, K.: Deterministic annealing for clustering, compression, classification, regression, and related optimization problems. Proceedings of IEEE 11(86), 2210–2239 (1998)
Article Google Scholar
Sindhwani, V., Bhattacharya, P., Rakshit, S.: Information theoretic feature crediting in multiclass support vector machines. In: 1st SIAM Int. Conf. on Data Mining, Chicago, USA, SIAM, Philadelphia (2001)
Google Scholar
Weller, F.: Stability ofVoronoi neighborhood under perturbations of the sites. In: Proc. of Ninth Canadian Conf. on Computational Geometry, Kingston, Ontario, Canada (August 1997)
Google Scholar

Download references

Author information

Authors and Affiliations

INFM-Istituto Nazionale per la Fisica della Materia, Via Dodecaneso 33, I-16146, Genova, Italy
Francesco Masulli & Stefano Rovetta
DI-Dipartimento di Informatica, Università di Pisa, Via F. Buonarroti 2, 56127, Pisa, Italy
Francesco Masulli
DISI-Dipartimento di Informatica e Scienze dell’Informazione, Università di Genova, Via Dodecaneso 35, 16146, Genova, Italy
Stefano Rovetta

Authors

Francesco Masulli
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Rovetta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimentimento di Scienze dell’Informazione, via Comelico 39/41, 20135, Milano, Italy
Bruno Apolloni
Dipartimento di Fisica “E.R. Caianiello”, Università degli Studi di Salerno, Via S. Allende, 84081, Baronissi, (SA), Italy
Maria Marinaro
Department of Mathematics and Informatics, University of Salerno, Via Ponte Don Melillo, 84084, Fisciano, (SA), Italy
Roberto Tagliaferri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Masulli, F., Rovetta, S. (2003). Gene Selection Using Random Voronoi Ensembles. In: Apolloni, B., Marinaro, M., Tagliaferri, R. (eds) Neural Nets. WIRN 2003. Lecture Notes in Computer Science, vol 2859. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45216-4_34

Download citation

DOI: https://doi.org/10.1007/978-3-540-45216-4_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20227-1
Online ISBN: 978-3-540-45216-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics