Abstract
In this paper we discuss a hybrid feature selection algorithm for the Quantitative Structure Activity Relationship (QSAR) modelling. This is one of the goals in Predictive Toxicology domain, aiming to describe the relations between the chemical structure of a molecule and its biological or toxicological effects, in order to predict the behaviour of new, unknown chemical compounds. We propose a hybridization of the ReliefF algorithm based on a simple fuzzy extension of the value difference metric. The experimental results both on benchmark and real world applications suggest more stability in dealing with noisy data and our preliminary tests give a promising starting point for future research.
Chapter PDF
Similar content being viewed by others
Keywords
- Feature Selection
- Bayesian Network
- Quantitative Structure Activity Relationship
- Feature Selection Method
- Feature Selection Technique
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Hansch, C., Hoekman, D., Leo, A., Zhang, L., Li, P.: The expanding role of quantitative structure-activity relationship (QSAR) in toxicology. Toxicology Letters 79, 45–53 (1995)
Woo, Y.-T.: A Toxicologist’s View and Evaluation. Predictive Toxicology Challenge (PTC) (2000-2001)
Fayad, U.M., Piatesky-Shapiro, G., Smyth, P., Uthurusamy, K. (eds.): Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, Menlo Park (1996)
Kira, K., Rendell, L.A.: A practical approach to feature selection. In: International Conference on machine learning, pp. 249–256. Morgan Kaufmann, San Francisco (1992)
Kononenko, I.: Estimating attributes: Analysis and Extension of Relief. In: Bergadano, F., De Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 171–182. Springer, Heidelberg (1994)
Robnik Sikonja, M., Kononenko, I.: An adaptation of Relief for attribute estimation in regression. In: Fisher, D. (ed.) Machine Learning: Proceedings of the Fourteenth International Conference (ICML 1997), pp. 296–304. Morgan Kaufmann Publishers, San Francisco (1997)
Zadeh, L.A.: Fuzzy Logic and Approximate Reasoning. Synthese 30, 407–428 (1975)
Pal, S.K., Shiu Simon, C.K.: Foundations of Soft Case-Based Reasoning. John Wiley and Sons, Chichester (2004)
Fuller, R.: Introduction to Neuro-Fuzzy Systems. Advances in Soft Computing Series. Springer, Berlin (1999)
Wilson, D.R., Martinez, T.R.: Improved Heterogeneous Distance Functions. Journal of Artificial Intelligence Research (JAIR) 6-1, 1–34 (1997)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann Publishers, San Francisco (1999)
Fisher, R.A.: Iris Plant Database
Domingos, P., Pazzani, M.: Beyond Independence: Conditions for the optimality of the simple bayeisan classifier. In: Proceeding of the Thirteenth International Conference on Machine Learning (1996)
Charles, E.: Naïve Bayesian Learning, Technical Report, University of California (1997)
Myllymäki, P., Silander, T., Tirri, H., Uronen, P.: B-Course: A Web-Based Tool for Bayesian and Causal Data Analysis. International Journal on Artificial Intelligence Tools 11(3), 369–387 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Crăciun, M.V., Cocu, A., Dumitriu, L., Segal, C. (2006). A Hybrid Feature Selection Algorithm for the QSAR Problem. In: Alexandrov, V.N., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds) Computational Science – ICCS 2006. ICCS 2006. Lecture Notes in Computer Science, vol 3991. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11758501_27
Download citation
DOI: https://doi.org/10.1007/11758501_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34379-0
Online ISBN: 978-3-540-34380-6
eBook Packages: Computer ScienceComputer Science (R0)