A Statistical-Genetic Algorithm to Select the Most Significant Features in Mammograms
An automatic classification system into either malignant or benign microcalcification from mammograms is a helpful tool in breast cancer diagnosis. From a set of extracted features, a classifying method using neural networks can provide a probability estimation that can help the radiologist in his diagnosis. With this objective in mind, this paper proposes a feature selection algorithm from a massive number of features based on a statistical distance method in conjunction with a genetic algorithm (GA). The use of a statistical distance as optimality criterion was improved with genetic algorithms for selecting an appropriate subset of features, thus making this algorithm capable of performing feature selection from a massive set of initial features. Additionally, it provides a criterion to select an appropriate number of features to be employed. Experimental work was performed using Generalized Softmax Perceptrons (GSP), trained with a Strict Sense Bayesian cost function for direct probability estimation, as microcalcification classifiers. A Posterior Probability Model Selection (PPMS) algorithm was employed to determine the network complexity. Results showed that this algorithm converges into a subset of features which has a good classification rate and Area Under Curve (AUC) of the Receiver Operating Curve (ROC).
KeywordsBreast cancer microcalcification classification feature selection medical diagnosis genetic algorithms neural network classifiers
Unable to display preview. Download preview PDF.
- 1.Zhou, X., Gordon, R.: Detection of early breast cancer: an overview and future prospects. Crit. Rev. Biomed. Eng. 17(3), 203–255 (1989)Google Scholar
- 2.Jiang, Y., Nishikawa, R.M., Wolverton, D.E., Metz, C.E., Giger, M.L., Schmidt, R.A., Vyborny, C.J., Doi, K.: Malignant and bening clustered microcalcifications: Automated feature analysis and classification. Radiology 198(3), 671–678 (1996)Google Scholar
- 7.Langley, P.: Selection of Relevant Features in Machine Learning. In: Proc. AAAI Fall Symp. Relevance, pp. 140–144 (1994)Google Scholar
- 10.Suckling, J., et al.: The mammographic image analysis society digital mammogram database. Exerpta Medica. International Congress Series 1069, 375–378 (1994)Google Scholar