Skip to main content

A Statistical-Genetic Algorithm to Select the Most Significant Features in Mammograms

  • Conference paper
Computer Analysis of Images and Patterns (CAIP 2007)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4673))

Included in the following conference series:

Abstract

An automatic classification system into either malignant or benign microcalcification from mammograms is a helpful tool in breast cancer diagnosis. From a set of extracted features, a classifying method using neural networks can provide a probability estimation that can help the radiologist in his diagnosis. With this objective in mind, this paper proposes a feature selection algorithm from a massive number of features based on a statistical distance method in conjunction with a genetic algorithm (GA). The use of a statistical distance as optimality criterion was improved with genetic algorithms for selecting an appropriate subset of features, thus making this algorithm capable of performing feature selection from a massive set of initial features. Additionally, it provides a criterion to select an appropriate number of features to be employed. Experimental work was performed using Generalized Softmax Perceptrons (GSP), trained with a Strict Sense Bayesian cost function for direct probability estimation, as microcalcification classifiers. A Posterior Probability Model Selection (PPMS) algorithm was employed to determine the network complexity. Results showed that this algorithm converges into a subset of features which has a good classification rate and Area Under Curve (AUC) of the Receiver Operating Curve (ROC).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhou, X., Gordon, R.: Detection of early breast cancer: an overview and future prospects. Crit. Rev. Biomed. Eng. 17(3), 203–255 (1989)

    Google Scholar 

  2. Jiang, Y., Nishikawa, R.M., Wolverton, D.E., Metz, C.E., Giger, M.L., Schmidt, R.A., Vyborny, C.J., Doi, K.: Malignant and bening clustered microcalcifications: Automated feature analysis and classification. Radiology 198(3), 671–678 (1996)

    Google Scholar 

  3. Haralick, R.M.: Statistical and structural approaches to texture. Proceedings of the IEEE 67(5), 786–804 (1979)

    Article  Google Scholar 

  4. Clausi, D.A., Jernigan, M.E.: A fast method to determine co-occurrence texture features. IEEE trans. geosci. remote sens. 36(1), 298–300 (1998)

    Article  Google Scholar 

  5. Verma, B., Zhang, P.: A novel neural-genetic algorithm to find the most significant combination of features in digital mammograms. Appl. Soft Comput. 7(2), 612–625 (2007)

    Article  Google Scholar 

  6. Liu, H., Yu, L.: Toward integrating feature selection algorithms for classification and clustering. IEEE Transactions on Knowledge and Data Engineering 17(4), 491–502 (2005)

    Article  Google Scholar 

  7. Langley, P.: Selection of Relevant Features in Machine Learning. In: Proc. AAAI Fall Symp. Relevance, pp. 140–144 (1994)

    Google Scholar 

  8. Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers, Boston, MA (1998)

    MATH  Google Scholar 

  9. Arribas, J.I., Cid-Sueiro, J.: A Model Algorithm for a Posteriori Probability Estimation With Neural Networks. IEEE Trans. on Neural Netw. 16(4), 799–809 (2005)

    Article  Google Scholar 

  10. Suckling, J., et al.: The mammographic image analysis society digital mammogram database. Exerpta Medica. International Congress Series 1069, 375–378 (1994)

    Google Scholar 

  11. Jain, A.K., Duin, R., Mao, J.: Statistical Pattern Recognition: A Review. IEEE Trans. on Pattern Analysis and Machine Intelligence 22(1), 4–37 (2000)

    Article  Google Scholar 

  12. Zahn, C.T., Roskies, R.Z.: Fourier descriptors for plane closed curves. IEEE Trans. on Computing C-21(3), 269–281 (1972)

    Article  MathSciNet  Google Scholar 

  13. Chuang, G.C.-H., Kuo, C.-C.J.: Wavelet descriptor of planar curves: theory and applications. IEEE Trans. on Image Proc. 5(1), 56–70 (1996)

    Article  Google Scholar 

  14. Jain, A.K.: Fundamentals of digital image processing. Prentice-Hall, Englewood Cliffs (1989)

    MATH  Google Scholar 

  15. Ghosal, S., Mehrotra, R.: A moment-based unified approach to image feature detection. IEEE Trans. on Image Processing 6(6), 781–793 (1997)

    Article  Google Scholar 

  16. Pierre, A.D., Kittler, J.: Pattern recognition: a statistical approach. Prentice/Hall International, Englewood Cliffs (1982)

    MATH  Google Scholar 

  17. Hong, J., Cho, S.: Efficient huge-scale feature selection with speciated genetic algorithm. Pattern Recogn. Lett. 27(2), 143–150 (2006)

    Article  Google Scholar 

  18. Tang, K.S., Man, K.F., Kwong, S., He, Q.: Genetic algorithms and their applications. IEEE Signal Processing Magazine 13(6), 22–37 (1996)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Walter G. Kropatsch Martin Kampel Allan Hanbury

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sánchez-Ferrero, G.V., Arribas, J.I. (2007). A Statistical-Genetic Algorithm to Select the Most Significant Features in Mammograms. In: Kropatsch, W.G., Kampel, M., Hanbury, A. (eds) Computer Analysis of Images and Patterns. CAIP 2007. Lecture Notes in Computer Science, vol 4673. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74272-2_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74272-2_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74271-5

  • Online ISBN: 978-3-540-74272-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics