Skip to main content

A Genetic Algorithm with Self–sizing Genomes for Data Clustering in Dermatological Semeiotics

  • Conference paper
Book cover Applied Soft Computing Technologies: The Challenge of Complexity

Part of the book series: Advances in Soft Computing ((AINSC,volume 34))

  • 1196 Accesses

Abstract

Medical semeiotics often deals with patient databases and would greatly benefit from efficient clustering techniques. In this paper a new evolutionary algorithm for data clustering, the Self–sizing Genome Genetic Algorithm, is introduced. It does not use a priori information about the number of clusters. Recombination takes place through a brand–new operator, i.e., gene–pooling, and fitness is based on simultaneously maximizing intra–cluster homogeneity and inter–cluster separability. This algorithm is applied to clustering in dermatological semeiotics. Moreover, a Pathology Addressing Index is defined to quantify utility of the clusters making up a proposed solution in unambiguously addressing towards pathologies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Babu GP, Murty MN (1993) A near–optimal initial seed value selection in k-means algorithm using a genetic algorithm, Pattern Recogn. Lett. 14(10):763–769

    Article  MATH  Google Scholar 

  2. Bhuyan J, Raghavan V, Venkatesh K (1991) Genetic algorithm for clustering with an ordered representation. In: Belew R.K, Booker LB (eds) Proc. of the Fourth Int. Conf. on Genetic Algorithms (1991), pp. 408–415. Morgan–Kaufmann San Mateo

    Google Scholar 

  3. Blake CL, Merz CJ (1998) UCI Repository of machine learning databases: University of California, Irvine [http://www.ics.uci.edu/~mlearn/MLR,epository.html]

    Google Scholar 

  4. Burdsal B, Giraud–Carrier C (1997) Evolving fuzzy prototypes for efficient data clustering, www.cs.bris.ac.uk/Tools/Reports/Ps/1997-burdsall.ps

    Google Scholar 

  5. Goldberg DE (1989) Genetic algorithms in search optimization and machine learning. Addison–Wesley, Reading, Mass

    MATH  Google Scholar 

  6. Goldberg DE, Korb B, Deb K (1989) Messy genetic algorithms: motivation, analysis, and first results, Complex Systems 3:493–530

    MATH  MathSciNet  Google Scholar 

  7. Guvenir HA, Demiroz G, Ilter N (1998) Learning differential diagnosis of erythemato–squamous diseases using voting feature intervals, Artificial Intelligence in Medicine 13:147–165

    Article  Google Scholar 

  8. Han J, Kamber M (2001) Data mining: concept and techniques. Morgan Kaufman

    Google Scholar 

  9. Hand DJ, Mannila H, Smyth P (2001) Principles of data mining. MIT Press

    Google Scholar 

  10. Holland JH (1975) Adaptation in natural and artificial systems, 2nd edition. The University of Michigan Press

    Google Scholar 

  11. Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review, ACM Computing Surveys 31(3):264–323

    Article  Google Scholar 

  12. Kaufman L, Rousseeuw PJ (1990) Finding groups in data. An introduction to cluster analysis. Wiley and Sons, New York

    Google Scholar 

  13. Luo F, Khan L, Yen IL, Bastani F (2003), A dynamical growing self–organizing tree (DGSOT) for hierarchical clustering. Submitted to IEEE Transactions on Knowledge and Data Engineering, July, 2003. utdallas.edu/ luofeng/DGSOT.doc

    Google Scholar 

  14. Yip AM (2002) A scale dependent data clustering model by direct maximization of homogeneity and separation. In: Proc. Mathematical Challenges in Scientific Data Mining IPAM. 14–18 January (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer

About this paper

Cite this paper

De Falco, I., Tarantino, E., Cioppa, A.D., Gagliardi, F. (2006). A Genetic Algorithm with Self–sizing Genomes for Data Clustering in Dermatological Semeiotics. In: Abraham, A., de Baets, B., Köppen, M., Nickolay, B. (eds) Applied Soft Computing Technologies: The Challenge of Complexity. Advances in Soft Computing, vol 34. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31662-0_34

Download citation

  • DOI: https://doi.org/10.1007/3-540-31662-0_34

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31649-7

  • Online ISBN: 978-3-540-31662-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics