Skip to main content

Part of the book series: Studies in Computational Intelligence ((SCI,volume 372))

  • 606 Accesses

Abstract

The primary objective of clustering is to discover a structure in the data by forming some number of clusters or groups. In order to achieve optimal clustering results in current soft computing approaches, two fundamental questions should be considered; (i) how many clusters should be actually presented in the given data, and (ii) how real or good the clustering itself is. Based on these two fundamental questions, almost clustering method needs to determine the number of clusters . Yet, it is difficult to determine an optimal number of a cluster group should be obtained for each data set. Hence, DNA-based clustering algorithms were proposed to solve clustering problem without considering any preliminary parameters such as a number of clusters, iteration and, etc..

Because of the nature of processes between DNA-based solutions with a silicon- based solution, the evaluation of obtained results from DNA-based clustering is critical to be conducted. It is to ensure that the obtained results from this proposal can be accepted as well as other soft computing techniques. Thus, this study proposes two different techniques to evaluate the DNA-based clustering algorithms either it can be accepted as other soft computing techniques or the results that obtained from DNA-based clustering are not reliable for employed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Adleman, L.M.: Molecular Computation of Solutions to Combinatorial Problems. Science 266(11), 1021–1024 (1994)

    Article  Google Scholar 

  2. Bakar, R.B.A., Watada, J., Pedryzc, W.: DNA approach to solve clustering problem based on a mutual distance order. Biosystems 91(1), 1–12 (2008)

    Article  Google Scholar 

  3. Bakar, R.B.A., Watada, J.: A proximity approach to DNA based clustering analysis. International Journal of Innovative Computing, Information and Control (IJICIC) 4(5), 1203–1212 (2008)

    Google Scholar 

  4. Oehler, K.L., Gray, R.M.: Combining image compression and classification using vector quantization. IEEE Transactions on Pattern Analysis and Machine Intelligence 17(5), 61–473 (1995)

    Article  Google Scholar 

  5. Shopbell, P.L., Britton, M.C., Ebert, R.: Making the most of missing values: object clustering with partial data in astronomy, astronomical data analysis software and system XIV. ASP Conference Series, vol. 30 (2005)

    Google Scholar 

  6. Jiang, T., Tuzhillin, A.: Segmenting customers from population to individuals: Does 1-to-1 keep your customer forever. IEEE Transaction on Knowledge and Data Engineering 18(10), 1297–1311 (2006)

    Article  Google Scholar 

  7. Jimmy, L., Karakos, D., Fushman, D.D., Khudanpur, S.: Generative content models for structural analysis of medical abstracts. In: Proceedings of the 2006 Workshop on Biomedical Natural Language Processing (BioNPL 2006), New York City (June 2006)

    Google Scholar 

  8. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: A review. ACM Computer Surveys 31(3) (September 1999)

    Google Scholar 

  9. Pedrycz, W.: Knowledge-based clustering:From data to information granules. Wiley Interscience, Hoboken (2005)

    Book  Google Scholar 

  10. Franti, P., Xu, M., Karkkainen, I.: Classification of binary vectors by using ∆ SC distance to minimize stochastic complexity. Journal of Pattern Recognation 24, 65–73 (2003)

    Article  Google Scholar 

  11. Lu, X.-g., Lin, et al.: Gene cluster algorithm based on most similarity tree. In: Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region (HPCASIA 2005), Beijing, November 30-December 3 (2005)

    Google Scholar 

  12. Cleju, I., Franti, P., Wu, X.: Clustering Based on Principal Curve. In: Kalviainen, H., Parkkinen, J., Kaarna, A. (eds.) SCIA 2005. LNCS, vol. 3540, pp. 872–881. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  13. Jain, A.K., Law, M.H.C.: Data clustering: A user’s dilemma. In: Pal, S.K., Bandyopadhyay, S., Biswas, S. (eds.) PReMI 2005. LNCS, vol. 3776, pp. 1–10. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  14. Volfovsky, N., et al.: A clustering method for repeat analysis in DNA sequences, Genome Biology Publication, Citing Internet sources (2001), http://genomebiology.com/2001/2/8/research/0027

  15. FitzGerald, P.C., Shlyakhtenko, A., Mir, A.A., Vinson, C.: Clustering of DNA sequences in human promoters. Cold Spring Harbor Laboratory Press (2004); ISBN 1088-9051/04, http://www.genome.org

  16. Sang, L., et al.: CLAGen: A tool for clustering and annotating gene se-quences using a suffix tree algorithm. BioSystems 84, 175–182 (2006)

    Article  Google Scholar 

  17. Joseph, Z.B., Gifford, D.K., Jaakkola, T.S.: Fast optimal leaf ordering for hierarchical clustering. Bioinformatics 17(suppl.1), S22–S29 (2001)

    Article  Google Scholar 

  18. Kim, S.Y., Lee, W.L., Bae, J.S.: Effect of data normalization on fuzzy clustering of DNA microarray data. BMC Bioinformatics 7,134 (2006), http://www.biomedcentrel.com/1471-2105/7/135

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Bakar, R.A., Yu-Yi, C., Watada, J. (2011). Robustness of DNA-Based Clustering. In: Ruano, A.E., Várkonyi-Kóczy, A.R. (eds) New Advances in Intelligent Signal Processing. Studies in Computational Intelligence, vol 372. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11739-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-11739-8_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-11738-1

  • Online ISBN: 978-3-642-11739-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics