Comparison of Efficient and Rand Index Fitness Function for Clustering Gene Expression Data

Patheja, P. S.; Waoo, Akhilesh A.; Sharma, Ragini

doi:10.1007/978-3-642-27317-9_17

Comparison of Efficient and Rand Index Fitness Function for Clustering Gene Expression Data

P. S. Patheja¹⁸,
Akhilesh A. Waoo¹⁸ &
Ragini Sharma¹⁸

Conference paper

1550 Accesses

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 86))

Abstract

This paper illustrates a comparative study of Efficient Fitness Function and Rand Index Fitness Function, to show how Efficient Fitness Function can give better results when used to cluster gene expression data. Variance which is the main limitation of Rand Index can be improved with Efficient Fitness Function. The results are evaluated by finding the precision value (i.e. sensitivity and specificity) of the dataset. Genetic Weighted K-Mean Algorithm (GWKMA) which is used here is a hybridization of Weighted K-Mean Algorithm (WKMA) and Genetic Algorithm. WKMA is used to perform optimal partition of data. Genetic Algorithm is then applied to get the best fit gene from clusters through the fitness function, on which genetic operators like selection, crossover and mutation are performed.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hartigan, J.: Clustering Algorithms. Wiley, New York (1975)
MATH Google Scholar
Obitko, M.: Introduction to Genetic Algorithms (1998)
Google Scholar
Krishna, K.K., Murty, M.M.: Genetic K-means algorithm. IEEE Transactions on Systems, Man, and Cybernetics–Part B: Cybernetics 29, 1083-4419(99)00770-0
Google Scholar
Wehrens, R., Buydens, M.C., Fraley, C., Raftery, A.C.: Model-Based Clustering for Image Segmentation and Large Datasets Via Sampling. Journal Of Classification 21, doi:10.1007/s00357-004-001-8
Google Scholar
Maulik, U., Bandyopadhyay, S.: Genetic algorithm-based clustering technique. Pattern Recognition 33, 1455–1456 (2000)
Article Google Scholar
Wu, F.X., Zhang, W.Z., Kusalik, A.J.: A genetic k-means clustering algorithm applied to gene expression data. In: Proceedings of The Sixteenth Canadian Conference on Artificial Intelligence, Halifax, Canada, pp. 520–526 (June 2003)
Google Scholar
Kerdprasop, K., Kerdprasop, N., Sattayatham, P.: Weighted K-Means for Density-Biased Clustering
Google Scholar
Tho, D.X.: Genetic Algorithms and Application in Examination Scheduling. In: Scholarly Research Paper (2009), doi:10.3239/9783640636723
Google Scholar
Srivastava, P.R., Kim, T.H.: Application of Genetic Algorithm in Software Testing. IJSE (2009)
Google Scholar
Dudoit, S., Fridlyland, J.: A prediction-based resampling method for estimating the number of clustering in a dataset. BMC Genome Biology 3, research 0036.1- 0036.2 (2002)
Google Scholar
Santos, J.M., Embrechts, M.: On the use of the Adjusted Rand Index as a Metric for Evaluating Supervised Classification
Google Scholar
Jeevanand, E.S., Abdul-Sathar, E.I.: Estimation of residual entropy function for exponential distribution from censored samples. ProbStat Forum (2009) ISSN 0974-3235
Google Scholar
Sherlock, G., Boussard, T.H., Kasarskis, A., Binkley, G., Matese, J.C., Dwight, S.S., Kaloper, M., Weng, S., Jin, H., Ball, C.A., Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D., Cherry, J.M.: The Stanford Microarray Database. Nucleic Acids Research 29, 152–155 (2001)
Article Google Scholar
Jiang, D., Tang, C., Zhang, A.: Cluster Analysis for Gene Expression Data: A Survey. IEEE Transactions on Knowledge and Data Engineering 16(11)
Google Scholar
Yeung, K.Y., Fraley, C., Murua, A., Raftery, A.E., Ruzzo, W.L.: Model-based clustering and data transformations for gene expression data. Bioinformatics (2001)
Google Scholar
Belacel, N., Wang, Q., Cuperlovic-Culf, M.: Clustering Methods for Microarray Gene Expression Data. OMICS 10(4) (2006)
Google Scholar
Ben-Dor, A., Shamir, R., Yakhini, Z.: Clustering Gene Expression Patterns. Journal of Computational Biology 6, 281–297
Google Scholar
Suresh, R.M., Dinakaran, K., Valarmathie, P.: Model based modified k-means clustering for microarray data. In: International Conference on Information Management and Engineering, vol. 13, pp. 271–273. IEEE (2009)
Google Scholar
Sarmah, S., Bhattacharyya, D.K.: An Effective Technique for Clustering Incremental Gene Expression data. IJCSI International Journal of Computer Science Issues 7(3(3)) (2010) ISSN (Online): 1694-0784
Google Scholar
Beşdok, E.: 3D Vision by Using Calibration Pattern with Inertial Sensor and RBF Neural Networks Sensors, vol. 9, pp. 4572–4585 (2009), doi: 10.3390/s90604572
Google Scholar
Deshmukh, M.K., Moorthy, C.B.: Application Of Genetic Algorithm To Neural Network Model For Estimation Of Wind Power Potential. Journal of Engineering, Science and Management Education 2, 42–48 (2010)
Google Scholar
Awad, M.: Optimization RBFNNs Parameters Using Genetic Algorithms: Applied on Function Approximation. International Journal of Computer Science and Security (IJCSS) 4(3)
Google Scholar

Download references

Author information

Authors and Affiliations

BIST, Bhopal, India
P. S. Patheja, Akhilesh A. Waoo & Ragini Sharma

Authors

P. S. Patheja
View author publications
You can also search for this author in PubMed Google Scholar
Akhilesh A. Waoo
View author publications
You can also search for this author in PubMed Google Scholar
Ragini Sharma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Jackson State University, Jackson, MS, USA
Natarajan Meghanathan
University of Calcutta, Calcutta, India
Nabendu Chaki
Wireilla Net Solutions PTY Ltd., Melbourne, VIC, Australia
Dhinaharan Nagamalai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Patheja, P.S., Waoo, A.A., Sharma, R. (2012). Comparison of Efficient and Rand Index Fitness Function for Clustering Gene Expression Data. In: Meghanathan, N., Chaki, N., Nagamalai, D. (eds) Advances in Computer Science and Information Technology. Computer Science and Information Technology. CCSIT 2012. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 86. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27317-9_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-27317-9_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27316-2
Online ISBN: 978-3-642-27317-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics