Abstract
In the medical field, successful classification of microarray gene expression data is of major importance for cancer diagnosis. However, due to the profusion of genes number, the performance of classifying DNA microarray gene expression data using statistical algorithms is often limited. Recently, there has been an important increase in the studies on the utilization of artificial intelligence methods, for the purpose of classifying large-scale data. In this context, a hybrid approach based on the adaptive neuro-fuzzy inference system (ANFIS), the fuzzy c-means clustering (FCM), and the simulated annealing (SA) algorithm is proposed in this study. The proposed method is applied to classify five different cancer datasets (i.e., lung cancer, central nervous system cancer, brain cancer, endometrial cancer, and prostate cancer). The backpropagation algorithm, hybrid algorithm, genetic algorithm, and the other statistical methods such as Bayesian network, support vector machine, and J48 decision tree are used to compare the proposed approach’s performance to other algorithms. The results show that the performance of training FCM-based ANFIS using SA algorithm for classifying all the cancer datasets becomes more successful with the average accuracy rate of 96.28% and the results of the other methods are also satisfactory. The proposed method gives more effective results than the others for classifying DNA microarray cancer gene expression data.
Graphical abstract

Basic structure of proposed method
This is a preview of subscription content, access via your institution.




References
- 1.
Schulze A, Downward J (2001) Navigating gene expression using microarrays -- a technology review. Nat Cell Biol 3(8):190–195. https://doi.org/10.1038/35087138
- 2.
Sharma A, Imoto S, Miyano S, Sharma V (2012) Null space based feature selection method for gene expression data. Int J Mach Learn Cybern 3(4):269–276. https://doi.org/10.1007/s13042-011-0061-9
- 3.
Saeys Y, Inza I, Larranaga P (2007) A review of feature selection techniques in bioinformatics. BIOINFORMATICS 23(19):2507–2517. https://doi.org/10.1093/bioinformatics/btm344
- 4.
Guo S, Guo D, Chen L, Jiang Q (2016) A centroid-based gene selection method for microarray data classification. J Theor Biol 400:32–41. https://doi.org/10.1016/j.jtbi.2016.03.034
- 5.
Dagliyan O, Yuksektepe F, Kavakli H, Turkay M (2011) Optimization based tumor classification from microarray gene expression data. PLoS One 6(2):e14579. https://doi.org/10.1371/journal.pone.0014579
- 6.
Tan AC, Gilbert D (2003) Ensemble machine learning on gene expression data for cancer classification. Appl Bioinforma 2(3):75–83
- 7.
Pirooznia M, Yang JY, Yang MQM, Deng Y (2008) A comparative study of different machine learning methods on microarray gene expression data. BMC Genomics 9(1):1–13. https://doi.org/10.1186/1471-2164-9-S1-S13
- 8.
Sarhan AM (2009) Cancer classification based on microarray gene expression data using DCT and ANN. J Theor Appl Inform Technol 6(2):208–216
- 9.
Loganathan C, Girija KV (2013) Cancer classification using adaptive neuro fuzzy inference system with runge kutta learning. Int J Comput Appl 79(4):46–50
- 10.
AnandaKumar K, Punithavalli M (2011) Efficient cancer classification using fast adaptive neuro-fuzzy inference system (FANFIS) based on statistical techniques. Int J Adv Comput Sci Appl Spec Issue Artif Intell:132–137. https://doi.org/10.14569/SpecialIssue.2011.010321
- 11.
Haznedar B, Arslan MT, Kalinli A (2017) Training ANFIS structure using genetic algorithm for liver cancer classification based on microarray gene expression data. Sakarya Univ J Sci 21:54–62. https://doi.org/10.12739/NWSA.2018.13.4.2A0159
- 12.
Simon D (2002) Training fuzzy systems with the extended Kalman filter. Fuzzy Sets Syst 132:189–199. https://doi.org/10.1016/S0165-0114(01)00241-X
- 13.
Canayaz M (2019) Training ANFIS system with moth-flame optimization algorithm. Int J Intell Syst Appl Eng 7(3):133–144
- 14.
Jinthanasatian P, Auephanwiriyakul S, Theera-Umpon N (2017) "Microarray data classification using neuro-fuzzy classifier with firefly algorithm," 2017 IEEE Symposium Series on Computational Intelligence (SSCI), Honolulu, HI, pp. 1-6, doi https://doi.org/10.1109/SSCI.2017.8280967
- 15.
Thangavel K, Kaja Mohideen A (2016) "Mammogram classification using ANFIS with ant colony optimization based learning," Annual Convention of the Computer Society of India. Springer, Singapore
- 16.
Karaboga D, Kaya E (2020) Estimation of number of foreign visitors with ANFIS by using ABC algorithm. Soft Comput 24:7579–7591
- 17.
Gordon GJ, Jensen RV, Hsiao LL, Gullans SR, Blumenstock JE, Ramaswamy S, Richards WG, Sugarbaker DJ, Bueno R (2002) Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer Res 62:4963–4967
- 18.
Pomeroy SL, Tamayo P, Gaasenbeek M, Sturla LM, Angelo M, McLaughlin ME, Kim JYH, Goumnerova LC, Black PM, Lau C, Allen JC, Zagzag D, Olson JM, Curran T, Wetmore C, Biegel JA, Poggio T, Mukherjee S, Rifkin R, Califano A, Stolovitzky G, Louis DN, Mesirov JP, Lander ES, Golub TR (2002) Gene expression-based classification and outcome prediction of central nervous system embryonal tumors. Nature 415:436–442
- 19.
Nutt CL, Mani DR, Betensky RA, Tamayo P, Cairncross JG, Ladd C, Pohl U, Hartmann C, McLaughlin M, Batchelor TT, Black PM, von Deimling A, Pomeroy SL, Golub TR, Louis DN (2003) Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. Cancer Res 63:1602–1607
- 20.
Risinger JI, Maxwell GL, Chandramouli GV, Jazaeri A, Aprelikova O, Patterson T, Berchuck A, Barrett JC (2003) Advances in brief microarray analysis reveals distinct gene expression profiles among different histologic types of endometrial cancer. Cancer Res 63:6–11
- 21.
Singh D, Febbo PG, Ross K, Jackson DG, Manola J, Ladd C, Tamayo P, Renshaw AA, D'Amico AV, Richie JP, Lander ES, Loda M, Kantoff PW, Golub TR, Sellers WR (2002) Gene expression correlates of clinical prostate cancer behavior. Cancer Cell 1:203–209. https://doi.org/10.1016/s1535-6108(02)00030-2
- 22.
Kumar V, Minz S (2014) Feature selection: a literature review. Smart Comput Rev 4(3):211–229. https://doi.org/10.6029/smartcr.2014.03.007
- 23.
Bontempi G, Meyer PE (2010) “Causal filter selection in microarray data,” In Proc. of the 27th international conference on machine learning, pp. 95-102
- 24.
Rui Y, Huang TS, Chang S (1999) Image retrieval: Current techniques, promising directions and open issues. Visual Commun Image Represent 10(4):39–62. https://doi.org/10.1006/jvci.1999.0413
- 25.
Sharma A, Imoto S, Miyano S (2012) A filter based feature selection algorithm using null space of covariance matrix for DNA microarray gene expression data. Curr Bioinforma 7(3). https://doi.org/10.2174/157489312802460802
- 26.
Queiros CE, Gelsema ES(1984) “On feature selection,” In Proc. of the Seventh International Conference on Pattern Recognition, pp. 128-130
- 27.
Yu L, Liu H (2003) “Feature selection for high-dimensional data: a fast correlation-based filter solution,” Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003), Washington DC
- 28.
Wanderley MF, Gardeux V, Natowicz R, Braga A (2013) “GA-KDE-Bayes: an evolutionary wrapper method based on non-parametric density estimation applied to bioinformatics problems”, ESANN 2013 proceedings, European Symposium on Artificial Neural Networks. Computational Intelligence and Machine Learning, Bruges (Belgium)
- 29.
Hall MA, Smith LA (1998) “Practical feature subset selection for machine learning,” In Proceedings of the 21st Australasian Computer Science Conference ACSC’98, pp. 181-191
- 30.
Jang JSR (1993) ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans Syst Man Cybern 23:665–685. https://doi.org/10.1109/21.256541
- 31.
Franklin GF, Powell JD, Workman ML (1997) Digital control of dynamic systems. Addison-Wesley Longman, United States
- 32.
Haznedar B, Kalinli A (2016) Detection of the relationship between thrombophilia disease with genetic disorders by adaptive neuro-fuzzy inference system (ANFIS). Sakarya Univ J Sci 20:13–21. https://doi.org/10.16984/saufenbilder.40786
- 33.
Hímer Z, Kovács J, Benyó I, Kortela U (2004) “Neuro- fuzzy modelling and genetic algorithms optimization for flue gas oxygen control”, In 2nd IFAC Workshop on Advanced Fuzzy/Neural Control. University of Oulu. https://doi.org/10.1016/S1474-6670(17)30861-3
- 34.
Jang JSR, Sun CT (1995) Neuro-Fuzzy modeling and control. Proc IEEE 83(3):378–406. https://doi.org/10.1109/5.364486
- 35.
Haznedar B (2010) Determine the presence of genetic anomaly in male infertile patients by using artificial intelligence techniques. Thesis, Erciyes University, Kayseri, Turkey, M.S.c
- 36.
Haznedar B, Kalinli A (2018) Training ANFIS structure using simulated annealing algorithm for dynamic systems identification. NEUROCOMPUTING 302:66–74. https://doi.org/10.1016/j.neucom.2018.04.006
- 37.
Haznedar B, Kalinli A (2016) Training ANFIS Using genetic algorithm for dynamic systems identification. Int J Intell Syst Appl Eng 4:44–47. https://doi.org/10.18201/ijisae.266053
- 38.
Haznedar B, Arslan MT, Kalinli A (2018) Using adaptive neuro-fuzzy inference system for classification of microarray gene expression cancer profiles. Tamap J Eng 2018(29):1–13. https://doi.org/10.29371/2018.3.29
- 39.
Dunn JC (2008) A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J Cybern 3:32–57. https://doi.org/10.1080/01969727308546046
- 40.
Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Springer, US, New York. https://doi.org/10.1007/978-1-4757-0450-1
- 41.
Park H, et al. (2005) “Comparison of recognition rates between BP and ANFIS with FCM clustering method on off-line PD diagnosis of defect models of traction motor stator coil,” In: Proceedings of 2005 International Symposium on Electrical Insulating Materials, (ISEIM 2005), pp. 849–852. https://doi.org/10.1109/ISEIM.2005.193512
- 42.
Abdulshahed AM, Longstaff AP, Fletcher S, Myers A (2015) Thermal error modelling of machine tools based on ANFIS with fuzzy c-means clustering using a thermal imaging camera. Appl Math Model 39:1837–1852. https://doi.org/10.1016/j.apm.2014.10.016
- 43.
Kirkpatrick S, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Sci New Series 220:671–680. https://doi.org/10.1126/science.220.4598.671
- 44.
Dowsland KA, Thompson JM (2012) Simulated Annealing. Handbook of Natural Computing. Springer, Berlin, pp 1623–1655
- 45.
Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E (1953) Equation of state calculations by fast computing machines. J Chem Phys 21:1087–1092. https://doi.org/10.1063/1.1699114
- 46.
Johnson DS, Aragon CR, McGeoch LA, Schevon C (1989) Optimization by simulated annealing: An experimental evaluation; Part I, Graph Partitioning. Oper Res 37:865–892. https://doi.org/10.1287/opre.37.6.865
- 47.
Kalinli A (2012) Simulated annealing algorithm-based Elman network for dynamic system identification. Turk J Electr Eng Comput Sci 20:569–582. https://doi.org/10.3906/elk-1012-942
Author information
Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Haznedar, B., Arslan, M.T. & Kalinli, A. Optimizing ANFIS using simulated annealing algorithm for classification of microarray gene expression cancer data. Med Biol Eng Comput 59, 497–509 (2021). https://doi.org/10.1007/s11517-021-02331-z
Received:
Accepted:
Published:
Issue Date:
Keywords
- Fuzzy neural networks
- Simulated annealing
- Machine learning
- Optimization
- Gene expression