Abstract
High dimensional data with limited number of samples is a challenging task in microarray data classification. Unsupervised gene selection methods handle such data of existing methods. Various methods are available to handle the data with class labels whereas some data are mislabeled and unreliable. We propose an unsupervised filter based method known as dynamic MBPSO (D-MBPSO) which integrates MBPSO into filter approach by defining new fitness function and it is independent of any learning model. The main aim of the filter approach is to quantify the relevance based on the intrinsic properties of the data. The proposed method is applied on benchmark microarray datasets and the results are compared with well known unsupervised gene selection methods using different classifiers. The proposed method has a remarkable ability to obtain reduced feature subset with good classification accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th ed. Elsevier Science, Amsterdam (2008)
Lazar, C., Taminau, J., Meganck, S., Steenhoff, D., Coletta, A., Molter, C., Schaetzen, Vd, Duque, R., Bersini, H., Nowe, A.: A survey on filter techniques for feature selection in gene expression microarray analysis. IEEE/ACM Trans. Comput. Biol. Bioinf. 9, 1106–1119 (2012)
Bolón-Canedo, V., Sánchez-Maroño, N., Alonso-Betanzos, A., BenÃtez, J.M., Herrera, F.: A review of microarray datasets and applied feature selection methods. Inf. Sci. 282 (2014). http://dx.doi.org/10.1016/j.ins.2014.05.042
Saeys, Y., Inza, I., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23, 2507–2517 (2007)
Lai, C., Reinders, M., van’t Veer, L., Wessels, L.: A comparison of univariate and multivariate gene selection techniques for classification of cancer datasets. BMC Bioinf. 7, 235 (2006)
He, X., Cai, D., Niyogi, P.: Laplacian score for feature selection. Adv. Neural Inf. Process. Syst. 18 (2005)
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Cai, R., Hao, Z., Yang, X., Wen, W.: An efficient gene selection algorithm based on mutual information. Neurocomputing 72, 991–999 (2009)
Raileanu, L.E., Stoffel, K.: Theoretical comparison between the gini index and information gain criteria. Ann. Math. Artif. Intell. 41, 77–93 (2004)
Ding, C., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinf. Comput. Biol. 03, 185–205 (2005)
Yu, L., Liu, H.: Feature selection for high-dimensional data: a fast correlation-based filter solution. In: 20th International Conference on Machine Learning, pp. 856–863 (2003)
Lai, C., Reinders, M.J.T., Wessels, L.: Random subspace method for multivariate feature selection. Pattern Recogn. Lett. 27, 1067–1076 (2006)
Haindl, M., Somol, P., Ververidis, D., Kotropoulos, C.: Feature selection based on mutual correlation. In: Recognition, Pattern (ed.) Image Analysis and Applications, pp. 569–577. Springer, Berlin (2006)
Kanan, H.R., Faez, K.: An improved feature selection method based on ant colony optimization evaluated on face recognition system. Appl. Math. Comput. 205, 716–725 (2008)
Aghdam, M.H., Ghasem-Aghaee, N., Basiri, M.E.: Text feature selection using ant colony optimization. Expert Syst. Appl. 36, 6843–6853 (2009)
Marinakis, Y., Marinaki, M., Doumpos, M., Zopounidis, C.: Ant colony and particle swarm optimization for financial classification problems. Expert Syst. Appl. 36, 10604–10611 (2009)
Niijima, S., Okuno, Y.: Laplacian linear discriminant analysis approach to unsupervised feature selection. IEEE/ACM Trans. Comput. Biol. Bioinf. 6(4), 605–614 (2009)
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of the IEEE International Conference on Neural Networks, vol. 4, pp. 1942–1948. IEEE (1995)
Brandstatter, B., Baumgartner, U.: Particle swarm optimizationmass-spring system analogon. IEEE Trans. Magn. 38(2), 997–1000 (2002)
Yuan, L., Zhao, Z.-D.: A modified binary particle swarm optimization algorithm for permutation flow shop problem. In: Proceedings of the International Conference on Machine Learning and Cybernetics, vol. 2, 2007, pp. 902–907
Marandi, A., Afshinmanesh, F., Shahabadi, M., Bahrami, F.: Boolean particle swarm optimization and its application to the design of a dual-band dual-polarized planar antenna. In: Proceedings of IEEE Congress on Evolutionary Computation, pp. 3212–3218 (2006)
Alba, E., Garcia-Nieto, J., Jourdan, L., Talbi, E.-G.: Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. In: Proceedings of the IEEE Congress on Evolutionary Computation, pp. 284–290 (2007)
Wahde, M.: Biologically Inspired Optimization Methods, 1st edn. WIT Press, Southampton (2008)
Chuang, L.-Y., Chang, H.-W., Tu, C.-J., Yang, C.-H.: Improved binary PSO for feature selection using gene expression data. Computat. Biol. Chemist. 32, 29–38 (2008)
Reynolds, C.W.: Flocks, herds and schools: a distributed behavioral model. SIGGRAPH Comput. Graph. 21, 25–34 (1987)
Lee, S., Soak, S., Oh, S., Pedrycz, W., Jeon, M.: Modified binary particle swarm optimization. Prog. Nat. Sci. 18(9), 1161–1166 (2008)
Shen, Qi, Jiang, Jian-Hui, Jiao, Chen-Xu, Shen, Guo-li, Ru-Qin, Yu.: Modified particle swarm optimization algorithm for variable selection in MLR and PLS modeling: QSAR studies of antagonism of angiotensin II antagonists. Eur. J. Pharm. Sci. 22, 145–152 (2004)
Susana, M. Vieira, Mendonc¸ L.F., Farinha, G.J., Sousa, J.M.C.: Modified binary PSO for feature selection using SVM applied to mortality prediction of septic patients. Appl. Soft Comput. 13, 3494–3504 (2013)
Dataset Repository, Bioinformatics Research Group.: http://www.upo.es/eps/bigs/datasets.html (2014)
Statnikov, A., Aliferis, C.F. Tsamardinos, I.: Gems: gene expression model selector. http://www.gems-system.org/ (2005)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA data mining software. http://www.cs.waikato.ac.nz/ml/weka
Tabakhi, Sina, Moradi, Parham, Akhlaghian, Fardin: An unsupervised feature selection algorithm based on ant colony optimization. Eng. Appl. Artif. Intell. 32, 112–123 (2014)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Umamaheswari, K., Dhivya, M. (2016). D-MBPSO: An Unsupervised Feature Selection Algorithm Based on PSO. In: Snášel, V., Abraham, A., Krömer, P., Pant, M., Muda, A. (eds) Innovations in Bio-Inspired Computing and Applications. Advances in Intelligent Systems and Computing, vol 424. Springer, Cham. https://doi.org/10.1007/978-3-319-28031-8_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-28031-8_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28030-1
Online ISBN: 978-3-319-28031-8
eBook Packages: EngineeringEngineering (R0)