Comparative Analogy on Classification and Clustering of Genomic Signal by a Novel Factor Analysis and F-Score Method

  • R. C. BarikEmail author
  • R. Mishra
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 394)


Data is the centroid in any dimension of scientific research. Most of the research arena is problem specific with respect to its data. In machine learning more and more data samples are used for training a machine for efficient cluster formation, prediction, recognition, classification. Variation between data sample to data sample and data sample from its centroid using Euclidean distance, Mahanalabolis distance for better cluster formation has better focused among computational researchers. In this paper, we proposed a novel loss less feature extraction and selection using feedforward cascaded Factor Analysis and F-score method. Comparative analysis of classification and clustering after efficient feature extraction is applied to life killer cancer disease. Tumor classification is based on multilayer perceptron and clustering is based on k-means clustering method.


Factor analysis F-score method k-means clustering Artificial neural network Multilayer perceptron 


  1. 1.
    Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science. 1999;286(5439):531–7.CrossRefGoogle Scholar
  2. 2.
    Barik RC, Naik B. A novel feature extraction and classification technique for machine learning using time series and statistical approach. In: Proceeding in Springer “Smart, Innovation and Technologies” Computational Intelligence in Data Mining, vol-3, 2014.Google Scholar
  3. 3.
    Huang M. Sch. of Biomed. Eng., Southern Med. Univ., Guangzhou, China; Yang W, Wu Y, Jiang J. Brain tumor segmentation based on local independent projection-based classification. IEEE Trans Biomed Eng 2014;61 10:2633–45.Google Scholar
  4. 4.
    Maji P, Paul S. Rough set based maximum relevance-maximum significance criterion and Gene selection from microarray data. Int J Approximate Reasoning Elsiver. 2011;52(3):408–26.CrossRefGoogle Scholar
  5. 5.
    Wu C-H, Kuo B-C, Tzeng G-H Factor analysis as the feature selection method in an Emotion Norm Database. In: Springer Proceeding Intelligent Information and Database Systems, vol. 8398; 2014 p. 332–341.Google Scholar
  6. 6.
    Sahu SS, Panda G, Barik RC. Cancer classification using microarray gene expression data: approached using wavelet transform and f-score method. In: International conference on electronic systems (ICES-2011), Orissa, India: NIT Rourkela, 07–09 Jan 2011.Google Scholar
  7. 7.
    Yang X, Jia L, Qingmao H, Zhijun C. F-score feature selection method may improve texture-based liver segmentation strategies. In: IEEE International Symposium on IT in Medicine and Education 2008.Google Scholar
  8. 8.
    Haykin S. Neural networks: a comprehensive foundation, 2nd edn. New York: Pearson Education Asia; 2002.Google Scholar
  9. 9.
    Hagan MT, Demuth HB, Beale M. Neural Network Design. Singapore: Thomson Asia Pte. Ltd; 2002.Google Scholar
  10. 10.
    Tatiraju S, Mehta A. Image segmentation using k-means clustering, EM and normalized cuts. Irvine: University of California.Google Scholar
  11. 11.
    Kanungo T, Mount DM, Netanyahu NS, Piatko CD, Silverman R, Wu AY. An efficient k-means clustering algorithm: analysis and implementation. IEEE Trans Pattern Anal Mach Intell 2002;24 7:881–92.Google Scholar

Copyright information

© Springer India 2016

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringVikash Institute of TechnologyBargarhIndia

Personalised recommendations