A Novel Gene Selection Method for Multi-catalog Cancer Data Classification

  • Xuejiao Lei
  • Yuehui Chen
  • Yaou Zhao
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7389)


In this paper, a novel gene selection method which was merging the relevance score (BW ratio) and the Flexible Neural Tree (FNT) together was proposed for the multi-class cancer data classification. Firstly, the BW ratio method was adopted to select some informative genes, and then the FNT method was used to extract more characteristic genes from the gene subsets. FNT is a tree-structured neural network with input variables selection, over-layer connections and different activation functions for different nodes. Based on the pre-defined instruction/operator sets, a flexible neural tree model can be created and evolved. The FNT structure is developed by using probabilistic incremental program evolution (PIPE) algorithm, and the free parameters embedded in neural trees are optimized by particle swarm optimization (PSO) algorithm. Experiment on two well-known cancer datasets shows that the proposed method achieved better results compared with other methods.


gene selection BW ratio Flexible Neural Tree Probabilistic Incremental Program Evolution Particle Swarm Optimization 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Saeys, Y., Abeel, T., Van de Peer, Y.: Robust Feature Selection Using Ensemble Feature Selection Techniques. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 313–325. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  2. 2.
    Sandrine, D., Jane, F., Terence, P.S.: Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data. The American Statistical Association 97(457) (2002)Google Scholar
  3. 3.
    Hong, C., Carlotta, D.: An Evaluation of Gene Selection Methods for Multi-class Microarray Data Classification. In: Proceedings of the Second European Workshop on Data Mining and Text Mining in BioinformaticsGoogle Scholar
  4. 4.
    Yang, C.S., Chuang, L.Y., Li, J.C., Yang, C.H.: A Novel BPSO Approach for Gene Selection and Classification of Microarray Data. IEEE (2008)Google Scholar
  5. 5.
    Hrishikesh, M., Nitya, S., Krishna, M., Tapobrata, L.: An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data. Bioinformation 6(6), 240–243 (2011)CrossRefGoogle Scholar
  6. 6.
    Kohavi, R., John, G.: Wrappers for feature subset selection. Artif. Intell. 97(1-2), 273–324 (1997)zbMATHCrossRefGoogle Scholar
  7. 7.
    Liu, K.H., Xu, C.G.: A genetic programming-based approach to the classification of multiclass microarray datasets. Original Paper, Bioinformatics/btn644 25(3), 331–337 (2009)Google Scholar
  8. 8.
    Chen, Y., Peng, L., Abraham, A.: Gene Expression Profiling Using Flexible Neural Trees. In: Corchado, E., Yin, H., Botti, V., Fyfe, C. (eds.) IDEAL 2006. LNCS, vol. 4224, pp. 1121–1128. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  9. 9.
    Saeys, Y., Inza, I., Larranaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)CrossRefGoogle Scholar
  10. 10.
    Salustowicz, R., Schmidhuber, J.: Probabilistic incremental program evolution. Evolutionary Computation 5(2), 123–141 (1997)CrossRefGoogle Scholar
  11. 11.
    Yang, K., Cai, Z., Li, J., Lin, G.: A stable gene selection in microarray data analysis. BMC Bioinformatics 7(228) (2006)Google Scholar
  12. 12.
    Armstrong, S.A., Staunton, J.E., Silverman, L.B., Pieters, R., den Boer, M.L., Minden, M.D., Sallan, S.E., Lander, E.S., Golub, T.R., Korsmeyer, S.J.: MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia. Nature Genetics 30, 41–47 (2002)CrossRefGoogle Scholar
  13. 13.
    Nutt, C.L., Mani, D.R., Betensky, R.A., Tamayo, P., Cairncross, J.G., Ladd, C., Pohl, U., Hartmann, C., McLaughlin, M.E., Batchelor, T.T., Black, P.M., von Deimling, A., Pomeroy, S.L., Golub, T.R., Louis, D.N.: Gene Expression-based Classification of Malignant Gliomas Correlates Better with Survival than Histological Classification. Cancer Research 63, 1602–1607 (2003)Google Scholar
  14. 14.
    Zhang, B.-L.: Cancer Classification by Kernel Principal Component Self-regression. In: Sattar, A., Kang, B.-H. (eds.) AI 2006. LNCS (LNAI), vol. 4304, pp. 719–728. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  15. 15.
    Li, G.Z., Meng, H.H., Ni, J.: Embedded Gene Selection for Imbalanced Microarray Data Analysis. IEEE (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Xuejiao Lei
    • 1
  • Yuehui Chen
    • 1
  • Yaou Zhao
    • 1
  1. 1.School of Information Science and EngineeringUniversity of JinanPR China

Personalised recommendations