Prediction of Apoptosis Protein’s Subcellular Localization by Fusing Two Different Descriptors Based on Evolutionary Information
- 30 Downloads
The apoptosis protein has a central role in the development and the homeostasis of an organism. Obtaining information about the subcellular localization of apoptosis protein is very helpful to understand the apoptosis mechanism and the function of this protein. Prediction of apoptosis protein’s subcellular localization is a challenging task, and currently the existing feature extraction methods mainly rely on the protein’s primary sequence. In this paper we develop a feature extraction model based on two different descriptors of evolutionary information, which contains the 192 frequencies of triplet codons (FTC) in the RNA sequence derived from the protein’s primary sequence and the 190 features from a detrended forward moving-average cross-correlation analysis (DFMCA) based on a position-specific scoring matrix (PSSM) generated by the PSI-BLAST program. Hence, this model is called FTC-DFMCA-PSSM. A 382-dimensional (382D) feature vector is constructed on the ZD98, ZW225 and CL317 datasets. Then a support vector machine is adopted as classifier, and the jackknife cross-validation test method is used for evaluating the accuracy. The overall prediction accuracies are further improved by an objective and rigorous jackknife test. Our model not only broadens the source of the feature information, but also provides a more accurate and reliable automated calculation method for the prediction of apoptosis protein’s subcellular localization.
KeywordsSubcellular localization Triplet codon Position-specific scoring matrix Detrended moving-average cross-correlation Support vector machine
This work was supported by the National Natural Science Foundation of China (No. 11601407), the Fundamental Research Funds for the Central Universities (Nos. JB160711 and JBG160703) and Doctoral Scientific Research Foundation of Xi’an Polytechnic University (No. BS1710).
Compliance with Ethical Standards
Conflict of interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
- Carbone A (2009) Detrending moving average algorithm: a brief review, Science and Technology for Humanity (TIC-STH), pp 691–696Google Scholar
- Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machinesGoogle Scholar
- Chen YL, Li QZ (2004) Prediction of the subcellular location apoptosis proteins using the algorithm of measure of diversity. Acta Sci Nat Univ NeiMongol 25:413–417Google Scholar
- Huang J, Shi F, Zhou HB (2005) Support vector machine for predicting apoptosis proteins types by incorporating protein instability index. China J Bioinform 3:121–123Google Scholar
- Wang X, Li H, Wang R, Zhang QW, Zhang WW, Gan Y (2017) MultiP-Apo: a multilabel predictor for identifying subcellular locations of apoptosis proteins. Comput Intell Neurosci 2017:9183796.1–9183796.10Google Scholar