iPhosY-PseAAC: identify phosphotyrosine sites by incorporating sequence statistical moments into PseAAC
- 68 Downloads
Protein phosphorylation is one of the most fundamental types of post-translational modifications and it plays a vital role in various cellular processes of eukaryotes. Among three types of phosphorylation i.e. serine, threonine and tyrosine phosphorylation, tyrosine phosphorylation is one of the most frequent and it is important for mediation of signal transduction in eukaryotic cells. Site-directed mutagenesis and mass spectrometry help in the experimental determination of cellular signalling networks, however, these techniques are costly, time taking and labour associated. Thus, efficient and accurate prediction of these sites through computational approaches can be beneficial to reduce cost and time. Here, we present a more accurate and efficient sequence-based computational method for prediction of phosphotyrosine (PhosY) sites by incorporation of statistical moments into PseAAC. The study is carried out based on Chou’s 5-step rule, and various position-composition relative features are used to train a neural network for the prediction purpose. Validation of results through Jackknife testing is performed to validate the results of the proposed prediction method. Overall accuracy validated through Jackknife testing was calculated 93.9%. These results suggest that the proposed prediction model can play a fundamental role in the prediction of PhosY sites in an accurate and efficient way.
KeywordsPhosphotyrosine Prediction PseAAC Statistical moments Neural network
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
- 28.Cheng X, Lin W-Z, Xiao X, Chou K-C, Hancock J (2018) pLoc_bal-mAnimal: predict subcellular localization of animal proteins by balancing training dataset and PseAAC. Bioinformatics 1:9Google Scholar
- 46.Chou K-C, Shen H-B (2009) Recent advances in developing web-servers for predicting protein attributes. Nat Sci 1(02):63Google Scholar
- 49.Khan YD, Ahmad F, Anwar MW (2012) A neuro-cognitive approach for iris recognition using back propagation. World Appl Sci J 16(5):678–685Google Scholar
- 70.Chen Z, Zhao P, Li F, Leier A, Marquez-Lago TT, Wang Y, Webb GI, Smith AI, Daly RJ, Chou K-C (2018) iFeature: a python package and web server for features extraction and selection from protein and peptide sequences. Bioinformatics 1:4Google Scholar
- 73.Hayashida M, Rocker A, Zhang Y, Akutsu T, Chou K-C, Strugnell RA, Song J, Lithgow T (2018) Bastion6: a bioinformatics approach for accurate prediction of type VI secreted effectors. Bioinformatics 1:10Google Scholar
- 74.Liu B, Weng F, Huang D-S, Chou K-C (2018) iRO-3wPseKNC: identify DNA replication origins by three-window-based PseKNC. Bioinformatics 1:8Google Scholar