Improved Prediction of Protein Secondary Structures Using Adaptively Weighted Profiles

Pok, Gouchol; Ryu, Keun Ho; Chung, Yong J.

doi:10.1007/978-3-540-72524-4_12

Gouchol Pok¹,
Keun Ho Ryu² &
Yong J. Chung³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4505))

Included in the following conference series:

1146 Accesses

Abstract

Prediction of protein secondary structures from amino acid sequences is a useful intermediate step for further elucidation of native, three-dimensional conformation of proteins. Currently, most predictors are based on machine learning approaches with a short fixed-size input window scanning over the amino acid sequence. The center of the window corresponds to the prediction site where the prediction is performed by utilizing the properties of neighboring amino acid residues. By nature, most machine learning approaches consider feature vectors as position-independent in terms of feature components. As such, for the secondary structure prediction problem, most existing approaches do not take into account the distance of amino acid residues from the center residue. We have studied on how the prediction performance can be affected by imposing different weights on the features according to the distance of residues from the center residue, and in this work, we propose an adaptive weighting scheme to improve prediction accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Molecular Biology 215, 403–410 (1990)
Google Scholar
Baldi, P., Brunak, S., Frasconi, P., Soda, G., Pollastri, G.: Exploiting the Past and the Future in Protein Secondary Structure Prediction. Bioinformatics 15(11), 937–946 (1999)
Article Google Scholar
Baldi, P., Brunak, S.: Bioinformatics: The Machine Learning Approach. MIT Press, Cambridge (1998)
Google Scholar
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The protein data bank. Nucl. Acid. Res. 28, 235–242 (2000)
Article Google Scholar
Casbon, J.: Protein Secondary Structure Prediction with Support Vector Machines. MSc Thesis, University of Sussex (2002)
Google Scholar
Chou, P.Y., Fasman, G.D.: Prediction of Protein Conformation. Biochemistry 13, 222–245 (1974)
Article Google Scholar
Cuff, J.A., Clamp, M.E., Siddiqui, A.S., Finlay, M., Barton, G.J.: JPred: a consensus secondary structure prediction server. Bioinformatics 14, 892–893 (1998)
Article Google Scholar
Cuff, J.A., Barton, G.J.: Evaluation and improvement of multiple sequence methods for protein secondary structure prediction. Proteins 34, 508–519 (1999)
Article Google Scholar
Frishman, D., Argos, P.: Seventy-five percent accuracy in protein secondary structure prediction. Proteins 27, 329–335 (1997)
Article Google Scholar
Garnier, J., Osguthorpe, D.J., Robson, B.: Analysis of the accuracy and Implications of simple methods for predicting the secondary structure of globular proteins. J. Molecular Biology 120, 97–120 (1978)
Article Google Scholar
Gromiha, M., Selvaraj, S.: Protein Secondary Structure Prediction in Different Structural Classes. Protein Engineering 11(4), 249–251 (1998)
Article Google Scholar
Guo, J., Chen, H., Sun, Z., Lin, Y.: A Novel Method for Protein Secondary Structure Prediction Using Dual-Layer SVM and Profiles. Poteins: Structure, Function, and Bioinformatics 54, 738–743 (2004)
Article Google Scholar
Hua, S., Sun, Z.: A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. J. Molecular Biology 308, 397–407 (2001)
Article Google Scholar
Jones, D.T.: Protein Secondary Structure Prediction Based on Position-specific Scoring Matrices. J. Molecular Biology 292, 195–202 (1999)
Article Google Scholar
Joachims, T.: SVMlight: Support Vector Machine, http://svmlight.joachims.org/
Kabsch, W., Sander, C.: Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22(12), 2577–2637 (1983)
Article Google Scholar
Kim, H., Park, H.: Protein secondary structure prediction based on an improved support vector machines approach. Protein Engineering 16(8), 553–560 (2003)
Article Google Scholar
Kneller, D.G., Cohen, F.E., Langridge, R.: Improvements in Protein Secondary Structure Prediction by an Enhanced Neural Network. J. Molecular Biology 214, 171–182 (1990)
Article Google Scholar
Needleman, S.B., Wunsch, C.D.: A General Method Applicable tothe Search for Similarities in the Amino Acid Sequence of Two Proteins. J. Molecular Biology 48, 443–453 (1970)
Article Google Scholar
Nguyen, M.H., Rajapakse, J.C.: Multi-Class Support Vector Machines for Protein Secondary Structure Prediction. Genome Informatics 14, 218–227 (2003)
Google Scholar
Nordin, M., Sundstrom, M.: Structural Proteomics: Developments in Structure-to-Function Predictions. TRENDS in Biochemistry 20(2), 79–84 (2002)
Article Google Scholar
Pollastri, G., Przybylski, D., Rost, B., Baldi, P.: Improving the Prediction of Protein Secondary Structure in Three and Eight Classes Using Recurrent Neural Networks and Profiles. Proteins 47, 228–235 (2002)
Article Google Scholar
Qian, N., Sejnowski, T.J.: Predicting the secondary structure of globular proteins using neural network models. J. Molecular Biology 202, 865–884 (1988)
Article Google Scholar
Riis, S.K., Krogh, A.: Improving prediction of protein secondary structure using structured neual networks and multiple sequence alignment. J. Comput. Biol. 3, 163–183 (1996)
Article Google Scholar
Rost, B., Sander, C.: Prediction of protein secondary structure at better than 70% accuracy. J. Molecular Biology 232, 584–599 (1993)
Article Google Scholar
Rost, B., Sander, C.: Improved prediction of protein secondary structure by use of sequence profiles and neural networks. Proc. Natl. Acad. Sci. USA 90, 7558–7562 (1993)
Article Google Scholar
Rost, B., Sander, C.: Combining evolutionary information and neural networks to predict protein secondary structure. Proteins 19, 55–72 (1994)
Article Google Scholar
Rost, B.: Better secondary structure prediction through more data. Columbia University, http://cubic.bioc.columbia.edu/predictprotein
Rost, B.: Rising accuracy of protein secondary structure prediction. In: Chasman, D. (ed.) Protein structure determination, analysis, and modeling for drug discovery, pp. 207–249. Marcel Dekker, New York (2003)
Google Scholar
Salamov, A.A., Solovyev, V.V.: Prediction of protein secondary structure by combining nearest-neighbor algorithms and multiple sequence alignments. J. Molecular Biology 247, 11–15 (1995)
Article Google Scholar
Smith, T., Waterman, M.: Identification of common molecular subsequences. J. Molecular Biology 147, 195–197 (1981)
Article Google Scholar
Thompson, J., Higgins, D., Gibson, T.: Clustal w: Improving the sensitivity of progressive multiple sequence alignments through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680 (1994)
Article Google Scholar
Vapnik, V.: Statistical learning theory. John Wiley & Sons, New York (1998)
MATH Google Scholar
Wang, L.-H., Liu, J.: Predicting Protein Secondary Structure by a Support Vector Machine Based on a New Coding Scheme. Genome Informatics 15(2), 181–190 (2004)
Google Scholar
Ward, J.J., McGuffin, L.J., Buxton, B.F., Jones, D.T.: Secondary structure prediction with support vector machines. Bioinformatics 19(13), 1650–1655 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Yanbian University of Science and Technology, Department of Computer Science, Yanji, Jilin Province, China
Gouchol Pok
Chungbuk National University, Department of Computer Science, Cheongju, Chungbuk, Korea
Keun Ho Ryu
Chungbuk National University, Department of Biochemistry, Cheongju, Chungbuk, Korea
Yong J. Chung

Authors

Gouchol Pok
View author publications
You can also search for this author in PubMed Google Scholar
Keun Ho Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Yong J. Chung
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Guozhu Dong Xuemin Lin Wei Wang Yun Yang Jeffrey Xu Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pok, G., Ryu, K.H., Chung, Y.J. (2007). Improved Prediction of Protein Secondary Structures Using Adaptively Weighted Profiles. In: Dong, G., Lin, X., Wang, W., Yang, Y., Yu, J.X. (eds) Advances in Data and Web Management. APWeb WAIM 2007 2007. Lecture Notes in Computer Science, vol 4505. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72524-4_12

Download citation

DOI: https://doi.org/10.1007/978-3-540-72524-4_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72483-4
Online ISBN: 978-3-540-72524-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics