Skip to main content
Log in

Using pseudo amino acid composition to predict protein subcellular location: approached with amino acid composition distribution

  • Published:
Amino Acids Aims and scope Submit manuscript

Summary.

In the Post Genome Age, there is an urgent need to develop the reliable and effective computational methods to predict the subcellular localization for the explosion of newly found proteins. Here, a novel method of pseudo amino acid (PseAA) composition, the so-called “amino acid composition distribution” (AACD), is introduced. First, a protein sequence is divided equally into multiple segments. Then, amino acid composition of each segment is calculated in series. After that, each protein sequence can be represented by a feature vector. Finally, the feature vectors of all sequences thus obtained are further input into the multi-class support vector machines to predict the subcellular localization. The results show that AACD is quite effective in representing protein sequences for the purpose of predicting protein subcellular localization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Abbreviations

AAC:

amino acid composition

AACD:

amino acid composition distribution

5CV:

5-fold cross validation

DAG:

directed acyclic graph

DPC:

dipeptide composition

KNN:

k-nearest neighbor

OVO:

one-versus-one

OVR:

one-versus-rest

PPC:

polypeptide composition

PseAA:

pseudo amino acid composition

RBF:

radial basis function

SVM:

support vector machines

References

  • M Bhasin GPS Raghava (2004) ArticleTitleESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST Nucleic Acids Res 32 W414–W419 Occurrence Handle15215421 Occurrence Handle10.1093/nar/gkh350 Occurrence Handle1:CAS:528:DC%2BD2cXlvFKnurk%3D

    Article  PubMed  CAS  Google Scholar 

  • Cao Y, Liu S, Zhang L, Qin J, Wang J, Tang K (2006) Prediction of protein structural class with Rough Sets. BMC Bioinformatics 7: doi:10.1186/1471-2105-7-20

  • C Chen YX Tian XY Zou PX Cai JY Mo (2006a) ArticleTitleUsing pseudo-amino acid composition and support vector machine to predict protein structural class J Theor Biol 243 444–448 Occurrence Handle10.1016/j.jtbi.2006.06.025 Occurrence Handle1:CAS:528:DC%2BD28XhtFKlsL3N

    Article  CAS  Google Scholar 

  • C Chen X Zhou Y Tian X Zou P Cai (2006b) ArticleTitlePredicting protein structural class with pseudo-amino acid composition and support vector machine fusion network Anal Biochem 357 116–121 Occurrence Handle10.1016/j.ab.2006.07.022 Occurrence Handle1:CAS:528:DC%2BD28XpsVOgs78%3D

    Article  CAS  Google Scholar 

  • J Chen H Liu J Yang KC Chou (2007) ArticleTitlePrediction of linear B-cell epitopes using amino acid pair antigenicity scale Amino Acids 33 423–428 Occurrence Handle17252308 Occurrence Handle10.1007/s00726-006-0485-9 Occurrence Handle1:CAS:528:DC%2BD2sXpvVagsrc%3D

    Article  PubMed  CAS  Google Scholar 

  • Chen YL, Li QZ (2007) Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo amino acid composition. J Theor Biol doi: 10.1016/j.jtbi.2007.05.019

  • KC Chou (2001) ArticleTitlePrediction of protein cellular attributes using pseudo-amino acid composition Proteins Struct Funct Genet 43 246–255 Occurrence Handle11288174 Occurrence Handle10.1002/prot.1035 Occurrence Handle1:CAS:528:DC%2BD3MXjtFOls74%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (2005) ArticleTitleUsing amphiphilic pseudo amino acid composition to predict enzyme subfamily classes Bioinformatics 21 10–19 Occurrence Handle15308540 Occurrence Handle10.1093/bioinformatics/bth466 Occurrence Handle1:CAS:528:DC%2BD2MXisVWitw%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou YD Cai (2002) ArticleTitleUsing functional domain composition and support vector machines for prediction of protein subcellular location J Biol Chem 277 45765–45769 Occurrence Handle12186861 Occurrence Handle10.1074/jbc.M204161200 Occurrence Handle1:CAS:528:DC%2BD38XovFKjurg%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou YD Cai (2005) ArticleTitlePrediction of membrane protein types by incorporating amphipathic effects J Chem Inf Model 45 407–413 Occurrence Handle15807506 Occurrence Handle10.1021/ci049686v Occurrence Handle1:CAS:528:DC%2BD2MXht1aqtLs%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou D Elrod (1999) ArticleTitleProtein subcellular localization prediction Protein Eng 12 107–118 Occurrence Handle10195282 Occurrence Handle10.1093/protein/12.2.107 Occurrence Handle1:CAS:528:DyaK1MXhvFehs7g%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou HB Shen (2006a) ArticleTitleHum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization Biochem Biophys Res Commun 347 150–157 Occurrence Handle10.1016/j.bbrc.2006.06.059 Occurrence Handle1:CAS:528:DC%2BD28Xmslyrsbc%3D

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2006b) ArticleTitleLarge-scale predictions of Gram-negative bacterial protein subcellular locations J Proteome Res 5 3420–3428 Occurrence Handle10.1021/pr060404b Occurrence Handle1:CAS:528:DC%2BD28XhtFehurjJ

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2006c) ArticleTitlePredicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers J Proteome Res 5 1888–1897 Occurrence Handle10.1021/pr060167c Occurrence Handle1:CAS:528:DC%2BD28XmvVeitr0%3D

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2006d) ArticleTitlePredicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers J Proteome Res 5 1888–1897 Occurrence Handle10.1021/pr060167c Occurrence Handle1:CAS:528:DC%2BD28XmvVeitr0%3D

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2007a) ArticleTitleEuk-mPLoc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites J Proteome Res 6 1728–1734 Occurrence Handle1:CAS:528:DC%2BD2sXjs1SrsbY%3D Occurrence Handle10.1021/pr060635i

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2007b) ArticleTitleLarge-scale plant protein subcellular location prediction J Cell Biochem 100 665–678 Occurrence Handle10.1002/jcb.21096 Occurrence Handle1:CAS:528:DC%2BD2sXht1Slu7c%3D

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2007c) ArticleTitleLarge-scale plant protein subcellular location prediction J Cell Biochem 100 665–678 Occurrence Handle10.1002/jcb.21096 Occurrence Handle1:CAS:528:DC%2BD2sXht1Slu7c%3D

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2007d) ArticleTitleMemType-2L: a Web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM Biochem Biophys Res Commun 360 339–345 Occurrence Handle10.1016/j.bbrc.2007.06.027 Occurrence Handle1:CAS:528:DC%2BD2sXnslSqtLw%3D

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2007e) ArticleTitleReview: recent progresses in protein subcellular location prediction Anal Biochem 370 1–16 Occurrence Handle10.1016/j.ab.2007.07.006 Occurrence Handle1:CAS:528:DC%2BD2sXhtVOmur%2FF

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2007f) ArticleTitleSignal-CF: a subsite-coupled and window-fusing approach for predicting signal peptides Biochem Biophys Res Commun 357 633–640 Occurrence Handle10.1016/j.bbrc.2007.03.162 Occurrence Handle1:CAS:528:DC%2BD2sXkslCju78%3D

    Article  CAS  Google Scholar 

  • KC Chou CT Zhang (1995) ArticleTitlePrediction of protein structural classes Crit Rev Biochem Mol Biol 30 275–349 Occurrence Handle7587280 Occurrence Handle10.3109/10409239509083488 Occurrence Handle1:CAS:528:DyaK2MXosFentb8%3D

    Article  PubMed  CAS  Google Scholar 

  • K Crammer Y Singer (2001) ArticleTitleOn the algorithmic implementation of multiclass kernel-based vector machines J Machine Learning Res 2 265–292 Occurrence Handle10.1162/15324430260185628

    Article  Google Scholar 

  • Q Cui T Jiang B Liu S Ma (2004) ArticleTitleEsub8: A novel tool to predict protein subcellular localizations in eukaryotic organisms BMC Bioinformatics 5 66–72 Occurrence Handle15163352 Occurrence Handle10.1186/1471-2105-5-66

    Article  PubMed  Google Scholar 

  • YS Ding TL Zhang KC Chou (2007) ArticleTitlePrediction of protein structure classes with pseudo amino acid composition and fuzzy support vector machine network Protein Peptide Lett 14 811–815 Occurrence Handle10.2174/092986607781483778 Occurrence Handle1:CAS:528:DC%2BD2sXhtlWiur7J

    Article  CAS  Google Scholar 

  • P Du Y Li (2006) ArticleTitlePrediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence BMC Bioinformatics 7 518 Occurrence Handle17134515 Occurrence Handle10.1186/1471-2105-7-518 Occurrence Handle1:CAS:528:DC%2BD28XhtlWlsb7E

    Article  PubMed  CAS  Google Scholar 

  • QS Du ZQ Jiang WZ He DP Li KC Chou (2006) ArticleTitleAmino acid principal component analysis (AAPCA) and its applications in protein structural class prediction J Biomol Struct Dyn 23 635–640 Occurrence Handle16615809 Occurrence Handle1:CAS:528:DC%2BD28XkvVCntLw%3D

    PubMed  CAS  Google Scholar 

  • QS Du DQ Wei KC Chou (2003) ArticleTitleCorrelation of amino acids in proteins Peptides 24 1863–1869 Occurrence Handle15127938 Occurrence Handle10.1016/j.peptides.2003.10.012 Occurrence Handle1:CAS:528:DC%2BD2cXhtV2ktLw%3D

    Article  PubMed  CAS  Google Scholar 

  • QB Gao ZZ Wang (2006) ArticleTitleClassification of G-protein coupled receptors at four levels Protein Eng Des Sel 19 511–516 Occurrence Handle17032692 Occurrence Handle10.1093/protein/gzl038 Occurrence Handle1:CAS:528:DC%2BD28XhtFeisbzM

    Article  PubMed  CAS  Google Scholar 

  • QB Gao ZZ Wang C Yan YH Du (2005a) ArticleTitlePrediction of protein subcellular location using a combined feature of sequence FEBS Lett 579 3444–3448 Occurrence Handle10.1016/j.febslet.2005.05.021 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KjsL0%3D

    Article  CAS  Google Scholar 

  • Y Gao SH Shao X Xiao YS Ding YS Huang ZD Huang KC Chou (2005b) ArticleTitleUsing pseudo amino acid composition to predict protein subcellular location: approached with Lyapunov index, Bessel function, and Chebyshev filter Amino Acids 28 373–376 Occurrence Handle10.1007/s00726-005-0206-9 Occurrence Handle1:CAS:528:DC%2BD2MXlt1Kmurw%3D

    Article  CAS  Google Scholar 

  • C Guda S Subramaniam (2005) ArticleTitlepTARGET: a new method for predicting protein subcellular localization in eukaryotes Bioinformatics 21 3963–3969 Occurrence Handle16144808 Occurrence Handle10.1093/bioinformatics/bti650 Occurrence Handle1:CAS:528:DC%2BD2MXhtFCjt7bM

    Article  PubMed  CAS  Google Scholar 

  • J Guo Y Lin X Liu (2006a) ArticleTitleGNBSL: a new integrative system to predict the subcellular location for Gram-negative bacteria proteins Proteomics 6 5099–5105 Occurrence Handle10.1002/pmic.200600064 Occurrence Handle1:CAS:528:DC%2BD28XhtFarsbzO

    Article  CAS  Google Scholar 

  • YZ Guo M Li M Lu Z Wen K Wang G Li J Wu (2006b) ArticleTitleClassifying G protein-coupled receptors and nuclear receptors based on protein power spectrum from fast Fourier transform Amino Acids 30 397–402 Occurrence Handle10.1007/s00726-006-0332-z Occurrence Handle1:CAS:528:DC%2BD28Xls1egs7o%3D

    Article  CAS  Google Scholar 

  • A Höglund P Dönnes T Blum H-W Adolph O Kohlbacher (2006) ArticleTitleMultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition Bioinformatics 22 1158–1165 Occurrence Handle16428265 Occurrence Handle10.1093/bioinformatics/btl002 Occurrence Handle1:CAS:528:DC%2BD28Xktlaku78%3D

    Article  PubMed  CAS  Google Scholar 

  • C Hsu CJ Lin (2002) ArticleTitleA comparison of methods for multi-class support vector machines IEEE T Neural Networ 13 415–425 Occurrence Handle10.1109/72.991427

    Article  Google Scholar 

  • SJ Hua ZR Sun (2001) ArticleTitleSupport vector machine approach for protein subcellular localization prediction Bioinformatics 17 721–728 Occurrence Handle11524373 Occurrence Handle10.1093/bioinformatics/17.8.721 Occurrence Handle1:CAS:528:DC%2BD3MXntFKjsb0%3D

    Article  PubMed  CAS  Google Scholar 

  • Y Huang YD Li (2004) ArticleTitlePrediction of protein subcellular locations using fuzzy k-NN method Bioinformatics 20 21–28 Occurrence Handle14693804 Occurrence Handle10.1093/bioinformatics/btg366 Occurrence Handle1:CAS:528:DC%2BD2cXns1Ol

    Article  PubMed  CAS  Google Scholar 

  • S Jahandideh P Abdolmaleki M Jahandideh EB Asadabadi (2007) ArticleTitleNovel two-stage hybrid neural discriminant model for predicting proteins structural classes Biophys Chem 128 87–93 Occurrence Handle17467878 Occurrence Handle10.1016/j.bpc.2007.03.006 Occurrence Handle1:CAS:528:DC%2BD2sXkvFGku7g%3D

    Article  PubMed  CAS  Google Scholar 

  • AK Jain RPW Duin J Mao (2000) ArticleTitleStatistical pattern recognition: a review IEEE T Pattern Anal 22 4–37 Occurrence Handle10.1109/34.824819

    Article  Google Scholar 

  • KD Kedarisetti LA Kurgan S Dick (2006) ArticleTitleClassifier ensembles for protein structural class prediction with varying homology Biochem Biophys Res Commun 348 981–988 Occurrence Handle16904630 Occurrence Handle10.1016/j.bbrc.2006.07.141 Occurrence Handle1:CAS:528:DC%2BD28XosVOitL4%3D

    Article  PubMed  CAS  Google Scholar 

  • UH Kreßel (1999) Pairwise classification and support vector machines B Schölkopf CJ Burges AJ Smola (Eds) Advances in Kernel methods: support vector learning MIT Press Cambridge, MA

    Google Scholar 

  • H Lin QZ Li (2007a) ArticleTitlePredicting conotoxin superfamily and family by using pseudo amino acid composition and modified Mahalanobis discriminant Biochem Biophys Res Commun 354 548–551 Occurrence Handle10.1016/j.bbrc.2007.01.011 Occurrence Handle1:CAS:528:DC%2BD2sXhtlOgtLo%3D

    Article  CAS  Google Scholar 

  • H Lin QZ Li (2007b) ArticleTitleUsing pseudo amino acid composition to predict protein structural class: approached by incorporating 400 dipeptide components J Comput Chem 28 1463–1466 Occurrence Handle10.1002/jcc.20554 Occurrence Handle1:CAS:528:DC%2BD2sXlslSgs7w%3D

    Article  CAS  Google Scholar 

  • DQ Liu H Liu HB Shen J Yang KC Chou (2007) ArticleTitlePredicting secretory protein signal sequence cleavage sites by fusing the marks of global alignments Amino Acids 32 493–496 Occurrence Handle17103116 Occurrence Handle10.1007/s00726-006-0466-z Occurrence Handle1:CAS:528:DC%2BD2sXlsVGnsL8%3D

    Article  PubMed  CAS  Google Scholar 

  • H Liu M Wang KC Chou (2005a) ArticleTitleLow-frequency Fourier spectrum for predicting membrane protein types Biochem Biophys Res Commun 336 737–739 Occurrence Handle10.1016/j.bbrc.2005.08.160 Occurrence Handle1:CAS:528:DC%2BD2MXhtVegtLfP

    Article  CAS  Google Scholar 

  • H Liu J Yang M Wang L Xue KC Chou (2005b) ArticleTitleUsing Fourier spectrum analysis and pseudo amino acid composition for prediction of membrane protein types Protein J 24 385–389 Occurrence Handle10.1007/s10930-005-7592-4 Occurrence Handle1:CAS:528:DC%2BD2MXht1OqsLjL

    Article  CAS  Google Scholar 

  • EM Marcotte I Xenarios A van Der Bliek D Eisenberg (2000) ArticleTitleLocalizing proteins in the cell from their phylogenetic profiles Proc Natl Acad Sci USA 97 12115–12120 Occurrence Handle11035803 Occurrence Handle10.1073/pnas.220399497 Occurrence Handle1:CAS:528:DC%2BD3cXnvVSgtLg%3D

    Article  PubMed  CAS  Google Scholar 

  • S Mondal R Bhavna R Mohan Babu S Ramakumar (2006) ArticleTitlePseudo amino acid composition and multi-class support vector machines approach for conotoxin superfamily classification J Theor Biol 243 252–260 Occurrence Handle16890961 Occurrence Handle10.1016/j.jtbi.2006.06.014 Occurrence Handle1:CAS:528:DC%2BD28XhtVygtbzM

    Article  PubMed  CAS  Google Scholar 

  • P Mundra M Kumar KK Kumar VK Jayaraman BD Kulkarni (2007) ArticleTitleUsing pseudo amino acid composition to predict protein subnuclear localization: approached with PSSM Pattern Recogn Lett 28 1610–1615 Occurrence Handle10.1016/j.patrec.2007.04.001

    Article  Google Scholar 

  • R Mott J Schultz P Bork CP Ponting (2002) ArticleTitlePredicting protein cellular localization using a domain projection method Genome Res 12 1168–1174 Occurrence Handle12176924 Occurrence Handle10.1101/gr.96802 Occurrence Handle1:CAS:528:DC%2BD38Xmtl2rtb0%3D

    Article  PubMed  CAS  Google Scholar 

  • R Nair B Rost (2002) ArticleTitleInferring sub-cellular localization through automated lexical analysis Bioinformatics 18 S78–S86 Occurrence Handle12169534

    PubMed  Google Scholar 

  • K Nakai P Horton (1999) ArticleTitlePSORT: a program for detecting the sorting signals of proteins and predicting their subcellular localization Trends Biochem Sci 24 34–36 Occurrence Handle10087920 Occurrence Handle10.1016/S0968-0004(98)01336-X Occurrence Handle1:CAS:528:DyaK1MXks12qtLk%3D

    Article  PubMed  CAS  Google Scholar 

  • H Nakashima K Nishikawa (1994) ArticleTitleDiscrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies J Mol Biol 238 54–61 Occurrence Handle8145256 Occurrence Handle10.1006/jmbi.1994.1267 Occurrence Handle1:CAS:528:DyaK2cXivFemtrw%3D

    Article  PubMed  CAS  Google Scholar 

  • B Niu YD Cai WC Lu GY Zheng KC Chou (2006) ArticleTitlePredicting protein structural class with AdaBoost learner Protein Peptide Lett 13 489–492 Occurrence Handle10.2174/092986606776819619 Occurrence Handle1:CAS:528:DC%2BD28XlsVGqs7o%3D

    Article  CAS  Google Scholar 

  • YX Pan ZZ Zhang ZM Guo GY Feng Z Huang L He (2003) ArticleTitleApplication of pseudo amino acid composition for predicting protein subcellular location: Stochastic signal processing approach J Protein Chem 22 395–402 Occurrence Handle13678304 Occurrence Handle10.1023/A:1025350409648 Occurrence Handle1:CAS:528:DC%2BD3sXmsFejs7s%3D

    Article  PubMed  CAS  Google Scholar 

  • KJ Park M Kanehisa (2003) ArticleTitlePrediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs Bioinformatics 19 1656–1663 Occurrence Handle12967962 Occurrence Handle10.1093/bioinformatics/btg222 Occurrence Handle1:CAS:528:DC%2BD3sXnt1Gqu78%3D

    Article  PubMed  CAS  Google Scholar 

  • Platt J, Cristianini N, Shawe-Taylor J (2000) Large margin DAGs for multiclass classification. In: Solla SA, Leen TK, Müller KR (eds) Adv Neural Inform Proc Syst 12: 547–555

  • R Rifin A Klautau (2004) ArticleTitleIn defense of one-vs-all classification J Machine Learn Res 5 101–141

    Google Scholar 

  • HB Shen KC Chou (2005a) ArticleTitlePredicting protein subnuclear location with optimized evidence-theoretic K-nearest classifier and pseudo amino acid composition Biochem Biophys Res Commun 337 752–756 Occurrence Handle1:CAS:528:DC%2BD2MXhtFCjs7%2FI Occurrence Handle10.1016/j.bbrc.2005.09.117

    Article  CAS  Google Scholar 

  • HB Shen KC Chou (2005b) ArticleTitleUsing optimized evidence-theoretic K-nearest neighbor classifier and pseudo amino acid composition to predict membrane protein types Biochem Biophys Res Comm 334 288–292 Occurrence Handle10.1016/j.bbrc.2005.06.087 Occurrence Handle1:CAS:528:DC%2BD2MXmt1aqsLw%3D

    Article  CAS  Google Scholar 

  • HB Shen KC Chou (2006) ArticleTitleEnsemble classifier for protein fold pattern recognition Bioinformatics 22 1717–1722 Occurrence Handle16672258 Occurrence Handle10.1093/bioinformatics/btl170 Occurrence Handle1:CAS:528:DC%2BD28Xotl2rsLY%3D

    Article  PubMed  CAS  Google Scholar 

  • HB Shen KC Chou (2007a) ArticleTitleGpos-PLoc: an ensemble classifier for predicting subcellular localization of Gram-positive bacterial proteins Protein Eng Design and Selection 20 39–46 Occurrence Handle10.1093/protein/gzl053 Occurrence Handle1:CAS:528:DC%2BD2sXhvFWmtr8%3D

    Article  CAS  Google Scholar 

  • HB Shen KC Chou (2007b) ArticleTitleHum-mPLoc: an ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites Biochem Biophys Res Commun 355 1006–1011 Occurrence Handle10.1016/j.bbrc.2007.02.071 Occurrence Handle1:CAS:528:DC%2BD2sXivVahur0%3D

    Article  CAS  Google Scholar 

  • Shen HB, Chou KC (2007c) PseAC: a flexible web-server for generating various kinds of protein pseudo amino acid composition. Anal Biochem doi: 10.10.1016/j.ab.2007.10.012

  • HB Shen KC Chou (2007d) ArticleTitleUsing ensemble classifier to identify membrane protein types Amino Acids 32 483–488 Occurrence Handle10.1007/s00726-006-0439-2 Occurrence Handle1:CAS:528:DC%2BD2sXlsVGnsLY%3D

    Article  CAS  Google Scholar 

  • HB Shen KC Chou (2007e) ArticleTitleVirus-PLoc: a fusion classifier for predicting the subcellular localization of viral proteins within host and virus-infected cells Biopolymers 85 233–240 Occurrence Handle10.1002/bip.20640 Occurrence Handle1:CAS:528:DC%2BD2sXhvFWhs70%3D

    Article  CAS  Google Scholar 

  • HB Shen J Yang KC Chou (2006) ArticleTitleFuzzy KNN for predicting membrane protein types from pseudo amino acid composition J Theor Biol 240 9–13 Occurrence Handle16197963 Occurrence Handle10.1016/j.jtbi.2005.08.016 Occurrence Handle1:CAS:528:DC%2BD28Xjs1Knt70%3D

    Article  PubMed  CAS  Google Scholar 

  • HB Shen J Yang KC Chou (2007) ArticleTitleEuk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction Amino Acids 33 57–67 Occurrence Handle17235453 Occurrence Handle10.1007/s00726-006-0478-8 Occurrence Handle1:CAS:528:DC%2BD2sXotVWru7Y%3D

    Article  PubMed  CAS  Google Scholar 

  • JY Shi SW Zhang Y Liang Q Pan (2006) Prediction of protein subcellular localizations using moment descriptors and support vector machine JC Ragapakse L Wong R Acharya (Eds) PRIB, Hong Kong, China Springer Berlin, Heidelberg

    Google Scholar 

  • JY Shi SW Zhang Q Pan YM Cheng J Xie (2007) ArticleTitleSVM-based method for subcellular localization of protein using multi-scale energy and pseudo amino acid composition Amino Acids 33 69–74 Occurrence Handle17235454 Occurrence Handle10.1007/s00726-006-0475-y Occurrence Handle1:CAS:528:DC%2BD2sXotVWru7g%3D

    Article  PubMed  CAS  Google Scholar 

  • XD Sun RB Huang (2006) ArticleTitlePrediction of protein structural classes using support vector machines Amino Acids 30 469–475 Occurrence Handle16622605 Occurrence Handle10.1007/s00726-005-0239-0 Occurrence Handle1:CAS:528:DC%2BD28Xls1ehu7c%3D

    Article  PubMed  CAS  Google Scholar 

  • V Vapnik (1998) Statistical learning theory Wiley New York

    Google Scholar 

  • M Wang J Yang KC Chou (2005) ArticleTitleUsing string kernel to predict signal peptide cleavage site based on subsite coupling model Amino Acids 28 395–402 Occurrence Handle15838592 Occurrence Handle10.1007/s00726-005-0189-6 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KmtbY%3D

    Article  PubMed  CAS  Google Scholar 

  • M Wang J Yang GP Liu ZJ Xu KC Chou (2004) ArticleTitleWeighted-support vector machines for predicting membrane protein types based on pseudo amino acid composition Protein Eng Des Sel 17 509–516 Occurrence Handle15314209 Occurrence Handle10.1093/protein/gzh061 Occurrence Handle1:CAS:528:DC%2BD2cXos1GisLY%3D

    Article  PubMed  CAS  Google Scholar 

  • SQ Wang J Yang KC Chou (2006) ArticleTitleUsing stacked generalization to predict membrane protein types based on pseudo amino acid composition J Theor Biol 242 941–946 Occurrence Handle16806277 Occurrence Handle10.1016/j.jtbi.2006.05.006 Occurrence Handle1:CAS:528:DC%2BD28Xps1Oku70%3D

    Article  PubMed  CAS  Google Scholar 

  • Z Wen M Li Y Li Y Guo K Wang (2006) ArticleTitleDelaunay triangulation with partial least squares projection to latent structures: a model for G-protein coupled receptors classification and fast structure recognition Amino Acids 32 277–283 Occurrence Handle16729188 Occurrence Handle10.1007/s00726-006-0341-y Occurrence Handle1:CAS:528:DC%2BD2sXhtFyhtLY%3D

    Article  PubMed  CAS  Google Scholar 

  • Xiao X, Chou KC (2007) Digital coding of amino acids based on hydrophobic index. Protein Peptide Lett 14: doi: 0929-8665/07

  • X Xiao SH Shao YS Ding ZD Huang Y Huang KC Chou (2005a) ArticleTitleUsing complexity measure factor to predict protein subcellular location Amino Acids 28 57–61 Occurrence Handle10.1007/s00726-004-0148-7 Occurrence Handle1:CAS:528:DC%2BD2MXhsVKqsro%3D

    Article  CAS  Google Scholar 

  • X Xiao S Shao Y Ding Z Huang X Chen KC Chou (2005b) ArticleTitleUsing cellular automata to generate Image representation for biological sequences Amino Acids 28 29–35 Occurrence Handle10.1007/s00726-004-0154-9 Occurrence Handle1:CAS:528:DC%2BD2MXhsVKqs70%3D

    Article  CAS  Google Scholar 

  • X Xiao SH Shao YS Ding ZD Huang KC Chou (2006a) ArticleTitleUsing cellular automata images and pseudo amino acid composition to predict protein subcellular location Amino Acids 30 49–54 Occurrence Handle10.1007/s00726-005-0225-6 Occurrence Handle1:CAS:528:DC%2BD28XhsFCksrk%3D

    Article  CAS  Google Scholar 

  • X Xiao SH Shao ZD Huang KC Chou (2006b) ArticleTitleUsing pseudo amino acid composition to predict protein structural classes: approached with complexity measure factor J Comput Chem 27 478–482 Occurrence Handle10.1002/jcc.20354 Occurrence Handle1:CAS:528:DC%2BD28XitFyqsr4%3D

    Article  CAS  Google Scholar 

  • SW Zhang Q Pan HC Zhang ZC Shao JY Shi (2006a) ArticleTitlePrediction protein homo-oligomer types by pseudo amino acid composition: approached with an improved feature extraction and naive bayes feature fusion Amino Acids 30 461–468 Occurrence Handle10.1007/s00726-006-0263-8 Occurrence Handle1:CAS:528:DC%2BD28Xls1egsr0%3D

    Article  CAS  Google Scholar 

  • T Zhang Y Ding KC Chou (2006b) ArticleTitlePrediction of protein subcellular location using hydrophobic patterns of amino acid sequence Comput Biol Chem 30 367–371 Occurrence Handle10.1016/j.compbiolchem.2006.08.003 Occurrence Handle1:CAS:528:DC%2BD28XhtVSisrzN

    Article  CAS  Google Scholar 

  • Zhang TL, Ding YS (2007) Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes. Amino Acids 10.1007/s00726-007-0496-1

  • ZH Zhang ZH Wang ZR Zhang YX Wang (2006c) ArticleTitleA novel method for apoptosis protein subcellular localization prediction combining encoding based on grouped weight and support vector machine FEBS Lett 580 6169–6174 Occurrence Handle10.1016/j.febslet.2006.10.017 Occurrence Handle1:CAS:528:DC%2BD28XhtFOhsrzN

    Article  CAS  Google Scholar 

  • GP Zhou (1998) ArticleTitleAn intriguing controversy over protein structural class prediction J Protein Chem 17 729–738 Occurrence Handle9988519 Occurrence Handle10.1023/A:1020713915365 Occurrence Handle1:CAS:528:DyaK1MXnslaltw%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • GP Zhou N Assa-Munt (2001) ArticleTitleSome insights into protein structural class prediction Proteins 44 57–59 Occurrence Handle11354006 Occurrence Handle10.1002/prot.1071 Occurrence Handle1:CAS:528:DC%2BD3MXktlSnsbk%3D

    Article  PubMed  CAS  Google Scholar 

  • GP Zhou K Doctor (2003) ArticleTitleSubcellular location prediction of apoptosis proteins Proteins 50 44–48 Occurrence Handle12471598 Occurrence Handle10.1002/prot.10251 Occurrence Handle1:CAS:528:DC%2BD3sXlsVKmug%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • XB Zhou C Chen ZC Li XY Zou (2007) ArticleTitleUsing Chou’s amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes J Theor Biol 248 546–551 Occurrence Handle17628605 Occurrence Handle10.1016/j.jtbi.2007.06.001 Occurrence Handle1:CAS:528:DC%2BD2sXhtVWhsrnI

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Additional information

Authors’ address: Shao-Wu Zhang, College of Automation control, Northwestern Polytechnical University, 127 YouYi West Rd., Xi’an 710072, Shaanxi, China

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shi, JY., Zhang, SW., Pan, Q. et al. Using pseudo amino acid composition to predict protein subcellular location: approached with amino acid composition distribution. Amino Acids 35, 321–327 (2008). https://doi.org/10.1007/s00726-007-0623-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00726-007-0623-z

Navigation