Abstract
This paper presents a web platform for the detection of peptidases and motifs search from Merops database. The methodology for peptidases detection uses text mining techniques combined with Support Vector Machines (SVM). Preliminary results using two types of SVMs, the C-Support Vector Classification (C-SVC) and One-class SVM, show the feasibility of the methodology. Despite of the best results obtained with C-SVC the One-class SVM can be an alternative solution if only positive examples are available for training.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cheng BYM, Carbonell JG, Klein-Seetharaman J (2005) Protein classification based on text document classification techniques. Protein Struct Funct Bioinform 58(4):955–970. doi:10.1002/prot.20373
Wurst M. Word Vector Tool. http://sourceforge.net/projects/wvtool
Frijters J IKVM.NET: A JVM for the Microsoft.NET Framework. http://www.ikvm.net.
Salton G, Wong A, Yang CS (1975) A vector space model for automatic indexing. Commun ACM 18(11):613–620. doi:10.1145/361219.361220
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297. doi:10.1007/bf00994018
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):1–27. doi:10.1145/1961189.1961199
Schölkopf B, Platt JC, Shawe-Taylor J, Smola AJ, Williamson RC (2001) Estimating the support of a high-dimensional distribution. Neural Comput 13(7):1443–1471. doi:10.1162/089976601750264965
Rawlings ND, Barrett AJ, Bateman A (2010) MEROPS: the peptidase database. Nucleic Acids Res 38(Database issue):D227–233. doi:10.1093/nar/gkp971
Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 247(4):536–540. doi:10.1006/jmbi.1995.0159
Pereira C, Morgado L, Correia D, Verissimo P, Dourado A (2011) Kernel machines for proteomics data analysis: algorithms and tools. Paper presented at the European Network for Business and Industrial Statistics. ENBIS 2011, Coimbra, Portugal
Acknowledgments
This work was supported by FCT (Foundation for Science and Technology) and FEDER through Program COMPETE (QREN) under the project FCOMP-01-0124-FEDER-010160 (PTDC/EIA/71770/2006), designated BIOINK – Incremental Kernel Learning for Biological Data Analysis
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media Dordrecht
About this paper
Cite this paper
Correia, D., Pereira, C., VerÃssimo, P., Dourado, A. (2013). A Platform for Peptidase Detection Based on Text Mining Techniques and Support Vector Machines. In: Madureira, A., Reis, C., Marques, V. (eds) Computational Intelligence and Decision Making. Intelligent Systems, Control and Automation: Science and Engineering, vol 61. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-4722-7_42
Download citation
DOI: https://doi.org/10.1007/978-94-007-4722-7_42
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-4721-0
Online ISBN: 978-94-007-4722-7
eBook Packages: EngineeringEngineering (R0)