Abstract
Biological research is becoming increasingly database driven, motivated, in part, by the advent of large-scale functional genomics and proteomics experiments such as those comprehensively measuring gene expression. Consequently, a challenge in bioinformatics is integrating databases to connect this disparate information as well as performing large-scale studies to collectively analyze many different data sets. These composite data sets are conducive to extensive computational analysis and present new opportunities for data mining. Both supervised and unsupervised approaches can often be used to analyze the same kinds of data, depending on the desired result and the range of features available. Large-scale experiments, such as those performed with microarrays, yield large homogenous data sets that are well suited for computational analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000)
Kohonen, T.: Self-organization and Associative Memory. Springer, Berlin (1984)
Baldi, P., Brunak, S.: Bioinformatics: the Machine Learning Approach, 2nd edn. MIT Press, Cambridge (1998)
Lones, D.T.: Gent READER: an efficient and reliable protein fold recognition method for genomic sequences. J. Mol. Biol. 287, 797–815 (1999)
Ding, C.H.Q., Dubchak, I.: Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics 17(4), 349–358 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dash, A., Swarnkar, T., Nayak, M. (2011). Advanced Bioinformatics Approach in Machine Learning for Analyzing Genome Wide Expression Profiles and Proteomic Data Sets. In: Das, V.V., Stephen, J., Chaba, Y. (eds) Computer Networks and Information Technologies. CNC 2011. Communications in Computer and Information Science, vol 142. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19542-6_54
Download citation
DOI: https://doi.org/10.1007/978-3-642-19542-6_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19541-9
Online ISBN: 978-3-642-19542-6
eBook Packages: Computer ScienceComputer Science (R0)