While biomedical data acquired from the latest spectroscopic modalities yield important information relevant to many diagnostic or prognostic procedures, they also present significant challenges for analysis, classification and interpretation. These challenges include sample sparsity, high-dimensional feature spaces, and noise/artifact signatures. Since a dataindependent ‘universal’ classifier does not exist, a classification strategy is needed, possessing five key components acting in concert: data visualization, preprocessing, feature space dimensionality reduction, reliable/robust classifier development, and classifier aggregation/fusion. These components, which should be flexible, data-driven, extensible, and computationally efficient, must provide accurate, reliable diagnosis/prognosis with the fewest maximally discriminatory, yet medically interpretable, features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2004 Springer
About this chapter
Cite this chapter
Somorjai, R.L. et al. (2004). A Data-Driven, Flexible Machine Learning Strategy for the Classification of Biomedical Data. In: Dubitzky, W., Azuaje, F. (eds) Artificial Intelligence Methods And Tools For Systems Biology. Computational Biology, vol 5. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-5811-0_5
Download citation
DOI: https://doi.org/10.1007/978-1-4020-5811-0_5
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-2859-5
Online ISBN: 978-1-4020-2865-6
eBook Packages: Springer Book Archive