Abstract
The application of machine learning algorithms on real world problems rarely encounters ideal conditions. Often either the available data are imperfect or insufficient, or the learning situation requires a rather complicated combination of different approaches. In this article I describe an application, which – in an ideal world – would be solvable by a conventional supervised classification algorithm. Unfortunately, the available data are neither reliably classified nor could a manual correct reclassification be derived under restricted available resources. Since we had a large domain thesaurus available, we were able to develop a new approach, skipping the learning step and deriving a classification model directly from this thesaurus. The evaluation showed that for the intended use the quality of the classification model is more than sufficient.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
http://www.berlin.de/sen/wirtschaft/politik/innovationsstrategie.en.html. Accessed 19 July 2014.
- 2.
This number still excludes all variants derivable by stemming, which cannot be estimated.
- 3.
Only 600 of these test cases could be classified into the innovation clusters.
References
Hoppe, T.: Modellierung des Sprachraums von Unternehmen – Was man nicht beschreiben kann, das kann man auch nicht finden. In: Humm, B., Reibold, A., Ege, B. (eds.) Corporate Semantic Web – Wie semantische Anwendungen in Unternehmen Nutzen stiften. X.media.press, Springer, Heidelberg (2015)
Hoppe, T., Junghans, H.: Messung des Nutzens semantischer Suche. In: Humm, B., Reibold, A., Ege, B. (eds.) Corporate Semantic Web – Wie semantische Anwendungen in Unternehmen Nutzen Stiften. X.media.press, Springer, Heidelberg (2015)
Efron, B.: Bootstrap methods: another look at the jackknife. Ann. Stat. 7(1), 1–26 (1979)
Hesterberg, T., Monaghan, S., Moore, D.S., Clipson, A., Epstein, R.: Bootstrap Methods and Permutation Tests, Companion Chapter 18 to The Practice of Business Statistics. W.H. Freemann and Company, New York (2003). http://bcs.whfreeman.com/pbs/cat_160/PBS18.pdf. Accessed 2 Mar 2015
Tukey, J.W.: Exploratory data analysis. Addison-Wesley, Boston (1977)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Hoppe, T. (2016). Ontology-Based Classification – Application of Machine Learning Concepts Without Learning. In: Michaelis, S., Piatkowski, N., Stolpe, M. (eds) Solving Large Scale Learning Tasks. Challenges and Algorithms. Lecture Notes in Computer Science(), vol 9580. Springer, Cham. https://doi.org/10.1007/978-3-319-41706-6_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-41706-6_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41705-9
Online ISBN: 978-3-319-41706-6
eBook Packages: Computer ScienceComputer Science (R0)