Abstract
We propose the ontological approach to Data Mining that is based on: (1) the analysis of subject domain ontology, (2) information in data that are interpretable in terms of ontology, and (3) interpretability of Data Mining methods and their results in ontology. Respectively concepts of Data Ontology and Data Mining Method Ontology are introduced. These concepts lead us to a new Data Mining approach—Ontological Data Mining (ODM). ODM uses the information extracted from data which is interpretable in the subject domain ontology instead of raw data. Next we present the theoretical and practical advantages of this approach and the Discovery system that implements this approach. The value of ODM is demonstrated by solutions of the tasks from the areas of financial forecasting, bioinformatics and medicine.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kovalerchuk, B., Vityaev, E. Symbolic Methodology for Numeric Data Mining. Intelligent Data Analysis. Special issue on “Philosophies and Methodologies for Knowledge Discovery and Intelligent Data Analysis” eds. Keith Rennolls, Evgenii Vityaev. v.12(2), IOS Press, 2008, pp. 165–188.
Kovalerchuk, B., Vityaev, E., Ruiz, J.F. (2001). Consistent and Complete Data and “Expert” Mining in Medicine’. In: Medical Data Mining and Knowledge Discovery, Springer, pp. 238–280.
Vityaev, E.E. Knowledge discovery. Computational cognition. Cognitive process models. Novosibirsk State University Press, Novosibirsk, 2006, p. 293. (in Russian).
Vityaev, E., Kovalerchuk, B.Y. Relational Methodology for Data Mining and Knowledge Discovery. Intelligent Data Analysis. Special issue on “Philosophies and Methodologies for Knowledge Discovery and Intelligent Data Analysis” eds. Keith Rennolls, Evgenii Vityaev. v.12(2), IOS Press, 2008, pp. 189–210.
Fishbern PC (1970): Utility Theory for Decision Making. NY-London, J.Wiley&Sons.
Krantz, D.H., Luce, R.D., Suppes, P., Tversky, A. (1971, 1989, 1990). Foundations of measurement, Vol. 1,2,3—NY, London: Acad. press, (1971) 577 p., (1989) 493 p., (1990) 356 p.
Pfanzagl J. (1971). Theory of measurement (in cooperation with V. Baumann, H. Huber) 2nd ed. Physica-Verlag.
Kovalerchuk, B., Vityaev, E. (2000), Data Mining in finance: Advances in Relational and Hybrid Methods, Kluwer Academic Publishers, 308 p.
Vityaev E. The logic of prediction. In: Mathematical Logic in Asia. Proceedings of the 9th Asian Logic Conference (August 16–19, 2005, Novosibirsk, Russia), World Scientific, Singapore, 2006, pp. 263–276.
Demin A.V., Vityaev E.E. Universal system Discovery for knowledge acquisition development and its applications // Novosibirsk State University messenger, Information technologies, v.7, issue 1, Novosibirsk, 2009, p. 73–83. (in Russian).
The Progic series of conferences: http://www.kent.ac.uk/secl/philosophy/jw/progic.htm.
Smerdov, S.O., Vityaev, E.E. Logic, probability and learning synthesis: prediction formalization // Siberian electronic mathematical news. v.6, Institute of mathematics SD RAS, 2009, p. 340–365. (in Russian).
Vityaev, E., Kovalerchuk, B., Empirical Theories Discovery based on the Measurement Theory. Mind and Machine, v.14, #4, 551–573, 2004.
Vityaev, E., Smerdov, S. On the Problem of Prediction // K.E. Wolff et al. (Eds.): KONT/KPP 2007, LNAI 6581, Springer, Heidelberg, 2011, pp. 280–296.
Scientific Discovery website http://www.math.nsc.ru/AP/ScientificDiscovery.
Kovalerchuk, B., Vityaev, E. (1998): Discovering Lawlike Regularities in Financial Time Series. Journal of Computational Intelligence in Finance 6 (3): 12–26.
Kolchanov, N.A., Pozdnyakov, M.A., Orlov, Y.L., Vishnevsky, O.V., Podkolodny, N.L. Vityaev, E.E., Kovalerchuk, B.Y. Computer System “Gene Discovery” for Promoter Structure Analysis In: Artificial Intelligence and Heuristic Methods in Bioinformatics, Eds: P. Frasconi and R. Shamir, IOS Press, 2003, pp. 173–192.
Vityaev, E.E., Orlov, Y.L., Vishnevsky, O.V., Pozdnyakov, M.A., Kolchanov, N.A. Computer system “Gene Discovery” for promoter structure analysis, In Silico Biol. 2 (2002) 257–262.
Vishnevsky, O.V., Vityaev, E. E. Analysis and Recognition of Erythroid-Specific Gene Promoters Basing on Degenerate Oligonucleotide Motifs // Molecular Biology. November 2001, Volume 35, Issue 6, pp 833–840.
Kovalerchuk, B., Vityaev, E., Ruiz, J. (2000). Consistent Knowledge Discovery in Medical Diagnosis, IEEE Engineering in Medicine and Biology Magazine. Special issue: “Medical Data Mining”, July/August 2000, pp. 26–37.
Pazzani, M., (1997), Comprehensible Knowledge Discovery: Gaining Insight from Data. First Federal Data Mining Conference and Exposition, pp. 73–82. Washington, DC.
Zagoruiko, N. G., Gulyaevskii, S. E., Kovalerchuk, B. Ya. Ontology of the Data Mining Subject Domain, Pattern Recognition and Image Analysis Journal, 2007, Vol. 17, No. 3, pp. 349–356. Pleiades Publishing, Ltd.
Acknowledgements
The work has been supported by the Russian Foundation for Basic Research (grant #15-07-03410-a) and Russian Federation grants (Scientific Schools grant of the President of the Russian Federation) #860.2014.1.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Vityaev, E., Kovalerchuk, B. (2017). Ontological Data Mining. In: Kreinovich, V. (eds) Uncertainty Modeling. Studies in Computational Intelligence, vol 683. Springer, Cham. https://doi.org/10.1007/978-3-319-51052-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-51052-1_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51051-4
Online ISBN: 978-3-319-51052-1
eBook Packages: EngineeringEngineering (R0)