Abstract
We present a logic database language with elementary data mining mechanisms to model the relevant aspects of knowledge discovery, and to provide a support for both the iterative and interactive features of the knowledge discovery process. We adopt the notion of user-defined aggregate to model typical data mining tasks as operations unveiling unseen knowledge. We illustrate the use of aggregates to model specific data mining tasks, such as frequent pattern discovery, classification, data discretization and clustering, and show how the resulting data mining query language allows the modeling of typical steps of the knowledge discovery process, that range from data preparation to knowledge extraction and evaluation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Boulicaut, J.-F., Klemettinen, M., Mannila, H.: Querying Inductive Databases: A Case Study on the MINE RULE Operator. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS, vol. 1510, pp. 194–202. Springer, Heidelberg (1998)
Chen, M.S., Han, J., Yu, P.S.: Data Mining: An Overview from a Database Perspective. IEEE Trans. on Knowledge and Data Engineering 8(6), 866–883 (1996)
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and Unsupervised Discretization of Continuous Features. In: Procs. 12th International Conference on Machine Learning, pp. 194–202 (1995)
Giannotti, F., Manco, G.: Declarative knowledge extraction with iterative user-defined aggregates. AI*IA Notizie 13(4) (December 2000)
Giannotti, F., Manco, G.: Making Knowledge Extraction and Reasoning Closer. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805, pp. 360–371. Springer, Heidelberg (2000)
Giannotti, F., Manco, G., Nanni, M., Pedreschi, D.: Nondeterministic, Nonmonotonic Logic Databases. IEEE Trans. on Knowledge and Data Engineering 13(5), 813–823 (2001)
Giannotti, F., Manco, G., Pedreschi, D., Turini, F.: Experiences with a Logic-Based Knowledge Discovery Support Environment. In: Lamma, E., Mello, P. (eds.) AI*IA 1999. LNCS (LNAI), vol. 1792, Springer, Heidelberg (2000)
Giannotti, F., Manco, G., Turini, F.: Specifying Mining Algorithms with Iterative User-Defined Aggregates: A Case Study. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, pp. 128–139. Springer, Heidelberg (2001)
Giannotti, F., Pedreschi, D., Zaniolo, C.: Semantics and Expressive Power of Non Deterministic Constructs for Deductive Databases. Journal of Computer and Systems Sciences 62(1), 15–42 (2001)
Graefe, G., Fayyad, U., Chaudhuri, S.: On the Efficient Gathering of Sufficient Statistics for Classification from Large SQL Databases. In: Proc. 4th Int. Conf. on Knowledge Discovery and Data Mining (KDD 1998), pp. 204–208 (1998)
Han, J., Fu, Y., Koperski, K., Wang, W., Zaiane, O.: DMQL: A Data Mining Query Language for Relational Databases. In: SIGMOD 1996 Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD 1996) (1996)
Hussain, F., Liu, H., Tan, C., Dash, M.: Discretization: An Enabling Technique. Journal of Knowledge Discovery and Data Mining 6(4), 393–423 (2002)
Imielinski, T., Mannila, H.: A Database Perspective on Knowledge Discovery. Communications of the ACM 39(11), 58–64 (1996)
Imielinski, T., Virmani, A.: MSQL: A Query Language for Database Mining. Journal of Knowledge Discovery and Data Mining 3(4), 373–408 (1999)
Kerber, R.: ChiMerge: Discretization of Numeric Attributes. In: Proc. 10th National Conference on Artificial Intelligence (AAAI 1992), pp. 123–127. The MIT Press, Cambridge (1992)
Manco, G.: Foundations of a Logic-Based Framework for Intelligent Data Analysis. PhD thesis, Department of Computer Science, University of Pisa (April 2001)
Mannila, H.: Inductive databases and condensed representations for data mining. In: International Logic Programming Symposium, pp. 21–30 (1997)
Mannila, H., Toivonen, H.: Levelwise Search and Border of Theories in Knowledge Discovery. Journal of Knowledge Discovery and Data Mining 3, 241–258 (1997)
Meo, R., Psaila, G., Ceri, S.: A Tightly-Coupled Architecture for Data Mining. In: International Conference on Data Engineering (ICDE 1998), pp. 316–323 (1998)
De Raedt, L.: Data mining as constraint logic programming. In: Procs. Int. Conf. on Inductive Logic Programming (2000)
Shen, W., Ong, K., Mitbander, B., Zaniolo, C.: Metaqueries for Data Mining. In: Advances in Knowledge Discovery and Data Mining, pp. 375–398. AAAI Press/The MIT Press (1996)
Tsur, D., et al.: Query Flocks: A Generalization of Association-Rule Mining. In: Proc. ACM Conf. on Management of Data (Sigmod 1998), pp. 1–12 (1998)
Zaniolo, C., Arni, N., Ong, K.: Negation and Aggregates in Recursive Rules: The LDL++ Approach. In: Ceri, S., Tsur, S., Tanaka, K. (eds.) DOOD 1993. LNCS, vol. 760, Springer, Heidelberg (1993)
Zaniolo, C., Ceri, S., Faloutsos, C., Snodgrass, R.T., Subrahmanian, V.S., Zicari, R.: Advanced Database Systems. Morgan Kaufman, San Francisco (1997)
Zaniolo, C., Wang, H.: Logic-Based User-Defined Aggregates for the Next Generation of Database Systems. In: The Logic Programming Paradigm: Current Trends and Future Directions, Springer, Heidelberg (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Giannotti, F., Manco, G., Turini, F. (2004). Towards a Logic Query Language for Data Mining. In: Meo, R., Lanzi, P.L., Klemettinen, M. (eds) Database Support for Data Mining Applications. Lecture Notes in Computer Science(), vol 2682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44497-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-44497-8_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22479-2
Online ISBN: 978-3-540-44497-8
eBook Packages: Springer Book Archive