Abstract
The paper shows how a logic-based database language can support the various steps of the KDD process by providing a high degree of expressiveness, and the separation of concerns between the specification level and the mapping to the underlying databases and data mining tools. In particular, the mechanism of user-defined aggregates provided in LDL++ allows to specify data mining tasks and to formalize the mining results in a uniform way. We show how the mechanism applies to the concept of Inductive Databases, proposed in [2,12]. We concentrate on bayesian classification and show how user defined aggregates allow to specify the mining evaluation functions and the returned patterns. The resulting formalism provides a flexible way to customize, tune and reason on both the evaluation functions and the extracted knowledge.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Agrawal, S. Sarawagi, and S. Thomas. Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications. In Procs. of ACM-SIGMOD’98, 1998.
J-F. Boulicaut, M. Klemettinen, and H. Mannila. Querying Inductive Databases: A Case Study on the MINE RULE Operator. In Procs. 2nd European Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD98), volume 1510 of Lecture Notes in Computer Science, pages 194–202, 1998.
J-F. Boulicaut, P. Marcel, and C. Rigotti. Query Driven Knowledge Discovery in Multidimensional Data. In Procs. of the ACM international workshop on Data warehousing and OLAP, pages 87–93, 1999.
C. Elkan. Boosting and Naive Bayesian Learning. In Procs. of the International Conference on Knowledge Discovery and Data Mining (KDD-97), 1997.
U.M. Fayyad, G. Piatesky-Shapiro, P. Smyth, and R. Uthurusamy. Advances in Knowledge Discovery and Data Mining. AAAI Press/the MIT Press, 1996.
F. Giannotti and G. Manco. Querying Inductive Databases via Logic-Based User Defined Aggregates. In Procs. of the 3rd European Conference on Principles and Practice of Knowledge Discovery in Databases, number 1704 in Lecture Notes in Artificial Intelligence, pages 125–135, September 1999.
F. Giannotti, G. Manco, M. Nanni, and D. Pedreschi. Nondeterministic, Nonmonotonic Logic Databases. IEEE Trans. on Knowledge and Data Engineering, 2000. To appear.
F. Giannotti, G. Manco, M. Nanni, D. Pedreschi, and F. Turini. Integration of deduction and induction for mining supermarket sales data. In Proceedings of the International Conference on Practical Applications of Knowledge Discovery (PADD99), April 1999.
F. Giannotti, G. Manco, M. Nanni, D. Pedreschi, and F. Turini. Using Deduction for Intelligent Data Analysis. Technical Report B4-1999-02, CNUCE Institute of CNR, January 1999. Submitted for publication.
F. Giannotti, D. Pedreschi, and C. Zaniolo. Semantics and Expressive Power of Non Deterministic Constructs for Deductive Databases. In Journal of Logic Programming, 1999.
J. Han. Towards On-Line Analytical Mining in Large Databases. Sigmod Records, 27(1):97–107, 1998.
H. Mannila. Inductive databases and condensed representations for data mining. In International Logic Programming Symposium, pages 21–30, 1997.
D. Michie, D.J. Spiegelhalter, and C. Taylor. Machine Learning, Neural and Statistical Classification. Ellis Horwood, New York, 1994.
J. Mitchell. Machine Learning. McGraw-Hill, 1997.
S. Ceri R. Meo, G. Psaila. A New SQL-Like Operator for Mining Association Rules. In Proceedings of The Conference on Very Large Databases, pages 122–133, 1996.
S. Ruggieri. Efficient C4.5. Technical report, Department of Computer Science, University of Pisa, January 2000. Available at http://www-kdd.di.unipi.it.
W. Shen, K. Ong, B. Mitbander, and C. Zaniolo. Metaqueries for Data Mining. In Advances in Knowledge Discovery and Data Mining, pages 375–398. AAAI Press/The MIT Press, 1996.
D. Tsur et al. Query Flocks: A Generalization of Association-Rule Mining. In Proc. ACM Conf. on Management of Data (Sigmod98), pages 1–12, 1998.
H. Wang and C. Zaniolo. User defined aggregates in database languages. In Seventh International Workshop on Database Programming Languages, September 1999.
C. Zaniolo, N. Arni, and K. Ong. Negation and Aggregates in Recursive Rules: The LDL++ Approach. In Proc. 3rd Int. Conf. on Deductive and Object-Oriented Databases (DOOD93), volume 760 of Lecture Notes in Computer Science, 1993.
C. Zaniolo and H. Wang. Logic-Based User-Defined Aggregates for the Next Generation of Database Systems. The Logic Programming Paradigm: Current Trends and Future Directions. Springer Verlag, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Giannotti, F., Manco, G. (2000). Making Knowledge Extraction and Reasoning Closer. In: Terano, T., Liu, H., Chen, A.L.P. (eds) Knowledge Discovery and Data Mining. Current Issues and New Applications. PAKDD 2000. Lecture Notes in Computer Science(), vol 1805. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45571-X_42
Download citation
DOI: https://doi.org/10.1007/3-540-45571-X_42
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67382-8
Online ISBN: 978-3-540-45571-4
eBook Packages: Springer Book Archive