Abstract
Semantic Query Optimisation makes use of the semantic knowledge of a database (rules) to perform query transformation. Rules are normally learned from former queries fired by the user. Over time, however, this can result in the rule set becoming very large thereby degrading the efficiency of the system as a whole. Such a problem is known as the utility problem. This paper seeks to provide a solution to the utility problem through the use of statistical techniques in selecting and maintaining an optimal rule set. Statistical methods have, in fact, been used widely in the field of Knowledge Discovery to identify and measure relationships between attributes. Here we extend the approach to Semantic Query Optimisation using the Chi-square statistical method which is integrated into a prototype query optimiser developed by the authors. We also present a new technique for calculating Chi-square, which is faster and more efficient than the traditional method in this situation.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
U. S. CHAKRAVARTHY, J. GRANT and J. MINKER. Logic-based approach to semantic query optimisation. ACM Transactions on Database Systems, Vol. 15, No. 2, June 1990, 162–207.
K. C. CHAN and A. K. C. WONG. A statistical test for extracting classificatory knowledge form databases. Knowledge Discovery in Databases, Ed., The AAAI Press, 1991, 107–123.
G. GRAEFE and D. DEWITT. The EXODUS optimiser generator. Proceedings of the ACM-SIGMOD Conference on Management of Data, San Francisco, May 1987, 160–171.
J. HAN, Y. CAI and N. CERCONE. Data-driven discovery of quantitative rules in relational databases, IEEE Transactions on Knowledge and Data Engineering, Vol. 5, no. 1, February, 1993, 29–40.
I. F. IMAM, R. S. MICHALSKI and L. KERSCHBERG. Discovering attribute dependence in database by integrating symbolic learning and statistical analysis tests. Knowledge Discovery in Databases Workshop, 1993, 264–275.
B. G. T. LOWDEN and K. Y. LIM. A data driven semantic optimiser, CSM 211, Internal Publication, University of Essex.
B. G. T. LOWDEN, J. ROBINSON and K. Y. LIM, A semantic optimiser using automatic rule derivation. Proceedings of Workshop on Information Technologies and Systems, 1995, 68–76.
G. PIATETSKY-SHAPIRO, and C. MATHEUS. Measuring data dependencies in large databases. Knowledge Discovery in Databases Workshop, 1993, 162–173.
S. RUSSELL. Rationality and Intelligence. Artificial Intelligence 94, 1997, 57–77.
I. SAVNIK and P. A., FLACH Bottom-up induction of functional dependencies from relations. Knowledge Discovery in Databases Workshop, 1993, 174–185.
S. SHEKHAR, B. HAMIDZADEH and A. KOHLI. Learning transformation rules for semantic query optimisation: a data-driven approach. IEEE, 1993, 949–964.
S. T. SHENOY and Z. M. OZSOYOGLU. Design and implementation of semantic query otpimiser. IEEE Transactions on Knowledge and Data Engineering, Vol. 1, No. 3, Sept. 1989, 344–361.
M. D. SIEGEL, E. SCIORE and S. SALVETER. A method for automatic rule derivation to support semantic query optimisation, ACM Transactions on Database Systems, Vol. 17, No. 4, Dec 1992, 563–600.
W. ZIARKO. The discovery, analysis, and representation of data dependencies in databases. In Knowledge Discovery in Databases, The AAAI Press, 1991, 195–209.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lowden, B.G.T., Robinson, J. (1999). A statistical approach to rule selection in semantic query optimisation. In: Raś, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1999. Lecture Notes in Computer Science, vol 1609. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095119
Download citation
DOI: https://doi.org/10.1007/BFb0095119
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65965-5
Online ISBN: 978-3-540-48828-6
eBook Packages: Springer Book Archive