Abstract
A Multi-Feature Cube (MF-Cube) query is a complex-data-mining query based on data cubes, which computes the dependent complex aggregates at multiple granularities. Existing computations designed for simple data cube queries can be used to compute distributive and algebraic MF-Cubes queries. In this paper we propose an efficient computation of holistic MF-Cubes queries. This method computes holistic MF-Cubes with PDAP (Part Distributive Aggregate Property). The efficiency is gained by using dynamic subset data selection strategy (Iceberg query technique) to reduce the size of materialized data cube. Also for efficiency, this approach adopts the chunk-based caching technique to reuse the output of previous queries. We experimentally evaluate our algorithm using synthetic and real-world datasets, and demonstrate that our approach delivers up to about twice the performance of traditional computations.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This work is partially supported by Australian large ARC grants (DP0449535, DP0559536 and DP0667060), a China NSFC major research Program (60496327), a China NSFC grant (60463003) and a grant from Overseas Outstanding Talent Research Program of Chinese Academy of Sciences (06S3011S01).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ross, K.A., Srivastava, D., Chatziantoniou, D.: Complex Aggregation at Multiple Granularities. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 263–277. Springer, Heidelberg (1998)
Ross, K., Srivastava, D.: Fast computation of sparse datacubes. In: Proc. Int. Conf. Very Large Data Bases, August 1997, Athens, Greece, pp. 116–125 (1997)
Zhao, Y., Deshpande, P.M., Naughton, J.F.: An Array-Based Algorithm for Simultaneous Multidimensional Aggregates. In: Proc. of ACM SIGMOD, pp. 159–170 (1997)
Beyer, K.S., Ramakrishnan, R.: Bottom-Up Computation of Sparse and Iceberg Cubes. In: Proc. Of ACM SIGMOD, pp. 359–370 (1999)
Agarwal, S., Agrawal, R., Deshpande, P.M., Gupta, A., Naughton, J.F., Ramakrishnan, R., Sarawagi, S.: On the Computation of Multidimensional Aggregates. In: Proc. Int’ Conf. Very Large Data Bases, pp. 506–521 (1996)
Gray, A., Bosworth, A., Layman, A., Pirahesh, H.: Datacube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. In: Proc. of the IEEE ICDE, pp. 152–159 (1996)
Fang, M., Shivakumar, N., Garcia-Molina, H., Motwani, R., Ullman, J.D.: Computing iceberg queries efficiently. In: Proc. of 24th VLDB Conf., New York, pp. 299–310 (1998)
Han, J.P., Dong, G., Wang, K.: Efficient Computation of Iceberg Cubes with Complex Measures. In: Proc. ACM-SIGMOD Int’l Conf. Management of Data, pp. 1–12 (2001)
Deshpande, P.M., Ramasamy, K., Shukla, A.: Caching Multidimensional Queries Using Chunks. In: Proc. ACM SIGMOD, pp. 259–270 (1998)
Dong, G., Han, J., Joyce, M.W.L.: Mining Constrained Gradients in Large Databases. In: IEEE TKDE (2003)
Hahn, C., Warren, S., London, J.: Edited synoptic cloud reports from ships and land stations over the globe (1996), Available on: http://cdiac.esd.ornl.gov/cdiac/ndps/ndp026b.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, S., Wang, R., Guo, Y. (2006). Efficient Computation of Multi-feature Data Cubes. In: Lang, J., Lin, F., Wang, J. (eds) Knowledge Science, Engineering and Management. KSEM 2006. Lecture Notes in Computer Science(), vol 4092. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811220_52
Download citation
DOI: https://doi.org/10.1007/11811220_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37033-8
Online ISBN: 978-3-540-37035-2
eBook Packages: Computer ScienceComputer Science (R0)