Abstract
XML has become a widely used standard for data representation, distribution and sharing. The concept of the Sensor Web has led to web generated sensor data in many diverse applications where delivery of the sensed data takes place using the Web. In order to obtain useful knowledge from XML sensor data, data warehouse and OLAP applications aimed at providing support for decision making for operational data must be developed. In this paper, we present a pipeline design based OLAP data cube construction framework designated for real time web generated sensor data, transforming sensor data into XML streams conforming to an underlying data warehouse logical model, which constructs corresponding data cubes. As part of this work, we discuss how our cube construction and acceleration strategy improves the efficiency in managing large volumes of XML data.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Boussaid, O., Darmont, J., Bentayeb, F., Loudcher, S.: Warehousing complex data from the web. Int. J. Web Eng. Technol. 4, 408–433 (2008)
Cuzzocrea, A.: Cubing Algorithms for XML Data. In: Proceedings of DEXA Workshops, pp. 407–411. IEEE Computer Society (2009)
Park, B.-K., Han, H., Song, I.-Y.: XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2005. LNCS, vol. 3589, pp. 32–42. Springer, Heidelberg (2005)
Dittrich, J., Blunschi, L., Salles, M.: Dwarfs in the Rearview Mirror: How Big are they Really? Proceedings of VLDB Endowment 1, 1586–1597 (2008)
Gui, H., Roantree, M.: A Data Cube Model for Analysis of High Volumes of Ambient Data. Procedia Computer Science 10, 94–101 (2012)
Marks, G., Roantree, M., Murphy, J.: Classification of Index Partitions to Boost XML Query Performance. In: Parsons, J., Saeki, M., Shoval, P., Woo, C., Wand, Y. (eds.) ER 2010. LNCS, vol. 6412, pp. 405–418. Springer, Heidelberg (2010)
Marks, G., Roantree, M., Smyth, D.: Optimizing Queries for Web Generated Sensor Data. In: 22nd Australasian Database Conference (ADC 2011), Perth, Australia (2011)
Parimala, N., Pahwa, P.: From XML Schema to Cube. International Journal of Computer Theory and Engineering 1(3), 236–243 (2009)
Roantree, M., Sallinen, M.: The Sensor Web - Bridging the Physical-Digital Divide. ERCIM News 76 (2009)
Rusu, I.L., Wenny, R., David, T.: Partitioning methods for multi-version XML data warehouses. Distributed Parallel Databases 25, 47–69 (2009)
Sismanis, Y., Deligiannakis, A., Roussopoulos, N., Kotidis, Y.: Dwarf: Shrinking the PetaCube. In: Proceedings of ACM SIGMOD, pp. 464–475 (2002)
Widom, J.: Research Problems in Data Warehousing. In: Proceedings of the International Conference on Information and Knowledge Management (CIKM), pp. 25–30. ACM (1995)
Ykhlef, M.: On-Line Analytical Processing Queries for eXtensible Mark-up Language. Information Technology Journal 8(4), 521–528 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gui, H., Roantree, M. (2013). Using a Pipeline Approach to Build Data Cube for Large XML Data Streams. In: Hong, B., Meng, X., Chen, L., Winiwarter, W., Song, W. (eds) Database Systems for Advanced Applications. DASFAA 2013. Lecture Notes in Computer Science, vol 7827. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40270-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-40270-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40269-2
Online ISBN: 978-3-642-40270-8
eBook Packages: Computer ScienceComputer Science (R0)