Skip to main content

Using a Pipeline Approach to Build Data Cube for Large XML Data Streams

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7827))

Abstract

XML has become a widely used standard for data representation, distribution and sharing. The concept of the Sensor Web has led to web generated sensor data in many diverse applications where delivery of the sensed data takes place using the Web. In order to obtain useful knowledge from XML sensor data, data warehouse and OLAP applications aimed at providing support for decision making for operational data must be developed. In this paper, we present a pipeline design based OLAP data cube construction framework designated for real time web generated sensor data, transforming sensor data into XML streams conforming to an underlying data warehouse logical model, which constructs corresponding data cubes. As part of this work, we discuss how our cube construction and acceleration strategy improves the efficiency in managing large volumes of XML data.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Boussaid, O., Darmont, J., Bentayeb, F., Loudcher, S.: Warehousing complex data from the web. Int. J. Web Eng. Technol. 4, 408–433 (2008)

    Article  Google Scholar 

  2. Cuzzocrea, A.: Cubing Algorithms for XML Data. In: Proceedings of DEXA Workshops, pp. 407–411. IEEE Computer Society (2009)

    Google Scholar 

  3. Park, B.-K., Han, H., Song, I.-Y.: XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2005. LNCS, vol. 3589, pp. 32–42. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  4. Dittrich, J., Blunschi, L., Salles, M.: Dwarfs in the Rearview Mirror: How Big are they Really? Proceedings of VLDB Endowment 1, 1586–1597 (2008)

    Google Scholar 

  5. Gui, H., Roantree, M.: A Data Cube Model for Analysis of High Volumes of Ambient Data. Procedia Computer Science 10, 94–101 (2012)

    Article  Google Scholar 

  6. Marks, G., Roantree, M., Murphy, J.: Classification of Index Partitions to Boost XML Query Performance. In: Parsons, J., Saeki, M., Shoval, P., Woo, C., Wand, Y. (eds.) ER 2010. LNCS, vol. 6412, pp. 405–418. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  7. Marks, G., Roantree, M., Smyth, D.: Optimizing Queries for Web Generated Sensor Data. In: 22nd Australasian Database Conference (ADC 2011), Perth, Australia (2011)

    Google Scholar 

  8. Parimala, N., Pahwa, P.: From XML Schema to Cube. International Journal of Computer Theory and Engineering 1(3), 236–243 (2009)

    Google Scholar 

  9. Roantree, M., Sallinen, M.: The Sensor Web - Bridging the Physical-Digital Divide. ERCIM News 76 (2009)

    Google Scholar 

  10. Rusu, I.L., Wenny, R., David, T.: Partitioning methods for multi-version XML data warehouses. Distributed Parallel Databases 25, 47–69 (2009)

    Article  Google Scholar 

  11. Sismanis, Y., Deligiannakis, A., Roussopoulos, N., Kotidis, Y.: Dwarf: Shrinking the PetaCube. In: Proceedings of ACM SIGMOD, pp. 464–475 (2002)

    Google Scholar 

  12. Widom, J.: Research Problems in Data Warehousing. In: Proceedings of the International Conference on Information and Knowledge Management (CIKM), pp. 25–30. ACM (1995)

    Google Scholar 

  13. Ykhlef, M.: On-Line Analytical Processing Queries for eXtensible Mark-up Language. Information Technology Journal 8(4), 521–528 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gui, H., Roantree, M. (2013). Using a Pipeline Approach to Build Data Cube for Large XML Data Streams. In: Hong, B., Meng, X., Chen, L., Winiwarter, W., Song, W. (eds) Database Systems for Advanced Applications. DASFAA 2013. Lecture Notes in Computer Science, vol 7827. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40270-8_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40270-8_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40269-2

  • Online ISBN: 978-3-642-40270-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics