Skip to main content

CHANGE DETECTION AND MAINTENANCE OF AN XML WEB WAREHOUSE

  • Conference paper
Enterprise Information Systems VII
  • 670 Accesses

Abstract

The World Wide Web is a popular broadcast medium that contains a huge amount of information. The web warehouse is an efficient and effective means to facilitate utilization of information on the Web. XML has become the new standard for semi-structured data exchange over the Web. In this paper, therefore, we study the XML web warehouse and propose an approach to the problems of change detection and warehouse maintenance in an XML web warehouse system. This paper has three major contributions. First, we propose an object-oriented data model for XML web pages in the web warehouse as well as system architecture for change detection and warehouse maintenance. Second, we propose a change detection method based on mobile agent technology to actively detect changes of data sources of the web warehouse. Third, we propose an incremental and deferred maintenance method to maintain XML web pages in the web warehouse. We compared our approach with a rewriting approach to storage and maintenance of the XML web warehouse by experiments. Performance evaluation shows that our approach is more efficient than the rewriting approach in terms of the response time and storage space of the web warehouse.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Agrawal, D., El Abbadi, A., Singh, A., Yurek, T., 1997. Efficient view maintenance at data warehouses. In Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data, pp. 417–427.

    Google Scholar 

  • Apparao, V., 1998. Document Object Model (DOM) Level 1 Specification (Version 1.0).

    Google Scholar 

  • Bhowmick, S. S., Ng, W. K., Madria, S. K., Lim, E. P., 2000. Detecting and representing relevant web deltas using web join. In Proceedings of the 20th IEEE International Conference on Distributed Computing Systems, pp. 255–262.

    Google Scholar 

  • Chawathe, S. S., Abiteboul, S., Widom, J., 1999. Managing historical semistructured data. Theory and Practice of Object Systems, Vol. 5, No. 3, pp. 143–162.

    Article  Google Scholar 

  • Labio, W., Garcia-Molina, H., 1995. Efficient snapshot differential algorithm for data warehousing. In Proceedings of the 22nd International Conference on Very Large Data Bases, pp. 63–74.

    Google Scholar 

  • Lim, S. J., Ng, Y. K., 2001. An automated change detection algorithm for HTML documents based on semantic hierarchies. In Proceedings of the 17th IEEE International Conference on Data Engineering, pp. 303–312.

    Google Scholar 

  • Ng, W. K., Lin, E. P., Huang, C. T., Bhowmick, S., Qin, F. Q., 1998. Web warehousing: an algebra for web information. In Proceedings of the 1998 IEEE Forum on Research and Technology Advances in Digital Libraries, pp. 228–237.

    Google Scholar 

  • Xyleme, L., 2001. A dynamic warehouse for XML data of the web. IEEE Data Engineering Bulletin, Vol. 24, No. 2, pp. 40–47.

    Google Scholar 

  • Zhuge, Y., Garcia-Molina, H., Hammer, J., Widom, J., 1995. View maintenance in a warehousing environment. In Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data, pp. 316–327.

    Google Scholar 

  • Zhuge, Y., Garcia-Molina, H., Wiener, J. L., 1996. The Strobe algorithms for multi-source warehouse consistency. In Proceedings of the 4th IEEE International Conference on Parallel and Distributed Information Systems, 146–157.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer

About this paper

Cite this paper

Chao, CM. (2007). CHANGE DETECTION AND MAINTENANCE OF AN XML WEB WAREHOUSE. In: Chen, CS., Filipe, J., Seruca, I., Cordeiro, J. (eds) Enterprise Information Systems VII. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-5347-4_7

Download citation

  • DOI: https://doi.org/10.1007/978-1-4020-5347-4_7

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-5323-8

  • Online ISBN: 978-1-4020-5347-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics