Abstract
The World Wide Web is a popular broadcast medium that contains a huge amount of information. The web warehouse is an efficient and effective means to facilitate utilization of information on the Web. XML has become the new standard for semi-structured data exchange over the Web. In this paper, therefore, we study the XML web warehouse and propose an approach to the problems of change detection and warehouse maintenance in an XML web warehouse system. This paper has three major contributions. First, we propose an object-oriented data model for XML web pages in the web warehouse as well as system architecture for change detection and warehouse maintenance. Second, we propose a change detection method based on mobile agent technology to actively detect changes of data sources of the web warehouse. Third, we propose an incremental and deferred maintenance method to maintain XML web pages in the web warehouse. We compared our approach with a rewriting approach to storage and maintenance of the XML web warehouse by experiments. Performance evaluation shows that our approach is more efficient than the rewriting approach in terms of the response time and storage space of the web warehouse.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, D., El Abbadi, A., Singh, A., Yurek, T., 1997. Efficient view maintenance at data warehouses. In Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data, pp. 417–427.
Apparao, V., 1998. Document Object Model (DOM) Level 1 Specification (Version 1.0).
Bhowmick, S. S., Ng, W. K., Madria, S. K., Lim, E. P., 2000. Detecting and representing relevant web deltas using web join. In Proceedings of the 20th IEEE International Conference on Distributed Computing Systems, pp. 255–262.
Chawathe, S. S., Abiteboul, S., Widom, J., 1999. Managing historical semistructured data. Theory and Practice of Object Systems, Vol. 5, No. 3, pp. 143–162.
Labio, W., Garcia-Molina, H., 1995. Efficient snapshot differential algorithm for data warehousing. In Proceedings of the 22nd International Conference on Very Large Data Bases, pp. 63–74.
Lim, S. J., Ng, Y. K., 2001. An automated change detection algorithm for HTML documents based on semantic hierarchies. In Proceedings of the 17th IEEE International Conference on Data Engineering, pp. 303–312.
Ng, W. K., Lin, E. P., Huang, C. T., Bhowmick, S., Qin, F. Q., 1998. Web warehousing: an algebra for web information. In Proceedings of the 1998 IEEE Forum on Research and Technology Advances in Digital Libraries, pp. 228–237.
Xyleme, L., 2001. A dynamic warehouse for XML data of the web. IEEE Data Engineering Bulletin, Vol. 24, No. 2, pp. 40–47.
Zhuge, Y., Garcia-Molina, H., Hammer, J., Widom, J., 1995. View maintenance in a warehousing environment. In Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data, pp. 316–327.
Zhuge, Y., Garcia-Molina, H., Wiener, J. L., 1996. The Strobe algorithms for multi-source warehouse consistency. In Proceedings of the 4th IEEE International Conference on Parallel and Distributed Information Systems, 146–157.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this paper
Cite this paper
Chao, CM. (2007). CHANGE DETECTION AND MAINTENANCE OF AN XML WEB WAREHOUSE. In: Chen, CS., Filipe, J., Seruca, I., Cordeiro, J. (eds) Enterprise Information Systems VII. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-5347-4_7
Download citation
DOI: https://doi.org/10.1007/978-1-4020-5347-4_7
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-5323-8
Online ISBN: 978-1-4020-5347-4
eBook Packages: Computer ScienceComputer Science (R0)