Skip to main content

Web Warehousing: Design and Issues

  • Conference paper
Advances in Database Technologies (ER 1998)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1552))

Included in the following conference series:

Abstract

The World Wide Web is a distributed global information resource. It contains a large amount of information that have been placed on the web independently by different organizations and thus, related information may appear across different web sites. To manage and access heterogeneous information on WWW, we have started a project of building a web warehouse, called Whoweda (Warehouse of Web Data). Currently, our work on building a web warehousing system has focused on building a data model and designing a web algebra. In this paper, we discuss design and research issues in a web warehousing system. The issues include are designing algebraic operators for web information access and manipulation, web data visualization and web knowledge discovery. These issues will not only overcome the limitations of available search engines but also provide powerful and friendly query mechanisms for retrieving useful information and knowledge discovery from a web warehouse.

This work was supported in part by the Nanyang Technological University, Ministry of Education (Singapore) under Academic Research Fund #4-12034-5060, #4-12034-3012, #4-12034-6022. Any opinions, findings, and recommendations in this paper are those of the authors and do not reflect the views of the funding agencies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://www.cais.ntu.edu.sg:8000/~whoweda/.

  2. S. Abiteboul, D. Quass, J. Mchugh, J. Widom, J. Weiner. The Lorel Query Language for Semistructured Data. Journal of Digital Libraries, 1(1): 68–88, April 1997.

    Google Scholar 

  3. G. Arocena, A. Mendelzon. WebOQL: Restructuring Documents, Databases and Webs. Proceedings of International Conference on Data Engineering, Orlando, Florida, February 1998.

    Google Scholar 

  4. T. Bray. Measuring the Web. Proceedings of the 5th International World Wide Web Conference (WWW), Paris, France, 1996.

    Google Scholar 

  5. S. Bhowmicx, S. K. Madria, W.-K. Ng, E.-P. Lim. Web Bags: Are They Useful in A Web Warehouse? Proceedings of 5th International Conference of Foundation of Data Organization (FODO’98), Kobe, Japan, November 1998.

    Google Scholar 

  6. S. Bhowmicx, S. K. Madria, W.-K. Ng, E.-P. Lim. Data Visualization in a Web Warehouse. Proceedings of International Workshop on Data Warehousing and Data Mining (DWDM’98) (in conjunction with ER ‘88), Singapore, 1998.

    Google Scholar 

  7. S. Bhowmick, S. K. Madria, W.-K. Ng, E.-P. Lim. Web Mining in Whoweda: Some Issues.PRICAI’98 Workshop on Knowledge Discovery and Data Mining, Singapore, 1998.

    Google Scholar 

  8. S. Bhowmicx, S. K. Madria, W.-K. Ng, E.-P. Lim. Bags in A Web Warehouse: Design and Analysis. Submitted for publication.

    Google Scholar 

  9. S. Bhowmick, W.-K. Ng, E.-P. Lim. Information Coupling in Web Databases. Proceedings of the 17th International Conference on Conceptual Modelling (ER’98), Singapore, 1998.

    Google Scholar 

  10. S. S. Bhowmick, W.-K. Ng, E.-P. Lim, S. K. Madria. Join Processing in Web Databases. Proceedings of the 9th International Conference on Database and Expert Systems Application (DEXA), Vienna, Austria, 1998.

    Google Scholar 

  11. P. Buneman, S. Davidson, G. Hillebrand, D. Suciu. A query language and optimization techniques for unstructured data. Proceedings of the ACM SIGMOD International Conference on Management of Data, Canada, June 1996.

    Google Scholar 

  12. M. Fernandez, D. Florescu, A. Levy, D. Suciu. A Query Language for a Web-Site Management Systems SIGMOD Record, 26(3), Sept, 1997.

    Google Scholar 

  13. T. Fiebig, J. Weiss, G. Moerkotte. RAW: A Relational Algebra for the Web. Workshop on Management of Semistructured Data (PODS/SIGMOD’97), Tucson, Arizona, May 16, 1997.

    Google Scholar 

  14. J. Han, Y. Huang, N. Cercone, Y. Fu. Intelligent Query Answering by Knowledge Discovery Techniques. IEEE Transactions of Knowledge and Data Engineering., 8(3): 373–390, 1996.

    Article  Google Scholar 

  15. D. Konopnicki, O. Shmueli. W3QS: A Query System for the World Wide Web. Proceedings of the 21st International Conference on Very Large Data Bases, Zurich, Switzerland, 1995.

    Google Scholar 

  16. L.V.S. Lakshmanan, F. Sadri., I.N. Subramanian. A Declarative Language for Querying and Restructuring the Web Proceedings of the Sixth International Workshop on Research Issues in Data Engineering, February, 1996.

    Google Scholar 

  17. S. H. Lin, C. S. Shih, M. C. Chang Chen et al. Extracting Classification Knowledge of Internet Documents with Mining Term Associations: A Semantic Approach. Proceedings of the Sixth International Workshop on Research Issues in Data Engineering, February, 1996.

    Google Scholar 

  18. M. Liu, T. Guan, L. V. Saxton. Structured-Based Queries over the World Wide Web. Proceedings of the 17th International Conference on Conceptual Modeling (ER ‘88), Singapore, 1998.

    Google Scholar 

  19. S. K. Madria, M. Mohania, J. F. Roddick. A Query Processing Model for Mobile Computing using Concept Hierarchies and Summary Databases. Submitted for publication.

    Google Scholar 

  20. A. O. Mendelzon, G. A. Mmaila, T. Milo. Querying the World Wide Web. Proceedings of the International Conference on Parallel and Distributed Information Systems (PDIS’96) Miami, Florida.

    Google Scholar 

  21. B. Mobasher, R. Cooley, J. Shrivastava. Web Mining: Information and Pattern Discovery on the World Wide Web. Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence (ICTAI’97), November 1997.

    Google Scholar 

  22. W. K. Ng, E.-P. Lim, C. T. Huang, S. Bhowmick, F. Q. Qin. Web Warehousing: An Algebra for Web Information. Proceedings of IEEE International Conference on Advances in Digital Libraries (ADL’98), Santa Barbara, California, April 22–24, 1998.

    Google Scholar 

  23. O. R. Zaine, M. Xix, J. Han. Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs. Proceedings of IEEE International Conference on Advances in Digital Libraries (ADL’98), Santa Barbara, California, April 22–24, 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bhowmick, S.S., Madria, S.K., Ng, WK., Lim, E.P. (1999). Web Warehousing: Design and Issues. In: Kambayashi, Y., Lee, D.L., Lim, EP., Mohania, M.K., Masunaga, Y. (eds) Advances in Database Technologies. ER 1998. Lecture Notes in Computer Science, vol 1552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-49121-7_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-49121-7_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65690-6

  • Online ISBN: 978-3-540-49121-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics