Skip to main content

Part of the book series: International Handbooks on Information Systems ((INFOSYS))

  • 849 Accesses

Abstract

Abstract. Data warehouse systems have become a key component of the corporate information system architecture. Data warehouses are built in the interest of business decision support and contain historical data obtained from a variety of enterprise internal and external sources. By collecting and consolidating data that was previously spread over several heterogeneous systems, data warehouses try to provide a homogenous information basis for enterprise planning and decision making.

After an intuitive introduction to the concept of a data warehouse, the initial situation starting from operational systems or decision support systems is described in Section 2. Section 3 discusses the most important aspects of the database of a data warehouse, including a global view on data sources and the data transformation process, data classification and the fundamental modelling and design concepts for a warehouse database. Section 4 deals with the data warehouse architecture and reviews design alternatives such as local databases, data marts, operational data stores and virtual data warehouses. Section 5 is devoted to data evaluation tools with a focus on data mining systems and online analytical processing, a real time access and analysis tool that allows multiple views into the same detailed data. The chapter concludes with a discussion of concepts and procedures for building a data warehouse as well as an outlook on future research directions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agosta, L., The essential guide to data warehousing: aligning technology with business imperatives, Prentice-Hall, 1999.

    Google Scholar 

  2. Anahory, S., Murray, D., Data warehousing in the real world, Addison-Wesley, 1997.

    Google Scholar 

  3. Anand, S., Foundations of data mining, Addison-Wesley, 2000.

    Google Scholar 

  4. Adamson, C., Venerable, M., Data warehouse design solutions, John Wiley Si Sons, 1998.

    Google Scholar 

  5. Adriaans, P., Zantiage, D., Data mining, Addison-Wesley, 1996.

    Google Scholar 

  6. Bischoff, J., Alexander, T. (eds.), Data warehouse: practical advice from the experts, Prentice-Hall, 1997.

    Google Scholar 

  7. Barquin, R., Edelstein, H. (eds.), Planning and designing the data warehouse, Prentice-Hall, 1996.

    Google Scholar 

  8. Barquin, R., Edelstein, H. (eds.), Building, using and managing the data warehouse, Prentice-Hall, 1997.

    Google Scholar 

  9. Bigus, J.P., Data mining with neural networks, McGraw-Hill, 1996.

    Google Scholar 

  10. Bischoff, J., Achieving warehouse success, Database Programming 8 Design 7, 1994, 27–33.

    Google Scholar 

  11. Berry, M., Linoff, G., Mastering data mining, John Wiley & Sons, 2000.

    Google Scholar 

  12. Brackett, M.H., The data warehouse challenge — taming data chaos, John Wiley Si Sons, 1996.

    Google Scholar 

  13. Brosius, G., Microsoft OLAP services, Addison-Wesley, 1999.

    Google Scholar 

  14. Bontempo, C.J., Saracco, C., Database management: principles and products, Prentice-Hall, 1996.

    Google Scholar 

  15. Berson, A., Smith, S.J., Data warehousing, data mining, and OLAP, McGraw-Hill, 1997.

    Google Scholar 

  16. Burleson, D., High performance Oracle data warehousing, Coriolis Group, 1997.

    Google Scholar 

  17. Corey, M., Abbey, M., Oracle data warehousing, McGraw-Hill, 1996.

    Google Scholar 

  18. Corey, M., Abbey, M., Abramson, I., Taub, B., Oracle8 data warehousing, McGraw-Hill, 1998.

    Google Scholar 

  19. Corey, M., Abbey, M., Abramson, I., Venkitachalam, R., Barnes, L., Taub, B., SQL Server 7 data warehousing, McGraw-Hill, 1999.

    Google Scholar 

  20. Cabena, P., Discovering datamining: from concept to implementation, Prentice-Hall, 1997.

    Google Scholar 

  21. Chamoni, P., Gluchowski, P. (eds.), Analytische Informationssysteme, Springer, Berlin, 1998.

    Google Scholar 

  22. Craig, R.S., Vivona, J.A., Bercovitch, D., Microsoft data warehousing: building distributed decision support systems, John Wiley & Sons, 1999.

    Google Scholar 

  23. Cui, Y., Widom, J., Lineage tracing in a data warehousing system, Proc. 16th International Conference on Data Engineering, 2000, 683684.

    Google Scholar 

  24. Cui, Y., Widom, J., Wiener, J.L., Tracing the lineage of view data in a data warehousing environment, Technical Report, Stanford University, 1999.

    Google Scholar 

  25. Debevoise, T., The data warehouse method, Prentice-Hall, 1998.

    Google Scholar 

  26. Devlin, B., Data warehouse: from architecture to implementation, Addison-Wesley, 1997.

    Google Scholar 

  27. Dyer, R., Forman, E., An analytic approach to marketing decisions, Prentice-Hall, 1995.

    Google Scholar 

  28. Dodge, G., Gorman, T., Essential Oracle8i data warehousing, John Wiley & Sons, 2000.

    Google Scholar 

  29. Dubes, R., Jain, A.K., Clustering methodologies in exploratory data analysis, Advances in Computers 19, 1980, 113–228.

    Article  Google Scholar 

  30. Dorndorf, U., Pesch, E., Fast clustering algorithms, ORSA Journal on Computing 6, 1994, 141–153.

    Article  MATH  Google Scholar 

  31. Dhar, V., Stein, R., Intelligent decision support methods: the science of knowledge work, Prentice-Hall, 1997.

    Google Scholar 

  32. Dyché, J., e-Data: turning data into information with data warehousing, Addison-Wesley, 2000.

    Google Scholar 

  33. Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R., Advances in knowledge discovery and data mining, MIT Press, 1995.

    Google Scholar 

  34. Franco, J.M., Le datawarehouse, Eyrolles, 1998.

    Google Scholar 

  35. Gabriel, R., Gluchowski, P., Semantische Modellierungstechniken für multidimensionale Datenstrukturen, HMD,Theorie and Praxis der Wirtschaftsinformatik 34, 1997, 18–37.

    Google Scholar 

  36. Gluchowski, P., Gabriel, R., Chamoni, P., Management Support Systeme, Computergestützte Informationssysteme für Führungskräfte and Entscheidungsträger, Springer-Verlag, Berlin, 1997.

    Google Scholar 

  37. Gupta, A., Harinarayan, V., Quass, D., Aggregate-query processing in data warehousing environments, Proc. 21st Conf. on Very Large Data Bases (VLDB), 1995, 358–369.

    Google Scholar 

  38. Gupta, H., Harinarayan, V., Rajaraman, A., Ullman, J., Index selection for OLAP, Proc.International Conference on Data Engineering, 1997, 208–219.

    Google Scholar 

  39. Giovinazzo, W., Object-oriented data warehouse design, Prentice-Hall, 2000.

    Google Scholar 

  40. Garcia-Molina, H., Labio, W.J., Wiener, J.L., Zhuge, Y., Distributed and parallel computing issues in data warehousing, Proc. ACM Principles of Distributed Computing Conference, 1999, 7–10.

    Google Scholar 

  41. Goglin, J.-F., La construction du datawarehouse, Éditions Hermes, 1998.

    Google Scholar 

  42. Groffmann, H.-D., Das Data Warehouse Konzept, HMD, Theorie und Praxis der Wirtschaftsinformatik 34, 1997, 8–17.

    Google Scholar 

  43. Groth, R., Data mining: a hands on approach for business professionals, Prentice-Hall, 1997.

    Google Scholar 

  44. Groth, R., Data mining: building competitive advantage, Prentice-Hall, 1999.

    Google Scholar 

  45. Grötschel, M., Wakabayashi, Y., A cutting-plane algorithm for a clustering problem, Mathematical Programming 45, 1989, 59–96.

    Article  MathSciNet  MATH  Google Scholar 

  46. Hackathorn, R.D., Data warehousing energizes your enterprise, Datamation 41, 1995, 38–45.

    Google Scholar 

  47. Hackney, D., Understanding and implementing successful data marts, Addison-Wesley, 1997.

    Google Scholar 

  48. Hackathorn, R.D., Web farming for the data warehouse, Morgan Kaufmann, 1999.

    Google Scholar 

  49. []Hammergren, T.C., Data warehousing: building the corporate knowledgebase, The Coriolis Group, 1997.

    Google Scholar 

  50. []Hammergren, T.C., Official sybase data warehousing on the internet, The Coriolis Group, 1997.

    Google Scholar 

  51. Hashmi, N., Business information warehouse for SAP, Prima Publishing, 2000.

    Google Scholar 

  52. Humphreys, P., Bannon. L., Migliarese, P., Pomerol, J.-C., Mc-Cosh, A., Implementing systems for supporting management decisions, Chapman & Hall, 1996.

    Google Scholar 

  53. Hillson, S., Hobbs, L., Oracle8i data warehousing, Digital Press, 1999. Humphries, M.W., Hawkins, M.W., Dy, M.C., Data warehousing: architecture and implementation, Prentice-Hall, 1999.

    Google Scholar 

  54. Han, J., Kamber, M., Data mining — concepts and techniques, Morgan Kaufmann, 2001.

    Google Scholar 

  55. []Huang, K.-T., Lee, Y.W., Wang, R.Y., Quality information and knowledge, Prentice-Hall, 1998.

    Google Scholar 

  56. Holthuis, J., Multidimensionale Datenstrukturen — Modellierung, Strukturkomponenten, Implementierungsaspekte, H. Mucksch, W. Behme (eds.), Das Data Warehouse-Konzept, Gabler, 1997, 137–186.

    Google Scholar 

  57. Harinarayan, V., Rajaraman, A., Ullman, J., Implementing data cubes efficiently, Proc. ACM SIGMOD Conference,1996, 205–216.

    Google Scholar 

  58. Huyn, N., Efficient view self-maintenance, Proc. ACM Workshop on Materialized Views: Techniques and Applications, 1996, 17–25.

    Google Scholar 

  59. Huyn, N., Multiple-view self-maintenance in data warehousing environments, Proc. 23rd Conf. on Very Large Data Bases (VLDB), 1997, 26–35.

    Google Scholar 

  60. []Inmon, W.H., Hackathorn, R.D., Using the data warehouse, John Wiley & Sons, 1994.

    Google Scholar 

  61. Inmon, W.H., Imhoff, C., Sousa, R., Corporate information factory, John Wiley & Sons, 1997.

    Google Scholar 

  62. []Inmon, W.H., Building the data warehouse, 3rd edition, John Wiley & Sons, 2002.

    Google Scholar 

  63. []Inmon, W.H., Building the operational data store, John Wiley & Sons, 1999.

    Google Scholar 

  64. Inmon, W.H., Exploration warehousing, John Wiley & Sons, 2000.

    Google Scholar 

  65. Inmon, W.H., Rudin, K., Buss, C.K., Sousa, R., Data warehouse performance, John Wiley & Sons, 1998.

    Google Scholar 

  66. Inmon, W.H., Welch, J.D., Glassey, K., Managing the data warehouse, John Wiley & Sons, 1997.

    Google Scholar 

  67. Inmon, W.H., Zachman, J., Geiger, J., Data stores, data warehousing, and the Zachman framework, McGraw-Hill, 1997.

    Google Scholar 

  68. Jarke, M., Lenzerini, M., Vassiliou, Y., Vassiliadis, P., Fundamentals of data warehouses, 2nd edition, Springer-Verlag, 2000.

    Google Scholar 

  69. Kaiser, B.-U., Corporate information with SAP-EIS, Morgan Kaufmann, 1998.

    Google Scholar 

  70. Kelly, S., Data warehousing: the route to mass customization, John Wiley & Sons, 1994.

    Google Scholar 

  71. Kelly, B.W., AS/400 data warehousing: the complete implementation guide, Midrange Computing, 1997.

    Google Scholar 

  72. Kelly, S., Data warehousing in action,John Wiley & Sons, 1997.

    Google Scholar 

  73. Kimball, R., The data warehouse toolkit, John Wiley & Sons, 1996.

    Google Scholar 

  74. Kirchner, J., Transformationsprogramme und Extraktionsprozesse entscheidungsrelevanter Basisdaten, H. Mucksch, W. Behme (eds.), Das Data Warehouse-Konzept, Gabler, 1997, 237–266.

    Google Scholar 

  75. Kawaguchi, A., Lieuwen, D., Mumick, I., Quass, D., Ross, K., Con-currency control theory for deferred materialized views, Proc. International Conference on Database Theory, 1997, 306–320.

    Google Scholar 

  76. Kimball, R., Merz, R., The data webhouse toolkit: building the webenabled data warehouse, John Wiley & Sons, 2000.

    Google Scholar 

  77. Kimball, R., Reeves, L., Ross, M., Thornwaite, W., The data warehouse lifecycle toolkit: tools and techniques for designing, developing and deploying data marts and data warehouses, John Wiley & Sons, 1998.

    Google Scholar 

  78. Laudon, K.C., Laudon, J.P., Management information systems, organization and technology, 4th edition, Prentice-Hall, New Jersey 1996.

    Google Scholar 

  79. Lusti, M., Data warehousing und Data Mining, 2nd edition, Springer- Verlag, 2002.

    Google Scholar 

  80. Labio, W.J., Yerneni, R., Garcia-Molina, H., Shrinking the warehouse update window, Proc. ACM MOD Conference, 1999, 383–394.

    Google Scholar 

  81. Labio, W.J., Zhuge, Y., Wiener, J.L., Gupta, H., Garcia-Molina, H., Widom, J., The WHIPS prototype for data warehouse creation and maintenance, Proc. ACM SIGMOD Conference,1997, 557–559.

    Google Scholar 

  82. Moss, L., Adelman, S., Data warehouse project management, Addison-Wesley, 2000.

    Google Scholar 

  83. Mallach, E., Understanding decision support systems and expert systems, McGraw-Hill, 1994.

    Google Scholar 

  84. Marakas, G., Decision support systems in the 21st century, Prentice-Hall, 1999.

    Google Scholar 

  85. Mattison, R., Data warehousing: strategies,tools and techniques, McGraw-Hill, 1996.

    Google Scholar 

  86. Mattison, R., Data warehousing and data mining for telecommunications, Artech House, 1997.

    Google Scholar 

  87. Mattison, R., Web warehousing and knowledge management, McGraw-Hill, 1999.

    Google Scholar 

  88. Mucksch, H., Behme, W. (eds.), Das Data Warehouse-Konzept, 2nd edition, Gabler, 1997.

    Google Scholar 

  89. Meyer, D., Cannon, C., Building a better data warehouse, Prentice-Hall, 1998.

    Google Scholar 

  90. Mena, J., Data mining your website, Digital Press, 1999.

    Google Scholar 

  91. Mucksch, H., Holthuis, J., Reiser, M., Das Data Warehouse-Konzept — ein Überblick, Wirtschaftsinformatik 38, 1996, 421–433.

    Google Scholar 

  92. Morse, S., Issac, D., Parallel systems in the data warehouse, Prentice- Hall, 1997.

    Google Scholar 

  93. Microsoft Press, Microsoft SQL Server 7.0 data warehousing training kit, 1999.

    Google Scholar 

  94. Mentzl, R., Ludwig, C., Das Data Warehouse als Bestandteil eines Database Marketing-Systems, H. Mucksch, W. Behme (eds.), Das Data Warehouse-Konzept,Gabler, 1997, 469–484.

    Google Scholar 

  95. Mumick, I., Quass, D., Mumick, B., Maintenance of data cubes and summary tables in a warehouse, Proc. ACM SIGMOD Conference, 1997, 100–111.

    Google Scholar 

  96. O’Neil, P., Database: principles, programming,performance, Morgan Kaufmann, 1994.

    Google Scholar 

  97. O’Neil, P., Quass, D., Improved query performance with variant indexes, Proc. ACM SIGMOD Conference, 1997, 38–49.

    Google Scholar 

  98. Poe, V., Building a data warehouse for decision support, Prentice-Hall, 1997.

    Google Scholar 

  99. Ponniah, P., Data warehousing fundamentals,John Wiley & Sons, 2001.

    Google Scholar 

  100. Peterson, T., Pinkelman, J., Darroch, R., Microsoft OLAP unleashed, SAMS, 1999.

    Google Scholar 

  101. Pyle, D., Data preparation for data mining,Morgan Kaufmann, 1998.

    Google Scholar 

  102. Quass, D., Gupta, A., Mumick, I., Widom, J., Making views self-maintainable for data warehousing, Proc. Conference on Parallel and Distributed Information Systems, 1996, 158–169.

    Google Scholar 

  103. Quass, D., Widom, J., On-line warehouse view maintenance for batch updates, Proc. ACM SIGMOD Conference, 1997, 393–404.

    Google Scholar 

  104. Ramalho, J., Data warehousing with MS SQL 7.0,Wordware, 2000.

    Google Scholar 

  105. Reed, D., Managing the Oracle data warehouse,Prentice-Hall, 2000.

    Google Scholar 

  106. Ryan, C., Evaluating and selecting data warehousing tools, Prentice-Hall, 2000.

    Google Scholar 

  107. Sanchez, A., Data warehousing with Informix: best practices, Prentice-Hall, 1998.

    Google Scholar 

  108. Sauter, V.L., Decision support systems, John Wiley & Sons, 1996.

    Google Scholar 

  109. Schreier, U., Verarbeitungsprinzipien in Data-Warehousing-Systemen, HMD,Theorie and Praxis der Wirtschaftsinformatik 33, 1996, 78–93.

    Google Scholar 

  110. Silverston, L., Inmon, W.H., Graziano, K., The data model resource book: a library of logical data models and data warehouse designs, John Wiley & Sons, 1997.

    Google Scholar 

  111. Simon, A.R., 90 days to the data mart, John Wiley & Sons, 1998.

    Google Scholar 

  112. Singh, H.S., Data warehousing: concepts, technology, and applications, Prentice-Hall, 1997.

    Google Scholar 

  113. Singh, H.S., Interactive data warehousing via the web, Prentice-Hall, 1998.

    Google Scholar 

  114. Sperley, E., The enterprise data warehouse,vol. 1, Planning, building and implementation, Prentice-Hall, 9.

    Google Scholar 

  115. Sprague, R.H., Watson, H., Decision support for management, Prentice-Hall, 1996.

    Google Scholar 

  116. Tanler, R., The intranet data warehouse: tools and techniques for connecting data warehouses to anets, John Wiley & Sons, 1997.

    Google Scholar 

  117. Thierauf, R.J., On-line analytical processing systems for business, Quorum Books, 1997.

    Google Scholar 

  118. Thomsen, E., OLAP solutions: building multidimensional information systems, John Wiley & Sons, 1997.

    Google Scholar 

  119. Thomsen, E., Spofford, G., Chase, D., Microsoft OLAP solutions, John Wiley & Sons, 1999.

    Google Scholar 

  120. Turban, E., Decision support systems and expert systems, Prentice-Hall, 1998.

    Google Scholar 

  121. Venerable, M., Adamson, C., Data warehouse design solutions, John Wiley & Sons, 1998.

    Google Scholar 

  122. Westphal, C., Blaxton, T., Data mining solutions: methods and tools for solving real-world problems, n Wiley & Sons, 1998.

    Google Scholar 

  123. Welbrock, P.R., Strategic data warehousing principles using SAS soft-ware, SAS Institute, 1998.

    Google Scholar 

  124. Wetherbe, J.C., Executive information requirements: getting it right, MIS Quarterly,1991.

    Google Scholar 

  125. Watson, H., Gray, P., Decision support in the data warehouse, Prentice-Hall, 1997.

    Google Scholar 

  126. Watson, H.J., Houdeshel, G., Rainer, R.K., Building executive information systems and other decision support applications, John Wiley & Sons, 1997.

    Google Scholar 

  127. Weiss, S.M., Indurkhya, N., Predictive data mining: a practical guide, Morgan Kaufmann, 1997.

    Google Scholar 

  128. Wood, J., Silver, D., Joint application development, 2nd edition, John Wiley & Sons, 1995.

    Google Scholar 

  129. Whitehorn, M., Whitehorn, M., Business intelligence: the IBM solution, Springer, 1999.

    Google Scholar 

  130. Whitehorn, M., Whitehorn, M., SQL server: data warehousing and OLAP, Springer-Verlag, 1999.

    Google Scholar 

  131. Youness, S., Professional data warehousing with SQL Server 7.0 and OLAP services,Wrox, 2000.

    Google Scholar 

  132. Yazdani, S., Wong, S., Data warehousing with Oracle: an administrator’s handbook, Prentice Hall, 1997.

    Google Scholar 

  133. Yang, J., Widom, J., Making temporal views self-maintainable for data warehousing, Proc. 7th International Conference on Extending Database Technology, 2000, 395–412.

    Google Scholar 

  134. Zhuge, Y., Garcia-Molina, H., Hammer, J., Widom, J., View maintenance in a warehousing environment, Proc. ACM SIGMOD Conference,1995, 316–327.

    Google Scholar 

  135. Zhuge, Y., Garcia-Molina, H., Wiener, J.L., The strobe algorithms for multi-source warehouse consistency, Proc. Conference on Parallel and Distributed Information Systems, 1996, 146–157.

    Google Scholar 

  136. Zhuge, Y., Garcia-Molina, H., Wiener, J.L., Consistency algorithms for multi-source warehouse view maintenance, Journal of Distributed and Parallel Databases 6, 1998, 7–40.

    Article  Google Scholar 

  137. Zhuge, Y., Wiener, J.L., Garcia-Molina, H., Multiple view consistency for data warehousing, Proc. International Conference on Data Engineering, 1997, 289–300.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Dorndorf, U., Pesch, E. (2003). Data Warehouses. In: Błażewicz, J., Kubiak, W., Morzy, T., Rusinkiewicz, M. (eds) Handbook on Data Management in Information Systems. International Handbooks on Information Systems. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24742-5_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24742-5_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-53441-6

  • Online ISBN: 978-3-540-24742-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics