Abstract
Abstract. Data warehouse systems have become a key component of the corporate information system architecture. Data warehouses are built in the interest of business decision support and contain historical data obtained from a variety of enterprise internal and external sources. By collecting and consolidating data that was previously spread over several heterogeneous systems, data warehouses try to provide a homogenous information basis for enterprise planning and decision making.
After an intuitive introduction to the concept of a data warehouse, the initial situation starting from operational systems or decision support systems is described in Section 2. Section 3 discusses the most important aspects of the database of a data warehouse, including a global view on data sources and the data transformation process, data classification and the fundamental modelling and design concepts for a warehouse database. Section 4 deals with the data warehouse architecture and reviews design alternatives such as local databases, data marts, operational data stores and virtual data warehouses. Section 5 is devoted to data evaluation tools with a focus on data mining systems and online analytical processing, a real time access and analysis tool that allows multiple views into the same detailed data. The chapter concludes with a discussion of concepts and procedures for building a data warehouse as well as an outlook on future research directions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agosta, L., The essential guide to data warehousing: aligning technology with business imperatives, Prentice-Hall, 1999.
Anahory, S., Murray, D., Data warehousing in the real world, Addison-Wesley, 1997.
Anand, S., Foundations of data mining, Addison-Wesley, 2000.
Adamson, C., Venerable, M., Data warehouse design solutions, John Wiley Si Sons, 1998.
Adriaans, P., Zantiage, D., Data mining, Addison-Wesley, 1996.
Bischoff, J., Alexander, T. (eds.), Data warehouse: practical advice from the experts, Prentice-Hall, 1997.
Barquin, R., Edelstein, H. (eds.), Planning and designing the data warehouse, Prentice-Hall, 1996.
Barquin, R., Edelstein, H. (eds.), Building, using and managing the data warehouse, Prentice-Hall, 1997.
Bigus, J.P., Data mining with neural networks, McGraw-Hill, 1996.
Bischoff, J., Achieving warehouse success, Database Programming 8 Design 7, 1994, 27–33.
Berry, M., Linoff, G., Mastering data mining, John Wiley & Sons, 2000.
Brackett, M.H., The data warehouse challenge — taming data chaos, John Wiley Si Sons, 1996.
Brosius, G., Microsoft OLAP services, Addison-Wesley, 1999.
Bontempo, C.J., Saracco, C., Database management: principles and products, Prentice-Hall, 1996.
Berson, A., Smith, S.J., Data warehousing, data mining, and OLAP, McGraw-Hill, 1997.
Burleson, D., High performance Oracle data warehousing, Coriolis Group, 1997.
Corey, M., Abbey, M., Oracle data warehousing, McGraw-Hill, 1996.
Corey, M., Abbey, M., Abramson, I., Taub, B., Oracle8 data warehousing, McGraw-Hill, 1998.
Corey, M., Abbey, M., Abramson, I., Venkitachalam, R., Barnes, L., Taub, B., SQL Server 7 data warehousing, McGraw-Hill, 1999.
Cabena, P., Discovering datamining: from concept to implementation, Prentice-Hall, 1997.
Chamoni, P., Gluchowski, P. (eds.), Analytische Informationssysteme, Springer, Berlin, 1998.
Craig, R.S., Vivona, J.A., Bercovitch, D., Microsoft data warehousing: building distributed decision support systems, John Wiley & Sons, 1999.
Cui, Y., Widom, J., Lineage tracing in a data warehousing system, Proc. 16th International Conference on Data Engineering, 2000, 683684.
Cui, Y., Widom, J., Wiener, J.L., Tracing the lineage of view data in a data warehousing environment, Technical Report, Stanford University, 1999.
Debevoise, T., The data warehouse method, Prentice-Hall, 1998.
Devlin, B., Data warehouse: from architecture to implementation, Addison-Wesley, 1997.
Dyer, R., Forman, E., An analytic approach to marketing decisions, Prentice-Hall, 1995.
Dodge, G., Gorman, T., Essential Oracle8i data warehousing, John Wiley & Sons, 2000.
Dubes, R., Jain, A.K., Clustering methodologies in exploratory data analysis, Advances in Computers 19, 1980, 113–228.
Dorndorf, U., Pesch, E., Fast clustering algorithms, ORSA Journal on Computing 6, 1994, 141–153.
Dhar, V., Stein, R., Intelligent decision support methods: the science of knowledge work, Prentice-Hall, 1997.
Dyché, J., e-Data: turning data into information with data warehousing, Addison-Wesley, 2000.
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R., Advances in knowledge discovery and data mining, MIT Press, 1995.
Franco, J.M., Le datawarehouse, Eyrolles, 1998.
Gabriel, R., Gluchowski, P., Semantische Modellierungstechniken für multidimensionale Datenstrukturen, HMD,Theorie and Praxis der Wirtschaftsinformatik 34, 1997, 18–37.
Gluchowski, P., Gabriel, R., Chamoni, P., Management Support Systeme, Computergestützte Informationssysteme für Führungskräfte and Entscheidungsträger, Springer-Verlag, Berlin, 1997.
Gupta, A., Harinarayan, V., Quass, D., Aggregate-query processing in data warehousing environments, Proc. 21st Conf. on Very Large Data Bases (VLDB), 1995, 358–369.
Gupta, H., Harinarayan, V., Rajaraman, A., Ullman, J., Index selection for OLAP, Proc.International Conference on Data Engineering, 1997, 208–219.
Giovinazzo, W., Object-oriented data warehouse design, Prentice-Hall, 2000.
Garcia-Molina, H., Labio, W.J., Wiener, J.L., Zhuge, Y., Distributed and parallel computing issues in data warehousing, Proc. ACM Principles of Distributed Computing Conference, 1999, 7–10.
Goglin, J.-F., La construction du datawarehouse, Éditions Hermes, 1998.
Groffmann, H.-D., Das Data Warehouse Konzept, HMD, Theorie und Praxis der Wirtschaftsinformatik 34, 1997, 8–17.
Groth, R., Data mining: a hands on approach for business professionals, Prentice-Hall, 1997.
Groth, R., Data mining: building competitive advantage, Prentice-Hall, 1999.
Grötschel, M., Wakabayashi, Y., A cutting-plane algorithm for a clustering problem, Mathematical Programming 45, 1989, 59–96.
Hackathorn, R.D., Data warehousing energizes your enterprise, Datamation 41, 1995, 38–45.
Hackney, D., Understanding and implementing successful data marts, Addison-Wesley, 1997.
Hackathorn, R.D., Web farming for the data warehouse, Morgan Kaufmann, 1999.
[]Hammergren, T.C., Data warehousing: building the corporate knowledgebase, The Coriolis Group, 1997.
[]Hammergren, T.C., Official sybase data warehousing on the internet, The Coriolis Group, 1997.
Hashmi, N., Business information warehouse for SAP, Prima Publishing, 2000.
Humphreys, P., Bannon. L., Migliarese, P., Pomerol, J.-C., Mc-Cosh, A., Implementing systems for supporting management decisions, Chapman & Hall, 1996.
Hillson, S., Hobbs, L., Oracle8i data warehousing, Digital Press, 1999. Humphries, M.W., Hawkins, M.W., Dy, M.C., Data warehousing: architecture and implementation, Prentice-Hall, 1999.
Han, J., Kamber, M., Data mining — concepts and techniques, Morgan Kaufmann, 2001.
[]Huang, K.-T., Lee, Y.W., Wang, R.Y., Quality information and knowledge, Prentice-Hall, 1998.
Holthuis, J., Multidimensionale Datenstrukturen — Modellierung, Strukturkomponenten, Implementierungsaspekte, H. Mucksch, W. Behme (eds.), Das Data Warehouse-Konzept, Gabler, 1997, 137–186.
Harinarayan, V., Rajaraman, A., Ullman, J., Implementing data cubes efficiently, Proc. ACM SIGMOD Conference,1996, 205–216.
Huyn, N., Efficient view self-maintenance, Proc. ACM Workshop on Materialized Views: Techniques and Applications, 1996, 17–25.
Huyn, N., Multiple-view self-maintenance in data warehousing environments, Proc. 23rd Conf. on Very Large Data Bases (VLDB), 1997, 26–35.
[]Inmon, W.H., Hackathorn, R.D., Using the data warehouse, John Wiley & Sons, 1994.
Inmon, W.H., Imhoff, C., Sousa, R., Corporate information factory, John Wiley & Sons, 1997.
[]Inmon, W.H., Building the data warehouse, 3rd edition, John Wiley & Sons, 2002.
[]Inmon, W.H., Building the operational data store, John Wiley & Sons, 1999.
Inmon, W.H., Exploration warehousing, John Wiley & Sons, 2000.
Inmon, W.H., Rudin, K., Buss, C.K., Sousa, R., Data warehouse performance, John Wiley & Sons, 1998.
Inmon, W.H., Welch, J.D., Glassey, K., Managing the data warehouse, John Wiley & Sons, 1997.
Inmon, W.H., Zachman, J., Geiger, J., Data stores, data warehousing, and the Zachman framework, McGraw-Hill, 1997.
Jarke, M., Lenzerini, M., Vassiliou, Y., Vassiliadis, P., Fundamentals of data warehouses, 2nd edition, Springer-Verlag, 2000.
Kaiser, B.-U., Corporate information with SAP-EIS, Morgan Kaufmann, 1998.
Kelly, S., Data warehousing: the route to mass customization, John Wiley & Sons, 1994.
Kelly, B.W., AS/400 data warehousing: the complete implementation guide, Midrange Computing, 1997.
Kelly, S., Data warehousing in action,John Wiley & Sons, 1997.
Kimball, R., The data warehouse toolkit, John Wiley & Sons, 1996.
Kirchner, J., Transformationsprogramme und Extraktionsprozesse entscheidungsrelevanter Basisdaten, H. Mucksch, W. Behme (eds.), Das Data Warehouse-Konzept, Gabler, 1997, 237–266.
Kawaguchi, A., Lieuwen, D., Mumick, I., Quass, D., Ross, K., Con-currency control theory for deferred materialized views, Proc. International Conference on Database Theory, 1997, 306–320.
Kimball, R., Merz, R., The data webhouse toolkit: building the webenabled data warehouse, John Wiley & Sons, 2000.
Kimball, R., Reeves, L., Ross, M., Thornwaite, W., The data warehouse lifecycle toolkit: tools and techniques for designing, developing and deploying data marts and data warehouses, John Wiley & Sons, 1998.
Laudon, K.C., Laudon, J.P., Management information systems, organization and technology, 4th edition, Prentice-Hall, New Jersey 1996.
Lusti, M., Data warehousing und Data Mining, 2nd edition, Springer- Verlag, 2002.
Labio, W.J., Yerneni, R., Garcia-Molina, H., Shrinking the warehouse update window, Proc. ACM MOD Conference, 1999, 383–394.
Labio, W.J., Zhuge, Y., Wiener, J.L., Gupta, H., Garcia-Molina, H., Widom, J., The WHIPS prototype for data warehouse creation and maintenance, Proc. ACM SIGMOD Conference,1997, 557–559.
Moss, L., Adelman, S., Data warehouse project management, Addison-Wesley, 2000.
Mallach, E., Understanding decision support systems and expert systems, McGraw-Hill, 1994.
Marakas, G., Decision support systems in the 21st century, Prentice-Hall, 1999.
Mattison, R., Data warehousing: strategies,tools and techniques, McGraw-Hill, 1996.
Mattison, R., Data warehousing and data mining for telecommunications, Artech House, 1997.
Mattison, R., Web warehousing and knowledge management, McGraw-Hill, 1999.
Mucksch, H., Behme, W. (eds.), Das Data Warehouse-Konzept, 2nd edition, Gabler, 1997.
Meyer, D., Cannon, C., Building a better data warehouse, Prentice-Hall, 1998.
Mena, J., Data mining your website, Digital Press, 1999.
Mucksch, H., Holthuis, J., Reiser, M., Das Data Warehouse-Konzept — ein Überblick, Wirtschaftsinformatik 38, 1996, 421–433.
Morse, S., Issac, D., Parallel systems in the data warehouse, Prentice- Hall, 1997.
Microsoft Press, Microsoft SQL Server 7.0 data warehousing training kit, 1999.
Mentzl, R., Ludwig, C., Das Data Warehouse als Bestandteil eines Database Marketing-Systems, H. Mucksch, W. Behme (eds.), Das Data Warehouse-Konzept,Gabler, 1997, 469–484.
Mumick, I., Quass, D., Mumick, B., Maintenance of data cubes and summary tables in a warehouse, Proc. ACM SIGMOD Conference, 1997, 100–111.
O’Neil, P., Database: principles, programming,performance, Morgan Kaufmann, 1994.
O’Neil, P., Quass, D., Improved query performance with variant indexes, Proc. ACM SIGMOD Conference, 1997, 38–49.
Poe, V., Building a data warehouse for decision support, Prentice-Hall, 1997.
Ponniah, P., Data warehousing fundamentals,John Wiley & Sons, 2001.
Peterson, T., Pinkelman, J., Darroch, R., Microsoft OLAP unleashed, SAMS, 1999.
Pyle, D., Data preparation for data mining,Morgan Kaufmann, 1998.
Quass, D., Gupta, A., Mumick, I., Widom, J., Making views self-maintainable for data warehousing, Proc. Conference on Parallel and Distributed Information Systems, 1996, 158–169.
Quass, D., Widom, J., On-line warehouse view maintenance for batch updates, Proc. ACM SIGMOD Conference, 1997, 393–404.
Ramalho, J., Data warehousing with MS SQL 7.0,Wordware, 2000.
Reed, D., Managing the Oracle data warehouse,Prentice-Hall, 2000.
Ryan, C., Evaluating and selecting data warehousing tools, Prentice-Hall, 2000.
Sanchez, A., Data warehousing with Informix: best practices, Prentice-Hall, 1998.
Sauter, V.L., Decision support systems, John Wiley & Sons, 1996.
Schreier, U., Verarbeitungsprinzipien in Data-Warehousing-Systemen, HMD,Theorie and Praxis der Wirtschaftsinformatik 33, 1996, 78–93.
Silverston, L., Inmon, W.H., Graziano, K., The data model resource book: a library of logical data models and data warehouse designs, John Wiley & Sons, 1997.
Simon, A.R., 90 days to the data mart, John Wiley & Sons, 1998.
Singh, H.S., Data warehousing: concepts, technology, and applications, Prentice-Hall, 1997.
Singh, H.S., Interactive data warehousing via the web, Prentice-Hall, 1998.
Sperley, E., The enterprise data warehouse,vol. 1, Planning, building and implementation, Prentice-Hall, 9.
Sprague, R.H., Watson, H., Decision support for management, Prentice-Hall, 1996.
Tanler, R., The intranet data warehouse: tools and techniques for connecting data warehouses to anets, John Wiley & Sons, 1997.
Thierauf, R.J., On-line analytical processing systems for business, Quorum Books, 1997.
Thomsen, E., OLAP solutions: building multidimensional information systems, John Wiley & Sons, 1997.
Thomsen, E., Spofford, G., Chase, D., Microsoft OLAP solutions, John Wiley & Sons, 1999.
Turban, E., Decision support systems and expert systems, Prentice-Hall, 1998.
Venerable, M., Adamson, C., Data warehouse design solutions, John Wiley & Sons, 1998.
Westphal, C., Blaxton, T., Data mining solutions: methods and tools for solving real-world problems, n Wiley & Sons, 1998.
Welbrock, P.R., Strategic data warehousing principles using SAS soft-ware, SAS Institute, 1998.
Wetherbe, J.C., Executive information requirements: getting it right, MIS Quarterly,1991.
Watson, H., Gray, P., Decision support in the data warehouse, Prentice-Hall, 1997.
Watson, H.J., Houdeshel, G., Rainer, R.K., Building executive information systems and other decision support applications, John Wiley & Sons, 1997.
Weiss, S.M., Indurkhya, N., Predictive data mining: a practical guide, Morgan Kaufmann, 1997.
Wood, J., Silver, D., Joint application development, 2nd edition, John Wiley & Sons, 1995.
Whitehorn, M., Whitehorn, M., Business intelligence: the IBM solution, Springer, 1999.
Whitehorn, M., Whitehorn, M., SQL server: data warehousing and OLAP, Springer-Verlag, 1999.
Youness, S., Professional data warehousing with SQL Server 7.0 and OLAP services,Wrox, 2000.
Yazdani, S., Wong, S., Data warehousing with Oracle: an administrator’s handbook, Prentice Hall, 1997.
Yang, J., Widom, J., Making temporal views self-maintainable for data warehousing, Proc. 7th International Conference on Extending Database Technology, 2000, 395–412.
Zhuge, Y., Garcia-Molina, H., Hammer, J., Widom, J., View maintenance in a warehousing environment, Proc. ACM SIGMOD Conference,1995, 316–327.
Zhuge, Y., Garcia-Molina, H., Wiener, J.L., The strobe algorithms for multi-source warehouse consistency, Proc. Conference on Parallel and Distributed Information Systems, 1996, 146–157.
Zhuge, Y., Garcia-Molina, H., Wiener, J.L., Consistency algorithms for multi-source warehouse view maintenance, Journal of Distributed and Parallel Databases 6, 1998, 7–40.
Zhuge, Y., Wiener, J.L., Garcia-Molina, H., Multiple view consistency for data warehousing, Proc. International Conference on Data Engineering, 1997, 289–300.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Dorndorf, U., Pesch, E. (2003). Data Warehouses. In: Błażewicz, J., Kubiak, W., Morzy, T., Rusinkiewicz, M. (eds) Handbook on Data Management in Information Systems. International Handbooks on Information Systems. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24742-5_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-24742-5_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-53441-6
Online ISBN: 978-3-540-24742-5
eBook Packages: Springer Book Archive