Advertisement

Data Warehouse Schema and Instance Design

  • Dimitri Theodoratos
  • Timos Sellis
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1507)

Abstract

A Data Warehouse (DW) is a database that collects and stores data from multiple remote and heterogeneous information sources. When a query is posed, it is evaluated locally, without accessing the original information sources. In this paper we deal with the issue of designing a DW, in the context of the relational model, by selecting a set of views to materialize in it. Views allow to compute both the schema and the instance of the DW from the schemas and the instances of the source relations.

We briefly present a theoretical framework for the DW design problem, which concerns the selection of a set of views that (a) fits in the space allocated to the DW, (b) answers all the queries of interest, and (c) minimizes the total query evaluation and view maintenance cost. We then formalize it as a state space search problem by taking into account multiquery optimization over the maintenance queries (i.e. queries that compute changes to the materialized views) and the use of auxiliary views for reducing the view maintenance cost. Finally, incremental algorithms and heuristics for pruning the search space are presented.

Keywords

Space Constraint Query Evaluation Simple View Incremental Algorithm Source Relation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Ceri, S., Widom, J.: Deriving production rules for incremental view maintenance. In: Proc. of the 20th Intl. Conf. on Very Large Data Bases, pp. 577–589 (1991)Google Scholar
  2. 2.
    Chaudhuri, S., Dayal, U.: An Overview of Data Warehousing and OLAP Technology. SIGMOD Record 26(1), 65–74 (1997)CrossRefGoogle Scholar
  3. 3.
    Finkelstein, S.: Common Expression Analysis in Database Applications. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pp. 235–245 (1982)Google Scholar
  4. 4.
    Gupta, H.: Selection of Views to Materialize in a Data Warehouse. In: Intl. Conf. on Database Theory, pp. 98–112 (1997)Google Scholar
  5. 5.
    Gupta, H., Harinarayan, V., Rajaraman, A., Ullman, J.D.: Index Selection for OLAP. In: Proc. of the 13th Intl. Conf. on Data Engineering, pp. 208–219 (1997)Google Scholar
  6. 6.
    Harinarayan, V., Rajaraman, A., Ullman, J.D.: Implementing Data Cubes Efficiently. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data (1996)Google Scholar
  7. 7.
    Immon, W., Kelley, C.: Rdb/VMS: Developing the Data warehouse. QED Publishing Group, Boston (1993)Google Scholar
  8. 8.
    Labio, W., Quass, D., Adelberg, B.: Physical Database Design for Data Warehousing. In: Proc. of the 13th Intl. Conf. on Data Engineering (1997)Google Scholar
  9. 9.
    Levy, A., Mendelson, A.O., Sagiv, Y., Srivastava, D.: Answering Queries using Views. In: Proc. of the ACM Symp. on Principles of Database Systems, pp. 95–104 (1995)Google Scholar
  10. 10.
    Quass, D., Gupta, A., Mumick, I.S., Widom, J.: Making Views Self Maintainable for Data Warehousing. In: PDIS (1996)Google Scholar
  11. 11.
    Ross, K.A., Srivastava, D., Sudarshan, S.: Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pp. 447–458 (1996)Google Scholar
  12. 12.
    Roussopoulos, N.: View Indexing in Relational Databases. ACM Transactions on Database Systems 7(2), 258–290 (1982)zbMATHCrossRefGoogle Scholar
  13. 13.
    Sellis, T.K.: Multiple Query Optimization. ACM Transactions on Database Systems 13(1), 23–52 (1988)CrossRefGoogle Scholar
  14. 14.
    Shim, K., Sellis, T.K., Nau, D.: Improvements on a heuristic algorithm for multiple-query optimization. Data & Knowledge Engineering 12, 197–222 (1994)CrossRefGoogle Scholar
  15. 15.
    Theodoratos, D., Sellis, T.: Data Warehouse Configuration. In: Proc. of the 23nd Intl. Conf. on Very Large Data Bases, pp. 126–135 (1997)Google Scholar
  16. 16.
    Theodoratos, D., Sellis, T.: Designing Data Warehouses. Technical Report, Knowledge and data Base Systems Laboratory, Electrical and Computer Engineering Dept., National Technical University of Athens, pp. 1–29 (1997)Google Scholar
  17. 17.
    Widom, J. (ed.): Data Engineering, Special Issue on Materialized Views and Data Warehousing, vol. 18(2). IEEE, Los Alamitos (1995)Google Scholar
  18. 18.
    Yang, J., Karlapalem, K., Li, Q.: Algorithms for Materilaized View Design in Data Warehousing Environment. In: Proc. of the 23nd Intl. Conf. on Very Large Data Bases, pp. 136–145 (1997)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • Dimitri Theodoratos
    • 1
  • Timos Sellis
    • 1
  1. 1.Department of Electrical and Computer EngineeringComputer Science Division, National Technical University of AthensAthensGreece

Personalised recommendations