Skip to main content

D’MART: A Tool for Building and Populating Data Warehouse Model from Existing Reports and Tables

  • Conference paper
  • 1626 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7418))

Abstract

As companies grow (organically or inorganically), Data Administration (i.e. Stage 5 of Nolan’s IT growth model) becomes the next logical step in their IT evolution. Designing a Data Warehouse model, especially in the presence of legacy systems, is a challenging task. A lot of time and effort is consumed in understanding the existing data requirements, performing Dimensional and Fact modeling etc. This problem is further exacerbated if enterprise outsource their IT needs to external vendors. In such a situation no individual has a complete and in-depth view of the existing data setup. For such settings, a tool that can assist in building a data warehouse model from existing data models such that there is minimal impact to the business can be of immense value. In this paper we present the D’MART tool which addresses this problem. D’MART analyzes the existing data model of the enterprise and proposes alternatives for building the new data warehouse model. D’MART models the problem of identifying Fact/Dimension attributes of a warehouse model as a graph cut on a Dependency Analysis Graph (DAG). The DAG is built using the existing data models and the BI Report generation (SQL) scripts. The D’MART tool also uses the DAG for generation of ETL scripts that can be used to populate the newly proposed data warehouse from data present in the existing schemas. D’MART was developed and validated as part of an engagement with Indian Railways which operates one of the largest rail networks in the world.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arora, S., Rao, S., Vazirani, U.: Expander flows, geometric embeddings and graph partitioning (2009)

    Google Scholar 

  2. Chowdhary, P., Mihaila, G., Lei, H.: Model driven data warehousing for business performance management (2006)

    Google Scholar 

  3. Doan, A., Domingos, P., Halevy, A.: Reconciling schemas of disparate data sources: a machine-learning approach (2001)

    Google Scholar 

  4. Edmonds, J., Karp, R.: Theoretical improvements in algorithmic efficiency for network flow problems (1972)

    Google Scholar 

  5. Golfarelli, M., Maio, D., Rizzi, S.: Conceptual design of data warehouses from e/r schemes (1998)

    Google Scholar 

  6. Kimball, R.: The Data Warehouse Toolkit: Practical Techniques For Building Dimensional Data Warehouse. John Wiley & Sons (1996)

    Google Scholar 

  7. Rahm, E., Bernstein, P.: A survey of approaches to automatic schema matching (2001)

    Google Scholar 

  8. Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms. MIT Press and McGraw-Hill (2009)

    Google Scholar 

  9. Westerman, P.: Data Warehousing using the Wal-Mart Model. Morgan Kaufmann (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Negi, S., Bhide, M.A., Batra, V.S., Mohania, M.K., Bajpai, S. (2012). D’MART: A Tool for Building and Populating Data Warehouse Model from Existing Reports and Tables. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32281-5_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32280-8

  • Online ISBN: 978-3-642-32281-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics