Skip to main content

Data Integration for Business Analytics: A Conceptual Approach

  • Conference paper
Knowledge Science, Engineering and Management (KSEM 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5914))

  • 1241 Accesses

Abstract

We present first and basic ideas for a modeling environment for business analytics. Main emphasis is on modeling components for data preparation, in particular data integration. The model is based on combination of knowledge and techniques from statistical metadata management and from workflow processes. All modeling concepts are presented in a problem oriented formulation. The approach is embedded into an open model framework which aims for a modeling platform for all kinds of models useful in business applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Batini, C., Scannapieco, M.: Data Quality - Concepts, Methodologies and Techniques. Springer, Heidelberg (2006)

    MATH  Google Scholar 

  2. Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, G., Wirth, R.: CRISP-DM1.0 Cross Industry Standard Process for Data Mining (2000), http://www.crisp-dm.org/CRISPWP-0800.pdf

  3. Dasu, T., Johnson, T.: Exploratory Data Mining and Data Cleaning, Wiley Series in Probability and Statistics. Hoboken, New Jersey (2003)

    Google Scholar 

  4. Data Documentation Initiative (DDI) (2009), http://www.icpsr.umich.edu/DDI/

  5. Fellegi, I.P., Holt, D.: A systematic approach to automatic edit and imputation. Journal of the American Statistical Association 71, 17–35 (1976)

    Article  Google Scholar 

  6. Froeschl, K.A., Grossmann, W., Del Vecchio, V.: The concept of metadata: A report on the nature of metadata and how these concepts can be used in practice. Deliverable 5 of the METANET project (2003), http://www.epros.ed.ac.uk/metanet/

  7. Grossmann, W.: Metadata Usage in Statistical Computing. In: Braverman, A., Hesterberg, T., Minotte, M., Symanizik, J. (eds.) Proceedings of the 35th Symposium on the Interface, pp. 648–663. Interface Foundation of North America (2003)

    Google Scholar 

  8. Grossmann, W., Moschner, M.: Towards an Ontology for Data in Business Decisions. In: Karagiannis, D., Reimer, U. (eds.) PAKM 2004. LNCS (LNAI), vol. 3336, pp. 397–407. Springer, Heidelberg (2004)

    Google Scholar 

  9. Grossmann, W., Moschner, M.: Knowledge Integration from Mulidimensional Data Sources. In: Moreno Díaz, R., Pichler, F., Quesada Arencibia, A. (eds.) EUROCAST 2007. LNCS, vol. 4739, pp. 345–351. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  10. Karagiannis, D., Grossmann, W., Höfferer, P.: Open Models A feasibility Study, http://www.openmodels.at/web/omi/home

  11. Nordholt, E.S.: Imputation: Methods, Simulation, Experiments and Practical Examples. International Statistical Review 66, 157–180 (1998)

    Article  MATH  Google Scholar 

  12. Open Models, http://www.openmodels.at/web/omi/home

  13. PASW Modeler (formerly Clementine®), http://www.spss.com/software/modeling/modeler/

  14. Pentaho/Kettle, http://kettle.pentaho.org/

  15. Petrakos, G., Kalogeropoulos, K., Farmakis, G., Stavropoulos, P.: A Classification Scheme of Validation Rules Applied to Statistical Data Bases. In: Nanoupoulos, P., Wilkinson, D. (eds.) Proc. ETK_NTTS_2001. EUROSTAT, pp. 685–693 (2001)

    Google Scholar 

  16. Pourabbas, E., Shoshani, A.: The Composite OLAP–Object Data Model: Removing an Unnecessary Barrier. In: Froeschl, K.A., Grossmann, W. (eds.) Proc. 18th Int. Conf. on Scientific and Statistical Database Management – SSDBM 2006, pp. 291–300. IEEE, Los Alamitos (2006)

    Google Scholar 

  17. Raessler, S.: Statistical Matching. Springer, Heidelberg (2002)

    MATH  Google Scholar 

  18. Vardaki, M., Papageorgiou, H.: Statistical Data and Metadata Quality Assessment. In: Handbook of Research on Public Information Technology. Information Science Reference. IGI Global, New York (2007)

    Google Scholar 

  19. Winkler, W.E.: Overview of Record Linkage and Current Research Directions. Report RRS2006/02, Washington, D.C.: U.S. Bureau of the Census (2006)

    Google Scholar 

  20. Zhang, S., Zhang, C., Wi, X.: Knowledge Discovery from Multiple Databases. Springer, Heidelberg (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Grossmann, W. (2009). Data Integration for Business Analytics: A Conceptual Approach. In: Karagiannis, D., Jin, Z. (eds) Knowledge Science, Engineering and Management. KSEM 2009. Lecture Notes in Computer Science(), vol 5914. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10488-6_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-10488-6_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-10487-9

  • Online ISBN: 978-3-642-10488-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics