Skip to main content

The Impact of Data Dispersion on the Accuracy of the Data Warehouse Federation’s Response

  • Conference paper
  • First Online:
Computational Collective Intelligence (ICCCI 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11055))

Included in the following conference series:

  • 1249 Accesses

Abstract

This article is a preliminary attempt to verify how the values of the metrics that characterize data warehouses affect the accuracy of the data warehouse federation’s response. The federation can be build almost always, but sometimes the heterogeneity of data stored in the component data warehouse is too big to properly handle the user’s request. If that happens, the effort put on the integration is a waste. Some work was done, but the federation does not give accurate response. In this paper some dependencies between the accuracy of federation’s response and data warehouse metrics are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Berger, S., Schrefl, M.: From federated databases to a federated data warehouse system. In: Proceedings of the 41st Annual Hawaii International Conference on System Sciences, HICSS 2008, pp. 394–404 (2008)

    Google Scholar 

  2. Dong, X.L., Berti-Equille, L., Srivastava, D.: Data fusion: resolving conflicts from multiple sources. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds.) WAIM 2013. LNCS, vol. 7923, pp. 64–76. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38562-9_7

    Chapter  Google Scholar 

  3. Fan, W., Lu, H., Madnick, S.E., Cheung, D.: Discovering and reconciling value conflicts for numerical data integration. Inf. Syst. 26(8), 635–656 (2001)

    Article  Google Scholar 

  4. Jindal, R., Acharya, A.: Federated Data Warehouse Architecture. White paper, Wipro Technologies (2003)

    Google Scholar 

  5. Kern, R., Ryk, K., Nguyen, N.T.: A framework for building logical schema and query decomposition in data warehouse federations. In: Jȩdrzejowicz, P., Nguyen, N.T., Hoang, K. (eds.) ICCCI 2011, Part I. LNCS (LNAI), vol. 6922, pp. 612–622. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23935-9_60

    Chapter  Google Scholar 

  6. Kern, R., Stolarczyk, T., Nguyen, N.T.: A formal framework for query decomposition and knowledge integration in data warehouse federations. Expert Syst. Appl. 40, 2592–2606 (2013)

    Article  Google Scholar 

  7. Kern, R., Dobrowolski, G., Nguyen, N.T.: A method for response integration in federated data warehouses. In: Camacho, D., Kim, S.-W., Trawiński, B. (eds.) New Trends in Computational Collective Intelligence. SCI, vol. 572, pp. 63–73. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-10774-5_6

    Chapter  Google Scholar 

  8. Lenzerini, M.: Data integration: a theoretical perspective. In: Proceedings of the Twentyfirst ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2002, pp. 233–246 (2002)

    Google Scholar 

  9. Motro, A., Anokhin, P.: Fusionplex: Resolution of data inconsistencies in the integration of heterogeneous information sources. Inf. Fusion 7(2), 176–196 (2006)

    Article  Google Scholar 

  10. Nguyen, N.T.: Advanced Methods for Inconsistent Knowledge Management. Advanced Information and Knowledge Processing. Springer, New York (2007). https://doi.org/10.1007/978-1-84628-889-0

    Book  Google Scholar 

  11. Vo, T.N.C., Nguyen, H.P., Vo, T.N.T.: Making kernel-based vector quantization robust and effective for incomplete educational data clustering. Vietnam J. Comput. Sci. 3(2), 93–102 (2016)

    Article  Google Scholar 

  12. Schneider, M.: Integrated vision of federated data warehouses CEUR-WS. In: Proceedings of International Workshop on Data Integration and Semantic Web (DisWeb 2006), vol. 238, pp. 336–347 (2006)

    Google Scholar 

  13. Marcos, M., Maldonado, J.A., Martnez-Salvador, B., Bosc, D., Robles, M.: Interoperability of clinical decision-support systems and electronic health records using archetypes: a case study in clinical trial eligibility. J. Biomed. Inf. 46(4), 676 (2013)

    Article  Google Scholar 

  14. Banek, M., Tjoa, A.M., Stolba, N.: Integrating different grain levels in a medical data warehouse federation. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2006. LNCS, vol. 4081, pp. 185–194. Springer, Heidelberg (2006). https://doi.org/10.1007/11823728_18

    Chapter  Google Scholar 

  15. Kern, R.: Data warehouses federation as a single data warehouse. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016, Part I. LNCS (LNAI), vol. 9875, pp. 356–366. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45243-2_33

    Chapter  Google Scholar 

  16. Getta, J.R., Handoko: On transformation of query scheduling strategies in distributed and heterogeneous database systems. In: Nguyen, N., Trawiński, B., Kosala, R. (eds.) Intelligent Information and Database Systems. Lecture Notes in Computer Science, vol. 9011. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-15702-3_14

  17. Waddington, R.: An Architected Approach to Integrated Information. White paper, Kalido (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rafał Kern .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kern, R. (2018). The Impact of Data Dispersion on the Accuracy of the Data Warehouse Federation’s Response. In: Nguyen, N., Pimenidis, E., Khan, Z., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2018. Lecture Notes in Computer Science(), vol 11055. Springer, Cham. https://doi.org/10.1007/978-3-319-98443-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-98443-8_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-98442-1

  • Online ISBN: 978-3-319-98443-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics