Abstract
Prior to start of any data warehouse project, developers/architects have to finalize architecture to be followed during project life cycle. Two possibilities are (a) Use of commercial ETL tool or (b) the development of in-house ETL program. Both options are having merits and demerits. The scope of this article is to optimize the ETL process with retrieval by making use of technology mix and keep in consideration all the factors which are not been considered by ETL tools.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Simitisis, P. Vassiliadis, S. Skiadopoulos, and T. Sellis, “Data warehouse refreshment,” Data Warehouses Ol. Concepts, Archit. Solut., pp. 111–134, 2006.
Kamal & Theresa, “ETL Evolution for Real-Time Data Warehousing”, Proceedings of the Conference on Information Systems Applied Research New Orleans Louisiana, USA; ISSN: 2167-1508 v5 n2214, pp. 7–8 (2012).
Mummana, S. and R. Kiran Rompella, “An Empirical Data Cleaning Technique for CFDs”, International Journal of Engineering Trends and Technology (IJETT). 4(9). 3730–3735., pp. 3730–3731 (2013).
Bloomberg Business Week Research Services, “The Current State of Business Analytics: Where Do We Go From Here?” A white paper produced in collaboration with SAS, pp. 5–8, 2011. [Online]. Available: SAS, http://www.sas.com/resources/asset/busanalyticsstudy_wp_08232011.pdf. [Accessed Sep 09, 2015].
Agrawal, D., “The Reality of Real-Time Business Intelligence”, Proceedings of the 2nd International Workshop on Business Intelligence for the Real Time Enterprise (BIRTE 2008), Editors: M. Castellanos, U. Dayal, and T. Sellis, Springer, LNBIP 27, pp. 75–88 (2009).
Razi O. Mohammed and Samani A. Talab, “Clinical Data Warehouse Issues and Challenges”, International Journal of u-and e-Service, Science and Technology, Vol. 7, No. 5, pp. 251–262 (2014).
Bergamaschi, S., Guerra et al., “A Semantic Approach to ETL Technologies, Data & Knowledge Engineering”, 70(8), pp. 717–731. (2011).
P Lane, “Oracle Database Data Warehousing Guide 11 g”, Oracle Corporation USA, Release 2 (11.2) E25554–02, (2013).
Kushaoor & JNTUA.: ETL Process Modeling In DWH Using Enhanced Quality Techniques, International Journal of Database Theory and Application. Vol. 6. No. 4, pp. 181–182 (2013).
W.H. Inmon, “Building the Data Warehouse”,(third ed.), John Wiley and Sons, USA, (2002).
IBM, “IBM Data Warehouse Manager”, 2003 [Online], Available: http://www3.ibm.com/software/data/db2/datawarehouse, [Accessed Sep 20, 2015].
Simitsis, A., “Modeling and optimization of extraction-transformation-loading(ETL) processes in data warehouse environments”, Doctoral Thesis, NTU Athens, Greece (2004).
Muhammad Arif,Ghulam Mujtaba, “A Survey: Data Warehouse Architecture”, International Journal of Hybrid Information Technology, Vol. 8, No. 5, pp. 349–356, (2015).
Immanuel Chan, Lance Ashdown, “Oracle Database Performance Tuning Guide, 11 g”, Oracle Corporation USA, Release 2 (11.2) E41573–04 (2014).
Qin Hanlin; Jin Xianzhen; Zhang Xianrong, “Research on Extract, Transform and Load (ETL) in land and Resources Star Schema data Warehouses” Computational Intelligence and design (ISCID), 2012 fifth International IEEE Symposium on (Volume: 1), pp. 120–123, ISBN-978-1-4673-2646-9 (2012).
Burleson Consulting, “Hypercharge SQL*Loader load speed performance” [Online], Available: http://www.dba-oracle.com/art_orafaq_data_load.htm (2008).
H. Galhardas, “Achieving Data Quality With AJAX”, [Online], Available: https://fenix.tecnico.ulisboa.pt/downloadFile/3779571376272/ajax.pdf, 2006–07, Accessed: Sep 04, 2015.
V. Raman and J. M. Hellerstein, “Potter’s Wheel: An Interactive Data Cleaning System,” Data Base, vol. 01, pp. 381–390, 2001.
Shaker H. Ali El-Sappagha, Abdeltawab M. Ahmed Hendawib, Ali Hamed El Bastawissyb, “A proposed model for data warehouse ETL processes”, Journal of King Saud University - Computer and Information Sciences.
P. Vassiliadis et al., “A generic and customizable framework for the design of ETL scenarios”, Information Systems, vol. 30, no. 7, pp. 492–525, Elsevier Science Ltd.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media Singapore
About this paper
Cite this paper
Sharma Sachin, Kumar Kamal (2017). ETLR—Effective DWH Design Paradigm. In: Satapathy, S., Bhateja, V., Joshi, A. (eds) Proceedings of the International Conference on Data Engineering and Communication Technology. Advances in Intelligent Systems and Computing, vol 469. Springer, Singapore. https://doi.org/10.1007/978-981-10-1678-3_14
Download citation
DOI: https://doi.org/10.1007/978-981-10-1678-3_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-1677-6
Online ISBN: 978-981-10-1678-3
eBook Packages: EngineeringEngineering (R0)