Recommended Reading
Abadi DJ. Data Management in the cloud: limitations and opportunities. IEEE Data Eng Bull. 2009;32(1):3–12.
Abouzeid A, Bajda-Pawlikowski K, Abadi D, Silberschatz A, Rasin A. HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads. PVLDB 2009;2(1):922–933. doi:10.14778/1687627.1687731.
Agarwal S, Mozafari B, Panda A, Milner H, Madden S, Stoica I. BlinkDB: queries with bounded errors and bounded response times on very large data. Eurosys 2013. doi:10.1145/2465351.2465355.
Armbrust M, Xin RS, Lian C, et al. Spark SQL: relational data processing in spark. SIGMOD 2015. doi:10.1145/2723372.2742797.
Chan L. Presto: interacting with petabytes of data at Facebook. 2016. https://www.facebook.com/notes/facebook-engineering/presto-interacting-with-petabytes-of-data-at-facebook/10151786197628920. Accessed 28 Jun 2016.
Dean J, Ghemawat S. MapReduce: a flexible data processing tool. CACM 2010;53(1):72–77. doi:10.1145/1629175.1629198.
Gupta A, Agarwal D, Tan D, et al. Amazon redshift and the case for simpler data warehouses. SIGMOD 2015. doi:10.1145/2723372.2742795.
Liu X, Thomsen C, Pedersen TB. ETLMR: a highly scalable dimensional ETL framework based on MapReduce. DaWaK 2011. doi:10.1007/978-3-642-23544-3_8.
Liu X, Thomsen C, Pedersen TB. CloudETL: scalable dimensional ETL for hive. IDEAS 2014. doi:10.1145/2628194.2628249.
Olston C, Reed B, Srivastava U, Kumar R, Tomkins A. Pig Latin: a not-so-foreign language for data processing. SIGMOD 2008. doi:10.1145/1376616.1376726.
Özcan F, Hoa D, Beyer KS, Balmin A, Liu CJ, Li Y. Emerging trends in the enterprise analytics: connecting Hadoop and DB2 warehouse. SIGMOD 2011. doi:10.1145/1989323.1989446.
Pavlo A, Paulson E, Rasin A, Abadi DJ, DeWitt DJ, Madden S, Stonebraker M. A comparison of approaches to large-scale data processing. SIGMOD 2009. doi:10.1145/1559845.1559865.
Pike R, Dorward S, Griesemer R, Quinlan S. Interpreting the data: parallel analysis with Sawzall. Sci Program. 2005;13(4):277–298.
Stonebreaker M, Abadi D, DeWitt DJ, Madden S, Paulson E, Pavlo A, Rasin A. MapReduce and parallel DBMSs: friends of foes? CACM 2010;53(1):64–71. doi:10.1145/1629175.1629197.
Thusso A, Sarma JS, Jain N, Shao Z, Chakka P, Anthony S, et al. Hive – a warehousing solution over a Map-Reduce framework. VLDB 2009. doi:10.14778/1687553.1687609.
Xin R, Rosen J, Zaharia M, Franklin MJ, Shenker S, Stoica I. Shark: SQL and rich analytics at scale. SIGMOD 2013. doi:10.1145/2463676.2465288
Zaharia M, Chowdhury M, Das T, et al. Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. NSDI 2012.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media LLC
About this entry
Cite this entry
Thomsen, C., Pedersen, T.B. (2017). Data Warehousing in Cloud Environments. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_80623-1
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7993-3_80623-1
Received:
Accepted:
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4899-7993-3
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering