Abstract
One approach to big data benchmarking is to study the implications of the underlying computing platform, both hardware and software. The software stack for big data analytics comprises a suite of technology solutions including MPP SQL Engines. This paper provides an overview of the architectural choices in the underlying hardware infrastructure, the architectural choices in the design of parallel SQL Engines, and the implications of both on benchmarking. A benchmark suite developed at XtremeData for internal engineering use is also described.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Double Data Rate.
- 2.
Quad Data Rate.
- 3.
Storage Area Network.
- 4.
Network Attached Storage.
References
Big data: The next frontier for innovation, competition, and productivity, McKinsey&Company. http://www.mckinsey.com/insights/business_technology/big_data_the_next_frontier_for_innovation
Cloudera-Hadoop. http://www.cloudera.com/content/cloudera/en/products-and-services/cdh.html
Apache CouchDB. http://couchdb.apache.org
MongoDB: A Document Oriented Database. http://www.mongodb.org/about/
NoSQL Distilled. http://martinfowler.com/books/nosql.html
Amazon Web Services. https://aws.amazon.com/products/
Google Compute Engine:Quickstart: Creating an instance and launching Apache - Google Developers. https://developers.google.com/compute/docs/quickstart
VMware 10GE QoS Design Deep Dive with Cisco UCS, Nexus. http://bradhedlund.com/2010/09/15/vmware-10ge-qos-designs-cisco-ucs-nexus/
OpenStack Open Source Cloud Computing Software. https://www.openstack.org/
PostgreSQL: The world’s most advanced open source database. http://www.postgresql.org/
TPC – Homepage. http://www.tpc.org/default.asp
Seng, J.-L., Yao, S.B., Hevner, A.R.: Requirements-driven database systems benchmark method. Decis. Support Syst. 38(4), 629–648 (2005). doi:10.1016/j.dss.2003.06.002, http://dx.doi.org/10.1016/j.dss.2003.06.002
Darmont, J., Bentayeb, F., Boussaid, O.: Benchmarking data warehouses. Int. J. Bus. Intell. Data Min. 2(1), 79–104 (2007). doi:10.1504/IJBIDM.2007.012947, http://dx.doi.org/10.1504/IJBIDM.2007.012947
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Chandran, R., Sridhar, K.T., Sakkeer, M.A. (2014). MPP SQL Engines: Architectural Choices and Their Implications on Benchmarking. In: Rabl, T., Raghunath, N., Poess, M., Bhandarkar, M., Jacobsen, HA., Baru, C. (eds) Advancing Big Data Benchmarks. WBDB WBDB 2013 2013. Lecture Notes in Computer Science(), vol 8585. Springer, Cham. https://doi.org/10.1007/978-3-319-10596-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-10596-3_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10595-6
Online ISBN: 978-3-319-10596-3
eBook Packages: Computer ScienceComputer Science (R0)