Skip to main content

Parallel Database Systems

  • Chapter
  • First Online:
  • 4148 Accesses

Abstract

Many data-intensive applications require support for very large databases (e.g., hundreds of terabytes or exabytes). Supporting very large databases efficiently for either OLTP or OLAP can be addressed by combining parallel computing and distributed database management.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   49.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   89.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Change history

  • 09 September 2020

    The figures included in the original version of this book has been replaced. The figures have been updated throughout the book in this version of the book.

References

  • Abadi, D. J., Carney, D., Çetintemel, U., Cherniack, M., Convey, C., Lee, S., Stonebraker, M., Tatbul, N., and Zdonik, S. (2003). Aurora: a new model and architecture for data stream management. VLDB J., 12 (2): 120–139.

    Article  Google Scholar 

  • Abadi, D. J., Ahmad, Y., Balazinska, M., Çetintemel, U., Cherniack, M., Hwang, J.-H., Lindner, W., Maskey, A., Rasin, A., Ryvkina, E., Tatbul, N., Xing, Y., and Zdonik, S. B. (2005). The design of the Borealis stream processing engine. In Proc. 2nd Biennial Conf. on Innovative Data Systems Research, pages 277–289.

    Google Scholar 

  • Abadi, D. J., Marcus, A., Madden, S. R., and Hollenbach, K. (2007). Scalable semantic web data management using vertical partitioning. In Proc. 33rd Int. Conf. on Very Large Data Bases, pages 411–422.

    Google Scholar 

  • Abadi, D. J., Marcus, A., Madden, S., and Hollenbach, K. (2009). SW-Store: a vertically partitioned DBMS for semantic web data management. VLDB J., 18 (2): 385–406.

    Article  Google Scholar 

  • Aberer, K. (2001). P-grid: A self-organizing access structure for P2P information systems. In Proc. Int. Conf. on Cooperative Inf. Syst., pages 179–194.

    Google Scholar 

  • Aberer, K. (2003). Guest editor’s introduction. ACM SIGMOD Rec., 32 (3): 21–22.

    Article  Google Scholar 

  • Aberer, K., Cudré-Mauroux, P., Datta, A., Despotovic, Z., Hauswirth, M., Punceva, M., and Schmidt, R. (2003a). P-grid: a self-organizing structured P2P system. ACM SIGMOD Rec., 32 (3): 29–33.

    Article  Google Scholar 

  • Aberer, K., Cudré-Mauroux, P., and Hauswirth, M. (2003b). Start making sense: The chatty web approach for global semantic agreements. J. Web Semantics, 1 (1): 89–114.

    Article  Google Scholar 

  • Abiteboul, S., Quass, D., McHugh, J., Widom, J., and Wiener, J. (1997). The Lorel query language for semistructured data. Int. J. Digit. Libr., 1 (1): 68–88.

    Article  Google Scholar 

  • Abiteboul, S., Buneman, P., and Suciu, D. (1999). Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann.

    Google Scholar 

  • Abiteboul, S., Manolescu, I., Rigaux, P., Rousset, M.-C., and Senellart, P. (2011). Web Data Management. Cambridge University Press.

    Book  Google Scholar 

  • Abou-Rjeili, A. and Karypis, G. (2006). Multilevel algorithms for partitioning power-law graphs. In Proc. 20th IEEE Int. Parallel & Distributed Processing Symp., pages 124–124.

    Google Scholar 

  • Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D., Silberschatz, A., and Rasin, A. (2009). HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads. Proc. VLDB Endowment, 2 (1): 922–933.

    Article  Google Scholar 

  • Adali, S., Candan, K. S., Papakonstantinou, Y., and Subrahmanian, V. S. (1996a). Query caching and optimization in distributed mediator systems. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 137–148.

    Google Scholar 

  • Adali, S., Candan, K. S., Papakonstantinou, Y., and Subrahmanian, V. S. (1996b). Query caching and optimization in distributed mediator systems. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 137–148.

    Google Scholar 

  • Adamic, L. and Huberman, B. (2000). The nature of markets in the world wide web. Quart. J. Electron. Comm., 1: 5–12.

    Google Scholar 

  • Adiba, M. (1981). Derived relations: A unified mechanism for views, snapshots and distributed data. In Proc. 7th Int. Conf. on Very Data Bases, pages 293–305.

    Google Scholar 

  • Adiba, M. and Lindsay, B. (1980). Database snapshots. In Proc. 6th Int. Conf. on Very Data Bases, pages 86–91.

    Google Scholar 

  • Adler, M. and Mitzenmacher, M. (2001). Towards compressing web graphs. In Proc. Data Compression Conf., pages 203–212.

    Google Scholar 

  • Aggarwal, C. C., editor. (2007). Data Streams: Models and Algorithms. Springer.

    Google Scholar 

  • Agichtein, E., Lawrence, S., and Gravano, L. (2004). Learning to find answers to questions on the web. ACM Trans. Internet Tech., 4 (3): 129—162.

    Article  Google Scholar 

  • Agrawal, D. and Sengupta, S. (1993). Modular synchronization in distributed, multiversion databases: Version control and concurrency control. IEEE Trans. Knowl. and Data Eng., 5 (1): 126 –137.

    Article  Google Scholar 

  • Agrawal, D., Das, S., and El Abbadi, A. (2012). Data Management in the Cloud: Challenges and Opportunities. Synthesis Lectures on Data Management. Morgan & Claypool Publishers.

    Google Scholar 

  • Agrawal, S., Narasayya, V., and Yang, B. (2004). Integrating vertical and horizontal partitioning into automated physical database design. In Proc. ACM SIGMOD Int. Conf. on Management of Data.

    Google Scholar 

  • Akal, F., Böhm, K., and Schek, H.-J. (2002). Olap query evaluation in a database cluster: A performance study on intra-query parallelism. In Proc. 6th East European Conf. Advances in Databases and Information Systems, pages 218–231.

    Google Scholar 

  • Akal, F., Türker, C., Schek, H.-J., Breitbart, Y., Grabs, T., and Veen, L. (2005). Fine-grained replication and scheduling with freshness and correctness guarantees. In Proc. 31st Int. Conf. on Very Large Data Bases, pages 565–576.

    Google Scholar 

  • Akbarinia, R. and Martins, V. (2007). Data management in the APPA system. J. Grid Comp., 5 (3): 303–317.

    Article  Google Scholar 

  • Akbarinia, R., Martins, V., Pacitti, E., and Valduriez, P. (2006). Design and implementation of Atlas P2P architecture. In Baldoni, R., Cortese, G., and Davide, F., editors, Global Data Management, pages 98–123. IOS Press.

    Google Scholar 

  • Akbarinia, R., Pacitti, E., and Valduriez, P. (2007a). Processing top-k queries in distributed hash tables. In Proc. 13th Int. Euro-Par Conf., pages 489–502.

    Google Scholar 

  • Akbarinia, R., Pacitti, E., and Valduriez, P. (2007b). Query processing in P2P systems. Technical Report 6112, INRIA, Rennes, France.

    Google Scholar 

  • Akbarinia, R., Pacitti, E., and Valduriez, P. (2007c). Best position algorithms for top-k queries. In Proc. 33rd Int. Conf. on Very Large Data Bases, pages 495–506.

    Google Scholar 

  • Akbarinia, R., Pacitti, E., and Valduriez, P. (2007d). Data currency in replicated dhts. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 211–222.

    Google Scholar 

  • Akidau, T., Balikov, A., Bekiroglu, K., Chernyak, S., Haberman, J., Lax, R., McVeety, S., Mills, D., Nordstrom, P., and Whittle, S. (2013). MillWheel: Fault-tolerant stream processing at internet scale. Proc. VLDB Endowment, 6 (11): 1033–1044.

    Article  Google Scholar 

  • Alagiannis, I., Borovica, R., Branco, M., Idreos, S., and Ailamaki, A. (2012). NoDB: efficient query execution on raw data files. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 241–252.

    Google Scholar 

  • Alagiannis, I., Idreos, S., and Ailamaki, A. (2014). H2O: A hands-free adaptive store. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 1103–1114.

    Google Scholar 

  • Alamoudi, A. A., Grover, R., Carey, M. J., and Borkar, V. R. (2015). External data access and indexing in AsterixDB. In Proc. 24th ACM Int. Conf. on Information and Knowledge Management, pages 3–12.

    Google Scholar 

  • Albutiu, M.-C., Kemper, A., and Neumann, T. (2012). Massively parallel sort-merge joins in main memory multi-core database systems. Proc. VLDB Endowment, 5 (10): 1064–1075.

    Article  Google Scholar 

  • Allard, T., Hébrail, G., Masseglia, F., and Pacitti, E. (2015). Chiaroscuro: Transparency and privacy for massive personal time-series clustering. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 779–794.

    Google Scholar 

  • Alomari, M., Cahill, M., Fekete, A., and Rohm, U. (2008). The cost of serializability on platforms that use snapshot isolation. In Proc. 24th Int. Conf. on Data Engineering, pages 576 –585.

    Google Scholar 

  • Alomari, M., Fekete, A., and Rohm, U. (2009). A robust technique to ensure serializable executions with snapshot isolation DBMS. In Proc. 25th Int. Conf. on Data Engineering, pages 341–352.

    Google Scholar 

  • Alsberg, P. A. and Day, J. D. (1976). A principle for resilient sharing of distributed resources. In Proc. 2nd Int. Conf. on Software Engineering, pages 562–570.

    Google Scholar 

  • Alsubaiee, S., Altowim, Y., Altwaijry, H., Behm, A., Borkar, V. R., Bu, Y., Carey, M. J., Cetindil, I., Cheelangi, M., Faraaz, K., Gabrielova, E., Grover, R., Heilbron, Z., Kim, Y., Li, C., Li, G., Ok, J. M., Onose, N., Pirzadeh, P., Tsotras, V. J., Vernica, R., Wen, J., and Westmann, T. (2014). AsterixDB: A scalable, open source DBMS. Proc. VLDB Endowment, 7 (14): 1905–1916.

    Article  Google Scholar 

  • Altingövde, I. S. and Ulusoy, Ö. (2004). Exploiting interclass rules for focused crawling. IEEE Intelligent Systems, 19 (6): 66–73.

    Article  Google Scholar 

  • Aluç, G. (2015). Workload Matters: A Robust Approach to Physical RDF Database Design. PhD thesis, University of Waterloo.

    Google Scholar 

  • Alvarez, V., Schuhknecht, F. M., Dittrich, J., and Richter, S. (2014). Main memory adaptive indexing for multi-core systems. In Proc. 10th Workshop on Data Management on New Hardware, pages 3:1—-3:10.

    Google Scholar 

  • Amdahl, G. M. (1967). Validity of the single processor approach to achieving large scale computing capabilities. In Proc. Spring Joint Computer Conf., pages 483–485.

    Google Scholar 

  • Amsaleg, L., Franklin, M. J., Tomasic, A., and Urhan, T. (1996). Scrambling query plans to cope with unexpected delays. In Proc. 4th Int. Conf. on Parallel and Distributed Information Systems, pages 208–219.

    Google Scholar 

  • Andreev, K. and Racke, H. (2006). Balanced graph partitioning. Theor. Comp. Sci., 39 (6): 929–939.

    MathSciNet  MATH  Google Scholar 

  • Angles, R. and Gutierrez, C. (2008). The expressive power of SPARQL. In Proc. 7th Int. Semantic Web Conf., pages 114–129.

    Google Scholar 

  • Antoniou, G. and Plexousakis, D. (2018). Semantic web. In Liu, L. and Özsu, M. T., editors, Encyclopedia of Database Systems, pages 3425–3429. Springer New York, New York, NY.

    Chapter  Google Scholar 

  • Apache. (2016). Apache Giraph. http://giraph.apache.org. Last accessed June 2019.

  • Apers, P., van den Berg, C., Flokstra, J., Grefen, P., Kersten, M., and Wilschut, A. (1992). Prisma/DB: a parallel main-memory relational DBMS. IEEE Trans. Knowl. and Data Eng., 4: 541–554.

    Article  Google Scholar 

  • Apers, P. M. G. (1981). Redundant allocation of relations in a communication network. In Proc. 5th Berkeley Workshop on Distributed Data Management and Computer Networks, pages 245–258.

    Google Scholar 

  • Arasu, A. and Widom, J. (2004). A denotational semantics for continuous queries over streams and relations. ACM SIGMOD Rec., 33 (3): 6–11.

    Article  Google Scholar 

  • Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., and Raghavan, S. (2001). Searching the web. ACM Trans. Internet Tech., 1 (1): 2–43.

    Article  Google Scholar 

  • Arasu, A., Babu, S., and Widom, J. (2006). The CQL continuous query language: Semantic foundations and query execution. VLDB J., 15 (2): 121–142.

    Article  Google Scholar 

  • Armbrust, M., Xin, R. S., Lian, C., Huai, Y., Liu, D., Bradley, J. K., Meng, X., Kaftan, T., Franklin, M. J., Ghodsi, A., and Zaharia, M. (2015). Spark SQL: Relational data processing in Spark. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 1383–1394.

    Google Scholar 

  • Arocena, G. and Mendelzon, A. (1998). WebOQL: Restructuring documents, databases and webs. In Proc. 14th Int. Conf. on Data Engineering, pages 24–33.

    Google Scholar 

  • Asad, O. and Kemme, B. (2016). Adaptcache: Adaptive data partitioning and migration for distributed object caches. In Proc. ACM/IFIP/USENIX 17th Int. Middleware Conf., pages 7:1–7:13.

    Google Scholar 

  • Aspnes, J. and Shah, G. (2003). Skip graphs. In Proc. 14th Annual ACM-SIAM Symp. on Discrete Algorithms, pages 384–393.

    Google Scholar 

  • Avnur, R. and Hellerstein, J. (2000). Eddies: Continuously adaptive query processing. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 261–272.

    Google Scholar 

  • Ayad, A. and Naughton, J. (2004). Static optimization of conjunctive queries with sliding windows over unbounded streaming information sources. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 419–430.

    Google Scholar 

  • Azar, Y., Broder, A. Z., Karlin, A. R., and Upfal, E. (1999). Balanced allocations. SIAM J. on Comput., 29 (1): 180–200.

    Article  MathSciNet  MATH  Google Scholar 

  • Babb, E. (1979). Implementing a relational database by means of specialized hardware. ACM Trans. Database Syst., 4 (1): 1–29.

    Article  Google Scholar 

  • Babcock, B., Babu, S., Datar, M., Motwani, R., and Widom, J. (2002). Models and issues in data stream systems. In Proc. ACM SIGACT-SIGMOD Symp. on Principles of Database Systems, pages 1–16.

    Google Scholar 

  • Balazinska, M., Kwon, Y., Kuchta, N., and Lee, D. (2007). Moirae: History-enhanced monitoring. In Proc. 3rd Biennial Conf. on Innovative Data Systems Research, pages 375–386.

    Google Scholar 

  • Balke, W.-T., Nejdl, W., Siberski, W., and Thaden, U. (2005). Progressive distributed top-k retrieval in peer-to-peer networks. In Proc. 21st Int. Conf. on Data Engineering, pages 174–185.

    Google Scholar 

  • Bancilhon, F. and Spyratos, N. (1981). Update semantics of relational views. ACM Trans. Database Syst., 6 (4): 557–575.

    Article  MATH  Google Scholar 

  • Barbara, D., Garcia-Molina, H., and Spauster, A. (1986). Policies for dynamic vote reassignment. In Proc. 6th IEEE Int. Conf. on Distributed Computing Systems, pages 37–44.

    Google Scholar 

  • Barbara, D., Molina, H. G., and Spauster, A. (1989). Increasing availability under mutual exclusion constraints with dynamic voting reassignment. ACM Trans. Comp. Syst., 7 (4): 394–426.

    Article  Google Scholar 

  • Barthels, C., Loesing, S., Alonso, G., and Kossmann, D. (2015). Rack-scale in-memory join processing using RDMA. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 1463–1475.

    Google Scholar 

  • Batini, C. and Lenzirini, M. (1984). A methodology for data schema integration in entity-relationship model. IEEE Trans. Softw. Eng., SE-10 (6): 650–654.

    Article  Google Scholar 

  • Batini, C., Lenzirini, M., and Navathe, S. B. (1986). A comparative analysis of methodologies for database schema integration. ACM Comput. Surv., 18 (4): 323–364.

    Article  Google Scholar 

  • Beeri, C., Bernstein, P. A., and Goodman, N. (1989). A model for concurrency in nested transaction systems. J. ACM, 36 (2): 230–269.

    Article  MathSciNet  MATH  Google Scholar 

  • Bell, D. and Grimson, J. (1992). Distributed Database Systems. Addison Wesley. Reading.

    MATH  Google Scholar 

  • Bell, D. and Lapuda, L. (1976). Secure computer systems: Unified exposition and Multics interpretation. Technical Report MTR-2997 Rev.1, MITRE Corp, Bedford, MA.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Özsu, M.T., Valduriez, P. (2020). Parallel Database Systems. In: Principles of Distributed Database Systems. Springer, Cham. https://doi.org/10.1007/978-3-030-26253-2_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-26253-2_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-26252-5

  • Online ISBN: 978-3-030-26253-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics