Skip to main content

Parallel Implementation of External Sort and Join Operations on a Multi-core Network-Optimized System on a Chip

  • Conference paper
Algorithms and Architectures for Parallel Processing (ICA3PP 2011)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7016))

Abstract

In a commercial Relational Database Management System (RDBMS), sort and join are the most demanding operations, and it is quite beneficial to improve the performance of external sort and external join algorithms that handle large input data sizes. This paper proposes parallel implementations of multi-threaded external sort and external hash join algorithms to accelerate IBM DB2, one of leading RDBMSs, using an IBM Power Edge of Network (IBM PowerENTM) Peripheral Component Interconnect Express (PCIe) card as an accelerator. The preliminary results show that the proposed parallel implementation of the algorithms on PowerENTM PCIe card can speed up the DB2 sort and join performance about two times.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arefin, A.S., Hasan, M.A.: An Improvement of Bitonic Sorting for Parallel Computing. In: Proceedings of the 9th WSEAS International Conference on Distributed Computing, Athens, Greece (2005)

    Google Scholar 

  2. Chhugani, J., Nguyen, A.D., Lee, V.W., Macy, W., Hagog, M., Chen, Y.K., Baransi, A., Kumar, S., Dubey, P.: Efficient Implementation of Sorting on Multi-core SIMD CPU architecture. In: Proceedings of the VLDB Endowment, pp. 1313–1314 (2008)

    Google Scholar 

  3. Ramprasad, N., Baruah, P.K.: Radix Sort on the Cell Broadband Engine. In: International Conference on High Performance Computing, HiPC (2007)

    Google Scholar 

  4. Zagha, M., Blelloch, G.E.: Radix Sort for Vector Multiprocessors. In: Proceedings Supercomputing, pp. 712–721 (1991)

    Google Scholar 

  5. Satish, N., Harris, M., Garland, M.: Designing Efficient Sorting Algorithms for Manycore GPUs. In: The 23rd IEEE Internal Parallel and Distributed Processing Symposium, pp. 1–10 (2009)

    Google Scholar 

  6. Azadegan, S., Tripathi, A.: A Parallel Join Algorithm for SIMD Architectures. Journal of Systems and Software, 265–280 (1997)

    Google Scholar 

  7. Lu, H., Tan, K.L., Sahn, M.C.: Hash-based Join Algorithms for Multiprocessor Computers with Shared Memory. In: Proceedings of the Sixteenth International Conference on Very Large Database (1990)

    Google Scholar 

  8. Garcia, P., Korth, H.F.: Database Hash-Join Algorithms on Multithreaded Computer Architectures. In: Proceedings of the third Conference on Computing Frontiers (2006)

    Google Scholar 

  9. Martin, T.P., Larson, P.A., Deshpande, V.: Parallel Hash-Based Join Algorithms for a Shared-Everything Environment. IEEE Transactions on Knowledge and Data Engineering 6 (1994)

    Google Scholar 

  10. Knuth, D.E.: The Art of Computer Programming. Sorting and Searching, vol. 3. Addison-Wesley, Reading (1973)

    MATH  Google Scholar 

  11. Dewitt, D.J., Katz, R.H., Olken, F., Shapiro, L.D., Stonebraker, M.R., Wood, D.A.: Implementation Techniques for Main Memory Database Systems. Proceedings of the 1984 ACM SIGMOD International Conference on Management of Data 14 (1984)

    Google Scholar 

  12. LaPotin, D.P., Daijavad, S., Johnson, C.L., Hunter, S.W., Ishizaki, K., Franke, H., Achilles, H.D., Dumarot, D.P., Greco, N.A., Davari, B.: Workload and Network-Optimized Computing Systems. IBM Journal of Research and Development 54(1) (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Khorasani, E., Paulovicks, B.D., Sheinin, V., Yeo, H. (2011). Parallel Implementation of External Sort and Join Operations on a Multi-core Network-Optimized System on a Chip. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2011. Lecture Notes in Computer Science, vol 7016. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24650-0_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24650-0_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24649-4

  • Online ISBN: 978-3-642-24650-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics