OmpSs-OpenCL Programming Model for Heterogeneous Systems
The advent of heterogeneous computing has forced programmers to use platform specific programming paradigms in order to achieve maximum performance. This approach has a steep learning curve for programmers and also has detrimental influence on productivity and code re-usability. To help with this situation, OpenCL an open-source, parallel computing API for cross platform computations was conceived. OpenCL provides a homogeneous view of the computational resources (CPU and GPU) thereby enabling software portability across different platforms. Although OpenCL resolves software portability issues, the programming paradigm presents low programmability and additionally falls short in performance. In this paper we focus on integrating OpenCL framework with the OmpSs task based programming model using Nanos run time infrastructure to address these shortcomings. This would enable the programmer to skip cumbersome OpenCL constructs including OpenCL plaform creation, compilation, kernel building, kernel argument setting and memory transfers, instead write a sequential program with annotated pragmas. Our proposal mainly focuses on how to exploit the best of the underlying hardware platform with greater ease in programming and to gain significant performance using the data parallelism offered by the OpenCL run time for GPUs and multicore architectures. We have evaluated the platform with important benchmarks and have noticed substantial ease in programming with comparable performance.
KeywordsMemory Transfer Multicore Architecture Task Parallelism Kernel Code Data Dependency Graph
Unable to display preview. Download preview PDF.
- 1.OpenCL programming, http://www.khronos.org/registry/cl/specs/OpenCL-1.1.pdf
- 2.Duran, A., Ayguadé, E., Badia, R.M., et al.: OmpSs: a Proposal for Programming Heterogeneous Multi-Core Architectures. Parallel Processing Letters, 173–193 (2011)Google Scholar
- 3.Perez, J.M., Badia, R.M., Labarta, J.: Handling task dependencies under strided and aliased references. In: Proceeding ICS 2010 Proceedings of the 24th ACM International Conference on Supercomputing (2010)Google Scholar
- 6.Parallel Program Visualization and Analysis Tool, http://www.bsc.es/media/1364.pdf
- 8.Munshi, A., Gaster, B.R., Mattson, T.G., Fung, J., Ginsburg, D.: OpenCL Programming Guide, 1st edn. Addison-Wesley Professional (July 25, 2011) ISBN-10: 0321749642Google Scholar
- 9.Lee, J., et al.: An OpenCL framework for heterogeneous multicores with local memory. In: Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, PACT (2010)Google Scholar
- 11.Aoki, R., et al.: Hybrid OpenCL: Enhancing OpenCL for Distributed Processing. In: Parallel and Distributed Processing with Applications, ISPA (2011)Google Scholar
- 12.Gregg, C., et al.: Contention-Aware Scheduling of Parallel Code for Heterogeneous Systems. In: Poster at HotPar 2010 (2010)Google Scholar