Abstract
One of the outcomes of DARPA’s HPCS program has been the creation of three new high productivity languages: Chapel, Fortress, and X10. While these languages have introduced improvements in language expressiveness and programmer productivity, several technical challenges still remain in delivering high performance with these languages. In the absence of optimization, the high-level language constructs that improve productivity can result in order-of-magnitude runtime performance degradations.
This paper addresses the problem of efficient code generation for high level array accesses in the X10 language. Two aspects of high level array accesses in X10 are important for productivity but also pose significant performance challenges: the high level accesses are performed through Point objects rather than integer indices, and variables containing references to arrays are rank-independent. Our solution to the first challenge is to extend the X10 compiler with automatic inlining and scalar replacement of Point objects. Our partial solution to the second challenge is to use X10’s dependent type system to enable the programmer to annotate array variable declarations with additional information for the rank and region of the variable, and to allow the compiler to generate efficient code in cases where the dependent type information is available. Although this paper focuses on high level array accesses in X10, our approach is applicable to similar constructs in other languages.
Our experimental results for single-thread performance demonstrate that these compiler optimizations can enable high-level X10 array accesses with implicit ranks and Points to improve performance by up to a factor of 5.4× over unoptimized X10 code, and to also achieve performance comparable (from 48% to 100%) to that of lower-level Java programs. These results underscore the importance of the optimization techniques presented in this paper for achieving high performance with high productivity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bacon, D.F.: Kava: a Java dialect with a uniform object model for lightweight classes. In: Proceedings of the 2001 joint ACM-ISCOPE conference on Java Grande, Palo Alto, California, pp. 68–77 (2001)
Budimlić, Z.: Compiling Java for High Performance and the Internet. PhD thesis. Rice University (2001)
Budimlić, Z., Kennedy, K.: JaMake: A Java Compiler Environment. In: 3rd International Conference on Large Scale Scientific Computing, pp. 201–209 (2001)
Budimlić, Z., Kennedy, K.: Optimizing Java: Theory and practice. Concurrency: Practice and Experience 9(6), 445–463 (1997)
Budimlić, Z., Kennedy, K.: Prospects for Scientific Computing in Polymorphic, Object-Oriented Style. In: the Proceedings of the 9th SIAM Conference on Parallel Processing for Scientific Computing, San Antonio, Texas (1999)
Budimlić, Z., Joyner, M., Kennedy, K.: Improving Compilation of Java Scientific Applications. The International Journal of High Performance Computing Applications (2006)
Charles, P., Donawa, C., Ebcioglu, K., Grothoff, C., Kielstra, A., Praun, C.: X10: An object-oriented approach to non-uniform cluster computing. In: OOPSLA 2005 Onward! Track (2005)
Dolby, J.: Automatic Inline Allocation of Objects. In: Proceedings of ACM SIGPLAN conference on POPL, Las Vegas, Nevada (1997)
Dolby, J., Chien, A.: An Automatic Object Inlining Optimization and its Evaluation. In: Proceedings of the 2000 ACM Sigplan Conference on Programming Language Design and Implementation, pp. 345–357. ACM Press, New York (2000)
Ebcioglu, K., Sarkar, V., El-Ghazawi, T., Urbanic, J.: An Experiment in Measuring the Productivity of Three Parallel Programming Languages. In: P-PHEC 2006 (2006)
Harper, R., Mitchell, J.C., Moggi, E.: Higher-order modules and the phase distinction. In: POPL 1990. Proceedings of the 17th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pp. 341–354. ACM, New York (1990)
The Java Grande Forum benchmark suite, http://www.epcc.ed.ac.uk/javagrande
Joyner, M.: Improving Object Inlining for High Performance Java Scientific Applications. Master’s Thesis. Rice University (2005)
Leroy, X.: Unboxed objects and polymorphic typing. In: Proceedings of the 19th Symposium on the Principles of Programming Languages, pp. 177–188 (1992)
Markidis, S., Lapenta, G., VanderHeyden, W.B., Budimlić, Z.: Implementation and Performance of a Particle-in-cell code Written in Java. Concurrency and Computation: Practice and Experience 17, 821–837 (2005)
Saraswat, V.: Report on the experimental language x10 version 1.01. http://x10.sourceforge.net/docs/x10-101.pdf
Wu, P., Midkif, S., Moreira, J., Gupta, M.: Efficient support for complex numbers in Java. In: Proceedings of the ACM 1999 conference on Java Grande, pp. 109–118 (1999)
X10 Prototype Implementation, http://x10.sf.net
Yelick, K., Semenzato, L., Pike, G., Miyamoto, C., Liblit, B., Krishnamurthy, A., Hilfinger, P., Graham, S., Gay, D., Colella, P., Aiken, A.: Titanium: a high-performance Java dialect. Concurrency: Practice and Experience 10(11-13), 825–836 (1998)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Joyner, M., Budimlić, Z., Sarkar, V. (2007). Optimizing Array Accesses in High Productivity Languages. In: Perrott, R., Chapman, B.M., Subhlok, J., de Mello, R.F., Yang, L.T. (eds) High Performance Computing and Communications. HPCC 2007. Lecture Notes in Computer Science, vol 4782. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75444-2_43
Download citation
DOI: https://doi.org/10.1007/978-3-540-75444-2_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75443-5
Online ISBN: 978-3-540-75444-2
eBook Packages: Computer ScienceComputer Science (R0)