Skip to main content

Run-Time Support to Register Allocation for Loop Parallelization of Image Processing Programs

  • Conference paper
  • First Online:
High Performance Computing and Networking (HPCN-Europe 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1823))

Included in the following conference series:

Abstract

When Image Processing Programs (IPP) are targeted to Instruction Level Parallel architectures that perform dynamic instruction scheduling, register allocation is the key action to expose the high parallelism degree typically present in the loops of such programs.

This paper presents two main contributions to the register allocation for IPP loop parallelization: i) a framework to identify the inefficiencies of the two basic approaches to register allocation — the first based on compiling techniques and the second based on hardware mechanisms for register renaming; ii) a novel technique that eliminates the inefficiencies of both approaches. Some experimental results show the effectiveness of this technique.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. C. C. Weems, E. Riseman, A. Hanson, and A. Rosenfeld. “The DARPA Image Understanding Benchmark for Parallel Computers”. “J. of Parallel and Distributed Computing”, 11:1–24, 1992.

    Article  Google Scholar 

  2. J.L Hennessy and D.A. Patterson. Computer Architecture: A Quantitative Approach. Morgan Kaufmann, San Mateo, CA, second edition, 1996.

    MATH  Google Scholar 

  3. P. Baglietto, M. Maresca, M. Migliardi, and N. Zingirian. Image Processing on High-Performance RISC Systems. Proceedings of the IEEE, 84(7):917–930, July 1996.

    Article  Google Scholar 

  4. N Zingirian and M. Maresca. Scheduling image processing program activities on instruction level parallel RISC through program transformations. Lecture Notes in Computer Science, 1225:674–686, 1997.

    Google Scholar 

  5. J.K. Davidson and S. Jinturkar. Aggressive loop unrolling in a retargetable optimizing compiler. In Tibor Gyimothy, editor, Compiler Construction, 6th International Conference, volume 1060 of Lecture Notes in Computer Science, pages 59–73, Linköping, Sweden, 24–26 April 1996. Springer.

    Google Scholar 

  6. F.C. Chow and J.L. Hennessy. The priority-based coloring approach to register allocation. ACM Transactions on Programming Languages and Systems, 12(4):501–536, October 1990.

    Article  Google Scholar 

  7. V.H. Allan, R.B. Jones, R.M. Lee, and S.J Allan. Software pipelining. ACM Computing Surveys, 27(3):367–432, September 1995.

    Article  Google Scholar 

  8. N.J. Warter, G.E. Haab, K. Subramanian, and J.W Backhaus. Enhanced modulo scheduling for loops with conditional branches. In Micro, 1992.

    Google Scholar 

  9. A. Aiken, A. Nicolau, and S. Novack. “Resource-Constrained Software Pipelining”. “IEEE Transaction on Parallel and Distribuited Systems”, 6(12):1248–1271, 1995.

    Article  Google Scholar 

  10. S. Lelait, G. R. Gao, and C. Eisenbeis. A new fast algorithm for optimal register allocation in modulo scheduled loops. Lecture Notes in Computer Science, 1383:204–213, 1998.

    Google Scholar 

  11. Keith I. Farkas, Norman P. Jouppi, and Paul Chow. Register file design considerations in dynamically scheduled processors. In Proceedings of the Second International Symposium on High Performance Computer Architecture. IEEE, January 1996.

    Google Scholar 

  12. Kenneth C. Yeager. The MIPS R10000 superscalar microprocessor — emphasizing concurrency and atency-hiding techniques to efficiently run large, real-world applications. IEEE Micro, 16(2):28–40, April 1996.

    Article  Google Scholar 

  13. M. Lam. Software pipelining: An effective scheduling technique for VLIW machines. SIGPLAN Notices, 23(7):318–328, July 1988. Proceedings of the ACM SIGPLAN’ 88 Conference on Programming Language Design and Implementation.

    Article  Google Scholar 

  14. N.J. Warter, S.A. Mahlke, W.W. Hwu, and B.R. Rau. Reverse if-conversion. In Robert Cartwright, editor, Proceedings of the Conference on Programming Language Design and Implementation, pages 290–299, New York, NY, USA, June 1993. ACM Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zingirian, N., Maresca, M. (2000). Run-Time Support to Register Allocation for Loop Parallelization of Image Processing Programs. In: Bubak, M., Afsarmanesh, H., Hertzberger, B., Williams, R. (eds) High Performance Computing and Networking. HPCN-Europe 2000. Lecture Notes in Computer Science, vol 1823. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45492-6_34

Download citation

  • DOI: https://doi.org/10.1007/3-540-45492-6_34

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67553-2

  • Online ISBN: 978-3-540-45492-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics