Skip to main content

A Low Energy Clustered Instruction Memory Hierarchy for Long Instruction Word Processors

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2451))

Abstract

In the current embedded processors for media applications, up to 30% of the total processor power is consumed in the instruction memory hierarchy. In this context, we present an inherently low energy clustered instruction memory hierarchy template. Small instruction memories are distributed over groups of functional units and the interconnects are localized in order to minimize energy consumption. Furthermore, we present a simple profile based algorithm to optimally synthesize the L0 clusters, for a given application. Using a few representative multimedia benchmarks we show that up to 45% of the L0 buffer energy can be reduced using our clustering approach.

This work is supported in part by MESA under the MEDEA+ program

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. S. Bajwa, et al., “Instruction Buffering to Reduce Power in Processors for Signal Processing”, IEEE Transactions on VLSI systems, vol 5, no 4, Dec 1997.

    Google Scholar 

  2. N. Bellas, et al., “Architectural and Compiler Support for Energy Reduction in Memory Hierarchy of High Performance Microprocessors”, ISLPED 1998.

    Google Scholar 

  3. L. H. Lee, et al., “Instruction Fetch Energy Reduction Using Loop Caches For Applications with Small Tight Loops”, ISLPED 1999.

    Google Scholar 

  4. M. Jacome, et al., “Design Challenges for New Application Specific Processors”, Special Issue System Design of Embedded Systems, IEEE Design & Test of Computers, April–June 2000.

    Google Scholar 

  5. M. Jacome, et al., “Exploring Performance Tradeofis for Clustered VLIW ASIPs”, ICCAD, November 2000.

    Google Scholar 

  6. Texas Instruments Inc, Technical Report, “TMS3206000 Power Consumption Summary”, http://www.ti.com

  7. Texas Instruments Inc, “TMS320C6000 CPU and Instruction Set Reference Guide”, http://www.ti.com

  8. M. Jayapala, et al., “Loop Cache (Buffer) Organization: Energy Analysis and Partitioning”, Technical Report K.U.Leuven/ESAT, 22 Jan 2002.

    Google Scholar 

  9. A. Wolfe, et al., “Datapath Design for a VLIW Video Signal Processor”, IEEE Symposium on High-Performance Computer Architecture (HPCA’97).

    Google Scholar 

  10. S. Kim, et al., “Power-aware Partitioned Cache Architectures”, ISLPED 2001.

    Google Scholar 

  11. M. Huang, et al., “L1 Data Cache Decomposition for Energy Efficiency”, ISLPED 2001.

    Google Scholar 

  12. Trimaran, An Infrastructure for Research in Instruction-Level Parallelism, 1999. http://www.trimaran.org

  13. Ching-Long Su, et al., “Cache Design Trade-offs for Power and Performance Optimization: A Case Study”, ISLPED 1995.

    Google Scholar 

  14. L. Nachtergaele, V. Tiwari and N. Dutt, “System and Architectural-Level Power Reduction of Microprocessor-based Communication and Multimedia Applications”, ICCAD 2000.

    Google Scholar 

  15. D. Brooks, et al., “Wattch: A Framework for Architectural-Level Power Analysis and Optimizations”, ISCA 2000.

    Google Scholar 

  16. S.J.E. Wilton and N.P. Jouppi, “CACTI: an enhanced cache access and cycle time model”, IEEE Journal of Solid-State Circuits, 31(5):677–688, 1996.

    Article  Google Scholar 

  17. P. Petrov and A. Orailoglu, “Power Efficient Embedded Processor IPs through Application-Specific Tag Compression in Data Caches”, Design and Test in Europe Conf (DATE), April 2002.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jayapala, M., Barat, F., de Beeck, P.O., Catthoor, F., Deconinck, G., Corporaal, H. (2002). A Low Energy Clustered Instruction Memory Hierarchy for Long Instruction Word Processors. In: Hochet, B., Acosta, A.J., Bellido, M.J. (eds) Integrated Circuit Design. Power and Timing Modeling, Optimization and Simulation. PATMOS 2002. Lecture Notes in Computer Science, vol 2451. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45716-X_26

Download citation

  • DOI: https://doi.org/10.1007/3-540-45716-X_26

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44143-4

  • Online ISBN: 978-3-540-45716-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics