The Bandwidth Expansion Effectiveness of Cache Levels Block Prefetch

Ju, Youngkwan; Uh, Bongyong; Kim, Sukil

doi:10.1007/978-3-540-77704-5_17

Youngkwan Ju¹,
Bongyong Uh¹ &
Sukil Kim¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4759))

Included in the following conference series:

782 Accesses

Abstract

Most cache architectures exploit only a second level cache prefetch. In this paper, we propose the hierarchical prefetch cache architecture which allows prefetch between all levels of caches. We discovered that this architecture has a virtual effect of expanding memory bus bandwidth. According to an experimental analysis using 10 benchmark programs, the proposed architecture that employs all level cache prefetcher obtained a maximum 11% increased performance when compared to both architecture with expanded bus bandwidth and architecture with employment only a level 2 cache prefetcher. This shows our proposed architecture has an effectiveness of memory-bus bandwidth expansion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Grama, A., Gupta, A., Karapis, G., Kumar, V.: Introduction to Parallel Computing, 2nd edn. Addison Wesley, Reading (2003)
Google Scholar
Fritts, J.: Multi-Level Memory Prefetching for Media and Streaming Processing. In: Proceedings International Conference on Multimedia and Expo (2002)
Google Scholar
Bear, J.L., Wang, W.H.: Architectural Choices for Multi-level Cache Hierarchies. In: Proceedings 16th international Conference on Parallel Processing, pp. 258–256 (1987)
Google Scholar
Moon, H.J., Jeon, J.N., Kim, S.: Design of A Media Processor Equipped with Dual Cache. Journal Korean Institution Science Society 29(9), 573–581 (2002)
Google Scholar
Gaddis, N.B., Butler, J.R., Kumar, A., Queen, W.J.: A 56-entry instruction reorder buffer, Solid-State Circuits Conference. In: IEEE International Digest of Technical Papers. 43rd ISSCC, pp. 212–213 (February 1996)
Google Scholar
Joseph, D., Grunwald, D.: Prefetching Using Markov Predictors. In: Proceedings 24th Inl. Symp. Computer Architecture, pp. 252–263 (June 1997)
Google Scholar
Zhang, X., Lee, H.S.: A hardware-based cache pollution filtering mechanism for aggressive prefetches. In: Proceedings 2003 International Conference on Parallel Processing, pp. 286–293 (October 6-9, 2003)
Google Scholar
Smith, A.: Sequential Program Prefetching in Memory Hierarchies. IEEE Computer 11(2), 7–21 (1997)
Google Scholar
Jouppi, N.P.: Improving Direct-mapped Cache Performance by the Addition of a Small Fully associative Cache and Prefetch Buffers. In: Proceedings of the 17th Annual International Symposium on Computer Architecture, pp. 364–373 (May 1990)
Google Scholar
Horel, T., Lauterbach, G.: UltraSPARC-III: Designing Third-generation 64-bit Performance. IEEE Micro 19(3), 73–85 (1999)
Article Google Scholar
Chen, T.F., Baer, J.L.: Effective Hardware-Based Data Prefetching for High Performance Processors. IEEE Transactions on Computers 44(5), 609–623 (1995)
Article MATH Google Scholar
Jeon, Y.S., Moon, H.J., Jeon, J.N., Kim, S.: A Hardware Cache Prefetching Scheme for Multimedia Data with Intermittently Irregular Strides. KIPS Architecture 31(11), 658–672 (2004)
Google Scholar
Chan, K.K., Hay, C.C., Keller, J.R., Kurpanek, G.P., Schumacher, F.X., Zheng, J.: Design of the HP PA 7200 CPU. Hewlett-Packard Journal 47(1), 25–33 (1996)
Google Scholar
Pentium Processor User’s Manual, Vol.1, Pentium Processor Databook, Intel (1993)
Google Scholar
IA-32 Intel Architecture Software Developer s Manual, Vol.1, Basic Architecture, Intel (2004)
Google Scholar
Denamn, M.: PowerPC 604. Hot Chips VI, 193–200 (1994)
Google Scholar
Mutlu, O., Kim, H.S., Armstrong, D.N., Patt, Y.N.: Cache Filtering Techniques to Reduce the Negative Impact of Useless Speculative Memory References on Processor Performance. In: SBAC-PAD 2004. 16th Symposium Computer Architecture and High Performance Computing, October 27-29, 2004, pp. 2–9 (2004)
Google Scholar
Lee, J.S., Hong, W.K., Kim, S.D.: Design and Evaluation of On-Chip Cache Compression Technology. In: Proceedings the 17th IEEE International Conference on Computer Design, pp. 184–191 (1999)
Google Scholar
Rivers, J.A., Tyson, G.S., Davidson, E.S., Austin, T.M.: On High-Bandwidth Data Cache Design for Multi-Issue Processors. In: Proceedings of the 30th Annual International Symposium on Micro architecture, pp. 46–56 (December 1997)
Google Scholar
Lee, J.H., et al.: An Intelligent Cache System with Hardware Prefetching for High Performance. IEEE Transactions on Computers 5(5), 607–617 (2003)
Google Scholar
Solihin, Y., Lee, J., Torrellas, J.: Correlation prefetching with a user-level memory thread. IEEE Transactions on Parallel and Distributed Systems 14, 563–580 (2003)
Article Google Scholar
Srivastava, A., Eustace, A.: ATOM: A System for Building Customized Program Analysis Tools. In: Proceedings ACM SIGPLAN 1994, pp. 196–205 (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, Chungbuk National University, Gashindong 12, Cheongju, Chungbuk, Republic of Korea
Youngkwan Ju, Bongyong Uh & Sukil Kim

Authors

Youngkwan Ju
View author publications
You can also search for this author in PubMed Google Scholar
Bongyong Uh
View author publications
You can also search for this author in PubMed Google Scholar
Sukil Kim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Jesús Labarta Kazuki Joe Toshinori Sato

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ju, Y., Uh, B., Kim, S. (2008). The Bandwidth Expansion Effectiveness of Cache Levels Block Prefetch. In: Labarta, J., Joe, K., Sato, T. (eds) High-Performance Computing. ISHPC ALPS 2005 2006. Lecture Notes in Computer Science, vol 4759. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77704-5_17

Download citation

DOI: https://doi.org/10.1007/978-3-540-77704-5_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77703-8
Online ISBN: 978-3-540-77704-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics