Abstract
In this paper, we propose a novel compressed cache hierarchy that uses a unified compression algorithm in both L1 data cache and L2 cache, called Simple Frequent Pattern Compression(S-FPC). This scheme can increase the cache capacity of L1 data cache and L2 cache without any sacrifice of the L1 cache access latency. The layout of compressed data in L1 data cache enables partial cache line prefetching and does not introduce prefetch buffers or increase cache pollution and memory traffic. Compared to a baseline cache hierarchy not supporting data compression in cache, on average, our cache hierarchy design increases the average L1 cache capacity(in terms of the average number of valid words in cache per cycle) by about 33%, reduces the data cache miss rate by 21%, and speeds up program execution by 13%.
This work was supported by National Natural Science Foundation of China, grant No. 60376018.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hennessy, J., Patterson, D.: Computer Architecture: A Quantitative Approach. Morgan Kaufmann, San Francisco (1996)
Alameldeen, A., Wood, D.: Adaptive Cache Compression for High-Performance Processor. In: Proc. ISCA-31 (2004)
Chen, D., Peserico, E., Rudolph, L.: A Dynamically Partitionable Compressed Cache. In: Proceeding of the Singapore-MIT Alliance Symposium (January 2003)
Lee, J.-S., Hong, W.-K., Kim, S.-D.: Design and Evaluation of a Selective Compressed Memory System. In: Proceedings of International Conference on Computer Design (ICCD’99), October 1999, pp. 184–191 (1999)
Lee, J.-S., Hong, W.-K., Kim, S.-D.: Adaptive Methods to Minimize Decompression Overhead for Compressed On-chip Cache. International Journal of Computers and Application 25(2) (2003)
Yang, J., Zhang, Y., Gupta, R.: Frequent Value Compression in Data Caches. In: Proceedings of the 33rd Annual IEEE/ACM International Symposium on Microarchitecture, December 2000, pp. 258–265 (2000)
Hallnor, E., Reinhardt, S.: A Unified Compressed Memory Hierarchy. In: Proceedings of the 11th Int’l Symposium on High-Performance Computer Architecture, HPCA-11 (2005)
Franaszek, P., Robinson, J., Thomas, J.: Parallel Compression with Cooperative Dictionary Construction. In: Proc. Data Compression Conf., pp. 200–209 (1996)
Abali, B., Franke, H., Xiaowei, S., et al.: Performance of Hardware Compressed Main Memory. In: Proc. 7th Int’l Symp. on High-Performance Computer Architecture, pp. 73–81 (2001)
Burger, D., Austin, T.M.: The SimpleScalar Tool Set, Version 2.0. Computer Arch. News (1997)
Alameldeen, A.R., Wood, D.A.: Frequent Pattern Compression: A Significance-Based Compression Scheme for L2 Caches. Technical Report 1500, Computer Sciences Department, University of Wisconsin-Madison (April 2004)
Kjelso, M., et al.: Design and Performance of a Main Memory Hardware Data Compressor. In: Proc. EUROMICRO Conference (1996)
Zhang, Y., et al.: Frequent Value Locality and Value Centric Data Cache Design. In: Proc. ASPLOS (2000)
Pujara, P., Aggarwal, A.: Restrictive Compression Techniques to Increase Level 1 Cache Capacity. In: International Conference on Computer Design (2005)
Zhang, Y., Gupta, R.: Enabling Partial Cache Line Prefetching Through Data Compression. In: ICPP 2000, pp. 277–285 (2003), http://www.informatik.uni-trier.de/~ley/db/conf/icpp/icpp2003.html
Lee, J.-S., Hong, W.-K., Kim, S.-D.: Design and Evaluation of a Selective Compressed Memory System. In: Proceedings of International Conference on Computer Design (ICCD’99), October 1999, pp. 184–191 (1999)
Arramreddy, S., Har, D., Mak, K., et al.: IBM X-Press Memory Compression Technology Debuts in a ServerWorks NorthBridge. In: Hot Chips 12 (2000)
Lee, J.S., Hong, W.K., Kim, S.D.: An on-chip cache compression technique to reduce decompression overhead and design complexity. Journal of Systems Architecture 46, 1365–1382 (2000)
Roy, S., Kumar, R., Prvulovic, M.: Improving System Performance with Compressed Memory. In: Proc. 15th Int’l Parallel and Distributed Processing Symp., Apr. 2001, pp. 630–636 (2001)
Kumar, S., Pujara, P., Aggarwal, A.: Bit-Sliced Datapath for Energy-Efficient High Performance Microprocessors. In: Falsafi, B., VijayKumar, T.N. (eds.) PACS 2004. LNCS, vol. 3471, pp. 30–45. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Tian, X., Zhang, M. (2007). A Unified Compressed Cache Hierarchy Using Simple Frequent Pattern Compression and Partial Cache Line Prefetching. In: Lee, YH., Kim, HN., Kim, J., Park, Y., Yang, L.T., Kim, S.W. (eds) Embedded Software and Systems. ICESS 2007. Lecture Notes in Computer Science, vol 4523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72685-2_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-72685-2_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72684-5
Online ISBN: 978-3-540-72685-2
eBook Packages: Computer ScienceComputer Science (R0)