Abstract
This paper examines the use of coarse-grained multithreading to lessen the negative impact of memory access latencies on the performance of uniprocessor on-line transaction processing systems. It considers the effect of switching threads on cache misses in a two-level cache system. It also examines several different thread-switch policies. The results suggest that multithreading with a small number (3–5) of active threads can significantly improve the performance of such commercial environments.
Preview
Unable to display preview. Download preview PDF.
References
A. Agarwal, J. Kubiatowicz, D. Kranz, B.-H. Lim, D. Yeung, G. D'Souza, and M. Parkin. Sparcle: An evolutionary design for large-scale multiprocessors. IEEE Micro, 13(3):48–60, 1993.
R. Alverson, D. Callahan, D. Cummings, B. Koblenz, A. Porterfield, and B. Smith. The Tera computer system. In Proc. of the Intl. Conf. on Supercomputing, pp. 1–6, Jun. 1990.
T. Anderson, H. Levy, B. Bershad, and E. D. Lazowska. The interaction of architecture and operating system design. In Proc. of the Intl. Conf. on Arch. Support for Prog. Lang. and Operating Systems, 1991.
K. Boland and A. Dollas. Predicting and precluding problems with memory latency. IEEE Micro, 14(4):59–67, 1994.
B. Calder, D. Grunwald, and B. Zom. Quantifying behavioral differences between C and C++ programs. Technical Report CU-CS-698-94, University of Colorado, Jan. 1994.
Z. Cvetanovic and C. Bhandarkar. Characterization of Alpha performance using TP and SPEC workloads. In Proc. of the Intl. Symp. on Comp. Arch., pp. 60–70, Apr. 1994.
R. J. Eickemeyer, R. E. Johnson, S. R. Kunkel, B.-H. Lim, M. S. Squillante, and C. E. Wu. Evaluation of multithreaded processors and thread-switch policies. Technical report, IBM Research Division, Jul. 1996.
R. J. Eickemeyer, R. E. Johnson, S. R. Kunkel, M. S. Squillante, and S. Liu. Evaluation of multithreaded uniprocessors for commercial application environments. In Proc. of the Intl. Symp. on Comp. Arch., pp. 203–212, May 1996.
M. K. Farrens and A. R. Pleszkun. Strategies for achieving improved processor throughput. In Proc. of the Intl. Symp. on Comp. Arch., pp. 362–369, May 1991.
A. Gupta, J. Hennessy, K. Gharachorloo, T. Mowry, and W.-D. Weber. Comparative evaluation of latency reducing and tolerating techniques. In Proc. of the Intl. Symp. on Comp. Arch., May 1991.
J. Laudon, A. Gupta, and M. Horowitz. Interleaving: A multithreading technique targeting multiprocessors and workstations. In Proc. of the Intl. Conf. on Arch. Support for Prog. Lang. and Operating Systems, pp. 308–318, Oct. 1994.
A. M. Maynard, C. M. Donnelly, and B. R. Olszewski. Contrasting characteristics and cache performance of technical and multi-user commercial workloads. In Proc. of the Intl. Conf. on Arch. Support for Prog. Lang. and Operating Systems, pp. 145–156, Oct. 1994.
D. M. Tullsen, S. J. Eggers, and H. Levy. Simultaneous multithreading: Maximizing on-chip parallelism. In Proc. of the Intl. Symp. on Comp. Arch., pp. 392–403, Jun. 1995.
W. Wulf and S. McKee. Hitting the memory wall: Implications of the obvious. Comp. Arch. News, 23(1):20–24, 1995.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Eickemeyer, R.J., Johnson, R.E., Kunkel, S.R., Lime, BH., Squillante, M.S., Wu, C.E. (1997). Evaluation of multithreaded processors and thread-switch policies. In: Polychronopoulos, C., Joe, K., Araki, K., Amamiya, M. (eds) High Performance Computing. ISHPC 1997. Lecture Notes in Computer Science, vol 1336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0024207
Download citation
DOI: https://doi.org/10.1007/BFb0024207
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63766-0
Online ISBN: 978-3-540-69644-5
eBook Packages: Springer Book Archive