Journal of Computer Science and Technology

, Volume 15, Issue 5, pp 445–452 | Cite as

Supporting flexible data distribution in software DSMs

  • Hong Jinwei Email author
  • Chen Guoliang 
  • Zhang Zhaoqing 


Page-based software DSM systems suffer from false sharing caused by the large sharing granularity, and only support one-dimensionBlock orCyclicblock data distribution schemes. Thus applications running on them will suffer from poor data locality and will be able to exploit parallelism only when using a large number of processors. In this paper, a way towards supporting flexible data distribution (FDD) on software DSM system is presented. Small granularity-tunable blocks, the size of which can be set by compiler or programmer, are used to overlap the working data sets distributed among processors. The FDD was implemented on a software DSM system called JIAJIA. Compared withBlock/Cyclic-block distribution schemes used by most DSM systems now, experiments show that the proposed way of flexible data distribution is more effective. The performance of the applications used in the experiments is significantly improved.


DSM JIAJIA data distribution address computation Dawning 2000-I 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    Li K, Hudak P. Memory coherence in shared virtual memory systems. InProceedings of the 5th Annual ACM Symposium on the Principles of Distributed Computing, Aug. 1986, pp.229–239.Google Scholar
  2. [2]
    Keleher P. The relative importance of concurrent writes and weak consistency models. In16th International Conference on Distributed Computing Systems, Hong Kong, May 1996.Google Scholar
  3. [3]
    Keleher P, Cox A L, Dwarkadas S, Zwaenepoel W. TreadMark: Distributed shared memory on Standford workstations and operating systems. InProceedings of the Winter USENIX Conference, January 1994, pp.115–131.Google Scholar
  4. [4]
    Hu Weiwu, Shi Weisong, Tang Zhimin. JIAJIA: An SVM system based on a new cache coherence protocol. InProceedings of the High Performance Computing and Networking (HPCN’99),LNCS 1593, April 1999, Amsterdam, Netherlands, Springer-Verlag, pp.463–472.Google Scholar
  5. [5]
    Anant Agarwal, David Kranz, Venkat Natarajan. Automatic partitioning of parallel loops for cachecoherent multiprocessors. InProceedings of International Conference on Parallel Processing, 1993, pp.I-2–I-11.Google Scholar
  6. [6]
    Barua R, Kranz D, Agarwal A. Communication-minimal partitioning of parallel loops and data arrays for cache-coherent distributed-memory multiprocessors. InLanguages and Compilers for Parallel Computing, Berlin: Springer-Verlag, August 1996.Google Scholar
  7. [7]
    Gupta M, Banerjee P. Demonstration of automatic data partitioning techniques for parallelizing compilers on multicomputers.IEEE Transactions on Parallel and Distributed Systems, March 1992, 3(2): 179–193.CrossRefGoogle Scholar
  8. [8]
    Kennedy K, Mckinley K S. Optimizing for parallelism and data locality. InProceedings of the 1992 ACM International Conference on Sypercomputing, Washington D.C., July 1992.Google Scholar
  9. [9]
    Jennifer M Anderson, Monica S Lam. Global optimizations for parallelism and locality on scalable parallel machines. InProceedings of SIGPLAN’93 Conference on Programming Languages Designed and Implementation, ACM, June 1993.Google Scholar
  10. [10]
    Hu Weiwu, Shi Weisong, Tang Zhimin, Eskiciogiu M R. The JIAJIA programming manual. Technical Report No.TR980007., July 1998.Google Scholar
  11. [11]
    Barua R, Kranz D A, Agarwal A. Addressing partitioned arrays in distributed memory multiprocessors —The software virtual memory approach. InProceedings of the MIT Student Workshop, Wellesley, MA, July 1995.Google Scholar
  12. [12]
    Anant Agarwal, Richardo Bianchiniet al. The MIT Alewife machine: Architecture and performance. InProceedings of the 22nd Annual International Symposium on Computer Architecture (ISCA’95), June 1995, pp.2–13.Google Scholar
  13. [13]
    Feng Xiaobing. Data distribution — A global optimizing technology [dissertation]. Institute of Computing Technology, Chinese Academy of Sciences, 1999.Google Scholar

Copyright information

© Science Press, Beijing China and Allerton Press Inc. 2000

Authors and Affiliations

  • Hong Jinwei 
    • 1
    Email author
  • Chen Guoliang 
    • 1
  • Zhang Zhaoqing 
    • 2
  1. 1.Department of Computer ScienceUniversity of Science and Technology of ChinaHefeiP.R. China
  2. 2.Institute of Computing TechnologyChinese Academy of SciencesBeijingP.R. China

Personalised recommendations