Abstract
In this paper, a fast serial link, Serial Transparent Asynchronous First-in First-out Link (STAFF-Link), is introduced. Using such links, we construct a parallel processing system based on a workstation cluster. The workstation cluster implements a distributed sharedmemory mechanism for inter-process communication using a software controlled cache using a STAFF-Link router board. The board has a chained multicast capability with which we have implemented efficient invalidation protocol based on Eager Release Consistency (ERC) model in the DSM system. Performance results on several application programs from the SPLASH2 benchmark suites have been measured.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
REFERENCES
V. S. Sunderam, PVM: A Framework for Parallel Distributed Computing, Concurrency: Practice and Experience 2(4):315–339 (1990).
Message Passing Interface Forum, MPI: A Message-Passing Interface Standard, Int'l J. Supercomputer Applications and High Performance Computing 8(3-4):159–416 (1994).
Craig C. Douglas, Timothy G. Mattson, and Martin H. Schultz, Parallel Programming Systems for Workstation Clusters, Technical Report TR-975, Yale University Department of Computer Science Research (1993).
K. Li and P. Hudak, Memory Coherency in Shared Virtual Memory Systems, ACM Trans. Comput. Syst. 7(4):321–359 (1989).
Pete Keleher, Sandhya Dwarkadas, Alan L. Cox, and Willy Zwaenepoel, TreadMarks: Distributed Shared Memory on Standard Workstations and Operating Systems, Rice COMP TR93214 (1993).
Dilip Khandekar, Quarks: Portable Distributed SharedMemory on UNIX, quarks-doc/tech-report.ps including ftp://jaguar.cs.utah.edu-pub-dsm-Quarks.tar.Z (1995).
Mark D. Hill, James R. Larus, and David A. Wook, Tempest: A Substrate for Portable Parallel Programs, Proc. of COMPCON'95, pp. 327–332 (1995).
H. Nakajo, K. Kuramae, Y. Kaneda, and S. Maekawa, The Implementation and Evaluation of Software Distributed SharedMemory (DSM) for Workstation Clusters (in Japanese), Trans. IPS Japan 36(7):1719–1728 (1995).
H. Nakajo, S. Ohtani, T. Matsumoto, M. Kohata, K. Hiraki, and Y. Kaneda, An I-O Network Architecture of the Distributed Shared-Memory Massively Parallel Computer JUMP-1, Proc. of 11th Int'l. Conf. on Supercomputing (ICS97) (1997) (to appear).
Advanced Micro Devices, Inc. Am7968-Am7969-175 TAXI-175 Transmitter-Receiver Data Sheet and Technical Manual (1992).
J. B. Carter, J. K. Bennet, and W. Zwaenepoel, Implementation and Performance of Munin, Proc. 13th ACM Symp. Operat. Syst. Principles, pp. 152–164 (1991).
Pete Keleher, Alan L. Cox, and Willy Zwaenepoel, Lazy Release Consistency for Software Distributed Shared Memory, ACM SIGARCH Computer Architecture News 20(2):13–21 (1992).
Daniel E. Lenoski and Wolf-Dietrich Weber, Scalable Shared-Memory Multiprocessing, Morgan Kaufmann Publishers (1995).
Hironori Nakajo, Takeshi Yoshinaga, Koichi Wada, and Yukio Kaneda, Ring-Connected Parallel Computer KORP-Coherence Protocol for Distributed Shared-Memory, Proc. Int'l. Conf. Parallel and Distrib. Syst. ICPADS'92, pp. 504–511 (1992).
N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seitz, J. N. Seizovic, and W. K. Su, Myrinet: A Gigabitper Second Local Area Network, IEEE Micro 15(1):29–36 (1995).
Steven Cameron Woo, Moriyoshi Ohara, Evan Torrie, Jaswinder Pal Singh, and Anoop Gupta, SPLASH-2 Programs: Characterization and Methodological Considerations, Proc. 22nd Int'l. Symp. Computer Architecture, pp. 24–36 (1995).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Nakajo, H., Ichikawa, A. & Kaneda, Y. A Distributed Shared-Memory System on a Workstation Cluster Using Fast Serial Links. International Journal of Parallel Programming 28, 179–194 (2000). https://doi.org/10.1023/A:1007548216327
Issue Date:
DOI: https://doi.org/10.1023/A:1007548216327