Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings John Mellor-CrummeyDavid WhalleyKen Kennedy OriginalPaper Pages: 217 - 247
The Architectural and Operating System Implications on the Performance of Synchronization on ccNUMA Multiprocessors Dimitrios S. NikolopoulosTheodore S. Papatheodorou OriginalPaper Pages: 249 - 282
A Comparison of MPI, SHMEM and Cache-Coherent Shared Address Space Programming Models on a Tightly-Coupled Multiprocessors Hongzhang ShanJaswinder Pal Singh OriginalPaper Pages: 283 - 318
Data-Centric Transformations for Locality Enhancement Induprakas KodukulaKeshav Pingali OriginalPaper Pages: 319 - 364