No abstract available.
Proceeding Downloads
Principles of Memory-Centric Programming for High Performance Computing
The memory wall challenge -- the growing disparity between CPU speed and memory speed -- has been one of the most critical and long-standing challenges in computing. For high performance computing, programming to achieve efficient execution of parallel ...
Persistent Memory: The Value to HPC and the Challenges
This paper provides an overview of the expected value of emerging persistent memory technologies to high performance computing (HPC) use cases. These values are somewhat speculative at the time of writing, based on what has been announced by vendors to ...
Bit Contiguous Memory Allocation for Processing In Memory
Given the recent resurgence of research into processing in or near memory systems, we find an ever increasing need to augment traditional system software tools in order to make efficient use of the PIM hardware abstractions. One such architecture, the ...
Beyond 16GB: Out-of-Core Stencil Computations
Stencil computations are a key class of applications, widely used in the scientific computing community, and a class that has particularly benefited from performance improvements on architectures with high memory bandwidth. Unfortunately, such ...
NUMA Distance for Heterogeneous Memory
Experience with Intel Xeon Phi suggests that NUMA alone is inadequate for assignment of pages to devices in heterogeneous memory systems. We argue that this is because NUMA is based on a single distance metric between all domains (i.e., number of ...
Evaluating GPGPU Memory Performance Through the C-AMAT Model
General Purpose Graphics Processing Units (GPGPU) have become a popular platform to accelerate high performance applications. Although they provide exceptional computing power, GPGPU impose significant pressure on the off-chip memory system. Evaluating, ...




