Abstract
The shared memory model helps parallel programming productivity, but it also has a high hardware cost and imposes scalability constraints. Ultimately, higher performance will use distributed memories, which scales better but requires programmers to manually transfer data between local memories, which is a complex task. Distributed memories are also more energy efficient than shared memories, and are used in a family of embedded computing solutions called multi processor system on chip (MPSoC).
- Kalray. http://www.kalrayinc.com.Google Scholar
- W. W. Carlson, J. M. Draper, D. E. Culler, K. Yelick, E. Brooks, and K. Warren. Introduction to UPC and language specification. Center for Computing Sciences, Institute for Defense Analyses, 1999.Google Scholar
- B. D. de Dinechin, P. G. de Massas, G. Lager, C. Léger, B. Orgogozo, J. Reybert, and T. Strudel. A distributed run-time environment for the kalray mppa®-256 integrated manycore processor. Procedia Computer Science, 18:1654--1663, 2013.Google Scholar
Cross Ref
- S. Lyberis, P. Pratikakis, D. S. Nikolopoulos, M. Schulz, T. Gamblin, and B. R. de Supinski. The myrmics memory allocator: hierarchical, message-passing allocation for global address spaces. ACM SIGPLAN Notices, 47(11):15--24, 2013. Google Scholar
Digital Library
- J. Nelson, B. Holt, B. Myers, P. Briggs, L. Ceze, S. Kahan, and M. Oskin. Latency-tolerant software distributed shared memory. Technical report, Technical Report UW-CSE-14-05-03, Univeristy of Washington, 2014.Google Scholar
- S. Schneider, C. D. Antonopoulos, and D. S. Nikolopoulos. Scalable locality-conscious multithreaded memory allocation. In Proceedings of the 5th international symposium on Memory management, pages 84--94. ACM, 2006. Google Scholar
Digital Library
- S. Treichler, M. Bauer, and A. Aiken. Realm: An event-based low-level runtime for distributed memory architectures. In Proceedings of the 23rd international conference on Parallel architectures and compilation, pages 263--276. ACM, 2014. Google Scholar
Digital Library
Index Terms
An interval constrained memory allocator for the Givy GAS runtime
Recommendations
An interval constrained memory allocator for the Givy GAS runtime
PPoPP '16: Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel ProgrammingThe shared memory model helps parallel programming productivity, but it also has a high hardware cost and imposes scalability constraints. Ultimately, higher performance will use distributed memories, which scales better but requires programmers to ...
A bounded memory allocator for software-defined global address spaces
ISMM '16This paper presents a memory allocator targeting manycore architec- tures with distributed memory. Among the family of Multi Processor System on Chip (MPSoC), these devices are composed of multiple nodes linked by an on-chip network; most nodes have ...
A bounded memory allocator for software-defined global address spaces
ISMM 2016: Proceedings of the 2016 ACM SIGPLAN International Symposium on Memory ManagementThis paper presents a memory allocator targeting manycore architec- tures with distributed memory. Among the family of Multi Processor System on Chip (MPSoC), these devices are composed of multiple nodes linked by an on-chip network; most nodes have ...






Comments