Author image not provided
 Peter Thoman

Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article3.43
Citation Count48
Publication count14
Publication years2008-2017
Available for download4
Average downloads per article225.75
Downloads (cumulative)903
Downloads (12 Months)129
Downloads (6 Weeks)51
SEARCH
ROLE
Arrow RightAuthor only


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas




BOOKMARK & SHARE


14 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 14 of 14
Sort by:

1 published by ACM
December 2017 ACM Transactions on Architecture and Code Optimization (TACO): Volume 14 Issue 4, December 2017
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 36,   Downloads (12 Months): 36,   Downloads (Overall): 36

Full text available: PDFPDF
Shared memory machines continue to increase in scale by adding more parallelism through additional cores and complex memory hierarchies. Often, executing multiple applications concurrently, dividing among them hardware threads, provides greater efficiency rather than executing a single application with large thread counts. However, contention for shared resources can limit the ...
Keywords: dynamic concurrency throttling, Multi-program co-scheduling, speedup modeling

2 published by ACM
May 2017 CF'17: Proceedings of the Computing Frontiers Conference
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 6,   Downloads (12 Months): 34,   Downloads (Overall): 34

Full text available: PDFPDF
Achieving high performance in task-parallel runtime systems, especially with high degrees of parallelism and fine-grained tasks, requires tuning a large variety of behavioral parameters according to program characteristics. In the current state of the art, this tuning is generally performed in one of two ways: either by a group of ...

3
November 2016 MTAGS '16: Proceedings of the 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers
Publisher: IEEE Press
Bibliometrics:
Citation Count: 0

Extreme scale HPC systems are expected to reach exascale performance around the year 2020. While it is widely known that theses systems pose new challenges regarding energy efficiency of architectures, concurrency and resiliency, they also challenge developers of applications trying to efficiently utilizing resources: Managing parallel control flows, hardware resources ...
Keywords: runtime systems, distributed computing, parallel computing, programming models

4
March 2015 PDP '15: Proceedings of the 2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 1

Providing standardized building blocks for task-parallel programs within a language and its standard library has several advantages over other solutions. Close integration with compilers and runtime systems allows for potentially higher performance and portability facilitates wide-spread use. In the recently ratified C++11 standard, language constructs have been added along with ...
Keywords: C++11, task parallelism, async, inncabs, benchmarks, synchronization, std::async, std::lock, performance analysis

5
September 2014 Concurrency and Computation: Practice & Experience: Volume 26 Issue 14, September 2014
Publisher: John Wiley and Sons Ltd.
Bibliometrics:
Citation Count: 2

Task parallelism is a programming technique that has been shown to be applicable in a wide variety of problem domains. A central parameter that needs to be controlled to ensure efficient execution of task parallel programs is the granularity of tasks. When they are too coarse grained, scalability and load ...
Keywords: parallel computing, multiversioning, runtime system, compiler, recursion, task parallelism

6
October 2013 PACT '13: Proceedings of the 22nd international conference on Parallel architectures and compilation techniques
Publisher: IEEE Press
Bibliometrics:
Citation Count: 4
Downloads (6 Weeks): 6,   Downloads (12 Months): 31,   Downloads (Overall): 377

Full text available: PDFPDF
Programming standards like OpenMP, OpenCL and MPI are frequently considered programming languages for developing parallel applications for their respective kind of architecture. Nevertheless, compilers treat them like ordinary APIs utilized by an otherwise sequential host language. Their parallel control flow remains hidden within opaque runtime library calls which are embedded ...
Keywords: high-level program analysis, parallel computation, intermediate representation

7
August 2013 Euro-Par'13: Proceedings of the 19th international conference on Parallel Processing
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 3

Task parallelism is a programming technique that has been shown to be applicable in a wide variety of problem domains. A central parameter that needs to be controlled to ensure efficient execution of task-parallel programs is the granularity of tasks. When they are too coarse-grained, scalability and load balance suffer, ...
Keywords: parallel computing, multiversioning, runtime system, compiler, recursion, task parallelism

8
November 2012 SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Computer Society Press
Bibliometrics:
Citation Count: 13
Downloads (6 Weeks): 3,   Downloads (12 Months): 28,   Downloads (Overall): 456

Full text available: PDFPDF
In this paper we introduce a multi-objective auto-tuning framework comprising compiler and runtime components. Focusing on individual code regions, our compiler uses a novel search technique to compute a set of optimal solutions, which are encoded into a multi-versioned executable. This enables the runtime system to choose specifically tuned code ...

9
November 2012 SC '12: Proceedings of the 2012 International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 0

In this paper we introduce a multi-objective autotuning framework comprising compiler and runtime components. Focusing on individual code regions, our compiler uses a novel search technique to compute a set of optimal solutions, which are encoded into a multi-versioned executable. This enables the runtime system to choose specifically tuned code ...

10
June 2012 IWOMP'12: Proceedings of the 8th international conference on OpenMP in a Heterogeneous World
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 1

The scheduling of parallel loops in OpenMP has been a research topic for over a decade. While many methods have been proposed, most focus on adapting the loop schedule purely at runtime, and without regard for the overall system state. We present a fully automatic loop scheduling policy that can ...

11
August 2011 Euro-Par'11: Proceedings of the 17th international conference on Parallel processing - Volume Part II
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 10

The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using a single unified programming interface and language. While the standard guarantees portability of functionality for complying applications and platforms, performance portability on such a diverse set of hardware is limited. Devices may vary significantly in ...

12
June 2010 IWOMP'10: Proceedings of the 6th international conference on Beyond Loop Level Parallelism in OpenMP: accelerators, Tasking and more
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 0

Multi-core multi-processor machines provide parallelism at multiple levels, including CPUs, cores and hardware multithreading. Elements at each level in this hierarchy potentially exhibit heterogeneous memory access latencies. Due to these issues and the high degree of hardware parallelism, existing OpenMP applications often fail to use the whole system effectively. To ...

13
May 2008 ICVS'08: Proceedings of the 6th international conference on Computer vision systems
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 7

Multigrid methods provide fast solvers for a wide variety of problems encountered in computer vision. Recent graphics hardware is ideally suited for the implementation of such methods, but this potential has not yet been fully realized. Typically, work in that area focuses on linear systems only, or on implementation of ...
Keywords: optical flow, GPGPU, partial differential equations, multigrid methods

14
April 2008
Bibliometrics:
Citation Count: 1




The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2018 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us