Author image not provided
 Ganesh C Gopalakrishnan

Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article6.02
Citation Count807
Publication count134
Publication years1986-2017
Available for download38
Average downloads per article332.37
Downloads (cumulative)12,630
Downloads (12 Months)791
Downloads (6 Weeks)97
SEARCH
ROLE
Arrow RightAuthor only
· Editor only
· Advisor only
· Other only
· All roles


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas




BOOKMARK & SHARE


134 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 20 of 134
Result page: 1 2 3 4 5 6 7

Sort by:

1 published by ACM
January 2017 POPL 2017: Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages
Publisher: ACM
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 13,   Downloads (12 Months): 109,   Downloads (Overall): 251

Full text available: PDFPDF
Virtually all real-valued computations are carried out using floating-point data types and operations. The precision of these data types must be set with the goals of reducing the overall round-off error, but also emphasizing performance improvements. Often, a mixed-precision allocation achieves this optimum; unfortunately, there are no techniques available to ...
Keywords: Energy-efficient computing, Floating-point arithmetic, Precision allocation, Program optimization, Rigorous compilation
Also published in:
May 2017  ACM SIGPLAN Notices - POPL '17: Volume 52 Issue 1, January 2017

2 published by ACM
October 2016 OOPSLA 2016: Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications
Publisher: ACM
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 8,   Downloads (12 Months): 54,   Downloads (Overall): 138

Full text available: PDFPDF
Despite the growing popularity of GPGPU programming, there is not yet a portable and formally-specified barrier that one can use to synchronise across workgroups. Moreover, the occupancy-bound execution model of GPUs breaks assumptions inherent in traditional software execution barriers, exposing them to deadlock. We present an occupancy discovery protocol that ...
Keywords: GPU, OpenCL, barrier, portability, synchronisation
Also published in:
December 2016  ACM SIGPLAN Notices - OOPSLA '16: Volume 51 Issue 10, October 2016

3 published by ACM
May 2016 FormaliSE '16: Proceedings of the 4th FME Workshop on Formal Methods in Software Engineering
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 7,   Downloads (12 Months): 32,   Downloads (Overall): 87

Full text available: PDFPDF
The advent of data center, cloud computing and IoT has thrust distributed systems building into the programming mainstream. Building correct distributed systems is notoriously hard, yet today's developers have little training and few tools to aid them in reasoning about these complex systems. To that end, we present DS2 -- ...
Keywords: fault tolerance, formal methods, actors, concurrency, distributed systems

4
November 2015 Electronic Notes in Theoretical Computer Science (ENTCS): Volume 317 Issue C, November 2015
Publisher: Elsevier Science Publishers B. V.
Bibliometrics:
Citation Count: 3

We present a new, open-source formalization of fixed and floating-point numbers for arbitrary radix and precision that is now part of the HOL Light distribution John Harrison. HOL Light: A tutorial introduction. In Formal Methods in Computer-Aided Design, pages 265-269. Springer, 1996. We prove correctness and error bounds for the ...
Keywords: fixed point, formalization, IEEE-754-2008, floating point

5
November 2015 Electronic Notes in Theoretical Computer Science (ENTCS): Volume 317 Issue C, November 2015
Publisher: Elsevier Science Publishers B. V.
Bibliometrics:
Citation Count: 0

Numerical programs usually include type-casting instructions which convert data among different types. Identifying unsafe type-casting is important for preventing undefined program behaviors which cause serious problems such as security vulnerabilities and result non-reproducibility. While many tools had been proposed for handling sequential programs, to our best knowledge, there isn't a ...
Keywords: unsafe type-casting check, GPU program analysis, static analysis, numerical program analysis

6
September 2015 LCPC 2015: Revised Selected Papers of the 28th International Workshop on Languages and Compilers for Parallel Computing - Volume 9519
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 2

Reducing floating-point precision allocation in HPC programs is of considerable interest from the point of view of obtaining higher performance. However, this can lead to unexpected behavioral deviations from the programmer's intent. In this paper, we focus on the problem of divergence detection: when a given floating-point program exhibits different ...

7 published by ACM
June 2015 SCREAM '15: Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 1,   Downloads (12 Months): 7,   Downloads (Overall): 43

Full text available: PDFPDF
Many disruptive changes are happening in the arena of parallel computing, including the use of multiple compute element types (CPUs and GPUs), memory and interconnect types, as well as multiple concurrency models. In the face of these changes, cybersoftware development and debugging will become increasingly hard, requiring principled (formal) debugging ...
Keywords: concurrent program debugging, heterogeneous concurrency, test coverage

8 published by ACM
March 2015 ASPLOS '15: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems
Publisher: ACM
Bibliometrics:
Citation Count: 27
Downloads (6 Weeks): 12,   Downloads (12 Months): 126,   Downloads (Overall): 653

Full text available: PDFPDF
Concurrency is pervasive and perplexing, particularly on graphics processing units (GPUs). Current specifications of languages and hardware are inconclusive; thus programmers often rely on folklore assumptions when writing software. To remedy this state of affairs, we conducted a large empirical study of the concurrent behaviour of deployed GPUs. Armed with ...
Keywords: GPU, Nvidia PTX, formal model, litmus testing, memory consistency, openCL, test generation
Also published in:
May 2015  ACM SIGPLAN Notices - ASPLOS '15: Volume 50 Issue 4, April 2015 May 2015  ACM SIGARCH Computer Architecture News - ASPLOS'15: Volume 43 Issue 1, March 2015

9
November 2014 SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Press
Bibliometrics:
Citation Count: 7
Downloads (6 Weeks): 0,   Downloads (12 Months): 14,   Downloads (Overall): 112

Full text available: PDFPDF
Even the careful GPU programmer can inadvertently introduce data races while writing and optimizing code. Currently available GPU race checking methods fall short either in terms of their formal guarantees, ease of use, or practicality. Existing symbolic methods: (1) do not fully support existing CUDA kernels; (2) may require user-specified ...
Keywords: CUDA, GPU, data flow analsis, formal verification, parallelism, symbolic execution, taint analysis, virtual machine

10
November 2014 LLVM-HPC '14: Proceedings of the 2014 LLVM Compiler Infrastructure in HPC
Publisher: IEEE Press
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 1,   Downloads (12 Months): 22,   Downloads (Overall): 95

Full text available: PDFPDF
Neither static nor dynamic data race detection methods, by themselves, have proven to be sufficient for large HPC applications, as they often result in high runtime overheads and/or low race-checking accuracy. While combined static and dynamic approaches can fare better, creating such combinations, in practice, requires attention to many details. ...

11 published by ACM
February 2014 PPoPP '14: Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming
Publisher: ACM
Bibliometrics:
Citation Count: 19
Downloads (6 Weeks): 6,   Downloads (12 Months): 39,   Downloads (Overall): 246

Full text available: PDFPDF
Tools for floating-point error estimation are fundamental to program understanding and optimization. In this paper, we focus on tools for determining the input settings to a floating point routine that maximizes its result error. Such tools can help support activities such as precision allocation, performance optimization, and auto-tuning. We benchmark ...
Keywords: floating-point error estimation methods, guided search, sequential and parallel programming
Also published in:
November 2014  ACM SIGPLAN Notices - PPoPP '14: Volume 49 Issue 8, August 2014

12 published by ACM
November 2013 SE-HPCCSE '13: Proceedings of the 1st International Workshop on Software Engineering for High Performance Computing in Computational Science and Engineering
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 3,   Downloads (12 Months): 10,   Downloads (Overall): 70

Full text available: PDFPDF
Reproducibility, the ability to repeat program executions with the same numerical result or code behavior, is crucial for computational science and engineering applications. However, non-determinism in concurrency scheduling often hampers achieving this ability on high performance computing (HPC) systems. To aid in managing the adverse effects of non-determinism, prior work ...

13 published by ACM
June 2013 ICS '13: Proceedings of the 27th international ACM conference on International conference on supercomputing
Publisher: ACM
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 6,   Downloads (12 Months): 24,   Downloads (Overall): 145

Full text available: PDFPDF
With the increasing cost of developing robust HPC software, precise data-flow analysis for MPI programs -- the mainstay of HPC programming -- are essential. The knowledge of communication is essential for precise data-flow analysis and the difficulty of statically determining it makes the conventional techniques insufficient. Hybrid methods combining static ...
Keywords: data-flow analysis, MPI, parallel control-flow graph

14
May 2013 SE-CSE '13: Proceedings of the 5th International Workshop on Software Engineering for Computational Science and Engineering
Publisher: IEEE Press
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 1,   Downloads (12 Months): 6,   Downloads (Overall): 15

Full text available: PDFPDF
While formal correctness checking methods have been deployed at scale in a number of important practical domains, we believe that such an experiment has yet to occur in the domain of high performance computing at the scale of a million CPU cores. This paper presents preliminary results from the Uintah ...

15
November 2012 SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Computer Society Press
Bibliometrics:
Citation Count: 15
Downloads (6 Weeks): 2,   Downloads (12 Months): 16,   Downloads (Overall): 155

Full text available: PDFPDF
The growing scale of concurrency requires automated abstraction techniques to cut down the effort in concurrent system analysis. In this paper, we show that the high degree of behavioral symmetry present in GPU programs allows CUDA race detection to be dramatically simplified through abstraction. Our abstraction techniques is one of ...
Keywords: GPU programming, formal analysis, data races, parameterized reasoning

16
September 2012 SBMF'12: Proceedings of the 15th Brazilian conference on Formal Methods: foundations and applications
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 0

Formal dynamic analysis of Message Passing Interface (MPI) programs is crucially important in the context of developing HPC applications. Existing dynamic verification tools for MPI programs suffer from exponential schedule explosion, especially when multiple non-deterministic receive statements are issued by a process. In this paper, we focus on detecting message-orphaning ...

17 published by ACM
February 2012 PPoPP '12: Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
Publisher: ACM
Bibliometrics:
Citation Count: 39
Downloads (6 Weeks): 11,   Downloads (12 Months): 57,   Downloads (Overall): 445

Full text available: PDFPDF
Programs written for GPUs often contain correctness errors such as races, deadlocks, or may compute the wrong result. Existing debugging tools often miss these errors because of their limited input-space and execution-space exploration. Existing tools based on conservative static analysis or conservative modeling of SIMD concurrency generate false alarms resulting ...
Keywords: formal verification, symbolic execution, virtual machine, automatic test generation, CUDA, GPU, parallelism
Also published in:
September 2012  ACM SIGPLAN Notices - PPOPP '12: Volume 47 Issue 8, August 2012

18 published by ACM
December 2011 Communications of the ACM: Volume 54 Issue 12, December 2011
Publisher: ACM
Bibliometrics:
Citation Count: 18
Downloads (6 Weeks): 13,   Downloads (12 Months): 120,   Downloads (Overall): 2,987

Full text available: HtmlHtml  PDFPDF
The goal is reliable parallel simulations, helping scientists understand nature, from how foams compress to how ribosomes construct proteins.

19
May 2011 IPDPSW '11: Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 0

We examine the problem of automatically and efficiently verifying the absence of communication related bugs in message passing systems, specifically in programs written using Message Passing Interface (MPI) API. A typical debugging or testing tool will fail to achieve this goal because they do not provide any guarantee of coverage ...

20 published by ACM
March 2011 SIGCSE '11: Proceedings of the 42nd ACM technical symposium on Computer science education
Publisher: ACM
Bibliometrics:
Citation Count: 6
Downloads (6 Weeks): 1,   Downloads (12 Months): 14,   Downloads (Overall): 384

Full text available: PDFPDF
Multicore computers are now the norm. Taking advantage of these multiple cores entails parallel and concurrent programming. There is therefore a pressing need for courses that teach effective programming on multicore architectures. We believe that such courses should emphasize high-level abstractions for performance and correctness and be supported by tools. ...
Keywords: concurrency, distributed, multicore, parallelism, tools



The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2018 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us