Author image not provided
 Ehsan Totoni

Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article4.86
Citation Count68
Publication count14
Publication years2011-2017
Available for download9
Average downloads per article322.11
Downloads (cumulative)2,899
Downloads (12 Months)912
Downloads (6 Weeks)64
SEARCH
ROLE
Arrow RightAuthor only


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas




BOOKMARK & SHARE


15 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 15 of 15
Sort by:

1
September 2017 International Journal of High Performance Computing Applications: Volume 31 Issue 5, 9 2017
Publisher: Sage Publications, Inc.
Bibliometrics:
Citation Count: 0

Operating chips at high energy efficiency is one of the major challenges for modern large-scale supercomputers. Low-voltage operation of transistors increases the energy efficiency but leads to frequency and power variation across cores on the same chip. Finding energy-optimal configurations for such chips is a hard problem. In this work, ...
Keywords: integer programming, multicore chips, heterogeneity, optimization, energy, low-voltage computing, near-threshold voltage computing, quadratic integer programming, power, process variation

2 published by ACM
June 2017 ICS '17: Proceedings of the International Conference on Supercomputing
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 13,   Downloads (12 Months): 181,   Downloads (Overall): 242

Full text available: PDFPDF
Big data analytics requires high programmer productivity and high performance simultaneously on large-scale clusters. However, current big data analytics frameworks (e.g. Apache Spark) have prohibitive runtime overheads since they are library-based. We introduce a novel auto-parallelizing compiler approach that exploits the characteristics of the data analytics domain such as the ...
Keywords: automatic parallelization, high performance computing, big data analytics

3 published by ACM
May 2017 HotOS '17: Proceedings of the 16th Workshop on Hot Topics in Operating Systems
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 11,   Downloads (12 Months): 203,   Downloads (Overall): 250

Full text available: PDFPDF
Big data systems such as Spark are built around the idea of splitting an iterative parallel program into tiny tasks with other aspects of system design built around this basic design principle. Unfortunately, in spite of immense engineering effort, tiny tasks have unavoidably large overheads. We use the example of ...

4 published by ACM
June 2016 PLDI '16: Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation
Publisher: ACM
Bibliometrics:
Citation Count: 3
Downloads (6 Weeks): 34,   Downloads (12 Months): 431,   Downloads (Overall): 1,492

Full text available: PDFPDF
Deep neural networks (DNNs) have undergone a surge in popularity with consistent advances in the state of the art for tasks including image recognition, natural language processing, and speech recognition. The computationally expensive nature of these networks has led to the proliferation of implementations that sacrifice abstraction for high performance. ...
Keywords: Compiler, Deep Learning, Domain Specific Language, Neural Networks, Optimization
Also published in:
August 2016  ACM SIGPLAN Notices - PLDI '16: Volume 51 Issue 6, June 2016

5 published by ACM
February 2015 ACM Transactions on Parallel Computing - Special Issue on PPOPP 2012: Volume 1 Issue 2, January 2015
Publisher: ACM
Bibliometrics:
Citation Count: 3
Downloads (6 Weeks): 1,   Downloads (12 Months): 16,   Downloads (Overall): 109

Full text available: PDFPDF
Networks are among major power consumers in large-scale parallel systems. During execution of common parallel applications, a sizeable fraction of the links in the high-radix interconnects are either never used or are underutilized. We propose a runtime system based adaptive approach to turn off unused links, which has various advantages ...

6 published by ACM
February 2015 PMAM '15: Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores
Publisher: ACM
Bibliometrics:
Citation Count: 4
Downloads (6 Weeks): 2,   Downloads (12 Months): 19,   Downloads (Overall): 131

Full text available: PDFPDF
Power and energy efficiency is one of the major challenges to achieve exascale computing in the next several years. While chips operating at low voltages have been studied to be highly energy-efficient, low voltage operations lead to heterogeneity across cores within the microprocessor chip. In this work, we study chips ...
Keywords: energy, low voltage computing, quadratic integer programming, near threshold voltage computing, power, process variation, integer programming, multicore chips, heterogeneity, optimization

7
November 2014 SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Press
Bibliometrics:
Citation Count: 3
Downloads (6 Weeks): 0,   Downloads (12 Months): 14,   Downloads (Overall): 144

Full text available: PDFPDF
The cache hierarchy often consumes a large portion of a processor's energy. To save energy in HPC environments, this paper proposes software-controlled reconfiguration of the cache hierarchy with an adaptive runtime system. Our approach addresses the two major limitations associated with other methods that reconfigure the caches: predicting the application's ...

8
November 2014 SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Press
Bibliometrics:
Citation Count: 25
Downloads (6 Weeks): 2,   Downloads (12 Months): 29,   Downloads (Overall): 179

Full text available: PDFPDF
The advent of petascale computing has introduced new challenges (e.g. heterogeneity, system failure) for programming scalable parallel applications. Increased complexity and dynamism in science and engineering applications of today have further exacerbated the situation. Addressing these challenges requires more emphasis on concepts that were previously of secondary importance, including migratability, ...

9
October 2014 SBAC-PAD '14: Proceedings of the 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 0

Consumers of personal devices such as desktops, tablets, or smart phones run applications based on image or video processing, as they enable a natural computer-user interaction. The challenge with these computationally demanding applications is to execute them efficiently. One way to address this problem is to use on-chip heterogeneous systems, ...
Keywords: Evaluation of algorithms and systems, SIMD, Energy-aware systems, OpenCL

10 published by ACM
December 2013 ACM Transactions on Architecture and Code Optimization (TACO): Volume 10 Issue 4, December 2013
Publisher: ACM
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 1,   Downloads (12 Months): 14,   Downloads (Overall): 286

Full text available: PDFPDF
We optimize a visual object detection application (that uses Vision Video Library kernels) and show that OpenCL is a unified programming paradigm that can provide high performance when running on the Ivy Bridge heterogeneous on-chip architecture. We evaluate different mapping techniques and show that running each kernel where it fits ...
Keywords: Energy efficiency, OpenCL, SIMD, heterogeneous on-chip architectures, portable (mobile) devices

11
May 2013 IPDPSW '13: Proceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 2

Higher radix networks, such as high-dimensional tori and multi-level directly connected networks, are being used for supercomputers as they become larger but need lower diameter. These networks have more resources (e.g. links) in order to provide good performance for a range of applications. We observe that a sizeable fraction of ...

12
December 2012 IEEE Transactions on Computers: Volume 61 Issue 12, December 2012
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 9

As we move to exascale machines, both peak power demand and total energy consumption have become prominent challenges. A significant portion of that power and energy consumption is devoted to cooling, which we strive to minimize in this work. We propose a scheme based on a combination of limiting processor ...
Keywords: Energy consumption,Energy efficiency,Load management,Runtime,Energy management,Green design,Temperature measurement,DVFS,Green IT,temperature aware,load balancing,cooling energy

13
April 2012 ISPASS '12: Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 9

Power dissipation and energy consumption are becoming increasingly important architectural design constraints in different types of computers, from embedded systems to large-scale supercomputers. To continue the scaling of performance, it is essential that we build parallel processor chips that make the best use of exponentially increasing numbers of transistors within ...

14
December 2011 ICPADS '11: Proceedings of the 2011 IEEE 17th International Conference on Parallel and Distributed Systems
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 5

Hardware and software co-design is becoming increasingly important due to complexities in supercomputing architectures. Simulating applications before there is access to the real hardware can assist machine architects in making better design decisions that can optimize application performance. At the same time, the application and runtime can be optimized and ...
Keywords: simulation, performance prediction, mapping, system noise, collective communication

15 published by ACM
November 2011 SC '11 Companion: Proceedings of the 2011 companion on High Performance Computing Networking, Storage and Analysis Companion
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 0,   Downloads (12 Months): 5,   Downloads (Overall): 66

Full text available: PDFPDF
Communication algorithms play a crucial role in the performance of large-scale parallel systems. They are implemented in runtime systems and used in most parallel applications as a critical component. As vendors are willing to design new custom networks with significantly different performance properties for their new supercomputers, designing new efficient ...
Keywords: PERCS, all-to-all, bigsim, communication algorithm, pairwise-exchange, simulation



The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2018 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us