Author image not provided
 Tareq M Malas

Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article1.00
Citation Count4
Publication count4
Publication years2013-2017
Available for download2
Average downloads per article142.50
Downloads (cumulative)285
Downloads (12 Months)285
Downloads (6 Weeks)96
Arrow RightAuthor only

See all colleagues of this author


4 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 4 of 4
Sort by:

1 published by ACM
December 2017 ACM Transactions on Parallel Computing (TOPC): Volume 4 Issue 3, January 2018
Publisher: ACM
Citation Count: 0
Downloads (6 Weeks): 35,   Downloads (12 Months): 35,   Downloads (Overall): 35

Full text available: PDFPDF
Optimizing the performance of stencil algorithms has been the subject of intense research over the last two decades. Since many stencil schemes have low arithmetic intensity, most optimizations focus on increasing the temporal data access locality, thus reducing the data traffic through the main memory interface with the ultimate goal ...
Keywords: Wireless sensor networks, media access control, multi-channel, radio interference, time synchronization

2 published by ACM
November 2017 SC '17: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher: ACM
Citation Count: 0
Downloads (6 Weeks): 62,   Downloads (12 Months): 251,   Downloads (Overall): 251

Full text available: PDFPDF
This paper presents the first, 15-PetaFLOP Deep Learning system for solving scientific pattern classification problems on contemporary HPC architectures. We develop supervised convolutional architectures for discriminating signals in high-energy physics data as well as semi-supervised architectures for localizing and classifying extreme weather in climate data. Our Intelcaffe-based implementation obtains ~2TFLOP/s ...

November 2016 PMBS '16: Proceedings of the 7th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems
Publisher: IEEE Press
Citation Count: 0

NERSC has partnered with 20 representative application teams to evaluate performance on the Xeon-Phi Knights Landing architecture and develop an application-optimization strategy for the greater NERSC workload on the recently installed Cori system. In this article, we present early case studies and summarized results from a subset of the 20 ...

May 2013 International Journal of High Performance Computing Applications: Volume 27 Issue 2, May 2013
Publisher: Sage Publications, Inc.
Citation Count: 3

Several emerging petascale architectures use energy-efficient processors with vectorized computational units and in-order thread processing. On these architectures the sustained performance of streaming numerical kernels, ubiquitous in the solution of partial differential equations, represents a challenge despite the regularity of memory access. Sophisticated optimization techniques are required to fully utilize ...
Keywords: Blue Gene/P, code generation, performance optimization, high-performance computing, SIMD

The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2018 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us