Author image not provided
 Robert Brian Ross

Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article11.78
Citation Count1,178
Publication count100
Publication years1996-2018
Available for download41
Average downloads per article296.07
Downloads (cumulative)12,139
Downloads (12 Months)673
Downloads (6 Weeks)74
SEARCH
ROLE
Arrow RightAuthor only


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas




BOOKMARK & SHARE


95 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 20 of 95
Result page: 1 2 3 4 5

Sort by:

1 published by ACM
October 2018 ACM Transactions on Storage (TOS) - Special Issue on FAST 2018 and Regular Papers: Volume 14 Issue 3, November 2018
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 4,   Downloads (12 Months): 108,   Downloads (Overall): 108

Full text available: PDFPDF
Fail-slow hardware is an under-studied failure mode. We present a study of 114 reports of fail-slow hardware incidents, collected from large-scale cluster deployments in 14 institutions. We show that all hardware types such as disk, SSD, CPU, memory, and network components can exhibit performance faults. We made several important observations ...
Keywords: Hardware fault, fail-slow, fail-stutter, jitter, limpware, performance

2
February 2018 FAST'18: Proceedings of the 16th USENIX Conference on File and Storage Technologies
Publisher: USENIX Association
Bibliometrics:
Citation Count: 0

Fail-slow hardware is an under-studied failure mode. We present a study of 101 reports of fail-slow hardware incidents, collected from large-scale cluster deployments in 12 institutions. We show that all hardware types such as disk, SSD, CPU, memory and network components can exhibit performance faults. We made several important observations ...

3 published by ACM
November 2017 PDSW-DISCS '17: Proceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 6,   Downloads (12 Months): 18,   Downloads (Overall): 87

Full text available: PDFPDF
Data management is a critical component of high-performance computing, with storage as a cornerstone. Yet the traditional model of parallel file systems fails to meet users' needs, in terms of both performance and features. In this paper, we propose CoSS, a new storage model based on contracts. Contracts encapsulate in ...
Keywords: CoSS, HPC, I/O, contract, data model, metadata, storage

4 published by ACM
November 2017 WORKS '17: Proceedings of the 12th Workshop on Workflows in Support of Large-Scale Science
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 0,   Downloads (12 Months): 23,   Downloads (Overall): 58

Full text available: PDFPDF
While the use of workflows for HPC is growing, MPI interoperability remains a challenge for workflow management systems. The MPI standard and/or its implementations provide a number of ways to build multiple-programs-multiple-data (MPMD) applications. These methods present limitations related to fault tolerance , and are not easy to use. In ...
Keywords: Cram, MPI, MPI_Comm_launch, MPMD, Swift/T, ensemble simulations, workflows

5
November 2016 ESPT '16: Proceedings of the 5th Workshop on Extreme-Scale Programming Tools
Publisher: IEEE Press
Bibliometrics:
Citation Count: 4
Downloads (6 Weeks): 0,   Downloads (12 Months): 6,   Downloads (Overall): 38

Full text available: PDFPDF
Contemporary high-performance computing (HPC) applications encompass a broad range of distinct I/O strategies and are often executed on a number of different compute platforms in their lifetime. These large-scale HPC platforms employ increasingly complex I/O subsystems to provide a suitable level of I/O performance to applications. Tuning I/O workloads for ...

6
November 2016 PDSW-DISCS '16: Proceedings of the 1st Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems
Publisher: IEEE Press
Bibliometrics:
Citation Count: 0

As the amount of memory per core decreases in post-petascale machines, the memory footprint of any libraries and middleware used by HPC applications must be reduced. While scientific data can contain a great deal of entropy and require specialized compression techniques, the descriptions of scientific data layouts, as opposed to ...

7
October 2016 Parallel Computing: Volume 58 Issue C, October 2016
Publisher: Elsevier Science Publishers B. V.
Bibliometrics:
Citation Count: 1

Rich metadata in high-performance computing (HPC) systems contains extended information about users, jobs, data files, and their relationships. Property graphs are a promising data model to represent heterogeneous rich metadata flexibly. Specifically, a property graph can use vertices to represent different entities and edges to record the relationships between vertices ...
Keywords: Graph partitioning, Parallel file systems, Property graph, Rich metadata management, Graph traversal

8
May 2016 CCGRID '16: Proceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing
Publisher: IEEE Press
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 0,   Downloads (12 Months): 0,   Downloads (Overall): 0

Full text available: PDFPDF
This paper explores novel techniques for improving the performance of many-task workflows based on the Swift scripting language. We propose novel programmer options for automated distributed data placement and task scheduling. These options trigger a data placement mechanism used for distributing intermediate workflow data over the servers of Hercules, a ...
Keywords: data locality, file systems, high performance, load balance, scientific workflows

9
May 2016 CCGRID '16: Proceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing
Publisher: IEEE Press
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 0,   Downloads (12 Months): 0,   Downloads (Overall): 0

Full text available: PDFPDF
On current large-scale HPC platforms the data path from compute nodes to final storage passes through several networks interconnecting a distributed hierarchy of nodes serving as compute nodes, I/O nodes, and file system servers. Although applications compete for resources at various system levels, the current system software offers no mechanisms ...
Keywords: HPC, I/O scheduling, collective I/O, data staging, parallel I/O, storage

10
September 2015 CLUSTER '15: Proceedings of the 2015 IEEE International Conference on Cluster Computing
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 3

The optimization of parallel I/O has become challenging because of the increasing storage hierarchy, performance variability of shared storage systems, and the number of factors in the hardware and software stacks that impact performance. In this paper, we perform an in-depth study of the complexity involved in I/O autotuning and ...
Keywords: I/O performance modeling, statistical and analytical performance models, model-based tuning

11
September 2015 CLUSTER '15: Proceedings of the 2015 IEEE International Conference on Cluster Computing
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 1

Property graphs are a promising data model for rich metadata management in high-performance computing (HPC) systems because of their ability to represent not only metadata attributes but also the relationships between them. A property graph can be used to record the relationships between users, jobs, and data, for example, with ...
Keywords: Parallel File Systems, Rich Metadata Management, Property Graph, Graph Traversal, Property Graph Databases

12 published by ACM
June 2015 HPDC '15: Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing
Publisher: ACM
Bibliometrics:
Citation Count: 21
Downloads (6 Weeks): 18,   Downloads (12 Months): 141,   Downloads (Overall): 638

Full text available: PDFPDF
We examine the I/O behavior of thousands of supercomputing applications "in the wild," by analyzing the Darshan logs of over a million jobs representing a combined total of six years of I/O behavior across three leading high-performance computing platforms. We mined these logs to analyze the I/O behavior of applications ...
Keywords: hpc, input/output, parallel i/o, performance analysis

13
November 2014 PDSW '14: Proceedings of the 9th Parallel Data Storage Workshop
Publisher: IEEE Press
Bibliometrics:
Citation Count: 4
Downloads (6 Weeks): 1,   Downloads (12 Months): 2,   Downloads (Overall): 40

Full text available: PDFPDF
HPC platforms are capable of generating huge amounts of metadata about different entities including jobs, users, and files. Simple metadata , which describe the attributes of these entities (e.g., file size, name, and permissions mode), has been well recorded and used in current systems. However, only a limited amount of ...

14
November 2014 SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Press
Bibliometrics:
Citation Count: 3
Downloads (6 Weeks): 2,   Downloads (12 Months): 11,   Downloads (Overall): 165

Full text available: PDFPDF
Object storage is considered a promising solution for next-generation (exascale) high-performance computing platform because of its flexible and high-performance object interface. However, delivering high burst-write throughput is still a critical challenge. Although deploying more storage servers can potentially provide higher throughput, it can be ineffective because the burst-write throughput can ...

15
November 2014 SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Press
Bibliometrics:
Citation Count: 4
Downloads (6 Weeks): 0,   Downloads (12 Months): 7,   Downloads (Overall): 179

Full text available: PDFPDF
The increasing gap between the computation performance of post-petascale machines and the performance of their I/O subsystem has motivated many I/O optimizations including prefetching, caching, and scheduling techniques. In order to further improve these techniques, modeling and predicting spatial and temporal I/O patterns of HPC applications as they run has ...
Keywords: prediction, I/O, grammar, HPC, Omnisc'IO, exascale, storage

16
November 2014 International Journal of High Performance Computing Applications: Volume 28 Issue 4, November 2014
Publisher: Sage Publications, Inc.
Bibliometrics:
Citation Count: 0

The message passing interface (MPI) is one of the most portable high-performance computing (HPC) programming models, with platform-optimized implementations typically delivered with new HPC systems. Therefore, for distributed services requiring portable, high-performance, user-level network access, MPI promises to be an attractive alternative to custom network portability layers, platform-specific methods, or ...
Keywords: Message passing interface (MPI), distributed services, fault-tolerance, storage, cancellation

17 published by ACM
June 2014 HPDC '14: Proceedings of the 23rd international symposium on High-performance parallel and distributed computing
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 2,   Downloads (12 Months): 6,   Downloads (Overall): 139

Full text available: PDFPDF
In recent years, more and more applications in cloud have needed to process large-scale on-line data sets that evolve over time as entries are added or modified. Several programming frameworks, such as Percolator and Oolong, are proposed for such incremental data processing and can achieve efficient updates with an event-driven ...
Keywords: cloud computing, incremental computing, mapreduce

18
May 2014 CCGRID '14: Proceedings of the 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing
Publisher: IEEE Press
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 0,   Downloads (12 Months): 0,   Downloads (Overall): 0

Full text available: PDFPDF
Provenance, also known as lineage, is metadata that describes the history of an object [1], [2]. It reveals the detailed information about applications and data sets, which can be used to capture the system status. However, although there have been numerous research studies about the design and implementation of provenance ...

19
May 2014 IPDPS '14: Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 8

Unmatched computation and storage performance in new HPC systems have led to a plethora of I/O optimizations ranging from application-side collective I/O to network and disk-level request scheduling on the file system side. As we deal with ever larger machines, the interference produced by multiple applications accessing a shared parallel ...
Keywords: I/O, Parallel File Systems, Cross-Application Contention, Interference, CALCioM

20
March 2014 Cluster Computing: Volume 17 Issue 1, March 2014
Publisher: Kluwer Academic Publishers
Bibliometrics:
Citation Count: 0

Programming development tools are a vital component for understanding the behavior of parallel applications. Event tracing is a principal ingredient to these tools, but new and serious challenges place event tracing at risk on extreme-scale machines. As the quantity of captured events increases with concurrency, the additional data can overload ...
Keywords: Atomic append, Event tracing, I/O forwarding



The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2019 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us