|
ROLE
BOOKMARK & SHARE
|
|
1
August 2015
E-SCIENCE '15: Proceedings of the 2015 IEEE 11th International Conference on e-Science
Publisher: IEEE Computer Society
Cosmological N-body simulations are essential for studies of the large-scale distribution of matter and galaxies in the Universe. This analysis often involves finding clusters of particles and retrieving their properties. Detecting such "halos" among a very large set of particles is a computationally intensive problem, usually executed on the same ...
Keywords:
Stream Algorithm, Halo Finder, N-body Simulation, Cosmology
2
February 2015
FAST'15: Proceedings of the 13th USENIX Conference on File and Storage Technologies
Publisher: USENIX Association
Graph analysis performs many random reads and writes, thus, these workloads are typically performed in memory. Traditionally, analyzing large graphs requires a cluster of machines so the aggregate memory exceeds the graph size. We demonstrate that a multicore server can process graphs with billions of vertices and hundreds of billions ...
3
December 2014
WSC '14: Proceedings of the 2014 Winter Simulation Conference
Publisher: IEEE Press
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 2, Downloads (12 Months): 4, Downloads (Overall): 40
Full text available:
PDF
High Performance Computing is becoming an instrument in its own right. The largest simulations performed on our supercomputers are now approaching petabytes. As the volume of these simulations is growing, it is becoming harder to access, analyze and visualize these data. At the same time for a broad community buy ...
4
December 2014
WSC '14: Proceedings of the 2014 Winter Simulation Conference
Publisher: IEEE Press
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 3, Downloads (12 Months): 6, Downloads (Overall): 55
Full text available:
PDF
Computerized decision making is becoming a reality with exponentially growing data and machine capabilities. Some decision making is extremely complex, historically reserved for governing bodies or market places where the collective human experience and intelligence come to play. Other decision making can be trusted to computers that are on a ...
5
November 2014
SenSys '14: Proceedings of the 12th ACM Conference on Embedded Network Sensor Systems
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 2, Downloads (12 Months): 14, Downloads (Overall): 119
Full text available:
PDF
Time synchronization is an essential service in many sensor network applications. Harsh environment which causes nodes to fail, go offline, or reboot can challenge many time synchronization protocols. In this work, we first characterize this challenge and use a real time clock in one of the nodes in the network ...
6
June 2014
SSDBM '14: Proceedings of the 26th International Conference on Scientific and Statistical Database Management
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 9, Downloads (12 Months): 20, Downloads (Overall): 77
Full text available:
PDF
We present a case study about the spatial indexing and regional classification of billions of geographic coordinates from geo-tagged social network data using Hierarchical Triangular Mesh (HTM) implemented for Microsoft SQL Server. Due to the lack of certain features of the HTM library, we use it in conjunction with the ...
7
June 2014
SSDBM '14: Proceedings of the 26th International Conference on Scientific and Statistical Database Management
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 6, Downloads (12 Months): 41, Downloads (Overall): 176
Full text available:
PDF
We introduce the concept of the point cloud database, a new kind of database system aimed primarily towards scientific applications. Many scientific observations, experiments, feature extraction algorithms and large-scale simulations produce enormous amounts of data that are better represented as sparse (but often highly-clustered) points in a k-dimensional ( k ...
Keywords:
multi-dimensional database, proximity join, spatial indexing
8
November 2013
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Publisher: ACM
Bibliometrics:
Citation Count: 5
Downloads (6 Weeks): 6, Downloads (12 Months): 56, Downloads (Overall): 527
Full text available:
PDF
We describe a storage system that removes I/O bottlenecks to achieve more than one million IOPS based on a userspace file abstraction for arrays of commodity SSDs. The file abstraction refactors I/O scheduling and placement for extreme parallelism and non-uniform memory and I/O. The system includes a set-associative, parallel page ...
Keywords:
millions of IOPS, data-intensive computing, solid-state storage devices, low cost, page cache optimization
9
July 2013
SSDBM: Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 1, Downloads (12 Months): 20, Downloads (Overall): 87
Full text available:
PDF
Molecular dynamics (MD) simulations generate detailed time-series data of all-atom motions. These simulations are leading users of the world's most powerful supercomputers, and are standard-bearers for a wide range of high-performance computing (HPC) methods. However, MD data exploration and analysis is in its infancy in terms of scalability, ease-of-use, and ...
10
Randal Burns,
Kunal Lillaney,
Daniel R. Berger,
Logan Grosenick,
Karl Deisseroth,
R. Clay Reid,
William Gray Roncal,
Priya Manavalan,
Davi D. Bock,
Narayanan Kasthuri,
Michael Kazhdan,
Stephen J. Smith,
Dean Kleissas,
Eric Perlman,
Kwanghun Chung,
Nicholas C. Weiler,
Jeff Lichtman,
Alexander S. Szalay,
Joshua T. Vogelstein,
R. Jacob Vogelstein
July 2013
SSDBM: Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 4, Downloads (12 Months): 41, Downloads (Overall): 250
Full text available:
PDF
We describe a scalable database cluster for the spatial analysis and annotation of high-throughput brain imaging data, initially for 3-d electron microscopy image stacks, but for time-series and multi-channel data as well. The system was designed primarily for workloads that build connectomes ---neural connectivity maps of the brain---using the parallel ...
Keywords:
connectomics, data-intensive computing
11
July 2013
SSDBM: Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Publisher: ACM
Bibliometrics:
Citation Count: 3
Downloads (6 Weeks): 2, Downloads (12 Months): 9, Downloads (Overall): 76
Full text available:
PDF
We describe the challenges arising from tracking dark matter particles in state of the art cosmological simulations. We are in the process of running the Indra suite of simulations, with an aggregate count of more than 35 trillion particles and 1.1PB of total raw data volume. However, it is not ...
Keywords:
inverted index, cosmological N-body simulations
12
July 2013
SSDBM: Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Publisher: ACM
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 1, Downloads (12 Months): 12, Downloads (Overall): 109
Full text available:
PDF
Many fields of science rely on relational database management systems to analyze, publish and share data. Since RDBMS are originally designed for, and their development directions are primarily driven by, business use cases they often lack features very important for scientific applications. Horizontal scalability is probably the most important missing ...
13
May 2013
Computing in Science and Engineering: Volume 15 Issue 3, May 2013
Publisher: IEEE Educational Activities Department
Astronomical discoveries often happen at the edge of our observational capabilities. To fully analyze telescopic images, researchers must combine data from separate telescopes, but large volumes of data with intrinsic differences make this difficult. SkyQuery, a scalable query engine, helps with this process.
Keywords:
Image processing,Databases,Astronomy,Telescopes,Photonics,Distributed databases,Scientific computing,Query processing,Virtual environments,Probability,scientific computing,distributed database,query language,probabilistic cross-match,virtual observatory
14
December 2012
IEEE Transactions on Visualization and Computer Graphics: Volume 18 Issue 12, December 2012
Publisher: IEEE Educational Activities Department
Despite the ongoing efforts in turbulence research, the universal properties of the turbulence small-scale structure and the relationships between small- and large-scale turbulent motions are not yet fully understood. The visually guided exploration of turbulence features, including the interactive selection and simultaneous visualization of multiple features, can further progress our ...
15
November 2012
SCC '12: Proceedings of the 2012 SC Companion: High Performance Computing, Networking Storage and Analysis
Publisher: IEEE Computer Society
Stream processing methods and online algorithms are increasingly appealing in the scientific and large-scale data management communities due to increasing ingestion rates of scientific instruments, the ability to produce and inspect results interactively, and the simplicity and efficiency of sequential storage access over enormous datasets. This article will showcase our ...
Keywords:
Streaming analysis, streaming algorithm, principal component analysis, galaxy spectra, streaming PCA, robust PCA
16
November 2012
SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Computer Society Press
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 0, Downloads (12 Months): 2, Downloads (Overall): 138
Full text available:
PDF
We present a query processing framework for the efficient evaluation of spatial filters on large numerical simulation datasets stored in a data-intensive cluster. Previously, filtering of large numerical simulations stored in scientific databases has been impractical owing to the immense data requirements. Rather, filtering is done during simulation or by ...
17
November 2012
SC '12: Proceedings of the 2012 International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher: IEEE Computer Society
We present a query processing framework for the efficient evaluation of spatial filters on large numerical simulation datasets stored in a data-intensive cluster. Previously, filtering of large numerical simulations stored in scientific databases has been impractical owing to the immense data requirements. Rather, filtering is done during simulation or by ...
18
June 2012
SSDBM'12: Proceedings of the 24th international conference on Scientific and Statistical Database Management
Publisher: Springer-Verlag
Multi-wavelength astronomical studies require cross-identification of detections of the same celestial objects in multiple catalogs based on spherical coordinates and other properties. Because of the large data volumes and spherical geometry, the symmetric N-way association of astronomical detections is a computationally intensive problem, even when sophisticated indexing schemes are used ...
Keywords:
query optimization and languages, probabilistic join, workflow, astronomical catalogs, computational statistics
19
June 2012
DIDC '12: Proceedings of the fifth international workshop on Data-Intensive Distributed Computing Date
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 2, Downloads (12 Months): 13, Downloads (Overall): 111
Full text available:
PDF
Scientific computing is increasingly revolving around massive amounts of data. From physical sciences to numerical simulations to high throughput genomics and homeland security, we are soon dealing with Petabytes if not Exabytes of data. This new, data-centric computing requires a new look at computing architectures and strategies. We will revisit ...
Keywords:
fourth paradigm, simulations, big data, astronomy
20
June 2012
HotStorage'12: Proceedings of the 4th USENIX conference on Hot Topics in Storage and File Systems
Publisher: USENIX Association
We present a set-associative page cache for scalable parallelism of IOPS in multicore systems. The design eliminates lock contention and hardware cache misses by partitioning the global cache into many independent page sets, each requiring a small amount of metadata that fits in few processor cache lines. We extend this ...
|
|