Abstract
We explore the emerging application area of physics-based simulation for computer animation and visual special effects. In particular, we examine its parallelization potential and characterize its behavior on a chip multiprocessor (CMP). Applications in this domain model and simulate natural phenomena, and often direct visual components of motion pictures. We study a set of three workloads that exemplify the span and complexity of physical simulation applications used in a production environment: fluid dynamics, facial animation, and cloth simulation. They are computationally demanding, requiring from a few seconds to several minutes to simulate a single frame; therefore, they can benefit greatly from the acceleration possible with large scale CMPs.
Starting with serial versions of these applications, we parallelize code accounting for at least 96% of the serial execution time, targeting a large number of threads.We then study the most expensive modules using a simulated 64-core CMP.
For the code representing key modules, we achieve parallel scaling of 45x, 50x, and 30x for fluid, face, and cloth simulations, respectively. The modules have a spectrum of parallel task granularity and locking behavior, and all but one are dominated by loop-level parallelism. Many modules operate on streams of data. In some cases, modules iterate over their data, leading to significant temporal locality. This streaming behavior leads to very high on-die and main memory bandwidth requirements. Finally, most modules have little inter-thread communication since they are data-parallel, but a few require heavy communication between data-parallel operations.
- R. Bridson, R. P. Fedkiw, and J. Anderson. Robust Treatment of Collisions, Contact, and Friction for Cloth Animation. ACM Transactions on Graphics, 21(3):594--603, July 2002. Google Scholar
Digital Library
- R. Bridson, S. Marino, and R. Fedkiw. Simulation of Clothing With Folds and Wrinkles. In 2003 ACM SIGGRAPH / Eurographics Symposium on Computer Animation, pages 28--36, Aug. 2003. Google Scholar
Digital Library
- T.-F. Chen and J.-L. Baer. Effective Hardware-Based Data Prefetching for High--Performance Processors. IEEE Trans. on Computers, 44(5):609--623, 1995. Google Scholar
Digital Library
- D. P. Enright, S. R. Marschner, and R. P. Fedkiw. Animation and Rendering of Complex Water Surfaces. ACM Transactions on Graphics, 21(3):736--744, July 2002. Google Scholar
Digital Library
- M. Frigo, C. E. Leiserson, and K. H. Randall. The Implementation of the Cilk-5 Multithreaded Language. In Proc. of ACM SIGPLAN Conf. on Programming Language Design and Implementation, 1998. Google Scholar
Digital Library
- W. Hunt, W. R. Mark, and G. Stoll. Fast kd-tree Construction with an Adaptive Error-Bounded Heuristic. In Proc. of the 2006 IEEE Symp. on Interactive Ray Tracing, 2006.Google Scholar
Cross Ref
- T. Iwashita and M. Shimasaki. Block Red-Black Ordering Method for Parallel Processing of ICCG Solver. In Proc. of the 4th Intl. Symp. on High Perf. Computing, 2002. Google Scholar
Digital Library
- OpenMP Application Program Interface, May 2005. Version 2.5.Google Scholar
- PhysBAM package. http://graphics.stanford.edu/~fedkiw.Google Scholar
- J. A. Sethian. Level Set Methods and Fast Marching Methods: Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science. Cambridge University Press, 1999.Google Scholar
- E. Sifakis, I. Neverov, and R. Fedkiw. Automatic Determination of Facial Muscle Activations from Sparse Motion Capture Marker Data. ACM Transactions on Graphics, 24(3):417--425, Aug. 2005. Google Scholar
Digital Library
- E. Sifakis, A. Selle, A. Robinson-Mosher, and R. Fedkiw. Simulating Speech with a Physics-Based Facial Muscle Model. In M.-P. Cani and J. O'Brien, editors, ACM SIGGRAPH/Eurographics Symp. on Computer Animation (SCA), 2006. Google Scholar
Digital Library
- SPEC CPU2006. http://www.spec.org/cpu2006/.Google Scholar
- J. Stam. Stable Fluids. In Proceedings of SIGGRAPH 99, Computer Graphics Proceedings, Annual Conference Series, pages 121--128, Aug. 1999. Google Scholar
Digital Library
- J. Teran, E. Sifakis, G. Irving, and R. Fedkiw. Robust quasistatic finite elements and flesh simulation. In 2005 ACM SIGGRAPH / Eurographics Symposium on Computer Animation, pages 181--190, July 2005. Google Scholar
Digital Library
- Intel Thread Building Blocks Reference, 2006. Version 1.3.Google Scholar
- S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta. The SPLASH-2 Programs: Characterization and Methodological Considerations. In Proc. of the 22nd Annual Intl. Symp. on Computer Architecture, 1995. Google Scholar
Digital Library
- H. Zhao. A Fast Sweeping Method for Eikonal Equations. Mathematics of Computation, 74:603--627, 2005.Google Scholar
Cross Ref
Index Terms
Physical simulation for animation and visual effects: parallelization and characterization for chip multiprocessors
Recommendations
Physical simulation for animation and visual effects: parallelization and characterization for chip multiprocessors
ISCA '07: Proceedings of the 34th annual international symposium on Computer architectureWe explore the emerging application area of physics-based simulation for computer animation and visual special effects. In particular, we examine its parallelization potential and characterize its behavior on a chip multiprocessor (CMP). Applications in ...
Parallel simulation of chip-multiprocessor by using multi-threading
AsiaMS '07: Proceedings of the IASTED Asian Conference on Modelling and SimulationMulti-core on one chip becomes the current main trend of micro processors' architecture, and simulation plays an important role in evaluating new processor architectures. Traditional ways of sequential simulation for chip-multiprocessor (CMP) require ...
A multilayer nanophotonic interconnection network for on-chip many-core communications
DAC '10: Proceedings of the 47th Design Automation ConferenceMulti-core chips or chip multiprocessors (CMPs) are becoming the de facto architecture for scaling up performance and taking advantage of the increasing transistor count on the chip within reasonable power consumption levels. The projected increase in ...






Comments