Abstract
Ray tracing within a uniform grid volume is a fundamental process invoked frequently by many applications, especially radiation-dose calculation methods in radiotherapy. However, the conflicting features between the GPU memory architecture and the memory-accessing patterns of volume ray tracing lead to inefficient usage of GPU memory bandwidth and waste of capability of modern GPUs. To improve the ray tracing performance on GPU, we propose a lookup-table-based ray tracing method which is specially optimized towards the GPU memory system for processing a massive number of rays. The proposed method is based on a key observation that many of these applications normally involves a massive number of rays, but their ray tracing may not need to follow a specific execution order. Therefore, we divide the 3D space into many regions (called pyramids) and group together the rays falling into the same pyramid. For each ray group, the volume is rotated and resampled for their raytracing. This divide-and-rotate strategy allows the memory access of the ray tracing process to adopt a table-lookup approach and leads to better memory coalescing on GPU. Our proposed method was thoroughly evaluated in four volume setups with randomly-generated rays. The collapsed-cone convolution/superposition (CCCS) dose calculation method is also implemented with/without the proposed approach to verify the feasibility of our method. Compared with the direct GPU implementation of the popular 3DDDA algorithm, our method provides a speedup in the range of 1.91--2.94X for the volume settings we used. Major performance factors, including ray origins, volume size, and pyramid size, are also analyzed. The proposed technique was also found to be able to give a speedup of 1.61--2.17X over the original GPU implementation of the CCCS algorithm. Our experiment results indicate that the proposed approach is capable of offering better coalesced memory access which eventually boosts the raytracing performance on GPU. Moreover, our approach is conceptually simple and can be readily included into various applications.
- Ahnesj¨, A. 1997. Cone discretization for the collapsed cone algorithm, In Proceedings of the International Conference on the Use of Computers in Radiation Therapy. 114--16.Google Scholar
- Ahnesj¨, A. 1989. Collapsed cone convolution of radiant energy for photon dose calculation in heterogeneous media. Med. Phys. 16, 577--92.Google Scholar
Cross Ref
- Ahnesj¨, A. and Aspradakis, M. M. 1999. Dose calculations for external photon beams in radiotherapy, Phys. Med. Biol. 44, 99--155Google Scholar
Cross Ref
- Ahunbay, E., Chen, G., Peng, C., Yu, C., Narayanan, S., and Li, X. A. 2007. An on-line adaptive planning strategy for inter-fraction image guidance. Int. J. Radiat. Oncol. Biol. Phys. 69, 3, S23.Google Scholar
Cross Ref
- Amanatides, J. and Woo, A. 1987. A fast voxel traversal algorithm for ray tracing. In Proceedings of the European Computer Graphics Conference and Exhibition (Eurographics). 3--10.Google Scholar
- Boulos, S., Edwards, D., Lacewell, J. D., Kniss, J., Kautz, J., Wald, I., and Shirley, P. 2007. Packet-based Whitted and distribution ray tracing. In Proceedings of Graphics Interface. 177--184. Google Scholar
Digital Library
- Cameron, C. 2005. Sweeping-window arc therapy: An implementation of rotational IMRT with automatic beam-weight calculation. Phys. Med. Biol. 50, 4317--4336.Google Scholar
Cross Ref
- Chen, Q., Chen, M., and, Lu, W. 2011. Ultrafast convolution/superposition using tabulated and exponential kernels on GPU. Med. Phy. 38, 1150--1161.Google Scholar
Cross Ref
- De Greef, M., Crezee, J., van Eijk, J. C., Pool, R., and Bel, A. 2009. Accelerated ray tracing for radiotherapy dose calculations on a GPU. Med. Phys. 36, 9, 4095--4192.Google Scholar
Cross Ref
- Engel, K., Hadwiger, M., Kniss, J. M., Lefohn, A. E., Salama, C. R, and Weiskopf, D. 2004. Real-time volume graphics. SIGGRAPH 2004 Course Notes. Google Scholar
Digital Library
- Ernst, M. and Greiner, G. 2007. Early split clipping for bounding volume hierarchies. In Proceedings of the European Computer Graphics Conference and Exhibition/(Eurographics)/IEEE Symposium on Interactive Ray Tracing. 73--78 Google Scholar
Digital Library
- Feng, Y., Castro-Pareja, C., Shekhar, R., and Yu, C. 2006. Direct aperture deformation—An inter-fraction image guidance strategy. Med. Phys. 33, 12, 4490--4498.Google Scholar
Cross Ref
- Hadwiger, M., Ljung, P., Salama, C. R., and Ropinski, T. 2008. Advanced illumination techniques for GPU volume raycasting. In Proceedings of the SIGGRAPH Asia 2008 Courses. 1--166. Google Scholar
Digital Library
- Hissoiny, S., Ozell, B., and Depres, P. 2009. Fast convolution-superposition dose calculation on graphics hardware. Med. Phys., 36, 6, 1998--2005.Google Scholar
Cross Ref
- Jacques, R., Taylor, R., Wong, J., and McNutt, T. 2008. Towards real-time radiation therapy: GPU accelerated superposition/convolution. In Proceedings of the High-Performance Medical Image Computing and Computer Aided Intervention Workshop.Google Scholar
- Jia, X., Gu, X., Sempau, J., Choi, D., Majumdar, A., and Jiang, S. B. 2010. Development of a GPU-based Monte Carlo dose calculation code for coupled electron-photon transport. Phys. Med. Biol. 55, 11, 3077--3086.Google Scholar
Cross Ref
- Kniss, J., Kindlmann, G., and Charles, H. 2001. Interactive volume rendering using multi-dimensional transfer functions and direct manipulation widgets. In Proceedings of the Conference on Visualization. 255--262. Google Scholar
Digital Library
- Mohan, R., Zhang, X., Wang, H., Kang, Y., Wang, X., Liu, H., Ang, K., Kuban, D., and Dong, L. 2005. Use of deformed intensity distributions for online modification of image-guided IMRT to account for interfractional anatomic changes. Int. J. Radiat. Oncol. Biol. Phys. 61, 4, 1258--1266.Google Scholar
Cross Ref
- Parker, S., Parker, M., Livnat, Y., Sloan, P., Hansen, C., and Shirley, P. 1999. Interactive ray tracing for volume visualization. IEEE Trans. Comput. Graphics Visual. 5, 3, 238--250. Google Scholar
Digital Library
- Parker, S. G., Bigler, J., Dietrich, A., Friedrich, H., Hoberock, J., Luebke, D., McAllister, D., McGuire, M., Morley, K., Robison, A., and Stich, M. 2010. OptiX: A general purpose ray tracing engine. ACM Trans. Graph. 29, 4, 1--13. Google Scholar
Digital Library
- Purcell, T. J., Buck, I., Mark, W. R., and Hanrahan, P. 2002. Ray tracing on programmable graphics hardware. In Proceedings of the International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH'02), 703--712. Google Scholar
Digital Library
- Reckwerdt, P. J. and Mackie, T. R. 1992. Superposition/convolution speed improvements using run-length raytracing. Med. Phys. 19, 784.Google Scholar
- Reshetov, A. 2006. Omnidirectional ray tracing traversal algorithm for kd-trees. In Proceedings of the IEEE Symposium on Interactive Ray Tracing. 57--60.Google Scholar
Cross Ref
- Spackman, J. and Willis, P. 1991. The SMART navigation of a ray through an oct-tree. Comput. Graphics, 15, 2, 185--194.Google Scholar
Cross Ref
- Shepard, D. M., Cao, D., Afghan, M. K. N., and Earl, M. A. 2007. An arc-sequencing algorithm for intensity modulated arc therapy. Med. Phys. 34, 464--470.Google Scholar
Cross Ref
- Wald, I., Gribble, C. P., Boulos, S., and Kensler, A. 2007. SIMD Ray Stream Tracing - SIMD Ray Traversal with Generalized Ray Packets and On-the-fly Re-Ordering. Tech. rep., UUSCI-2007-012, Scientific Computing and Imaging Institute, University of Utah.Google Scholar
- Wald, I., Boulos, S., and Shirley, P. 2007. Ray tracing deformable scenes using dynamic bounding volume hierarchies. ACM Trans. Graphics 26, 1, 1--18. Google Scholar
Digital Library
- Wald, I., Friedrich, H., Marmitt, G., Slusallek, P., and Seidel, H. P. 2005. Faster isosurface ray tracing using implicit KD-trees. IEEE Trans. Visual. Comput. Graphics 11, 5, 562--573. Google Scholar
Digital Library
- Wald, I., Ize, T., Kensler, A., Knoll, A., and Parker, S. 2006. Ray tracing animated scenes using coherent grid traversal. ACM Trans. Graphics 25, 3, 485--493. Google Scholar
Digital Library
- Zhou, B., Yu, C. X., Chen, D. Z., and Hu, X. S. 2010. GPU-accelerated Monte Carlo convolution/superposition implementation for dose calculation. Med. Phys. 37, 11, 5593--5603.Google Scholar
Cross Ref
- Zhou, B., Hu, X. S., and Chen, D. Z. 2011. Memory-efficient volume ray tracing on GPU for radiotherapy. In Proceedings of the 9th IEEE Symposium on Application Specific Processors. 42--51. Google Scholar
Digital Library
Index Terms
GPU-optimized volume ray tracing for massive numbers of rays in radiotherapy
Recommendations
Real-Time Ray Tracing with CUDA
ICA3PP '09: Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel ProcessingThe graphics processors (GPUs) have recently emerged as a low-cost alternative for parallel programming. Since modern GPUs have great computational power as well as high memory bandwidth, running ray tracing on them has been an active field of research ...
Ray tracing via GPU rasterization
Ray tracing is a dominant method for generating a wide variety of global illumination effects, such as reflections/refractions, shadows, etc. In this paper, we propose an efficient technique to perform nearly accurate ray tracing using the programmable ...
Improving the Performance of a Ray Tracing Algorithm Using a GPU
SCCC '10: Proceedings of the 2010 XXIX International Conference of the Chilean Computer Science SocietyThis article presents the application of parallel computing techniques using a Graphics Processing Unit (GPU)in order to improve the computational efficiency of a ray tracing algorithm. Three different GPU implementations of the ray tracing algorithm ...






Comments