ABSTRACT
We propose a high performance, GPU integrated, hardware ray tracing system. We present and make use of a new analysis of ray traversal in axis aligned bounding volume hierarchies. This analysis enables compact traversal hardware through the use of reduced precision arithmetic. We also propose a new cache based technique for scheduling ray traversal. With the addition of our compact fixed function traversal unit and cache mechanism, we show that current GPU architectures are well suited for hardware accelerated ray tracing, requiring only small modifications to provide high performance. By making use of existing GPU resources we are able to keep all rays and scheduling traffic on chip and out of caches. We used simulations to estimate the performance of our architecture. Our system achieves an average ray rate of 3.4 billion rays per second while path tracing our test scenes.
- {AK10} Aila T., Karras T.: Architecture considerations for tracing incoherent rays. In Proc. High-Performance Graphics 2010 (2010), pp. 113--122. 1, 2, 5, 7, 8, 10 Google Scholar
- {AL09} Aila T., Laine S.: Understanding the efficiency of ray traversal on gpus. In Proc. High-Performance Graphics 2009 (2009), pp. 145--149. 2 Google Scholar
- {AMD13a} AMD: Opencl programming guide. http://developer.amd.com/tools-and-sdks/heterogeneous-computing/amd-accelerated-parallel-processing-app-sdk/documentation/amd-app-documentation/, November 2013. 5, 6Google Scholar
- {AMD13b} AMD: Sea islands series instruction set architecture. http://developer.amd.com/tools-and-sdks/heterogeneous-computing/amd-accelerated-parallel-processing-app-sdk/documentation/related-documentation/, December 2013. 5Google Scholar
- {BW09} Benthin C., Wald I.: Efficient ray traced soft shadows using multi-frusta tracing. In Proceedings of High-Performance Graphics 2009 (HPG09) (2009). 2 Google Scholar
- {BWW*12} Benthin C., Wald I., Woop S., Ernst M., Mark W. R.: Combining single and packet-ray tracing for arbitrary ray distributions on the intel mic architecture. IEEE Transactions on Visualization and Computer Graphics 18, 9 (Sept. 2012), 1438--1448. 2 Google Scholar
- {CNS*11} Crassin C., Neyret F., Sainz M., Green S., Eisemann E.: Interactive indirect illumination using voxel cone tracing. Computer Graphics Forum 30, 7 (2011), 1921--1930. 9Google Scholar
- {EHDR11} Egan K., Hecht F., Durand F., Ramamoorthi R.: Frequency analysis and sheared filtering for shadow light fields of complex occluders. ACM Trans. Graph. 30, 2 (Apr. 2011), 9:1--9:13. 11 Google Scholar
- {GPSS07} Günther J., Popov S., Seidel H.-P., Slusallek P.: Realtime ray tracing on GPU with BVH-based packet traversal. In Proceedings of the IEEE/Eurographics Symposium on Interactive Ray Tracing 2007 (sep 2007), pp. 113--118. 2 Google Scholar
- {Ize13} Ize T.: Robust BVH ray traversal. Journal of Computer Graphics Techniques (JCGT) 2, 2 (July 2013), 12--27. 3Google Scholar
- {KDK*11} Keckler S. W., Dally W. J., Khailany B., Garland M., Glasco D.: Gpus and the future of parallel computing. IEEE Micro 31, 5 (Sept. 2011), 7--17. 2 Google Scholar
- {KKW*13} Keller A., Karras T., Wald I., Aila T., Laine S., Bikker J., Gribble C., Lee W.-J., McCombe J.: Ray tracing is the future and ever will be... In ACM SIGGRAPH 2013 Courses (New York, NY, USA, 2013), SIGGRAPH '13, ACM, pp. 9:1--9:7. 2, 5 Google Scholar
- {KSS*13} Kopta D., Shkurko K., Spjut J., Brunvand E., Davis A.: An energy and bandwidth efficient ray tracing architecture. In Proceedings of the 5th High-Performance Graphics Conference (New York, NY, USA, 2013), HPG '13, ACM, pp. 121--128. 1, 2, 4, 5, 9, 10 Google Scholar
- {LALD12} Lehtinen J., Aila T., Laine S., Durand F.: Reconstructing the indirect light field for global illumination. ACM Trans. Graph. 31, 4 (2012). 11 Google Scholar
- {LKA13} Laine S., Karras T., Aila T.: Megakernels considered harmful: Wavefront path tracing on gpus. In Proceedings of the 5th High-Performance Graphics Conference (New York, NY, USA, 2013), HPG '13, ACM, pp. 137--143. 6 Google Scholar
- {LSL*13a} Lee W.-J., Shin Y., Lee J., Kim J.-W., Nah J.-H., Jung S., Lee S., Park H.-S., Han T.-D.: Sgrt: A mobile gpu architecture for real-time ray tracing. In Proceedings of the 5th High-Performance Graphics Conference (New York, NY, USA, 2013), HPG '13, ACM, pp. 109--119. 2, 3, 5, 10 Google Scholar
- {LSL*13b} Lee W.-J., Shin Y., Lee J., Lee S., Ryu S., Kim J.: Real-time ray tracing on future mobile computing platform. In SIGGRAPH Asia 2013 Symposium on Mobile Graphics and Interactive Applications (New York, NY, USA, 2013), SA '13, ACM, pp. 56:1--56:5. 10 Google Scholar
- {Mah05} Mahovsky J. A.: Ray Tracing with Reduced-precision Bounding Volume Hierarchies. PhD thesis, Calgary, Alta., Canada, Canada, 2005. AAINR06958. 4 Google Scholar
- {Mic13} Michael Mantor: AMD's Radeon R9-290X, One Big dGPU. http://www.slideshare.net/DevCentralAMD/gs4152-michael-mantor, November 2013. 5, 6Google Scholar
- {Mij12} Mijat R.: Take gpu processing power beyond graphics with mali gpu computing. http://malideveloper.arm.com/downloads/WhitePaper_GPU_Computing_on_Mali.pdf, August 2012. 6Google Scholar
- {MWR12} Mehta S. U., Wang B., Ramamoorthi R.: Axis-aligned filtering for interactive sampled soft shadows. ACM Trans. Graph. 31, 6 (Nov. 2012), 163:1--163:10. 11 Google Scholar
- {NFLC12} NavrÃątil P. A., Fussell D. S., Lin C., Childs H.: Dynamic Scheduling for Large-Scale Distributed-Memory Ray Tracing. In Eurographics Symposium on Parallel Graphics and Visualization (2012), pp. 61--70. 11Google Scholar
- {NFLM07} Navratil P., Fussell D., Lin C., Mark W.: Dynamic ray scheduling to improve ray coherence and bandwidth utilization. In Interactive Ray Tracing, 2007. RT '07. IEEE Symposium on (Sept 2007), pp. 95--104. 1, 2 Google Scholar
- {ORM08} Overbeck R., Ramamoorthi R., Mark W.: Large ray packets for real-time whitted ray tracing. In Interactive Ray Tracing, 2008. RT 2008. IEEE Symposium on (Aug 2008), pp. 41--48. 2Google Scholar
- {PKGH97} Pharr M., Kolb C., Gershbein R., Hanrahan P.: Rendering complex scenes with memory-coherent ray tracing. In Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques (New York, NY, USA, 1997), SIGGRAPH '97, ACM Press/Addison-Wesley Publishing Co., pp. 101--108. 11 Google Scholar
- {RGD09} Ramani K., Gribble C. P., Davis A.: Streamray: A stream filtering architecture for coherent ray tracing. In Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (New York, NY, USA, 2009), ASPLOS XIV, ACM, pp. 325--336. 2 Google Scholar
- {SE10} Segovia B., Ernst M.: Memory efficient ray tracing with hierarchical mesh quantization. In Proceedings of Graphics Interface 2010 (Toronto, Ont., Canada, Canada, 2010), GI '10, Canadian Information Processing Society, pp. 153--160. 8 Google Scholar
- {SKBD12} Spjut J., Kopta D., Brunvand E., Davis A.: A mobile accelerator architecture for ray tracing. In Workshop on SoCs, Heterogeneous Architectures and Workloads (2012). 2Google Scholar
- {SKKB09} Spjut J., Kensler A., Kopta D., Brunvand E.: Trax: A multicore hardware architecture for real-time ray tracing. Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on 28, 12 (Dec 2009), 1802--1815. 2 Google Scholar
- {SLL*13} Shin Y., Lee W.-J., Lee J., Lee S.-H., Ryu S., Kim J.: Energy efficient data transmission for ray tracing on mobile computing platform. In SIGGRAPH Asia 2013 Symposium on Mobile Graphics and Interactive Applications (New York, NY, USA, 2013), SA '13, ACM, pp. 64:1--64:5. 2 Google Scholar
- {Smi98} Smits B.: Efficiency issues for ray tracing. Journal of Graphics Tools 3, 2 (Feb. 1998), 1--14. 3 Google Scholar
- {SWS02} Schmittler J., Wald I., Slusallek P.: Saarcor: A hardware architecture for ray tracing. In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (Aire-la-Ville, Switzerland, Switzerland, 2002), HWWS '02, Eurographics Association, pp. 27--36. 2 Google Scholar
- {WDS05} Wald I., Dietrich A., Slusallek P.: An interactive out-of-core rendering framework for visualizing massively complex models. In ACM SIGGRAPH 2005 Courses (New York, NY, USA, 2005), SIGGRAPH '05, ACM. 11 Google Scholar
- {WSS05} Woop S., Schmittler J., Slusallek P.: Rpu: A programmable ray processing unit for realtime ray tracing. ACM Trans. Graph. 24, 3 (July 2005), 434--444. 2 Google Scholar
- {WWB*14} Wald I., Woop S., Benthin C., Johnson G. S., Ernst M.: Embree--A Ray Tracing Kernel Framework for Efficient CPU Ray Tracing. In conditionally accepted at ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH) (2014). (to appear, preprint available from the authors). 2, 8 Google Scholar
Index Terms
Reduced precision for hardware ray tracing in GPUs
Recommendations
Use of hardware Z-buffered rasterization to accelerate ray tracing
SAC '07: Proceedings of the 2007 ACM symposium on Applied computingRay tracing is a rendering technique for producing realistic 3D computer graphics. Compared to traditional scan-line rendering which is generally adopted by graphics pipeline, ray tracing can simulate more realistic global illumination, however, with ...
A Hardware Acceleration Method for Volumetric Ray Tracing
VIS '95: Proceedings of the 6th conference on Visualization '95In this paper we present an acceleration method for volumetric ray tracing which utilizes standard graphics hardware without compromising image accuracy. The graphics hardware is employed to identify those segments of each ray that could possibly ...
Distributed ray tracing
Ray tracing is one of the most elegant techniques in computer graphics. Many phenomena that are difficult or impossible with other techniques are simple with ray tracing, including shadows, reflections, and refracted light. Ray directions, however, ...




Comments