skip to main content
10.5555/2980009.2980013guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article

Reduced precision for hardware ray tracing in GPUs

Published:23 June 2014Publication History

ABSTRACT

We propose a high performance, GPU integrated, hardware ray tracing system. We present and make use of a new analysis of ray traversal in axis aligned bounding volume hierarchies. This analysis enables compact traversal hardware through the use of reduced precision arithmetic. We also propose a new cache based technique for scheduling ray traversal. With the addition of our compact fixed function traversal unit and cache mechanism, we show that current GPU architectures are well suited for hardware accelerated ray tracing, requiring only small modifications to provide high performance. By making use of existing GPU resources we are able to keep all rays and scheduling traffic on chip and out of caches. We used simulations to estimate the performance of our architecture. Our system achieves an average ray rate of 3.4 billion rays per second while path tracing our test scenes.

References

  1. {AK10} Aila T., Karras T.: Architecture considerations for tracing incoherent rays. In Proc. High-Performance Graphics 2010 (2010), pp. 113--122. 1, 2, 5, 7, 8, 10 Google ScholarGoogle Scholar
  2. {AL09} Aila T., Laine S.: Understanding the efficiency of ray traversal on gpus. In Proc. High-Performance Graphics 2009 (2009), pp. 145--149. 2 Google ScholarGoogle Scholar
  3. {AMD13a} AMD: Opencl programming guide. http://developer.amd.com/tools-and-sdks/heterogeneous-computing/amd-accelerated-parallel-processing-app-sdk/documentation/amd-app-documentation/, November 2013. 5, 6Google ScholarGoogle Scholar
  4. {AMD13b} AMD: Sea islands series instruction set architecture. http://developer.amd.com/tools-and-sdks/heterogeneous-computing/amd-accelerated-parallel-processing-app-sdk/documentation/related-documentation/, December 2013. 5Google ScholarGoogle Scholar
  5. {BW09} Benthin C., Wald I.: Efficient ray traced soft shadows using multi-frusta tracing. In Proceedings of High-Performance Graphics 2009 (HPG09) (2009). 2 Google ScholarGoogle Scholar
  6. {BWW*12} Benthin C., Wald I., Woop S., Ernst M., Mark W. R.: Combining single and packet-ray tracing for arbitrary ray distributions on the intel mic architecture. IEEE Transactions on Visualization and Computer Graphics 18, 9 (Sept. 2012), 1438--1448. 2 Google ScholarGoogle Scholar
  7. {CNS*11} Crassin C., Neyret F., Sainz M., Green S., Eisemann E.: Interactive indirect illumination using voxel cone tracing. Computer Graphics Forum 30, 7 (2011), 1921--1930. 9Google ScholarGoogle Scholar
  8. {EHDR11} Egan K., Hecht F., Durand F., Ramamoorthi R.: Frequency analysis and sheared filtering for shadow light fields of complex occluders. ACM Trans. Graph. 30, 2 (Apr. 2011), 9:1--9:13. 11 Google ScholarGoogle Scholar
  9. {GPSS07} Günther J., Popov S., Seidel H.-P., Slusallek P.: Realtime ray tracing on GPU with BVH-based packet traversal. In Proceedings of the IEEE/Eurographics Symposium on Interactive Ray Tracing 2007 (sep 2007), pp. 113--118. 2 Google ScholarGoogle Scholar
  10. {Ize13} Ize T.: Robust BVH ray traversal. Journal of Computer Graphics Techniques (JCGT) 2, 2 (July 2013), 12--27. 3Google ScholarGoogle Scholar
  11. {KDK*11} Keckler S. W., Dally W. J., Khailany B., Garland M., Glasco D.: Gpus and the future of parallel computing. IEEE Micro 31, 5 (Sept. 2011), 7--17. 2 Google ScholarGoogle Scholar
  12. {KKW*13} Keller A., Karras T., Wald I., Aila T., Laine S., Bikker J., Gribble C., Lee W.-J., McCombe J.: Ray tracing is the future and ever will be... In ACM SIGGRAPH 2013 Courses (New York, NY, USA, 2013), SIGGRAPH '13, ACM, pp. 9:1--9:7. 2, 5 Google ScholarGoogle Scholar
  13. {KSS*13} Kopta D., Shkurko K., Spjut J., Brunvand E., Davis A.: An energy and bandwidth efficient ray tracing architecture. In Proceedings of the 5th High-Performance Graphics Conference (New York, NY, USA, 2013), HPG '13, ACM, pp. 121--128. 1, 2, 4, 5, 9, 10 Google ScholarGoogle Scholar
  14. {LALD12} Lehtinen J., Aila T., Laine S., Durand F.: Reconstructing the indirect light field for global illumination. ACM Trans. Graph. 31, 4 (2012). 11 Google ScholarGoogle Scholar
  15. {LKA13} Laine S., Karras T., Aila T.: Megakernels considered harmful: Wavefront path tracing on gpus. In Proceedings of the 5th High-Performance Graphics Conference (New York, NY, USA, 2013), HPG '13, ACM, pp. 137--143. 6 Google ScholarGoogle Scholar
  16. {LSL*13a} Lee W.-J., Shin Y., Lee J., Kim J.-W., Nah J.-H., Jung S., Lee S., Park H.-S., Han T.-D.: Sgrt: A mobile gpu architecture for real-time ray tracing. In Proceedings of the 5th High-Performance Graphics Conference (New York, NY, USA, 2013), HPG '13, ACM, pp. 109--119. 2, 3, 5, 10 Google ScholarGoogle Scholar
  17. {LSL*13b} Lee W.-J., Shin Y., Lee J., Lee S., Ryu S., Kim J.: Real-time ray tracing on future mobile computing platform. In SIGGRAPH Asia 2013 Symposium on Mobile Graphics and Interactive Applications (New York, NY, USA, 2013), SA '13, ACM, pp. 56:1--56:5. 10 Google ScholarGoogle Scholar
  18. {Mah05} Mahovsky J. A.: Ray Tracing with Reduced-precision Bounding Volume Hierarchies. PhD thesis, Calgary, Alta., Canada, Canada, 2005. AAINR06958. 4 Google ScholarGoogle Scholar
  19. {Mic13} Michael Mantor: AMD's Radeon R9-290X, One Big dGPU. http://www.slideshare.net/DevCentralAMD/gs4152-michael-mantor, November 2013. 5, 6Google ScholarGoogle Scholar
  20. {Mij12} Mijat R.: Take gpu processing power beyond graphics with mali gpu computing. http://malideveloper.arm.com/downloads/WhitePaper_GPU_Computing_on_Mali.pdf, August 2012. 6Google ScholarGoogle Scholar
  21. {MWR12} Mehta S. U., Wang B., Ramamoorthi R.: Axis-aligned filtering for interactive sampled soft shadows. ACM Trans. Graph. 31, 6 (Nov. 2012), 163:1--163:10. 11 Google ScholarGoogle Scholar
  22. {NFLC12} NavrÃątil P. A., Fussell D. S., Lin C., Childs H.: Dynamic Scheduling for Large-Scale Distributed-Memory Ray Tracing. In Eurographics Symposium on Parallel Graphics and Visualization (2012), pp. 61--70. 11Google ScholarGoogle Scholar
  23. {NFLM07} Navratil P., Fussell D., Lin C., Mark W.: Dynamic ray scheduling to improve ray coherence and bandwidth utilization. In Interactive Ray Tracing, 2007. RT '07. IEEE Symposium on (Sept 2007), pp. 95--104. 1, 2 Google ScholarGoogle Scholar
  24. {ORM08} Overbeck R., Ramamoorthi R., Mark W.: Large ray packets for real-time whitted ray tracing. In Interactive Ray Tracing, 2008. RT 2008. IEEE Symposium on (Aug 2008), pp. 41--48. 2Google ScholarGoogle Scholar
  25. {PKGH97} Pharr M., Kolb C., Gershbein R., Hanrahan P.: Rendering complex scenes with memory-coherent ray tracing. In Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques (New York, NY, USA, 1997), SIGGRAPH '97, ACM Press/Addison-Wesley Publishing Co., pp. 101--108. 11 Google ScholarGoogle Scholar
  26. {RGD09} Ramani K., Gribble C. P., Davis A.: Streamray: A stream filtering architecture for coherent ray tracing. In Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (New York, NY, USA, 2009), ASPLOS XIV, ACM, pp. 325--336. 2 Google ScholarGoogle Scholar
  27. {SE10} Segovia B., Ernst M.: Memory efficient ray tracing with hierarchical mesh quantization. In Proceedings of Graphics Interface 2010 (Toronto, Ont., Canada, Canada, 2010), GI '10, Canadian Information Processing Society, pp. 153--160. 8 Google ScholarGoogle Scholar
  28. {SKBD12} Spjut J., Kopta D., Brunvand E., Davis A.: A mobile accelerator architecture for ray tracing. In Workshop on SoCs, Heterogeneous Architectures and Workloads (2012). 2Google ScholarGoogle Scholar
  29. {SKKB09} Spjut J., Kensler A., Kopta D., Brunvand E.: Trax: A multicore hardware architecture for real-time ray tracing. Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on 28, 12 (Dec 2009), 1802--1815. 2 Google ScholarGoogle Scholar
  30. {SLL*13} Shin Y., Lee W.-J., Lee J., Lee S.-H., Ryu S., Kim J.: Energy efficient data transmission for ray tracing on mobile computing platform. In SIGGRAPH Asia 2013 Symposium on Mobile Graphics and Interactive Applications (New York, NY, USA, 2013), SA '13, ACM, pp. 64:1--64:5. 2 Google ScholarGoogle Scholar
  31. {Smi98} Smits B.: Efficiency issues for ray tracing. Journal of Graphics Tools 3, 2 (Feb. 1998), 1--14. 3 Google ScholarGoogle Scholar
  32. {SWS02} Schmittler J., Wald I., Slusallek P.: Saarcor: A hardware architecture for ray tracing. In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (Aire-la-Ville, Switzerland, Switzerland, 2002), HWWS '02, Eurographics Association, pp. 27--36. 2 Google ScholarGoogle Scholar
  33. {WDS05} Wald I., Dietrich A., Slusallek P.: An interactive out-of-core rendering framework for visualizing massively complex models. In ACM SIGGRAPH 2005 Courses (New York, NY, USA, 2005), SIGGRAPH '05, ACM. 11 Google ScholarGoogle Scholar
  34. {WSS05} Woop S., Schmittler J., Slusallek P.: Rpu: A programmable ray processing unit for realtime ray tracing. ACM Trans. Graph. 24, 3 (July 2005), 434--444. 2 Google ScholarGoogle Scholar
  35. {WWB*14} Wald I., Woop S., Benthin C., Johnson G. S., Ernst M.: Embree--A Ray Tracing Kernel Framework for Efficient CPU Ray Tracing. In conditionally accepted at ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH) (2014). (to appear, preprint available from the authors). 2, 8 Google ScholarGoogle Scholar

Index Terms

  1. Reduced precision for hardware ray tracing in GPUs

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image Guide Proceedings
        HPG '14: Proceedings of High Performance Graphics
        June 2014
        130 pages

        Publisher

        Eurographics Association

        Goslar, Germany

        Publication History

        • Published: 23 June 2014

        Qualifiers

        • research-article