skip to main content
column

Exploring the Efficiency of the OpenCL Pipe Semantic on an FPGA

Published:22 April 2016Publication History
Skip Abstract Section

Abstract

This paper evaluates the potential benefits of leveraging the OpenCL Pipe semantic to accelerate FPGA-based applications. Our work focuses on streaming applications in the embedded vision processing domain. These applications are well-suited for concurrent kernel execution support and inter-kernel communication enabled by using OpenCL pipes. We analyze the impact of multiple design factors and application optimizations to improve the performance offered by OpenCL Pipes. The design tradeoffs considered include: the execution granularity across kernels, the rate and volume of data transfers, and the Pipe size. For our case study application of vision ow, we observe a 2.8X increase in throughput for tuned pipelined kernels, as compared to non-pipelined execution. In addition, we propose a novel mechanism to efficiently capture the behavior for 2-dimensional (2D) vision algorithms to benefit Pipe-based execution.

References

  1. Altera sdk for opencl. http://www.altera.com/literature/lit-opencl-sdk.jsp.Google ScholarGoogle Scholar
  2. Altera. Altera sdk for opencl: Best practice guide. Technical report, 2014.Google ScholarGoogle Scholar
  3. J. Andrade, G. Falco, V. Silva, and K. Kasai. Flexible non-binary ldpc decoding on fpgas. In IEEE International Conf. on Acoustics, Speech, and Signal Processing - ICASSP, volume 1, pages 1--5, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  4. D. Chen and D. Singh. Fractal video compression in opencl: An evaluation of cpus, gpus, and fpgas as acceleration platforms. 2013.Google ScholarGoogle Scholar
  5. B. Gaster, L. Howes, D. R. Kaeli, P. Mistry, and D. Schaa. Heterogeneous Computing with OpenCL: Revised OpenCL 1.2 Edition. Morgan Kaufmann Publishers Inc., 2 edition, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Q. Gautier, A. Shearer, J. Matai, D. Richmond, P. Meng, and R. Kastner. Real-time 3d reconstruction for fpgas: A case study for evaluating the performance, area, and programmability trade-offs of the altera opencl. In International Conference on Field-Programmable Technology (FPT), 2014.Google ScholarGoogle ScholarCross RefCross Ref
  7. J.-M. Geusebroek, A. Smeulders, and J. van de Weijer. Fast anisotropic gauss filtering. Image Processing, IEEE Transactions on, 12(8):938--943, Aug 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. S. Orr, B. M. Beckmann, S. K. Reinhardt, and D. A. Wood. Fine-grain task aggregation and coordination on gpus. In Proceeding of the 41st Annual International Symposium on Computer Architecuture, pages 181--192, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. K. Ratnayake and A. Amer. Embedded architecture for noise-adaptive video object detection using parameter-compressed background modeling. Journal of Real-Time Image Processing, pages 1--18, 2014.Google ScholarGoogle Scholar
  10. S. O. Settle. High-performance dynamic programming on fpgas with opencl. 2013.Google ScholarGoogle Scholar
  11. C. Stauffer and W. E. L. Grimson. Adaptive background mixture models for real-time tracking. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, volume 2, pages 246--252, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  12. Y. Ukidave, C. Kalra, D. Kaeli, P. Mistry, and D. Schaa. Runtime support for adaptive spatial partitioning and inter-kernel communication on gpus. In Computer Architecture and High Performance Computing (SBAC-PAD), pages 168--175, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Wjcikowski, R. aglewski, and B. Pankiewicz. Fpga-based real-time implementation of detection algorithm for automatic traffic surveillance sensor network. Journal of Signal Processing Systems, 68:1--18, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image ACM SIGARCH Computer Architecture News
    ACM SIGARCH Computer Architecture News  Volume 43, Issue 4
    HEART '15
    September 2015
    98 pages
    ISSN:0163-5964
    DOI:10.1145/2927964
    Issue’s Table of Contents

    Copyright © 2016 Authors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 22 April 2016

    Check for updates

    Qualifiers

    • column

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader
About Cookies On This Site

We use cookies to ensure that we give you the best experience on our website.

Learn more

Got it!