Abstract
This paper evaluates the potential benefits of leveraging the OpenCL Pipe semantic to accelerate FPGA-based applications. Our work focuses on streaming applications in the embedded vision processing domain. These applications are well-suited for concurrent kernel execution support and inter-kernel communication enabled by using OpenCL pipes. We analyze the impact of multiple design factors and application optimizations to improve the performance offered by OpenCL Pipes. The design tradeoffs considered include: the execution granularity across kernels, the rate and volume of data transfers, and the Pipe size. For our case study application of vision ow, we observe a 2.8X increase in throughput for tuned pipelined kernels, as compared to non-pipelined execution. In addition, we propose a novel mechanism to efficiently capture the behavior for 2-dimensional (2D) vision algorithms to benefit Pipe-based execution.
- Altera sdk for opencl. http://www.altera.com/literature/lit-opencl-sdk.jsp.Google Scholar
- Altera. Altera sdk for opencl: Best practice guide. Technical report, 2014.Google Scholar
- J. Andrade, G. Falco, V. Silva, and K. Kasai. Flexible non-binary ldpc decoding on fpgas. In IEEE International Conf. on Acoustics, Speech, and Signal Processing - ICASSP, volume 1, pages 1--5, 2014.Google Scholar
Cross Ref
- D. Chen and D. Singh. Fractal video compression in opencl: An evaluation of cpus, gpus, and fpgas as acceleration platforms. 2013.Google Scholar
- B. Gaster, L. Howes, D. R. Kaeli, P. Mistry, and D. Schaa. Heterogeneous Computing with OpenCL: Revised OpenCL 1.2 Edition. Morgan Kaufmann Publishers Inc., 2 edition, 2013. Google Scholar
Digital Library
- Q. Gautier, A. Shearer, J. Matai, D. Richmond, P. Meng, and R. Kastner. Real-time 3d reconstruction for fpgas: A case study for evaluating the performance, area, and programmability trade-offs of the altera opencl. In International Conference on Field-Programmable Technology (FPT), 2014.Google Scholar
Cross Ref
- J.-M. Geusebroek, A. Smeulders, and J. van de Weijer. Fast anisotropic gauss filtering. Image Processing, IEEE Transactions on, 12(8):938--943, Aug 2003. Google Scholar
Digital Library
- M. S. Orr, B. M. Beckmann, S. K. Reinhardt, and D. A. Wood. Fine-grain task aggregation and coordination on gpus. In Proceeding of the 41st Annual International Symposium on Computer Architecuture, pages 181--192, 2014. Google Scholar
Digital Library
- K. Ratnayake and A. Amer. Embedded architecture for noise-adaptive video object detection using parameter-compressed background modeling. Journal of Real-Time Image Processing, pages 1--18, 2014.Google Scholar
- S. O. Settle. High-performance dynamic programming on fpgas with opencl. 2013.Google Scholar
- C. Stauffer and W. E. L. Grimson. Adaptive background mixture models for real-time tracking. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, volume 2, pages 246--252, 1999.Google Scholar
Cross Ref
- Y. Ukidave, C. Kalra, D. Kaeli, P. Mistry, and D. Schaa. Runtime support for adaptive spatial partitioning and inter-kernel communication on gpus. In Computer Architecture and High Performance Computing (SBAC-PAD), pages 168--175, 2014. Google Scholar
Digital Library
- M. Wjcikowski, R. aglewski, and B. Pankiewicz. Fpga-based real-time implementation of detection algorithm for automatic traffic surveillance sensor network. Journal of Signal Processing Systems, 68:1--18, 2012. Google Scholar
Digital Library
Recommendations
Nuclear Reactor Simulations on OpenCL FPGA Platform
FPGA '19: Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysField-programmable gate arrays (FPGAs) are becoming a promising choice as a heterogeneous computing component for scientific computing when floating-point optimized architectures are added to the current FPGAs. The maturing high-level synthesis (HLS) ...
Nuclear Reactor Simulation on OpenCL FPGA: a Case Study of RSBench
IWOCL '18: Proceedings of the International Workshop on OpenCLField-programmable gate arrays (FPGAs) are becoming a promising choice as a heterogeneous computing component for scientific computing when floating-point optimized architectures are added to the current FPGAs. The emerging high-level synthesis tools ...
Base64 Encoding on OpenCL FPGA Platform
FPGA '19: Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysBase64 encoding has many applications on the Web. Previous studies are focused on improving the efficiency of Base64 encoding on central processing units (CPUs). As field-programmable gate arrays (FPGAs) are becoming promising heterogeneous computing ...






Comments