Abstract
Algorithms used in microwave imaging for breast cancer detection require hardware acceleration to speed up execution time and reduce power consumption. In this article, we present the hardware implementation of two accelerators for two alternative imaging algorithms that we obtain entirely from SystemC specifications via high-level synthesis. The two algorithms present opposite characteristics that stress the design process and the capabilities of commercial HLS tools in different ways: the first is communication bound and requires overlapping and pipelining of communication and computation in order to maximize the application throughput; the second is computation bound and uses complex mathematical functions that HLS tools do not directly support. Despite these difficulties, thanks to HLS, in the span of only 4 months we were able to explore a large design space and derive about 100 implementations with different cost-performance profiles, targeting both a Field-Programmable Gate Array (FPGA) platform and a 32-nm standard-cell Application Specific Integrated Circuit (ASIC) library. In addition, we could obtain results that outperform a previous Register-Transfer Level (RTL) implementation, which confirms the remarkable progress of HLS tools.
- J. Amaro, B. Y. S. Yiu, G. Falcao, M. A. C. Gomes, and A. C. H. Yu. 2015. Software-based high-level synthesis design of FPGA beamformers for synthetic aperture imaging. IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control 62, 5 (May 2015), 862--870. Google Scholar
Cross Ref
- V. Bhatnagar, G. S. Ouedraogo, M. Gautier, A. Carer, and O. Sentieys. 2013. An FPGA software defined radio platform with a high-level synthesis design flow. In Proceedings of the 2013 IEEE 77th Vehicular Technology Conference (VTC Spring). 1--5. Google Scholar
Cross Ref
- E. J. Bond, Xu Li, S. C. Hagness, and B. D. Van Veen. 2003. Microwave imaging via space-time beamforming for early detection of breast cancer. IEEE Transactions on Antennas and Propagation 51, 8 (Aug. 2003), 1690--1705. Google Scholar
Cross Ref
- A. Bui, Kwang-Ting Cheng, J. Cong, L. Vese, Yi-Chu Wang, Bo Yuan, and Yi Zou. 2012. Platform characterization for domain-specific computing. In Proceedings of the 17th ASP-DAC. 94--99.Google Scholar
Cross Ref
- M. J. Burfeindt, T. J. Colgan, R. O. Mays, J. D. Shea, N. Behdad, B. D. Van Veen, and S. C. Hagness. 2012. MRI-derived 3-d-printed breast phantom for microwave breast imaging validation. IEEE Antennas and Wireless Propagation Letters 11 (2012), 1610--1613. Google Scholar
Cross Ref
- S. A. Butt, S. Mancini, F. Rousseau, and L. Lavagno. 2014. Design of a pseudo-log image transform hardware accelerator in a high-level synthesis-based memory management framework. Journal of Electronic Imaging 23, 5 (2014), 053012.Google Scholar
Cross Ref
- S. A. Butt, M. Roozmeh, and L. Lavagno. 2016. Designing parameterizable hardware IPs in a model-based design environment for high-level synthesis. ACM Transactions on Embedded Computing Systems 15, 2, Article 32 (Feb. 2016), 28 pages. Google Scholar
Digital Library
- M. R. Casu, F. Colonna, M. Crepaldi, D. Demarchi, M. Graziano, and M. Zamboni. 2014. UWB microwave imaging for breast cancer detection: Many-core, GPU, or FPGA? ACM Transactions on Embedded Computing Systems 13, 3s, Article 109 (March 2014), 22 pages. Google Scholar
Digital Library
- F. Colonna, M. Graziano, M. R. Casu, X. Guo, and M. Zamboni. 2013. Hardware acceleration of beamforming in a UWB imaging unit for breast cancer detection. VLSI Design 2013 (2013), 11. Google Scholar
Digital Library
- J. Cong, V. Sarkar, G. Reinman, and A. Bui. 2011. Customizable domain-specific computing. IEEE Design 8 Test of Computers 28, 2 (2011), 6--15. Google Scholar
Digital Library
- J. Cong, M. A. Ghodrat, M. Gill, B. Grigorian, and G. Reinman. 2012. Architecture support for accelerator-rich CMPs. In Proceedings of the 49th Design Automation Conference (DAC’12). ACM, New York, 843--849. Google Scholar
Digital Library
- E. G. Cota, P. Mantovani, G. Di Guglielmo, and L. P. Carloni. 2015. An analysis of accelerator coupling in heterogeneous architectures. In Proceedings of the 52nd Annual Design Automation Conference (DAC’15). ACM, New York, Article 202, 6 pages. Google Scholar
Digital Library
- P. Coussy and A. Morawiec. 2008. High-Level Synthesis: From Algorithm to Digital Circuit. Springer. Google Scholar
Digital Library
- M. A. Elahi, A. Shahzad, M. Glavin, E. Jones, and M. O’Halloran. 2015. GPU accelerated confocal microwave imaging algorithms for breast cancer detection. In Proceedings of EuCAP 2015. 1--2.Google Scholar
- F. Hannig, M. Schmid, J. Teich, and H. Hornegger. 2010. A deeply pipelined and parallel architecture for denoising medical images. In Proceedings of the 2010 International Conference on Field-Programmable Technology (FPT). 485--490. Google Scholar
Cross Ref
- IEEE. 2011. IEEE Standard for SystemC Language Reference Manual. IEEE Computer Society.Google Scholar
- G. Inggs, S. Fleming, D. B. Thomas, and W. Luk. 2015. FPGA Based Accelerators for Financial Applications. Springer International Publishing, Cham, 97--115. Google Scholar
Digital Library
- D. J. Pagliari, M. R. Casu, and L. P. Carloni. 2015. Acceleration of microwave imaging algorithms for breast cancer detection via high-level synthesis. In Proceedings of the 2015 33rd ICCD. 475--478. Google Scholar
Digital Library
- I. Koren. 2002. Computer Arithmetic Algorithms. AK Peters. Google Scholar
Digital Library
- S. Li, J. H. Ahn, R. D. Strong, J. B. Brockman, D. M. Tullsen, and N. P. Jouppi. 2013. The McPAT framework for multicore and manycore architectures: Simultaneously modeling power, area, and timing. ACM Transactions on Architecture and Code Optimization 10, 1 (April 2013), 5:1--5:29. Google Scholar
Digital Library
- X. Li, E. J. Bond, B. D. Van Veen, and S. C. Hagnes. 2005. An overview of ultra-wideband microwave imaging via space-time beamforming for early-stage breast-cancer detection. IEEE Antennas and Propagation Magazine 47, 1 (Feb. 2005), 19--34. Google Scholar
Cross Ref
- H.-Y. Liu, M. Petracca, and L. P. Carloni. 2012. Compositional system-level design exploration with planning of high-level synthesis. In Proceedings of DATE. 641--646. Google Scholar
Digital Library
- X. Liu, Y. Chen, T. Nguyen, S. Gurumani, K. Rupnow, and D. Chen. 2016. High level synthesis of complex applications: An H.264 video decoder. In Proceedings of the ACM/SIGDA FPGA (FPGA’16). 224--233. Google Scholar
Digital Library
- G. A. Malazgirt, N. Sonmez, A. Yurdakul, O. Unsal, and A. Cristal. 2015. Accelerating complete decision support queries through high-level synthesis technology (abstract only). In Proceedings of the ACM/SIGDA FPGA (FPGA’15). ACM, New York, 277--277. Google Scholar
Digital Library
- G. Martin and G. Smith. 2009. High-level synthesis: Past, present, and future. IEEE Design 8 Test of Computers 26, 4 (2009), 18--25. Google Scholar
Digital Library
- D. Navarro, Ó Lucía, L. A. Barragán, I. Urriza, and Ó Jiménez. 2013. High-level synthesis for accelerating the FPGA implementation of computationally demanding control algorithms for power converters. IEEE Transactions on Industrial Informatics 9, 3 (Aug. 2013), 1371--1379. Google Scholar
Cross Ref
- N. K. Nikolova. 2011. Microwave imaging for breast cancer. IEEE Microwave Magazine 12, 7 (Dec. 2011), 78--94. Google Scholar
Cross Ref
- J. Oppermann, A. Koch, Ting Yu, and O. Sinnen. 2015. Domain-specific optimisation for the high-level synthesis of CellML-based simulation accelerators. In Proceedings of the 2015 25th International Conference on Field Programmable Logic and Applications (FPL’15). 1--7. Google Scholar
Cross Ref
- B. Parhami. 2010. Computer Arithmetic: Algorithms and Hardware Designs (2nd ed.). Oxford University Press. Google Scholar
Digital Library
- A. Prost-Boucle, O. Muller, and F. Rousseau. 2013. A fast and autonomous HLS methodology for hardware accelerator generation under resource constraints. In Proceedings of the Euromicro Conference on Digital System Design (DSD’13). 201--208. Google Scholar
Digital Library
- G. Ruvio, R. Solimene, A. Cuccaro, and M. Ammann. 2013. Comparison of noncoherent linear breast cancer detection algorithms applied to a 2-d numerical model. IEEE Antennas and Wireless Propagation Letters 12 (2013), 853--856. Google Scholar
Cross Ref
- R. O. Schmidt. 1986. Multiple emitter location and signal parameter estimation. IEEE Transactions on Antennas and Propagation 34, 3 (Mar. 1986), 276--280. Google Scholar
Cross Ref
- A. Shahzad, M. O’Halloran, M. Glavin, and E. Jones. 2014. A novel optimized parallelization strategy to accelerate microwave tomography for breast cancer screening. In Proceedings of the 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’14). 2456--2459. Google Scholar
Cross Ref
- R. Solimene, A. Dell’Aversano, and G. Leone. 2012. Interferometric time reversal music for small scatterer localization. Progress In Electromagnetics Research 131 (2012), 243--258. Google Scholar
Cross Ref
- Xilinx. 2013. Zynq-7000 All Programmable SoC First Generation Architecture. Xilinx. (December 2013).Google Scholar
- M. Xu, P. Thulasiraman, and S. Noghanian. 2012. Microwave tomography for breast cancer detection on cell broadband engine processors. Journal of Parallel and Distributed Computing 72, 9 (2012), 1106--1116. Google Scholar
Digital Library
Index Terms
Accelerators for Breast Cancer Detection
Recommendations
On-Chip Reconfigurable Hardware Accelerators for Popcount Computations
Popcount computations are widely used in such areas as combinatorial search, data processing, statistical analysis, and bio- and chemical informatics. In many practical problems the size of initial data is very large and increase in throughput is ...
From software to accelerators with LegUp high-level synthesis
CASES '13: Proceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded SystemsEmbedded system designers can achieve energy and performance benefits by using dedicated hardware accelerators. However, implementing custom hardware accelerators for an application can be difficult and time intensive. LegUp is an open-source high-level ...
Increasing hardware efficiency with multifunction loop accelerators
CODES+ISSS '06: Proceedings of the 4th international conference on Hardware/software codesign and system synthesisTo meet the conflicting goals of high-performance low-cost embedded systems, critical application loop nests are commonly executed on specialized hardware accelerators. These loop accelerators are traditionally designed in a single-function manner, ...






Comments