skip to main content
research-article

CORDIC-Based Enhanced Systolic Array Architecture for QR Decomposition

Authors Info & Claims
Published:14 December 2015Publication History
Skip Abstract Section

Abstract

Multiple input multiple output (MIMO) with orthogonal frequency division multiplexing (OFDM) systems typically use orthogonal-triangular (QR) decomposition. In this article, we present an enhanced systolic array architecture to realize QR decomposition based on the Givens rotation (GR) method for a 4 × 4 real matrix. The coordinate rotation digital computer (CORDIC) algorithm is adopted and modified to speed up and simplify the process of GR. To verify the function and evaluate the performance, the proposed architectures are validated on a Virtex 5 FPGA development platform. Compared to a commercial implementation of vectoring CORDIC, the enhanced vectoring CORDIC is presented that uses 37.7% less hardware resources, dissipates 71.6% less power, and provides a 1.8 times speedup while maintaining the same computation accuracy. The enhanced QR systolic array architecture based on the enhanced vectoring CORDIC saves 24.5% in power dissipation, provides a factor of 1.5-fold improvement in throughput, and the hardware efficiency is improved 1.45-fold with no accuracy penalty when compared to our previously proposed QR systolic array architecture.

References

  1. Ray Andraka. 1998. A survey of CORDIC algorithms for FPGAs based computers. In Proceedings of the 1998 ACM/SIGDA 6th International Symposium on Field Programmable Gate Arrays (FPGA’98). ACM, New York, NY, 191--200. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Aslan, S. Niu, and J. Saniie. 2012. FPGA implementation of fast QR decomposition based on Givens rotation. In Proceedings of the 55th International Midwest Symposium on Circuits and Systems (MWSCAS’12). IEEE, Los Alamitos, CA, 470--473.Google ScholarGoogle Scholar
  3. Dongdong Chen and Mihai Sima. 2011. Fixed-point CORDIC-based QR decomposition by Givens rotations on FPGA. In Proceedings of the International Conference on Reconfigurable Computing and FPGAs (ReConFig’11). IEEE, Los Alamitos, CA, 327--332. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Kyongkuk Cho and Dongweon Yoon. 2002. On the general BER expression of one- and two-dimensional amplitude modulations. IEEE Transactions on Communications 50, 7, 1074--1080.Google ScholarGoogle ScholarCross RefCross Ref
  5. Florent De Dinechin, Matei Istoan, and Guillaume Sergent. 2013. Fixed-point trigonometric functions on FPGAs. In Proceedings of the 4th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies. 1--6.Google ScholarGoogle Scholar
  6. H. Ekstrom, A. Furuskar, J. Karlsson, M. Meyer, S. Parkvall, J. Torsner, and M. Wahiqvist. 2006. Technical solutions for the 3G long-term evolution. IEEE Communications Magazine 44, 3, 38--45. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Lajos L. Hanzo and Thomas Keller. 2006. OFDM and MC-CDMA: A Primer. Wiley-IEEE Press, Chichester West Sussex, England.Google ScholarGoogle Scholar
  8. IEEE. 1999. Wireless LAN Medium Access Control (MAC) and Physical Layer (PHY) Specifications. IEEE, New York, NY.Google ScholarGoogle Scholar
  9. Anita Jain and Kavita Khare. 2013. Hardware efficient scaling free vectoring and rotational CORDIC for DSP applications. International Journal of VLSI Design and Communication Systems 4, 3, 15--22.Google ScholarGoogle ScholarCross RefCross Ref
  10. Marjan Karkooti, Joseph R. Cavallaro, and Chris Dick. 2005. FPGA implementation of matrix inversion using QRD-RLS algorithm. In Proceedings of the 39th Asilomar Conference on Signals, Systems, and Computers.Google ScholarGoogle ScholarCross RefCross Ref
  11. Andrew Kerr, Dan Campbell, and Mark Richards. 2009. QR decomposition on GPUs. In Proceedings of the 2nd Workshop on General Purpose Processing on Graphics Processing Units (GPGPU-2). ACM, New York, NY, 71--78. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Iput Heri Kurniawan, Ji-Hwan Yoon, and Jongsun Park. 2013. Multidimentional Householder based high-speed QR decomposition architecture for MIMO receviers. In Proceedings of the 2013 International Symposium on Circuits and Systems (ISCAS’13). IEEE, Los Alamitos, CA, 2159--2162.Google ScholarGoogle ScholarCross RefCross Ref
  13. Qinghua Li, Guangjie Li, Wookbong Lee, and Moon Il Lee. 2010. MIMO techniques in WiMAX and LTE: A future overview. IEEE Communications Magazine 48, 5, 86--92. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. Luethi, C. Studer, S. Duetsch, E. Zgraggen, H. Kaeslin, N. Felber, and W. Fichtner. 2008. Gram-Schmidt-based QR decomposition for MIMO detection: VLSI implementation and comparison. In Proceedings of the Asia Pacific Conference on Circuits and Systems (APCCAS’08). IEEE, Los Alamitos, CA, 830--833.Google ScholarGoogle Scholar
  15. K. Maharatna, S. Banerjee, E. Grass, M. Krstic, and A. Troya. 2005. Modified virtually scaling-free adaptive CORDIC rotator algorithm and architecture. IEEE Transactions on Circuits and Systems 15, 11, 1463--1474. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Michael P. McGraw-Herdeg, Douglas P. Enright, and B. Scott Michel. 2007. Benchmarking the NVIDIA 8800GTX with the CUDA development platform. In Proceedings of High Performance Embedded Computing (HPEC’07).Google ScholarGoogle Scholar
  17. P. K. Meher, J. Valls, T.-B. Juang, K. Sridharan, and K. Maharatna. 2009. 50 years of CORDIC: Algorithms, architectures, and applications. IEEE Transactions on Circuits and Systems I: Regular Papers 56, 9, 1893--1907. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Yuanxi Peng, Manuel Saldana, Chris Madill, Xiaofeng Zou, and Paul Chow. 2014. Benefits of adding hardware support for broadcast and reduce operations in MPSoC applications. ACM Transactions on Reconfigurable Technology and Systems 7, 3, 17:1--17:23. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. E. Volder. 1959. The CORDIC trigonometric computing technique. IRE Transactions on Electronic Computers 8, 3, 330--334.Google ScholarGoogle ScholarCross RefCross Ref
  20. C. R. Wana and D. J. Evans. 1994. A systolic array architecture for QR decomposition of block structured sparse systems. Parallel Computing 20, 6, 903--914. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Shaoyun Wang and Earl E. Swartzlander. 1996. The critically damped CORDIC algorithm for QR decomposition. In Proceedings of the 30th Asilomar Conference on Signals, Systems, and Computers. IEEE, Los Alamitos, CA, 908--911.Google ScholarGoogle Scholar
  22. Xilinx Inc. COREGen. 2015. Xilinx CORE Generator System. Retrieved November 23, 2015, from http://www.xilinx.com/tools/coregen.htm.Google ScholarGoogle Scholar
  23. Xilinx Inc. Xpower. 2015. XPower. Retrieved November 23, 2015, from http://www.xilinx.com/products/des ign_tools/logic_design/verification/xpo wer.htm.Google ScholarGoogle Scholar
  24. Jianfeng Zhang, Paul Chow, and Hengzhu Liu. 2014. An efficient FPGA implementation of QR decomposition using a novel systolic array architecture based on enhanced vectoring CORDIC. In Proceedings of the International Conference on Field Programmable Technology (FPT’14). IEEE, Los Alamitos, CA, 123--130.Google ScholarGoogle ScholarCross RefCross Ref
  25. J. Zhang, H. Liu, W. Hu, D. Liu, and B. Zhang. 2012. Adaptive recoding CORDIC. IEICE Electronics Express 9, 8, 765--771.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. CORDIC-Based Enhanced Systolic Array Architecture for QR Decomposition

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader
          About Cookies On This Site

          We use cookies to ensure that we give you the best experience on our website.

          Learn more

          Got it!