skip to main content
research-article

Runtime Performance and Power Optimization of Parallel Disparity Estimation on Many-Core Platforms

Published:14 November 2017Publication History
Skip Abstract Section

Abstract

This article investigates the use of many-core systems to execute the disparity estimation algorithm, used in stereo vision applications, as these systems can provide flexibility between performance scaling and power consumption. We present a learning-based runtime management approach that achieves a required performance threshold while minimizing power consumption through dynamic control of frequency and core allocation. Experimental results are obtained from a 61-core Intel Xeon Phi platform for the aforementioned investigation. The same performance can be achieved with an average reduction in power consumption of 27.8% and increased energy efficiency by 30.04% when compared to Dynamic Voltage and Frequency Scaling control alone without runtime management.

References

  1. K. Ambrosch and W. Kubinger. 2010. Accurate hardware-based stereo vision. Comput. Vis. Image Underst. 114, 11 (Nov. 2010), 1303--1316. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. N. Baha and S. Larabi. 2012. Accurate real-time neural disparity MAP estimation with FPGA. Pattern Recogn. 45, 3 (Mar. 2012), 1195--1204. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. Banz, S. Hesselbarth, H. Flatt, H. Blume, and P. Pirsch. 2010. Real-time stereo vision system using semi-global matching disparity estimation: Architecture and FPGA-implementation. In Proceedings of the 2010 International Conference on Embedded Computer Systems (SAMOS’10). 93--101.Google ScholarGoogle Scholar
  4. P. Bellasi, G. Massari, and W. Fornaciari. 2015. Effective runtime resource management using linux control groups with the barbequertrm framework. ACM Trans. Embed. Comput. Syst. 14, 2, Article 39 (Mar. 2015), 17 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Bleyer and C. Rhemann. 2011. PatchMatch stereo—Stereo matching with slanted support windows. In British Machine Vision Conference 2011. 1--11. http://publik.tuwien.ac.at/files/PubDat_201949.pdfGoogle ScholarGoogle Scholar
  6. A. Burbano, S. Bouaziz, and M. Vasiliu. 2015. 3D-sensing distributed embedded system for people tracking and counting. In Proceedings of the 2015 International Conference on Computational Science and Computational Intelligence (CSCI’15). 470--475. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Cochran, C. Hankendi, A. K. Coskun, and S. Reda. 2011. Pack 8 cap: Adaptive DVFS and thread packing under power caps. In Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture. 175--185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Cohen, P. Cohen, S. G. West, and L. S. Aiken. 2013. Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences. Taylor 8 Francis.Google ScholarGoogle Scholar
  9. M. Curtis-Maury, A. Shah, F. Blagojevic, D. S. Nikolopoulos, B. R. de Supinski, and M. Schulz. 2008. Prediction models for multi-dimensional power-performance optimization on many cores. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques. 250--259. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. Cyganek and J. P. Siebert. 2009. Introduction to 3D Computer Vision Techniques and Algorithms. Wiley-Blackwell. Google ScholarGoogle ScholarCross RefCross Ref
  11. J. Ding, J. Liu, W. Zhou, H. Yu, Y. Wang, and X. Gong. 2011. Real-time stereo vision system using adaptive weight cost aggregation approach. EURASIP J. Image Vid. Process. 2011, 1 (2011), 1--19. Google ScholarGoogle ScholarCross RefCross Ref
  12. N. R. Draper and H. Smith. 1998. Applied Regression Analysis (3rd ed.). Wiley-Blackwell. Google ScholarGoogle ScholarCross RefCross Ref
  13. M. Etinski, J. Corbalan, J. Labarta, and M. Valero. 2012. Understanding the future of energy-performance trade-off via DVFS in HPC environments. J. Parallel Distrib. Comput. 72, 4 (Apr. 2012), 579--590. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. X. Fu and X. Wang. 2011. Utilization-controlled task consolidation for power optimization in multi-core real-time systems. In Proceedings of the 2011 IEEE 17th International Conference on Embedded and Real-Time Computing Systems and Applications, Vol. 1. 73--82. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Gadioli, S. Libutti, G. Massari, E. Paone, M. Scandale, P. Bellasi, G. Palermo, V. Zaccaria, G. Agosta, W. Fornaciari, and C. Silvano. 2014. OpenCL application auto-tuning and run-time resource management for multi-core platforms. In Proceedings of the 2014 IEEE International Symposium on Parallel and Distributed Processing with Applications. 127--133. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. K. Gehrig, F. Eberli, and T. Meyer. 2009. A Real-Time Low-Power Stereo Vision Engine Using Semi-Global Matching. Springer, Berlin, 134--143. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. He, J. Sun, and X. Tang. 2013. Guided image filtering. IEEE Trans. Pattern Anal. Mach. Intell. 35, 6 (Jun. 2013), 1397--1409. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H. Hirschmuller. 2008. Stereo processing by semiglobal matching and mutual information. IEEE Trans. Pattern Anal. Mach. Intell. 30, 2 (Feb. 2008), 328--341. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Hosni, M. Bleyer, C. Rhemann, M. Gelautz, and C. Rother. 2011. REal-time local stereo matching using guided image filtering. In Proceedings of the 2011 IEEE International Conference on Multimedia and Expo. 1--6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y. S. Hwang and K. S. Chung. 2013. Dynamic power management technique for multicore based embedded mobile devices. IEEE Trans. Industr. Inform. 9, 3 (Aug. 2013), 1601--1612. Google ScholarGoogle ScholarCross RefCross Ref
  21. Intel. 2015. Intel Xeon Phi Product Family. https://ark.intel.com/products/80555/Intel-Xeon-Phi-Coprocessor-7120A-16GB-1_238-GHz-61-core.Google ScholarGoogle Scholar
  22. M. Jin and T. Maruyama. 2012. A real-time stereo vision system using a tree-structured dynamic programming on FPGA. In Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays. 21--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Jin and T. Maruyama. 2014. Fast and accurate stereo vision system on FPGA. ACM Trans. Reconfigurable Technol. Syst. 7, 1, Article 3 (Feb. 2014), 24 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Karakaya, G. Kkyildiz, C. Toprak, and H. Ocak. 2014. Development of a human tracking indoor mobile robot platform. In Proceedings of the 16th International Conference on Mechatronics (Mechatronika’14). 683--687. Google ScholarGoogle ScholarCross RefCross Ref
  25. G. C. Sirakoulis L. Nalpantidis and A. Gasteratos. 2008. Review of stereo vision algorithms: From software to hardware. Int. J. Optomechatron. 2, 4 (Jan. 2008), 435--462. Google ScholarGoogle ScholarCross RefCross Ref
  26. B. Lewis and D. J. Berg. 1998. Multithreaded Programming with Pthreads. Prentice-Hall. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. G. Mariani, C. Ykman-Couvreur, K. Zhang, L. Zhang, and G. Lafruit. 2010. An efficient run-time management methodology for stereo matching application. In Proceedings of the 23th International Conference on Architecture of Computing Systems 2010. 1--6. http://ieeexplore.ieee.org/document/5759019/.Google ScholarGoogle Scholar
  28. X. Mei, X. Sun, M. Zhou, S. Jiao, H. Wang, and Xiaopeng Zhang. 2011. On building an accurate stereo matching system on graphics hardware. In Proceedings of the 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops’11). 467--474. Google ScholarGoogle ScholarCross RefCross Ref
  29. C. C. T. Mendes and D. F. Wolf. 2013. Real time autonomous navigation and obstacle avoidance using a semi-global stereo method. In Proceedings of the 28th Annual ACM Symposium on Applied Computing. 235--236. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. H. Oleynikova, D. Honegger, and M. Pollefeys. 2015. Reactive avoidance using embedded stereo vision for MAV flight. In Proceedings of the 2015 IEEE International Conference on Robotics and Automation (ICRA’15). 50--56. Google ScholarGoogle ScholarCross RefCross Ref
  31. E. Paone, D. Gadioli, G. Palermo, V. Zaccaria, and C. Silvano. 2014. Evaluating orthogonality between application auto-tuning and run-time resource management for adaptive opencl applications. In Proceedings of the 2014 IEEE 25th International Conference on Application-Specific Systems, Architectures and Processors. 161--168. Google ScholarGoogle ScholarCross RefCross Ref
  32. M. G. Park, J. Park, Y. Shin, E. G. Lim, and K. J. Yoon. 2015. Stereo vision with image-guided structured-light pattern matching. Electron. Lett. 51, 3 (2015), 238--239. Google ScholarGoogle ScholarCross RefCross Ref
  33. S. Perri, P. Corsonello, and G. Cocorullo. 2013. Adaptive census transform: A novel hardware-oriented stereovision algorithm. Comput. Vis. Image Underst. 117, 1 (2013), 29--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. A. K. Porterfield, S. L. Olivier, S. Bhalachandra, and J. F. Prins. 2013. Power measurement and concurrency throttling for energy reduction in openmp programs. In Proceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum. 884--891. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. D. Scharstein and R. Szeliski. 2002. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vis. 47, 1–3 (2002), 7--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. R. A. Shafik, A. Das, S. Yang, G. Merrett, and B. M. Al-Hashimi. 2015. Adaptive energy minimization of openmp parallel applications on many-core systems. In Proceedings of the 6th Workshop on Parallel Programming and Run-Time Management Techniques for Many-Core Architectures. 19--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Y. Shan, Y. Hao, W. Wang, Y. Wang, X. Chen, H. Yang, and W. Luk. 2014. Hardware acceleration for an accurate stereo vision system using mini-census adaptive support region. ACM Trans. Embed. Comput. Syst. 13, 4s, Article 132 (2014), 24 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. S. Solak and E. D. Bolat. 2015. Distance estimation using stereo vision for indoor mobile robot applications. In Proceedings of the 2015 9th International Conference on Electrical and Electronics Engineering (ELECO’15). 685--688. Google ScholarGoogle ScholarCross RefCross Ref
  39. H. Son, K. Bae, S. Ok, Y. Lee, and B. Moon. 2012. A Rectification Hardware Architecture for an Adaptive Multiple-Baseline Stereo Vision System. Springer, Berlin, 147--155.Google ScholarGoogle Scholar
  40. J. Stowers, M. Hayes, and A. Bainbridge-Smith. 2011. Altitude control of a quadrotor helicopter using depth map from microsoft kinect sensor. In Proceedings of the 2011 IEEE International Conference on Mechatronics. 358--362. Google ScholarGoogle ScholarCross RefCross Ref
  41. T. Tahara, R. Kawahara, S. Nobuhara, and T. Matsuyama. 2015. Interference-free epipole-centered structured light pattern for mirror-based multi-view active stereo. In Proceedings of the 2015 International Conference on 3D Vision. 153--161. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. C. Ttofis, C. Kyrkou, and T. Theocharides. 2016. A low-cost real-time embedded stereo vision system for accurate disparity estimation based on guided image filtering. IEEE Trans. Comput. 65, 9 (2016), 2678--2693. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. P. Usachokcharoen, Y. Washizawa, and K. Pasupa. 2015. Sign language recognition with microsoft Kinect’s depth and colour sensors. In Proceedings of the 2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA’15). 186--190.Google ScholarGoogle Scholar
  44. W. Wang, J. Yan, N. Xu, Y. Wang, and F. H. Hsu. 2013. Real-time high-quality stereo vision system in FPGA. In Proceedings of the 2013 International Conference on Field-Programmable Technology (FPT’13). 358--361. Google ScholarGoogle ScholarCross RefCross Ref
  45. S. Yang, R. A. Shafik, G. V. Merrett, E. Stott, J. M. Levine, J. Davis, and B. M. Al-Hashimi. 2015. Adaptive energy minimization of embedded heterogeneous systems using regression-based learning. In Proceedings of the 2015 25th International Workshop on Power and Timing Modeling, Optimisation and Simulation (PATMOS’15). 103--110.Google ScholarGoogle Scholar
  46. K. Yoon and In S. Kweon. 2006. Adaptive support-weight approach for correspondence search. IEEE Trans. Pattern Anal. Mach. Intell. 28, 4 (2006), 650--656. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. K. Zhang, J. Lu, and G. Lafruit. 2009. Cross-based local stereo matching using orthogonal integral images. IEEE Trans. Circ. Syst. Video Technol. 19, 7 (2009), 1073--1079. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. L. Zhang, K. Zhang, T. S. Chang, G. Lafruit, G. K. Kuzmanov, and D. Verkest. 2011. Real-time high-definition stereo matching on FPGA. In Proceedings of the 19th ACM/SIGDA International Symposium on Field Programmable Gate Arrays. 55--64. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Runtime Performance and Power Optimization of Parallel Disparity Estimation on Many-Core Platforms

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader
            About Cookies On This Site

            We use cookies to ensure that we give you the best experience on our website.

            Learn more

            Got it!