skip to main content
research-article
Public Access

DyPO: Dynamic Pareto-Optimal Configuration Selection for Heterogeneous MpSoCs

Published:27 September 2017Publication History
Skip Abstract Section

Abstract

Modern multiprocessor systems-on-chip (MpSoCs) offer tremendous power and performance optimization opportunities by tuning thousands of potential voltage, frequency and core configurations. As the workload phases change at runtime, different configurations may become optimal with respect to power, performance or other metrics. Identifying the optimal configuration at runtime is infeasible due to the large number of workloads and configurations. This paper proposes a novel methodology that can find the Pareto-optimal configurations at runtime as a function of the workload. To achieve this, we perform an extensive offline characterization to find classifiers that map performance counters to optimal configurations. Then, we use these classifiers and performance counters at runtime to choose Pareto-optimal configurations. We evaluate the proposed methodology by maximizing the performance per watt for 18 single- and multi-threaded applications. Our experiments demonstrate an average increase of 93%, 81% and 6% in performance per watt compared to the interactive, ondemand and powersave governors, respectively.

References

  1. A. Aalsaud et al. 2016. Power--Aware Performance Adaptation of Concurrent Applications in Heterogeneous Many-Core Systems. In Proc. of the Intl. Symp. on Low Power Elec. and Design. 368--373. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Benini, A. Bogliolo, and G. De Micheli. 2000. A Survey of Design Techniques For System-Level Dynamic Power Management. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 8, 3 (2000), 299--316. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. Bienia, S. Kumar, J. P. Singh, and K. Li. 2008. The PARSEC Benchmark Suite: Characterization and Architectural Implications. In Proc. of the Intl. Conf. on Parallel Arch. and Compilation Tech. 72--81. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. Bogdan, R. Marculescu, S. Jain, and R. T. Gavila. 2012. An Optimal Control Approach to Power Management for Multi-Voltage and Frequency Islands Multiprocessor Platforms under Highly Variable Workloads. In Proc. of the Intl. Symp. on Networks on Chip. 35--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. X. Chen et al. 2013. Dynamic Voltage and Frequency Scaling for Shared Resources in Multicore Processor Designs. In Proc. of the Design Autom. Conf. 114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. Cochran, C. Hankendi, A. K. Coskun, and S. Reda. 2011. Pack 8 Cap: Adaptive DVFS and Thread Packing Under Power Caps. In Proc. of the Intl. Symp. on Microarch. 175--185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Cortex. 2013. A15 MPCore Processor Technical Reference Manual. ARM Holdings PLC 24 (2013).Google ScholarGoogle Scholar
  8. A. K. Coskun, T. S. Rosing, and K. Whisnant. 2007. Temperature Aware Task Scheduling in MPSoCs. In Proc. of the Conf. on Design, Autom. and Test in Europe. 1659--1664. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. C. de Melo. 2010. The New Linux Perf Tools. In Linux Kongress, Vol. 18.Google ScholarGoogle Scholar
  10. E. Del Sozzo et al. 2016. Workload-aware Power Optimization Strategy for Asymmetric Multiprocessors. In Proc. of the Design, Auto. 8 Test in Europe Conf. 8 Exhib. 531--534. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. Dhiman and T. S. Rosing. 2009. System-Level Power Management Using Online Learning. IEEE Trans. Comput.-Aided Design Integr. Circuits and Syst. 28, 5 (2009), 676--689. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. B. Donyanavard, T. Mück, S. Sarma, and N. Dutt. 2016. SPARTA: Runtime Task Allocation for Energy Efficient Heterogeneous Many-cores. In Proc. of the Intl. Conf. on Hardware/Software Codesign and Sys. Syn. 27. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Friedman, T. Hastie, and R. Tibshirani. 2001. The Elements of Statistical Learning. Vol. 1. Springer Series in Statistics, Berlin.Google ScholarGoogle Scholar
  14. U. Gupta et al. 2017. Dynamic Power Budgeting for Mobile Systems Running Graphics Workloads. IEEE Trans. on Multi-Scale Comp. Sys.Google ScholarGoogle Scholar
  15. M. R. Guthaus et al. 2001. Mibench: A Free, Commercially Representative Embedded Benchmark Suite. In Proc. of the Intl. Work. on Workload Char. 3--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Henkel et al. 2015. Dark Silicon: From Computation to Communication. In Proc. of the Intl. Symp. on Networks-on-Chip. 23. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Herbert and D. Marculescu. 2007. Analysis of Dynamic Voltage/Frequency Scaling in Chip-Multiprocessors. In Proc. of the Intl. Symp. on Low Power Elec. and Design. 38--43. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. Isci, G. Contreras, and M. Martonosi. 2006. Live, Runtime Phase Monitoring and Prediction on Real Systems With Application to Dynamic Power Management. In Proc. of the Intl. Symp. on Microarch. 359--370. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. G. James, D. Witten, T. Hastie, and R. Tibshirani. 2013. An Introduction to Statistical Learning. Vol. 6. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R. G. Kim et al. 2016. Wireless NoC and Dynamic VFI Codesign: Energy Efficiency Without Performance Penalty. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 24, 7 (2016), 2488--2501.Google ScholarGoogle ScholarCross RefCross Ref
  21. C. Lattner. 2008. LLVM and Clang: Next Generation Compiler Technology. In Proc. of the BSD. 1--2.Google ScholarGoogle Scholar
  22. C. Lattner and V. Adve. 2004. LLVM: A Compilation Framework for Lifelong Program Analysis 8 Transformation. In Proc. of the Intl. Symp. on Code Gen. and Opt.: Feedback-directed and Runtime Opt. 75. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Li and J. F. Martinez. 2006. Dynamic Power-Performance Adaptation of Parallel Computation on Chip Multiprocessors. In Proc. of the Intl. Symp. on High-Perf. Comp. Arch. 77--87.Google ScholarGoogle Scholar
  24. P. Mochel. 2005. The Sysfs Filesystem. In Proc. of the Linux Symp.Google ScholarGoogle Scholar
  25. P. J. Mucci, S. Browne, C. Deane, and G. Ho. 1999. PAPI: A Portable Interface to Hardware Performance Counters. In Proc. of the Department of Defense HPCMP Users Group Conf.Google ScholarGoogle Scholar
  26. T. S. Muthukaruppan, M. Pricopi, V. Venkataramani, T. Mitra, and S. Vishin. 2013. Hierarchical Power Management for Asymmetric Multi-Core in Dark Silicon Era. In Proc. of the Design Autom. Conf. 1--9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. ODROID. Platforms, ODROID - XU3. http://www.hardkernel.com/main/products/prdt_info.php?g_code=G143452239825, accessed 6 April 2017.Google ScholarGoogle Scholar
  28. U. Y. Ogras and R. Marculescu. 2013. Modeling, Analysis and Optimization of Network-on-Chip Communication Architectures. Vol. 184. Springer Science 8 Business Media. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. G. Palermo, C. Silvano, and V. Zaccaria. 2005. Multi-objective Design Space Exploration of Embedded Systems. Jrnl of Embd. Comp. 1.3 (2005), 305--316. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Palesi and T. Givargis. 2002. Multi-objective Design Space Exploration Using Genetic Algorithms. In Proc. of the Intl. Symp. on Hardware/Software Codesign. 67--72. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. V. Pallipadi, S. Li, and A. Belay. 2007. Cpuidle: Do Nothing, Efficiently. In Proc. of the Linux Symp., Vol. 2. 119--125.Google ScholarGoogle Scholar
  32. V. Pallipadi and A. Starikovskiy. 2006. The Ondemand Governor. In Proc. of the Linux Symp., Vol. 2.Google ScholarGoogle Scholar
  33. T. Sherwood, E. Perelman, G. Hamerly, S. Sair, and B. Calder. 2003. Discovering and Exploiting Program Phases. IEEE micro 23, 6 (2003), 84--93. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. G. Singla, G. Kaur, A. K. Unver, and U. Y. Ogras. 2015. Predictive Dynamic Thermal and Power Management for Heterogeneous Mobile Platforms. In Proc. of the Conf. on Design, Automation 8 Test in Europe. 960--965. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. S. Thomas et al. 2014. CortexSuite: A Synthetic Brain Benchmark Suite. In Proc. of the Intl. Symp. on Workload Char. 76--79.Google ScholarGoogle ScholarCross RefCross Ref
  36. TI-INA231. http://www.ti.com/lit/ds/symlink/ina231.pdf, accessed April 06, 2017.Google ScholarGoogle Scholar
  37. N. Vallina-Rodriguez and J. Crowcroft. 2012. Energy Management Techniques in Modern Mobile Handsets. IEEE Comm. Surveys 8 Tutorials 99 (2012), 1--20.Google ScholarGoogle Scholar
  38. W. Wang, P. Mishra, and S. Ranka. 2012. Dynamic Reconfiguration in Real-Time Systems. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. X. Wang et al. 2016. A Pareto-Optimal Runtime Power Budgeting Scheme for Many-Core Systems. Microprocessors and Microsystems 46 (2016), 136--148.Google ScholarGoogle ScholarCross RefCross Ref
  40. XDA-Developers Forums. https://forum.xda-developers.com/general/general/ref-to-date-guide-cpu-governors-o-t3 048957, accessed April 06, 2017.Google ScholarGoogle Scholar
  41. X. Zheng, L. K. John, and A. Gerstlauer. 2016. Accurate Phase-level Cross-platform Power and Performance Estimation. In Proc. of Design Autom. Conf. 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Y. Zhu and V. J. Reddi. 2013. High-Performance and Energy-Efficient Mobile Web Browsing on Big/Little Systems. In Intl. Symp. on High Perf. Comput. Arch. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. DyPO: Dynamic Pareto-Optimal Configuration Selection for Heterogeneous MpSoCs

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!