Abstract
Modern multiprocessor system-on-chips (SoCs) integrate multiple heterogeneous cores to achieve high energy efficiency. The power consumption of each core contributes to an increase in the temperature across the chip floorplan. In turn, higher temperature increases the leakage power exponentially, and leads to a positive feedback with nonlinear dynamics. This paper presents a power-temperature stability and safety analysis technique for multiprocessor systems. This analysis reveals the conditions under which the power-temperature trajectory converges to a stable fixed point. We also present a simple formula to compute the stable fixed point and maximum thermally-safe power consumption at runtime. Hardware measurements on a state-of-the-art mobile processor show that our analytical formulation can predict the stable fixed point with an average error of 2.6%. Hence, our approach can be used at runtime to ensure thermally safe operation and guard against thermal threats.
- Kendall E Atkinson. 2008. An Introduction to Numerical Analysis. John Wiley 8 SonGoogle Scholar
- Francesco Beneventi, Andrea Bartolini, Andrea Tilli, and Luca Benini. 2014. An effective gray-box identification procedure for multicore thermal modeling. IEEE Trans. Comput. 63, 5 (2014), 1097--1110. Google Scholar
Digital Library
- Christian Bienia, Sanjeev Kumar, Jaswinder Pal Singh, and Kai Li. 2008. The PARSEC benchmark suite: Characterization and architectural implications. In Proc. 17th Int. Conf. on Parallel Arch. and Compilation Techniques. 72--81. Google Scholar
Digital Library
- David Brooks, Robert P. Dick, Russ Joseph, and Li Shang. 2007. Power, thermal, and reliability modeling in nanometer-scale microprocessors. IEEE Micro 27, 3 (2007), 49--62. Google Scholar
Digital Library
- Guoqing Chen, Yi Xu, Xing Hu, Xiangyang Guo, Jun Ma, Yu Hu, and Yuan Xie. 2016. TSocket: Thermal sustainable power budgeting. ACM Trans. Des. Autom. of Electron. Syst. 21, 2 (2016), 29:1--29:22. Google Scholar
Digital Library
- Ryan Cochran and Sherief Reda. 2013. Thermal prediction and adaptive control through workload phase detection. ACM Trans. Des. Autom. of Electron. Syst. 18, 1 (2013), 7:1--7:19. Google Scholar
Digital Library
- Puyan Dadvar and Kevin Skadron. 2005. Potential thermal security risks. In 21st IEEE Semicond. Thermal Meas. 8 Management Symp. 229--234.Google Scholar
- Begum Egilmez, Gokhan Memik, Seda Ogrenci-Memik, and Oguz Ergin. 2015. User-specific skin temperature-aware DVFS for smartphones. In Proc. 2015 Design, Autom. 8 Test in Europe Conf. 8 Exhibition. 1217--1220. Google Scholar
Digital Library
- Ujjwal Gupta, Raid Ayoub, Michael Kishinevsky, David Kadjo, Niranjan Soundararajan, Ugurkan Tursun, and Umit Ogras. 2017. Dynamic power budgeting for mobile systems running graphics workloads. IEEE Trans. Multi-Scale Comput. Syst. (2017).Google Scholar
- M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M. Austin, T. Mudge, and R. B. Brown. 2001. Mibench: A free, commercially representative embedded benchmark suite. In Proc. Int. Workshop on Workload Char. 3--14. Google Scholar
Digital Library
- Vinay Hanumaiah, Sarma Vrudhula, and Karam S. Chatha. 2011. Performance optimal online DVFS and task migration techniques for thermally constrained multi-core processors. IEEE Trans. Comput.-Aided Design Integr. Circuits Syst. 30, 11 (2011), 1677--1690. Google Scholar
Digital Library
- Hardkernel. 2014. ODROID-XU3. http://www.hardkernel.com/main/products/prdt_info.php?g_code=g140448267127 Accessed 07/14/2017. (2014).Google Scholar
- Jahangir Hasan, Ankit Jalote, T. N. Vijaykumar, and Carla E. Brodley. 2005. Heat stroke: Power-density-based denial of service in SMT. In 11th Int. Symp. on High-Performance Computer Arch. 166--177. Google Scholar
Digital Library
- John L. Henning. 2006. SPEC CPU2006 benchmark descriptions. ACM SIGARCH Comput. Arch. News 34, 4 (2006), 1--17. Google Scholar
Digital Library
- Seongmoo Heo, Kenneth Barr, and Krste Asanović. 2003. Reducing power density through activity migration. In Proc. 2003 Int. Symp. on Low Power Electron. and Design. 217--222. Google Scholar
Digital Library
- Pei-Yu Huang and Yu-Min Lee. 2009. Full-chip thermal analysis for the early design stage via generalized integral transforms. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 17, 5 (2009), 613--626. Google Scholar
Digital Library
- Wei Huang, Shougata Ghosh, Siva Velusamy, Karthik Sankaranarayanan, Kevin Skadron, and Mircea R. Stan. 2006. HotSpot: A compact thermal modeling methodology for early-stage VLSI design. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 14, 5 (2006), 501--513. Google Scholar
Digital Library
- Canturk Isci, Alper Buyuktosunoglu, Chen-Yong Cher, Pradip Bose, and Margaret Martonosi. 2006. An analysis of efficient multi-core global power management policies: Maximizing performance for a given power budget. In Proc. Int. Symp. on Microarchitecture. 347--358. Google Scholar
Digital Library
- Nam Sung Kim et al. 2003. Leakage current: Moore’s law meets static power. Computer 36, 12 (2003), 68--75. Google Scholar
Digital Library
- Joonho Kong, Johnsy K. John, Eui-Young Chung, Sung Woo Chung, and Jie Hu. 2010. On the thermal attack in instruction caches. IEEE Trans. Depend. and Sec. Comput. 7, 2 (2010), 217--223. Google Scholar
Digital Library
- Amit Kumar, Li Shang, Li-Shiuan Peh, and Niraj K. Jha. 2008. System-level dynamic thermal management for high-performance microprocessors. IEEE Trans. Comput.-Aided Design of Integr. Circuits Syst. 27, 1 (2008), 96--108. Google Scholar
Digital Library
- Peng Li, Lawrence T. Pileggi, Mehdi Asheghi, and Rajit Chandra. 2004. Efficient full-chip thermal modeling and analysis. In Proc. Int. Conf. on Comput.-Aided Design. 319--326. Google Scholar
Digital Library
- Weiping Liao and Lei He. 2003. Coupled power and thermal simulation with active cooling. In Int. Workshop on Power-Aware Comput. Syst. 148--163. Google Scholar
Digital Library
- Yongpan Liu, Robert P. Dick, Li Shang, and Huazhong Yang. 2007. Accurate temperature-dependent integrated circuit leakage power estimation is easy. In Proc. Conf. on Design, Autom. and Test in Europe. 1526--1531. Google Scholar
Digital Library
- Zao Liu, Tailong Xu, Sheldon X.-D. Tan, and Hai Wang. 2013. Dynamic thermal management for multi-core microprocessors considering transient thermal effects. In 18th Asia and South Pacific Design Autom. Conf. 473--478.Google Scholar
- Santiago Pagani, Heba Khdr, Jian-Jia Chen, Muhammad Shafique, Minming Li, and Jörg Henkel. 2017. Thermal safe power (TSP): Efficient power budgeting for heterogeneous manycore systems in dark silicon. IEEE Trans. Comput. 66, 1 (2017), 147--162. Google Scholar
Digital Library
- Alok Prakash, Hussam Amrouch, Muhammad Shafique, Tulika Mitra, and Jörg Henkel. 2016. Improving mobile gaming performance through cooperative CPU-GPU thermal management. In Proc. of the Design Autom. Conf. 47:1--47:6. Google Scholar
Digital Library
- Onur Sahin and A. Coskun. 2016. QScale: Thermally-efficient QoS management on heterogeneous mobile platforms. In Proc. Int. Conf. on Comput.-Aided Design. 125:1--125:8. Google Scholar
Digital Library
- Samsung Electronics Co.2016. Samsung Expands Recall to All Galaxy Note7 Devices. http://www.samsung.com/us/note7recall/. Accessed 07/14/2017. (2016).Google Scholar
- Shervin Sharifi, Dilip Krishnaswamy, and Tajana Simunic Rosing. 2013. PROMETHEUS: A proactive method for thermal management of heterogeneous MPSoCs. IEEE Trans. Comput.-Aided Design Integr. Circuits Syst. (2013), 1110--1123. Google Scholar
Digital Library
- Gaurav Singla, Gurinderjit Kaur, Ali K. Unver, and Umit Y. Ogras. 2015. Predictive dynamic thermal and power management for heterogeneous mobile platforms. In Proc. 2015 Design, Autom. 8 Test in Europe Conf. 8 Exhibition. 960--965. Google Scholar
Digital Library
- Arman Vassighi and Manoj Sachdev. 2006. Thermal runaway in integrated circuits. IEEE Trans. Device Mater. Rel. 6, 2 (2006), 300--305.Google Scholar
Cross Ref
- Qing Xie, Jaemin Kim, Yanzhi Wang, Donghwa Shin, Naehyuck Chang, and Massoud Pedram. 2013. Dynamic thermal management in mobile devices considering the thermal coupling between battery and application processor. In Proc. Int. Conf. on Comput.-Aided Design. 242--247. Google Scholar
Digital Library
- Yonghong Yang, Zhenyu Gu, Changyun Zhu, Robert P. Dick, and Li Shang. 2007. ISAC: Integrated space-and-time-adaptive chip-package thermal analysis. IEEE Trans. Comput.-Aided Design Integr. Circuits Syst. 26, 1 (2007), 86--99. Google Scholar
Digital Library
- Yong Zhan and Sachin S. Sapatnekar. 2005. A high efficiency full-chip thermal simulation algorithm. In Proc. Int. Conf. on Comput.-Aided Design. 635--638. Google Scholar
Digital Library
Index Terms
Power-Temperature Stability and Safety Analysis for Multiprocessor Systems
Recommendations
Evaluation of Low-Power Computing when Operating on Subsets of Multicore Processors
Given the accelerated growth in tablet devices, smartphones, and netbooks, designers are faced with serious challenges to meet the needs of mobility in terms of battery life and form factor. It is vital to investigate how to deliver the best mobile ...
Power-efficient time-sensitive mapping in heterogeneous systems
PACT '12: Proceedings of the 21st international conference on Parallel architectures and compilation techniquesHeterogeneous systems that contain multiple types of resources, such as CPUs and GPUs, are becoming increasingly popular thanks to the potential of achieving high performance and energy efficiency. In such systems, the problem of data mapping and ...
Economic learning for thermal-aware power budgeting in many-core architectures
CODES+ISSS '11: Proceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesisOne of the key challenges for multi-core processors in the nano-CMOS era is dealing with the increased temperatures. It is imperative that peak temperatures are reduced and that heat is spread as evenly on the chip as possible to avoid mutual heating ...






Comments