Abstract
Multiprocessors have become the main architecture trend in modern systems due to the superior performance; nevertheless, the power consumption remains a critical challenge. Global power management (GPM) aims at dynamically finding the power state combination that satisfies the power budget constraint while maximizing the overall performance (or vice versa). Due to the increasing number of cores in a multiprocessor system, the scalability of GPM policies has become critical when searching satisfactory state combinations within acceptable time. This article proposes a highly scalable policy based on combinatorial optimization with theoretical proofs, whereas previous works take exhaustive search or heuristic methods. The proposed policy first applies an optimum algorithm to construct a state combination table in pseudo--polynomial time using dynamic programming. Then, the state combination is assigned to cores with minimum transition cost in linear time by mapping to the network flow problem. Simulation results show that the proposed policy achieves better system performance for any given power budget when compared to the state-of-the-art heuristic. Furthermore, the proposed policy demonstrates its prominent scalability with 125 times faster policy runtime for 512 cores.
- ACPI. 2011. ACPI—Advanced Configuration and Power Interface Specification. Available at http://www.acpi.info/Google Scholar
- AMD. 2013. AMD PowerNow! Technology. Available at http://www.amd.com/us/products/technologies/amd-powernow-technology/Pages/amd-powernow-technology.aspxGoogle Scholar
- ARM. 2005. ARM Intelligent Energy Controller Technical Overview. http://www.arm.com/Google Scholar
- ARM. 2008. Cortex-A9 MPCore Technical Reference Manual. ARM.Google Scholar
- Intel. 2013. Enhanced Intel SpeedStep Technology. Retrieved August 13, 2015, from http://www3.intel.com/cd/channel/reseller/asmo-na/eng/203838.htmGoogle Scholar
- Luca Benini, Alessandro Bogliolo, and Giovanni De Micheli. 2000. A survey of design techniques for system-level dynamic power management. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 8, 3, 299--316. Google Scholar
Digital Library
- Khurram Bhatti, Cecile Belleudy, and Michel Auguin. 2010. Power management in real time embedded systems through online and adaptive interplay of DPM and DVFS policies. In Proceedings of the IEEE/IFIP International Conference on Embedded and Ubiquitous Computing. 184--191. Google Scholar
Digital Library
- Juan M. Cebrian, Juan L. Aragon, and Stefanos Kaxiras. 2011. Power token balancing: Adapting CMPs to power constraints for parallel multithreaded workloads. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium. Google Scholar
Digital Library
- Ryan Cochran and Sherief Reda. 2012. Thermal prediction and adaptive control through workload phase detection. ACM Transactions on Design Automation of Electronic Systems 18, 1, Article No. 7. Google Scholar
Digital Library
- Jack Edmonds and Richard M. Karp. 1972. Theoretical improvements in algorithmic efficiency for network flow problems. Journal of the Association for Computing Machinery 19, 2, 248--264. Google Scholar
Digital Library
- Yang Ge, Parth Malani, and Qinru Qiu. 2010. Distributed task migration for thermal management in many-core systems. In Proceedings of the 47th ACM/IEEE Design Automation Conference. 579--584. Google Scholar
Digital Library
- Vinay Hanumaiah, Sarma Vrudhula, and Karam S. Chatha. 2011. Performance optimal online DVFS and task migration techniques for thermally constrained multi-core processors. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 30, 11, 1677--1690. Google Scholar
Digital Library
- Jim Held, Jerry Bautista, and Sean Koehi. 2006. From a Few Cores to Many: A Tera-Scale Computing Research Overview. Technical Report. Intel Corporation.Google Scholar
- John L. Henning. 2006. SPEC CPU2006 benchmark descriptions. SIGARCH Computer Architecture News 34, 4, 1--17. Google Scholar
Digital Library
- Sebastian Herbert and Diana Marculescu. 2007. Analysis of dynamic voltage/frequency scaling in chip-multiprocessors. In Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design. 38--43. Google Scholar
Digital Library
- Sebastian Herbert and Diana Marculescu. 2009. Variation-aware dynamic voltage/frequency scaling. In Proceedings of the International Symposium on High Performance Computer Architecture. 1--12.Google Scholar
Cross Ref
- Canturk Isci, Alper Buyuktosunoglu, Chen-Yong Cher, Pradip Bose, and Margaret Martonosi. 2006. An analysis of efficient multi-core global power management policies: Maximizing performance for a given power budget. In Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture. 347--358. Google Scholar
Digital Library
- Niraj K. Jha. 2001. Low power system scheduling and synthesis. In Proceedings of the 2001 IEEE/ACM International Conference on Computer Aided Design. 259--263. Google Scholar
Digital Library
- Philo Juang, Qiang Wu, Li-Shiuan Peh, Margaret Martonosi, and Douglas W. Clark. 2005. Coordinated, distributed, formal energy management of chip multiprocessors. In Proceedings of the 2005 International Symposium on Low Power Electronics and Design. 127--130. Google Scholar
Digital Library
- Wonyoung Kim, Meeta S. Gupta, Gu-Yeon Wei, and David Brooks. 2008. System level analysis of fast, per-core DVFS using on-chip switching regulators. In Proceedings of the IEEE 14th International Symposium on High Performance Computer Architecture. 123--134.Google Scholar
- Matthias Knoth. 2009. Power management in an embedded multiprocessor cluster. In Proceedings of the Embedded World Conference.Google Scholar
- Harold W. Kuhn. 2010. The Hungarian method for the assignment problem. In 50 Years of Integer Programming 1958--2008, J. Junger, Th. M. Liebling, D. Naddef, G. L. Nemhauser, W. R. Pullyblank, G. Reinelt, G. Rinaldi, and L. A. Wolsey (Eds.). Springer, 29--47.Google Scholar
- Jungseob Lee and Nam Sung Kim. 2009. Optimizing total power of many-core processors considering voltage scaling limit and process variations. In Proceedings of the 14th ACM/IEEE International Symposium on Low Power Electronics and Design. 201--206. Google Scholar
Digital Library
- Jian Li and Jose F. Martinez. 2005. Power-performance considerations of parallel computing on chip multiprocessors. ACM Transactions on Architecture and Code Optimization 2, 4, 397--422. Google Scholar
Digital Library
- Sheng Li, Jung Ho Ahn, Richard D. Strong, Jay B. Brockman, Dean M. Tullsen, and Norman P. Jouppi. 2009. McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures. In Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture. 469--480. Google Scholar
Digital Library
- Kai Ma, Xue Li, Ming Chen, and Xiaorui Wang. 2011a. Scalable power control for many-core architectures running multi-threaded applications. ACM SIGARCH Computer Architecture News 39, 3, 449--460. Google Scholar
Digital Library
- Kai Ma, Xiaorui Wang, and Yefu Wang. 2011b. DPPC: Dynamic power partitioning and capping in chip multiprocessors. In Proceedings of the 2011 IEEE 29th International Conference on Computer Design. 39--44. Google Scholar
Digital Library
- Martina Maggio, Henry Hoffmann, Alessandro V. Papadopoulos, Jacopo Panerati, Marco D. Santambrogio, Anant Agarwal, and Alberto Leva. 2012. Comparison of decision-making strategies for self-optimization in autonomic computing systems. ACM Transactions on Autonomous and Adaptive Systems 7, 4, 36:1--36:32. Google Scholar
Digital Library
- Grigorios Magklis, Pedro Chaparro, Jose Gonzalez, and Antonio Gonzalez. 2006. Independent front-end and back-end dynamic voltage scaling for a GALS microarchitecture. In Proceedings of the 2006 International Symposium on Low Power Electronics and Design. 49--54. Google Scholar
Digital Library
- Ke Meng, Russ Joseph, Robert P. Dick, and Li Shang. 2008. Multi-optimization power management for chip multiprocessors. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques. 177--186. Google Scholar
Digital Library
- Asit K. Mishra, Shekhar Srikantaiah, Mahmut Kandemir, and Chita R. Das. 2010. CPM in CMPs: Coordinated power management in chip-multiprocessors. In Proceedings of the 2010 International Conference for High Performance Computing, Networking, Storage, and Analysis. 1--12. Google Scholar
Digital Library
- Massoud Pedram. 1996. Power minimization in IC design: Principles and applications. ACM Transactions on Design Automation of Electronic Systems 1, 1, 3--56. Google Scholar
Digital Library
- Karthick Rajamani, Heather Hanson, Juan Rubio, Soraya Ghiasi, and Freeman Rawson. 2006. Application-aware power management. In Proceedings of the 2006 IEEE International Symposium on Workload Characterization. 39--48.Google Scholar
Cross Ref
- Krishna K. Rangan, Gu-Yeon Wei, and David Brooks. 2009. Thread motion: Fine-grained power management for multi-core systems. In Proceedings of the 36th Annual International Symposium on Computer Architecture. 302--313. Google Scholar
Digital Library
- John Sartori and Rakesh Kumar. 2009. Distributed peak power management for many-core architectures. In Proceedings of the Conference on Design, Automation, and Test in Europe. 1--4. Google Scholar
Digital Library
- Joseph Sharkey, Alper Buyuktosunoglu, and Pradip Bose. 2007. Evaluating design tradeoffs in on-chip power management for CMPs. In Proceedings of the 2007 International Symposium on Low Power Electronics and Design. 44--49. Google Scholar
Digital Library
- Hao Shen, Ying Tan, Jun Lu, Qing Wu, and Qinru Qiu. 2013. Achieving autonomous power management using reinforcement learning. ACM Transactions on Design Automation of Electronic Systems 18, 2, Article No. 24. Google Scholar
Digital Library
- Meeta Srivastav, Michael B. Henry, and Leyla Nazhandali. 2012. Design of energy-efficient, adaptable throughput systems at near/sub-threshold voltage. ACM Transactions on Design Automation of Electronic Systems 18, 1, Article No. 3. Google Scholar
Digital Library
- Emil Talpes and Diana Marculescu. 2005. Toward a multiple clock/voltage island design style for power-aware processors. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 13, 5, 591--603. Google Scholar
Digital Library
- Radu Teodorescu and Josep Torrellas. 2008. Variation-aware application scheduling and power management for chip multiprocessors. In Proceedings of the 35th Annual International Symposium on Computer Architecture. 363--374. Google Scholar
Digital Library
- Rafael Ubal, Julio Sahuquillo, Salvadore Petit, and Pedro López. 2007. Multi2Sim: A simulation framework to evaluate multicore-multithreaded processors. In Proceedings of the 19th International Symposium on Computer Architecture and High Performance Computing. 62--68.Google Scholar
Cross Ref
- Xiaorui Wang, Kai Ma, and Yefu Wang. 2011. Adaptive power control with online model estimation for chip multiprocessors. IEEE Transactions on Parallel and Distributed Systems 22, 10, 1681--1696. Google Scholar
Digital Library
- Jonathan A. Winter, David H. Albonesi, and Christine A. Shoemaker. 2010. Scalable thread scheduling and global power management for heterogeneous many-core architectures. In Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques. 29--40. Google Scholar
Digital Library
- Frances Yao, Alan Demers, and Scott Shenker. 1995. A scheduling model for reduced CPU energy. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science. 374--382. Google Scholar
Digital Library
- Francesco Zanini, David Atienza, Colin N. Jones, Luca Benini, and Giovanni De Micheli. 2012. Online thermal control methods for multiprocessor systems. ACM Transactions on Design Automation of Electronic Systems 18, 1, Article No. 6. Google Scholar
Digital Library
Index Terms
Scalable Global Power Management Policy Based on Combinatorial Optimization for Multiprocessors
Recommendations
Coordinated power management of voltage islands in CMPs
SIGMETRICS '10: Proceedings of the ACM SIGMETRICS international conference on Measurement and modeling of computer systemsMultiple clock domain architectures have recently been proposed to alleviate the power problem in CMPs by having different frequency/voltage values assigned to each domain based on workload requirements. However, accurate allocation of power to these ...
Distributed power management of real-time applications on a GALS multiprocessor SOC
EMSOFT '15: Proceedings of the 12th International Conference on Embedded SoftwareIt is generally desirable to reduce the power consumption of embedded systems. Dynamic Voltage and Frequency Scaling (DVFS) is a commonly applied technique to achieve power reduction at the cost of computational performance. Multiprocessor System on ...
Cooperative shared resource access control for low-power chip multiprocessors
ISLPED '09: Proceedings of the 2009 ACM/IEEE international symposium on Low power electronics and designIn a single-chip multiprocessor (CMP), the last-level cache and its lower memory hierarchy components are typically shared by multiple processors. Conflicts in these resources lead to poor overall performance of the CMP and/or unpredictable performance ...






Comments