Abstract
Mobile virtualization introduces extra layers in software stacks, which leads to performance degradation. Notably, each I/O operation has to pass through several software layers to reach the NAND-flash-based storage systems. This article targets at optimizing I/O for mobile virtualization, since I/O becomes one of major performance bottlenecks that seriously affects the performance of mobile devices. Among all the I/O operations, a large percentage is to update metadata. Frequently updated metadata not only degrade overall I/O performance but also severely reduce flash memory lifetime.
In this article, we propose a novel I/O optimization technique to identify the metadata of a guest file system that is stored in a virtual machine image file and frequently updated. Then, these metadata are stored in a small additional non-volatile memory (NVM), which is faster and more endurable to greatly improve flash memory’s performance and lifetime. To the best of our knowledge, this is the first work to identify the file system metadata from regular data in a guest OS image file with NVM optimization. The proposed scheme is evaluated on a real hardware embedded platform. The experimental results show that the proposed techniques can improve write performance to 45.21% in mobile devices with virtualization.
- H. Akinaga and H. Shima. 2010. Resistive random access memory (ReRAM) based on metal oxides. Proc. IEEE 98, 12 (2010), 2237--2251.Google Scholar
- ArndaleBoard. 2014. Arndale board exynos5250. Retrieved from http://www.arndaleboard.org/wiki/index. php/Main_Page.Google Scholar
- Fabrice Bellard. 2005. QEMU, a fast and portable dynamic translator. In Proceedings of the USENIX Annual Technical Conference (ATC’05). 41--46. Google Scholar
Digital Library
- Davide Bertozzi, Stefano Di Carlo, Salvatore Galfano, Marco Indaco, Piero Olivo, Paolo Prinetto, and Cristian Zambelli. 2015. Performance and reliability analysis of cross-layer optimizations of NAND flash controllers. ACM Trans. Embedd. Comput. Syst. 14, 1 (2015), 7:1--7:24. Google Scholar
Digital Library
- Yu Cai, E. F. Haratsch, O. Mutlu, and Ken Mai. 2012. Error patterns in MLC NAND flash memory: Measurement, characterization, and analysis. In Proceedings of the Design, Automation Test in Europe Conference Exhibition (DATE’12). 521--526. Google Scholar
Digital Library
- Bing-Jing Chang, Yuan-Hao Chang, Hung-Sheng Chang, Tei-Wei Kuo, and Hsiang-Pang Li. 2014a. A PCM translation layer for integrated memory and storage management. In Proceedings of the 2014 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS’14). 6:1--6:10. Google Scholar
Digital Library
- Li-Pin Chang, Yo-Chuan Su, and I-Chen Wu. 2015. Plugging versus logging: Adaptive buffer management for hybrid-mapping SSDs. ACM Trans. Embedd. Comput. Syst. 14, 2 (2015), 29:1--29:21. Google Scholar
Digital Library
- Yu-Ming Chang, Yuan-Hao Chang, Jian-Jia Chen, Tei-Wei Kuo, Hsiang-Pang Li, and Hang-Ting Lue. 2014b. On trading wear-leveling with heal-leveling. In Proceedings of the 51st Annual Design Automation Conference (DAC’14). 83:1--83:6. Google Scholar
Digital Library
- R. Chen, Z. Qin, Y. Wang, D. Liu, Z. Shao, and Y. Guan. 2015. On-demand block-level address mapping in large-scale NAND flash storage systems. IEEE Trans. Comput. 64, 6 (2015), 1729--1741.Google Scholar
- Sheng-Wei Cheng, Yu-Fen Chang, Yuan-Hao Chang, Hsin-Wen Wei, and Wei-Kuan Shih. 2014. Warranty-aware page management for PCM-based embedded systems. In Proceedings of the 2014 IEEE/ACM International Conference on Computer-Aided Design (ICCAD’14). 734--741. Google Scholar
Digital Library
- Citrix. 2015. XenMobile. Retrieved from https://www.citrix.com/products/xenmobile/overview.html (2015).Google Scholar
- Xiangyu Dong and Yuan Xie. 2011. AdaMS: Adaptive MLC/SLC phase-change memory design for file storage. In Proceedings of the 16th Asia and South Pacific Design Automation Conference (ASP-DAC’11). 31--36. Google Scholar
Digital Library
- Ext4. 2014. Ext4 disk layout. Retrieved from https://ext4.wiki.kernel.org/index.php/Ext4_Disk_Layout (2014).Google Scholar
- Yongqiang Gao, Haibing Guan, Zhengwei Qi, Yang Hou, and Liang Liu. 2013. A multi-objective ant colony system algorithm for virtual machine placement in cloud computing. J. Comput. Syst. Sci. 79, 8 (2013), 1230--1242. Google Scholar
Digital Library
- Yong Guan, Guohui Wang, Yi Wang, Renhai Chen, and Zili Shao. 2013. BLog: Block-level log-block management for NAND flash memorystorage systems. In Proceedings of the 14th ACM SIGPLAN/SIGBED Conference on Languages, Compilers and Tools for Embedded Systems (LCTES’13). 111--120. Google Scholar
Digital Library
- Jie Guo, Jun Yang, Youtao Zhang, and Yiran Chen. 2013. Low cost power failure protection for MLC NAND flash storage systems with PRAM/DRAM hybrid buffer. In Proceedings of the Design, Automation Test in Europe Conference Exhibition (DATE’13). 859--864. Google Scholar
Digital Library
- H. H. Huang, Shan Li, A. Szalay, and A. Terzis. 2011. Performance modeling and analysis of flash-based storage devices. In Proceedings of the 2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST’11). 1--11. Google Scholar
Digital Library
- M. Huang, Z. Liu, L. Qiao, Y. Wang, and Z. Shao. 2016. An endurance-aware metadata allocation strategy for MLC NAND flash memory storage systems. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst. 35, 4 (2016), 691--694.Google Scholar
Cross Ref
- Po-Chun Huang, Yuan-Hao Chang, Tei-Wei Kuo, Jen-Wei Hsieh, and Miller Lin. 2008. The behavior analysis of flash-memory storage systems. In Proceedings of the 11th IEEE Symposium on Object Oriented Real-Time Distributed Computing (ISORC’08). 529--534. Google Scholar
Digital Library
- Jaeyong Jeong, Sangwook Shane Hahn, Sungjin Lee, and Jihong Kim. 2014. Lifetime improvement of NAND flash-based storage systems using dynamic program and erase scaling. In Proceedings of the 12th USENIX Conference on File and Storage Techniques (FAST’14). 61--74. Google Scholar
Digital Library
- Lei Jiang, Youtao Zhang, and Jun Yang. 2012. ER: Elastic RESET for low power and long endurance MLC based phase change memory. In Proceedings of the 2012 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED’12). 39--44. Google Scholar
Digital Library
- Yuan-Hung Kuan, Yuan-Hao Chang, Po-Chun Huang, and Kam-Yiu Lam. 2014. Space-efficient multiversion index scheme for PCM-based embedded database systems. In Proceedings of the 51st Annual Design Automation Conference (DAC’14). 147:1--147:6. Google Scholar
Digital Library
- D. Lee, J. Choi, J. H. Kim, S. H. Noh, S. L. Min, Y. Cho, and C. S. Kim. 2001. LRFU: A spectrum of policies that subsumes the least recently used and least frequently used policies. IEEE Trans. Comput. 50, 12 (2001), 1352--1361. Google Scholar
Digital Library
- Sungjin Lee, Taejin Kim, Ji-Sung Park, and Jihong Kim. 2013. An integrated approach for managing the lifetime of flash-based SSDs. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE’13). 1522--1525. Google Scholar
Digital Library
- Qingan Li, Yanxiang He, Jianhua Li, Liang Shi, Yiran Chen, and C. J. Xue. 2015. Compiler-assisted refresh minimization for volatile STT-RAM Cache. IEEE Trans. Comput. 64, 8 (2015), 2169--2181.Google Scholar
Digital Library
- Qian Lin, Zhengwei Qi, Jiewei Wu, Yaozu Dong, and Haibing Guan. 2012. Optimizing virtual machines using hybrid virtualization. J. Syst. Softw. 85, 11 (2012), 2593--2603. Google Scholar
Digital Library
- Chenchen Liu and Hai Li. 2014. A weighted sensing scheme for ReRAM-based cross-point memory array. In Proceedings of the 2014 IEEE Computer Society Annual Symposium on VLSI (ISVLSI’14). 65--70. Google Scholar
Digital Library
- Qcow2. 2008. The qcow2 image format. Retrieved from https://people.gnome.org/m̃arkmc/qcow-image-format.html.Google Scholar
- Samsung. 2012. Exynos5250. Retrieved from http://www.samsung.com/global/business/semiconductor/file/ product/Exynos_5_Dual_User_Manaul_Public_REV1.00-0.pdf.Google Scholar
- Zili Shao, Yongpan Liu, Yiran Chen, and Tao Li. 2012. Utilizing PCM for energy optimization in embedded systems. In IEEE Computer Society Annual Symposium on VLSI (ISVLSI’12). 398--403. Google Scholar
Digital Library
- Liang Shi, Keni Qiu, Mengying Zhao, and C. J. Xue. 2014. Error model guided joint performance and endurance optimization for flash memory. IEEE Trans. Comput.-Aide. Des. Integr. Circ. Syst. 33, 3 (2014), 343--355. Google Scholar
Digital Library
- Liang Shi, Chun Jason Xue, Jingtong Hu, Wei-Che Tseng, Xuehai Zhou, and Edwin H.-M. Sha. 2010. Write activity reduction on flash main memory via smart victim cache. In Proceedings of the 20th Symposium on Great Lakes Symposium on VLSI (GLSVLSI’10). 91--94. Google Scholar
Digital Library
- Guangyu Sun, Yongsoo Joo, Yibo Chen, Dimin Niu, Yuan Xie, Yiran Chen, and Hai Li. 2010. A hybrid solid-state storage architecture for the performance, energy consumption, and lifetime improvement. In Proceedings of the 2010 IEEE 16th International Symposium on High Performance Computer Architecture (HPCA’10). 1--12.Google Scholar
Cross Ref
- Zhenyu Sun, Xiuyuan Bi, Hai (Helen) Li, Weng-Fai Wong, Zhong-Liang Ong, Xiaochun Zhu, and Wenqing Wu. 2011. Multi retention level STT-RAM cache designs with a dynamic refresh scheme. In Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO’44). 329--338. Google Scholar
Digital Library
- Yuan Heng Tseng, Chia-En Huang, C. H. Kuo, Y.-D. Chih, and Chrong Jung Lin. 2009. High density and ultra small cell size of contact ReRAM (CR-RAM) in 90nm CMOS logic technology and circuits. In Proceedings of the 2009 IEEE International on Electron Devices Meeting (IEDM’09). 1--4.Google Scholar
Cross Ref
- VMware. 2008. MVP. Retrieved from http://www.vmware.com/ap/company/acquisitions/trango.html.Google Scholar
- Chundong Wang and Weng-Fai Wong. 2012. Observational wear leveling: An efficient algorithm for flash memory management. In Proceedings of the2012 49th ACM/EDAC/IEEE Design Automation Conference (DAC’12). 235--242. Google Scholar
Digital Library
- Guohui Wang, Yong Guan, Yi Wang, and Zili Shao. 2016. Energy-aware assignment and scheduling for hybrid main memory in embedded systems. Computing 98, 3 (2016), 279--301. Google Scholar
Digital Library
- Tianzheng Wang, Duo Liu, Yi Wang, and Zili Shao. 2015. Towards write-activity-aware page table management for non-volatile main memories. ACM Trans. Embedd. Comput. Syst. 14, 2 (2015), 34:1--34:23. Google Scholar
Digital Library
- Y. Wang, M. Huang, Z. Shao, H. C. B. Chan, L. A. D. Bathen, and N. D. Dutt. 2014a. A reliability-aware address mapping strategy for NAND flash memory storage systems. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst. 33, 11 (2014), 1623--1631.Google Scholar
Cross Ref
- Yiqun Wang, Yongpan Liu, Shuangchen Li, Xiao Sheng, Daming Zhang, Mei-Fang Chiang, Baiko Sai, X. S. Hu, and Huazhong Yang. 2014b. PaCC: A parallel compare and compress codec for area reduction in nonvolatile processors. IEEE Trans. VLSI Syst. 22, 7 (2014), 1491--1505.Google Scholar
Cross Ref
- Yiqun Wang, Yongpan Liu, Shuangchen Li, Daming Zhang, Bo Zhao, Mei-Fang Chiang, Yanxin Yan, Baiko Sai, and Huazhong Yang. 2012a. A 3us wake-up time nonvolatile processor based on ferroelectric flip-flops. In Proceedings of the ESSCIRC (ESSCIRC’12). 149--152.Google Scholar
Cross Ref
- Yiqun Wang, Yongpan Liu, Yumeng Liu, Daming Zhang, Shuangchen Li, Baiko Sai, Mei-Fang Chiang, and Huazhong Yang. 2012b. A compression-based area-efficient recovery architecture for nonvolatile processors. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE’12). 1519--1524. Google Scholar
Digital Library
- Chin-Hsien Wu and Tei-Wei Kuo. 2006. An adaptive two-level management for the flash translation layer in embedded systems. In Proceedings of the IEEE/ACM International Conference on Computer-Aided Design (ICCAD’06). 601--606. Google Scholar
Digital Library
- Po-Liang Wu, Yuan-Hao Chang, and Tei-Wei Kuo. 2009. A file-system-aware FTL design for flash-memory storage systems. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE’09). 393--398. Google Scholar
Digital Library
- Chun Jason Xue, Youtao Zhang, Yiran Chen, Guangyu Sun, J. Jianhua Yang, and Hai Li. 2011. Emerging non-volatile memories: Opportunities and challenges. In Proceedings of the 7th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS’11). 325--334. Google Scholar
Digital Library
- Chao Zhang, Guangyu Sun, Weiqi Zhang, Fan Mi, Hai Li, and Weisheng Zhao. 2015b. Quantitative modeling of racetrack memory, a tradeoff among area, performance, and power. In Proceedings of the 2015 20th Asia and South Pacific Design Automation Conference (ASP-DAC’15). 100--105.Google Scholar
- Chao Zhang, Guangyu Sun, Xian Zhang, Weiqi Zhang, Weisheng Zhao, Tao Wang, Yun Liang, Yongpan Liu, Yu Wang, and Jiwu Shu. 2015c. Hi-fi playback: Tolerating position errors in shift operations of racetrack memory. In Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA’15). 694--706. Google Scholar
Digital Library
- Qi Zhang, Xuandong Li, Linzhang Wang, Tian Zhang, Yi Wang, and Zili Shao. 2015a. Lazy-RTGC: A real-time lazy garbage collection mechanism with jointly optimizing average and worst performance for NAND flash memory storage systems. ACM Trans. Des. Autom. Electron. Syst. 20, 3 (2015), 43:1--43:32. Google Scholar
Digital Library
- Xianwei Zhang, Youtao Zhang, Bruce R. Childers, and Jun Yang. 2015d. Exploiting DRAM restore time variations in deep sub-micron scaling. In Proceedings of the 2015 Design, Automation 8 Test in Europe Conference 8 Exhibition (DATE’15). 477--482. Google Scholar
Digital Library
- Yue Zhang, Chao Zhang, J.-O. Klein, D. Ravelosona, Guangyu Sun, and Weisheng Zhao. 2015e. Perspectives of racetrack memory based on current-induced domain wall motion: From device to system. In Proceedings of the 2015 IEEE International Symposium on Circuits and Systems (ISCAS’15). 381--384.Google Scholar
Cross Ref
- Mengying Zhao, Lei Jiang, Liang Shi, Youtao Zhang, and C. J. Xue. 2015. Wear relief for high-density phase change memory through cell morphing considering process variation. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst. 34, 2 (2015), 227--237.Google Scholar
Cross Ref
Index Terms
Image-Content-Aware I/O Optimization for Mobile Virtualization
Recommendations
Reducing write activities on non-volatile memories in embedded CMPs via data migration and recomputation
DAC '10: Proceedings of the 47th Design Automation ConferenceRecent advances in circuit and process technologies have pushed non-volatile memory technologies into a new era. These technologies exhibit appealing properties such as low power consumption, non-volatility, shock-resistivity, and high density. However, ...
A unified framework for designing high performance in-memory and hybrid memory file systems
The emerging non-volatile memory technologies provide a new choice for storing persistent data in memory. Therefore, file system structure needs re-studying and re-designing. Our goal is to design a framework that gives high-performance in-memory file ...
Embedded non-volatile memories
SBCCI '07: Proceedings of the 20th annual conference on Integrated circuits and systems designThis tutorial covers trends in embedded non-volatile memories including details of issues for scaling NAND and NOR flash and descriptions of scaled flash memory technologies and various evolutionary flash memory technologies such as trapping site ...






Comments