Abstract
Cloud service is being adopted as a utility for large numbers of tenants by renting Virtual Machines (VMs). But for cloud storage, unpredictable IO characteristics make accurate Service-Level-Objective (SLO) enforcement challenging. As a result, it has been very difficult to support simple-to-use and technology-agnostic SLO specifying a particular value for a specific metric (e.g., storage bandwidth). This is because the quality of SLO enforcement depends on performance error and fluctuation that measure the precision of SLO enforcement. High precision of SLO enforcement is critical for user-oriented performance customization and user experiences. To address this challenge, this article presents V-Cup, a framework for VM-oriented customizable SLO and its near-precise enforcement. It consists of multiple auto-tuners, each of which exports an interface for a tenant to customize the desired storage bandwidth for a VM and enable the storage bandwidth of the VM to converge on the target value with a predictable precision. We design and implement V-Cup in the Xen hypervisor based on the fair sharing scheduler for VM-level resource management. Our V-Cup prototype evaluation shows that it achieves satisfying performance guarantees through near-precise SLO enforcement.
- Amazon EC2. 2015. Amazon EC2 website. Retrieved from http://aws.amazon.com/ec2.Google Scholar
- Amazon EC2 SLA. 2015. Amazon EC2 SLA. Retrieved from http://aws.amazon.com/cn/ec2-sla/.Google Scholar
- J. Axboe. 2004. Linux block IO—Present and future. In Proceedings of the Ottawa Linux Symposium.Google Scholar
- P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield. 2003. Xen and the art of virtualization. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP). Google Scholar
Digital Library
- J. C. R. Bennett and H. Zhang. 1996. WF2Q: Worst-case fair weighted fair queueing. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM). Google Scholar
Digital Library
- R. Bradford, E. Kotsovinos, A. Feldmann, and H. Schioberg. 2007. Live wide-area migration of virtual machines including local persistent state. In Proceedings of the International Conference on Virtual Execution Environments (VEE). Google Scholar
Digital Library
- D. D. Chambliss, G. A. Alvarez, P. Pandey, D. Jadav, J. Xu, R. Menon, and T. P. Lee. 2003. Performance virtualization for large-scale storage systems. In Proceedings of the 22nd International Symposium on Reliable Distributed Systems (SRDS).Google Scholar
- B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears. 2010. Benchmarking cloud serving systems with YCSB. In Proceedings of the ACM Symposium on Cloud Computing (SoCC). Google Scholar
Digital Library
- A. Dan and D. Sitaram. 1996. A generalized interval caching policy for mixed interactive and long video environments. In Proceedings of Multimedia Computing and Networking Conference.Google Scholar
- K. J. Duda and D. R. Cheriton. 1999. Borrowed-virtual-time (BVT) scheduling: Supporting latency-sensitive threads in a general-purpose scheduler. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP). Google Scholar
Digital Library
- FIO. 2015. FIO. Retrieved from http://freecode.com/projects/fio.Google Scholar
- G. F. Franklin, J. D. Powell, and M. Workman. 1998. Digital Control of Dynamic Systems. Addison-Wesley. Google Scholar
Digital Library
- K. Fraser, S. Hand, R. Neugebauer, I. Pratt, A. Warfield, and M. Williamson. 2004. Safe hardware access with the Xen virtual machine monitor. In Proceedings of the 1st Workshop on Operating System and Architectural Support for the On Demand IT InfraStructure (OASIS).Google Scholar
- S. Golestani. 1994. A self-clocked fair queueing scheme for broadband applications. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM).Google Scholar
Cross Ref
- Google Compute Engine. 2015. Google Compute Engine. Retrieved from https://cloud.google.com/products/compute-engine/.Google Scholar
- P. Goyal, H. M. Vin, and H. Cheng. 1997. Start-time fair queuing: A scheduling algorithm for integrated services packet switching networks. IEEE/ACM Transactions on Networking 55 (1997), 690--704. Google Scholar
Digital Library
- A. Gulati, I. Ahmad, and C. A. Waldspurger. 2009. PARDA: Proportional allocation of resources for distributed storage access. In Proccedings of the Conference on File and Storage Technologies (FAST). Google Scholar
Digital Library
- A. Gulati, A. Merchant, and P. J. Varman. 2010. mClock: Handling throughput variability for hypervisor IO scheduling. In Proceedings of the Symposium on Operating Systems Design and Implementation (OSDI). Google Scholar
Digital Library
- A. Gulati, G. Shanmuganathan, X. Zhang, and P. Varman. 2012. Demand based hierarchical QoS using storage resource pools. In Proceedings of the USENIX Annual Technical Conference (ATC). Google Scholar
Digital Library
- J. L. Hellerstein, Y. Diao, S. Parekh, and D. M. Tilbury. 2004. Feedback Control of Computing Systems. John Wiley 8 Sons, Inc., Hoboken, New Jersey. Google Scholar
Digital Library
- W. Jin, J. S. Chase, and J. Kaur. 2004. Interposed proportional sharing for a storage service utility. ACM SIGMETRICS Performance Evaluation Review 32, 1 (2004), 37--48. Google Scholar
Digital Library
- J. Kang, B. Zhang, T. Wo, C. Hu, and J. Huai. 2014. MultiLanes: Providing virtualized storage for OS-level virtualization on many cores. In Proccedings of the Conference on File and Storage Technologies (FAST). Google Scholar
Digital Library
- M. Karlsson, C. Karamanolis, and X. Zhu. 2005. Triage: Performance differentiation for storage systems using adaptive control. ACM Transactions on Storage (TOS) 1, 4 (2005), 457--480. Google Scholar
Digital Library
- J. Kim, D. Lee, and S. H. Noh. 2015. Towards SLO complying SSDs through OPS isolation. In Proceedings of the Conference on File and Storage Technologies (FAST). Google Scholar
Digital Library
- A. Kivity, Y. Kamay, D. Laor, U. Lublin, and A. Liguori. 2007. kvm: The Linux virtual machine monitor. In Proceedings of the 2007 Linux Symposium.Google Scholar
- J. Liu, S. G. Rao, H. Zhang, and B. Li. 2008. Opportunities and challenges of peer-to-peer internet video broadcast. Proceedings of the IEEE 96, 1 (2008), 11--24.Google Scholar
Cross Ref
- Z. Lu, J. Wu, Y. Huang, L. Chen, and D. Deng. 2012. CPDID: A novel CDN-P2P dynamic interactive delivery scheme for live streaming. In Proceedings of IEEE 18th International Conference on Parallel and Distributed Systems (ICPADS). Google Scholar
Digital Library
- C. R. Lumb, A. Merchant, and G. A. Alvarez. 2003. Façade: Virtual storage devices with performance guarantees. In Proccedings of the Conference on File and Storage Technologies (FAST). Google Scholar
Digital Library
- LXC. 2016. LXC. Retrieved from https://linuxcontainers.org/lxc/.Google Scholar
- K. Mansley, G. Law, D. Riddoch, G. Barzini, N. Turton, and S. Pope. 2007. Getting 10 Gb/s from Xen: Safe and fast device access from unprivileged domains. In Proceedings of the 2007 Conference on Parallel Processing. 224--233. Google Scholar
Digital Library
- A. Mashtizadeh, E. Celebi, T. Garfinkel, and M. Cai. 2011. The design and evolution of live storage migration in VMware ESX. In Proceedings of the USENIX Annual Technical Conference (ATC). Google Scholar
Digital Library
- R. Mcdougall. 2015. A prototype model-based workload for file systems, work in progress. Retrieved from http://solarisinternals.com/si/tools/filebench/filebench nasconf.pdf.Google Scholar
- MongoDB. 2015. MongoDB. Retrieved from http://www.mongodb.org/.Google Scholar
- R. Nathuji, A. Kansal, and A. Ghaffarkhah. 2010. Q-clouds: Managing performance interference effects for QoS-aware clouds. In Proceedings of the 3rd European Conference on Computer Systems (EuroSys). Google Scholar
Digital Library
- J. Nieh and M. S. Lam. 2003. A SMART scheduler for multimedia applications. ACM Transactions on Computer Systems (TOCS) 21, 2 (2003), 117--163. Google Scholar
Digital Library
- A. Povzner, T. Kaldewey, S.Brandt, R. Golding, T. M. Wong, and C. Maltzahn. 2008. Efficient guaranteed disk request scheduling with Fahrrad. In Proceedings of the 3rd European Conference on Computer Systems (EuroSys). Google Scholar
Digital Library
- R. Russell. 2008. virtio: Towards a de-facto standard for virtual I/O devices. SIGOPS Operating Systems Review 42, 5 (2008), 95--103. Google Scholar
Digital Library
- M. Ryu and U. Ramachandran. 2013. FlashStream: A multi-tiered storage architecture for adaptive HTTP streaming. In Proceedings of the 21st ACM International Conference on Multimedia. 313--322. Google Scholar
Digital Library
- M. Shreedhar and G. Varghese. 1987. Efficient fair queuing using deficit round-robin. IEEE/ACM Transactions on Networking 4, 3 (1987), 375--385. Google Scholar
Digital Library
- D. Shue, M. J. Freedman, and A. Shaikh. 2012. Performance isolation and fairness for multi-tenant cloud storage. In Proceedings of the Symposium on Operating Systems Design and Implementation (OSDI). Google Scholar
Digital Library
- S. Soltesz, H. Pötzl, M. E. Fiuczynski, A. Bavier, and L. Peterson. 2007. Container-based operating system virtualization: A scalable, high-performance alternative to hypervisors. In Proceedings of the 3rd European Conference on Computer Systems (EuroSys). Google Scholar
Digital Library
- J. Sugerman, G. Venkitachalam, and B. H. Lim. 2001. Virtualizing I/O devices on VMware workstation’s hosted virtual machine monitor. In Proceedings of the USENIX Annual Technical Conference (ATC). Google Scholar
Digital Library
- S. Suri, G. Varghese, and G. Chandramenon. 1997. Leap forward virtual clock: A new fair queueing scheme with guaranteed delay and throughput fairness. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM). Google Scholar
Digital Library
- E. Thereska, H. Ballani, G. O’Shea, T. Karagiannis, A. Rowstron, T. Talpey, R. Black, and T. Zhu. 2013. IOFlow: A software-defined storage architecture. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP). Google Scholar
Digital Library
- VMware Infrastructure. 2016. VMware, Inc. Introduction to VMware Infrastructure. Retrieved from http://www.vmware.com/support/pubs/.Google Scholar
- M. Wachs, M. Abd-El-Malek, E. Thereska, and G. R. Ganger. 2007. Argon: Performance insulation for shared storage servers. In Proccedings of the Conference on File and Storage Technologies (FAST). Google Scholar
Digital Library
- A. Wang, S. Venkataraman, S. Alspaugh, R. Katz, and I. Stoica. 2012. Cake: Enabling high-level SLOs on shared storage systems. In Proceedings of the ACM Symposium on Cloud Computing (SoCC). Google Scholar
Digital Library
- Windows Azure. 2015. Windows Azure. Retrieved from http://www.windowsazure.com/.Google Scholar
- J. C. Wu and S. A. Brandt. 2006. The design and implementation of Aqua: An adaptive quality of service aware object-based storage device. In Proceedings of the IEEE Conference on Mass Storage Systems and Technologies (MSST).Google Scholar
- S. Wu, H. Jiang, D. Feng, L. Tian, and B. Mao. 2009. WorkOut: I/O workload outsourcing for boosting RAID reconstruction performance. In Proccedings of the Conference on File and Storage Technologies (FAST). Google Scholar
Digital Library
- S. Wu, H. Jiang, and B. Mao. 2012. IDO: Intelligent data outsourcing with improved RAID reconstruction performance in large-scale data centers. In Proceedings of the USENIX Large Installation System Administration (LISA). Google Scholar
Digital Library
- Y. Wu, C. Wu, B. Li, X. Qiu, and F. C. Lau. 2011. CloudMedia: When cloud on demand meets video on demand. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS). Google Scholar
Digital Library
- J. Zhang, A. Riska, A. Sivasubramaniam, Q. Wang, and E. Riedel. 2006. Storage performance virtualization via throughput and latency control. ACM Transactions on Storage (TOS) 2, 3 (2006), 283--308. Google Scholar
Digital Library
Index Terms
Customizable SLO and Its Near-Precise Enforcement for Storage Bandwidth
Recommendations
Adaptive Live VM Migration in Share-Nothing IaaS-Clouds with LiveFS
CLOUDCOM '13: Proceedings of the 2013 IEEE International Conference on Cloud Computing Technology and Science - Volume 02Live migration is a versatile option when it comes to attain load-balancing in IaaS-cloud architectures. Liveness, reliability and conformance to SLAs may all be achieved by moving a VM that creates excessive work from its current physical machine (PM) ...
Providing high availability in cloud storage by decreasing virtual machine reboot time
HotDep'14: Proceedings of the 10th USENIX conference on Hot Topics in System DependabilityA Virtual Storage Architecture (VSA) is a storage controller deployed as a virtual machine on a server with a hypervisor. The advantage of VSA is to leverage shared data storage services without procuring additional storage hardware, which is a cost ...
Securing cloud storage systems through a virtual machine monitor
SRAS '12: Proceedings of the First International Workshop on Secure and Resilient Architectures and SystemsCloud storage solutions have increasingly gained in popularity as they offer a convenient method of maintaining one's data all in one place, with the ability to access it from anywhere at any time. In this paper, we leverage a virtualization-based ...






Comments