Abstract
With the intense competition between cloud providers, oversubscription is increasingly important to maintain profitability. Oversubscribing physical resources is not without consequences: it increases the likelihood of overload. Memory overload is particularly damaging. Contrary to traditional views, we analyze current data center logs and realistic Web workloads to show that overload is largely transient: up to 88.1% of overloads last for less than 2 minutes. Regarding overload as a continuum that includes both transient and sustained overloads of various durations points us to consider mitigation approaches also as a continuum, complete with tradeoffs with respect to application performance and data center overhead. In particular, heavyweight techniques, like VM migration, are better suited to sustained overloads, whereas lightweight approaches, like network memory, are better suited to transient overloads. We present Overdriver, a system that adaptively takes advantage of these tradeoffs, mitigating all overloads within 8% of well-provisioned performance. Furthermore, under reasonable oversubscription ratios, where transient overload constitutes the vast majority of overloads, Overdriver requires 15% of the excess space and generates a factor of four less network traffic than a migration-only approach.
- memcached. http://www.danga.com/memcached/, May 2003.Google Scholar
- T. E. Anderson, D. E. Culler, D. A. Patterson, and the NOW team. A case for NOW (Networks of Workstations). IEEE Micro, 15 (1): 54--64, Feb. 1995. Google Scholar
Digital Library
- M. Andreolini, S. Casolari, M. Colajanni, and M. Messori. Dynamic load management of virtual machines in a cloud architecture. In Proc. of ICST CLOUDCOMP, Munich, Germany, Oct. 2009.Google Scholar
- P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield. Xen and the art of virtualization. In Proc. of ACM SOSP, Bolton Landing, NY, Oct. 2003. Google Scholar
Digital Library
- C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live migration of virtual machines. In Proc. of USENIX NSDI, Boston, MA, May 2005. Google Scholar
Digital Library
- B. Cully, G. Lefebvre, D. Meyer, M. Feeley, N. Hutchinson, and A. Warfield. Remus: high availability via asynchronous virtual machine replication. In Proc. of USENIX NSDI, San Francisco, CA, Apr. 2008. Google Scholar
Digital Library
- M. D. Dahlin, R. Y. Wang, T. E. Anderson, and D. A. Patterson. Cooperative caching: Using remote client memory to improve file system performance. In Proc. of USENIX OSDI, Monterey, CA, Nov. 1994. Google Scholar
Digital Library
- M. J. Feeley, W. E. Morgan, E. P. Pighin, A. R. Karlin, H. M. Levy, and C. A. Thekkath. Implementing global memory management in a workstation cluster. In Proc. of ACM SOSP, Copper Mountain, CO, Dec. 1995. Google Scholar
Digital Library
- K. Govil, D. Teodosiu, Y. Huang, and M. Rosenblum. Cellular Disco: Resource management using virtual clusters on shared-memory multiprocessors. In Proc. of ACM SOSP, Charleston, SC, Dec. 1999. Google Scholar
Digital Library
- D. Gupta, S. Lee, M. Vrable, S. Savage, A. C. Snoeren, G. Varghese, G. M. Voelker, and A. Vahdat. Difference engine: Harnessing memory redundancy in virtual machines. In Proc. of USENIX OSDI, San Diego, CA, Dec. 2008. Google Scholar
Digital Library
- F. Hermenier, X. Lorca, J.-M. Menaud, G. Muller, and J. Lawall. Entropy: a consolidation manager for clusters. In Proc. of ACM VEE, Washington, DC, Mar. 2009. Google Scholar
Digital Library
- M. Hines and K. Gopalan. Post-copy based live virtual machine migration using adaptive pre-paging and dynamic self-ballooning. In Proc. of ACM VEE, Washington, DC, Mar. 2009. Google Scholar
Digital Library
- M. R. Hines and K. Gopalan. MemX: supporting large memory workloads in Xen virtual machines. In Proc. of IEEE VTDC, Reno, NV, Nov. 2007. Google Scholar
Digital Library
- R. H. Katz. Tech Titans Building Boom. http://www.spectrum.ieee.org/feb09/7327, Feb. 2009. Google Scholar
Digital Library
- G. Khanna, K. Beaty, G. Kar, and A. Kochut. Application performace management in virtualized server environments. In Proc. of IEEE/IFIP NOMS, Vancouver, Canada, Apr. 2006.Google Scholar
- A. Kivity, Y. Kamay, and D. Laor. KVM: The kernel-based virtual machine for Linux. In Proc. of Ottawa Linux Symposium, Ottawa, Canada, June 2007.Google Scholar
- M. Kozuch and M. Satyanarayanan. Internet suspend/resume. In Proc. of IEEE WMCSA, Calicoon, NY, June 2002. Google Scholar
Digital Library
- H. A. Lagar-Cavilla, J. Whitney, A. Scannell, P. Patchin, S. M. Rumble, E. de Lara, M. Brudno, and M. Satyanarayanan. SnowFlock: Rapid virtual machine cloning for cloud computing. In Proc. of ACM EuroSys, Nuremberg, Germany, Apr. 2009. Google Scholar
Digital Library
- E. P. Markatos and G. Dramitinos. Implementation of a reliable remote memory pager. In Proc. of USENIX Annual Technical Conf., San Diego, CA, Jan. 1996. Google Scholar
Digital Library
- D. S. Milojicic, F. Douglis, Y. Paindaveine, and S. Zhou. Process migration. ACM Computing Surveys, 32 (3): 241---299, Sept. 2000. Google Scholar
Digital Library
- G. Miło's, D. G. Murray, S. Hand, and M. A. Fetterman. Satori: Enlightened page sharing. In Proc. of USENIX Annual Technical Conf., San Diego, CA, June 2009. Google Scholar
Digital Library
- B. Mitchell, J. Rosse, and T. Newhall. Reliability algorithms for network swapping systems with page migration. In Proc. of IEEE CLUSTER, San Diego, CA, Sept. 2004. Google Scholar
Digital Library
- M. Nelson, B.-H. Lim, and G. Hutchins. Fast transparent migration for virtual machines. In Proc. of USENIX Annual Technical Conf., Anaheim, CA, Apr. 2005. Google Scholar
Digital Library
- T. Newhall, S. Finney, K. Ganchev, and M. Spiegel. Nswap: A Network Swapping Module for Linux Clusters. In Proc. of Euro-Par, Klagenfurt, Austria, Aug. 2003.Google Scholar
Cross Ref
- H. Qian, E. Miller, W. Zhang, M. Rabinovich, and C. E. Wills. Agility in virtualized utility computing. In Proc. of IEEE VTDC, Reno, NV, Nov. 2007. Google Scholar
Digital Library
- C. P. Sapuntzakis, R. Chandra, B. Pfaff, J. Chow, M. S. Lam, and M. Rosenblum. Optimizing the migration of virtual computers. In Proc. of USENIX OSDI, Boston, MA, Dec. 2002. Google Scholar
Digital Library
- V. Shrivastava, P. Zerfos, K. won Lee, H. Jamjoom, Y.-H. Liu, and S. Banerjee. Application-aware virtual machine migration in data centers (to appear). In Proc. of IEEE INFOCOM Mini-conference, Shanghai, China, Apr. 2011.Google Scholar
Cross Ref
- A. Stage and T. Setzer. Network-aware migration control and scheduling of differentiated virtual machine workloads. In Proc. of ICSE Workshop on Software Engineering Challenges of Cloud Computing, Vancouver, Canada, May 2009. Google Scholar
Digital Library
- Standard Performance Evaluation Corporation. Specweb2009 release 1.10 banking workload design document. http://www.spec.org/web2009/docs/design/BankingDesign.html, Apr. 2009.Google Scholar
- H. N. Van, F. D. Tran, and J.-M. Menaud. Autonomic virtual resource management for service hosting platforms. In Proc. of ICSE Workshop on Software Engineering Challenges of Cloud Computing, Vancouver, Canada, May 2009. Google Scholar
Digital Library
- C. A. Waldspurger. Memory resource management in VMware ESX server. In Proc. of USENIX OSDI, Boston, MA, Dec. 2002. Google Scholar
Digital Library
- A. Warfield, S. Hand, K. Fraser, and T. Deegan. Facilitating the development of soft devices. In Proc. of USENIX Annual Technical Conf., Anaheim, CA, Apr. 2005. Google Scholar
Digital Library
- S. Weber and R. Hariharan. A new synthetic web server trace generation methodology. In Proc. of IEEE ISPASS, Austin, TX, Mar. 2003. Google Scholar
Digital Library
- T. Wood, P. Shenoy, and A. Venkataramani. Black-box and gray-box strategies for virtual machine migration. In Proc. of USENIX NSDI, Cambridge, MA, Apr. 2007. Google Scholar
Digital Library
- T. Wood, G. Tarasuk-Levin, P. Shenoy, P. Desnoyers, E. Cecchet, and M. D. Corner. Memory buddies: Exploiting page sharing for smart colocation in virtualized data centers. In Proc. of ACM VEE, Washington, DC, Mar. 2009. Google Scholar
Digital Library
Index Terms
Overdriver: handling memory overload in an oversubscribed cloud
Recommendations
Overdriver: handling memory overload in an oversubscribed cloud
VEE '11: Proceedings of the 7th ACM SIGPLAN/SIGOPS international conference on Virtual execution environmentsWith the intense competition between cloud providers, oversubscription is increasingly important to maintain profitability. Oversubscribing physical resources is not without consequences: it increases the likelihood of overload. Memory overload is ...
Enabling Efficient Hypervisor-as-a-Service Clouds with Ephemeral Virtualization
VEE '16When considering a hypervisor, cloud providers must balance conflicting requirements for simple, secure code bases with more complex, feature-filled offerings. This paper introduces Dichotomy, a new two-layer cloud architecture in which the roles of the ...
Enabling Efficient Hypervisor-as-a-Service Clouds with Ephemeral Virtualization
VEE '16: Proceedings of the12th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution EnvironmentsWhen considering a hypervisor, cloud providers must balance conflicting requirements for simple, secure code bases with more complex, feature-filled offerings. This paper introduces Dichotomy, a new two-layer cloud architecture in which the roles of the ...







Comments