Abstract
In this paper, we present techniques for provisioning CPU and network resources in shared hosting platforms running potentially antagonistic third-party applications. The primary contribution of our work is to demonstrate the feasibility and benefits of overbooking resources in shared platforms, to maximize the platform yield: the revenue generated by the available resources. We do this by first deriving an accurate estimate of application resource needs by profiling applications on dedicated nodes, and then using these profiles to guide the placement of application components onto shared nodes. By overbooking cluster resources in a controlled fashion, our platform can provide performance guarantees to applications even when overbooked, and combine these techniques with commonly used QoS resource allocation mechanisms to provide application isolation and performance guarantees at run-time. When compared to provisioning based on the worst-case, the efficiency (and consequently revenue) benefits from controlled overbooking of resources can be dramatic. Specifically, experiments on our Linux cluster implementation indicate that overbooking resources by as little as 1% can increase the utilization of the cluster by a factor of two, and a 5% overbooking yields a 300--500% improvement, while still providing useful resource guarantees to applications.
- M. Aron. Differentiated and Predictable Quality of Service in Web Server Systems. PhD Thesis, Computer Science, Rice University, October 2000.]] Google Scholar
Digital Library
- M. Aron, S. Iyer, and P. Druschel. A Resource Management Framework for Predictable Quality of Service in Web Servers. Submitted for publication]]Google Scholar
- M. Aron, P. Druschel, and W. Zwaenepoel. Cluster Reserves: A Mechanism for Resource Management in Cluster-based Network Servers. In Proceedings of the ACM SIGMETRICS Conference, Santa Clara, CA, June 2000.]] Google Scholar
Digital Library
- G. Banga, P. Druschel, and J. Mogul. Resource Containers: A New Facility for Resource Management in Server Systems. In Proceedings of the third Symposium on Operating System Design and Implementation (OSDI'99), New Orleans, pages 45--58, February 1999.]] Google Scholar
Digital Library
- C. A. Waldspurger. Memory Resource Management in VMware ESX Server. In Proceedings of the 5th Symposium on Operating System Design and Implementation (OSDI'02), Boston, MA, December 2002.]] Google Scholar
Digital Library
- R. Boorstyn, A. Burchard, J. Liebeherr, and C. Oottamakorn. Statistical Service Assurances for Traffic Scheduling Algorithms. IEEE Journal on Selected Areas in Communications, 18(12):2651--2664, December 2000.]]Google Scholar
Digital Library
- J. Chase, D. Anderson, P. Thakar, A. Vahdat, and R. Doyle. Managing Energy and Server Resources in Hosting Centers. In Proceedings of the Eighteenth ACM Symposium on Operating Systems Principles (SOSP), pages 103--116, October 2001.]] Google Scholar
Digital Library
- A. Fox, S. D. Gribble, Y. Chawathe, E. A. Brewer, and P. Gauthier. Cluster-based Scalable Network Services. In Proceedings of the sixteenth ACM symposium on Operating systems principles (SOSP'97), Saint-Malo, France, pages 78--91, December 1997.]] Google Scholar
Digital Library
- K. Govil, D. Teodosiu, Y. Huang, and M. Rosenblum. Cellular Disco: Resource Management using Virtual Clusters on Sharedmemory Multiprocessors. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP'99), Kiawah Island Resort, SC, pages 154--169, December 1999.]] Google Scholar
Digital Library
- P. Goyal, X. Guo, and H. M. Vin. A Hierarchical CPU Scheduler for Multimedia Operating Systems. In Proceedings Operating System Design and Implimentation (OSD1'96), Seattle, pages 107--122, October 1996.]] Google Scholar
Digital Library
- Global Grid Forum: Scheduling and Resource Management Working Group. http://www-unix.mcs.anl.gov/schopf/ggf-sched, 2002.]]Google Scholar
- M B. Jones, D Rosu, and M Rosu. CPU Reservations and Time Constraints: Efficient, Predictable Scheduling of Independent Activities. In Proceedings of the sixteenth ACM symposium on Operating Systems Principles (SOSP'97), Saint-Malo, France, pages 198--211, December 1997.]] Google Scholar
Digital Library
- I. Leslie, D. McAuley, R. Black, T. Roscoe, P. Barham, D. Evers, R. Fairbairns, and E. Hyden. The Design and Implementation of an Operating System to Support Distributed Multimedia Applications. IEEE Journal on Selected Areas in Communication, 14(7): 1280--1297, September 1996.]]Google Scholar
Digital Library
- Linux Trace Toolkit Project Page. http://www.opersys.com/LTT/,2002.]]Google Scholar
- M. Litzkow, M. Livny, and M. Mutka. Condor - A Hunter of Idle Workstations. In Proceedings of the 8th International Conference of Distributed Computing Systems, pages 104--111, June 1988.]]Google Scholar
Cross Ref
- The pgbench man page, PostgreSQL software distribution, 2002.]]Google Scholar
- T. Roscoe and B. Lyles. Distributing Computing without DPEs: Design Considerations for Public Computing Platforms. In Proceedings of the 9th ACM SIGOPS European Workshop, Kolding, Denmark, September 2000.]] Google Scholar
Digital Library
- Y. Saito, B. Bershad, and H. Levy. Manageability, Availability and Performance in Porcupine: A Highly Available, Scalable Clusterbased Mail Service. In Proceedings of the 17th SOSP, Kiawah Island Resort, SC, pages 1--15, December 1999.]] Google Scholar
Digital Library
- B C. Smith, J F. Leimkuhler, and R M. Darrow. Yield Management at American Airlines. Interfaces, 22(1):8--31, January-February 1992.]]Google Scholar
Digital Library
- The Standard Performance Evaluation Corporation (SPEC), http://www.spec.org. SPECWeb99 Benchmark Documentation.]]Google Scholar
- V Sundaram, A. Chandra, P. Goyal, P. Shenoy, J Sahni, and H Vin. Application Performance in the QLinux Multimedia Operating System. In Proceedings of the Eighth ACM Conference on Multimedia, Los Angeles, CA, November 2000.]] Google Scholar
Digital Library
- P. Tang and T. Tai. Network Traffic Characterization Using Token Bucket Model. In Proceedings of IEEE Infocom'99, New York, NY, March 1999.]]Google Scholar
- B. Urgaonkar and P. Shenoy. Sharc: Managing CPU and Network Bandwidth in Shared Clusters. Technical Report TR01--08, Department of Computer Science, University of Massachusetts, October 2001.]]Google Scholar
- B. Urgaonkar, P. Shenoy, and T. Roscoe. Resource Overbooking and Application Profiling in Shared Hosting Platforms. Technical report TR02-21, Department of Computer Science, University of Massachusetts, May 2002.]]Google Scholar
- H. M. Vin, P. Goyal, A. Goyal, and A. Goyal. A Statistical Admission Control Algorithm for Multimedia Servers. In Proceedings of the ACM Multimedia'94, San Francisco, pages 33--40, October 1994.]] Google Scholar
Digital Library
- C. A. Waldspurger and W. E. Weihl. Lottery Scheduling: Flexible Proportional-share Resource Management. In Proceedings of symposim on Operating System Design and Implementation, November 1994.]] Google Scholar
Digital Library
- T. Zhao and V. Karmacheti. Enforcing Resource Sharing Agreements among Distributed Server Clusters. In Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS), April 2002.]] Google Scholar
Digital Library
Index Terms
Resource overbooking and application profiling in shared hosting platforms
Recommendations
Resource overbooking and application profiling in a shared Internet hosting platform
In this article, we present techniques for provisioning CPU and network resources in shared Internet hosting platforms running potentially antagonistic third-party applications. The primary contribution of our work is to demonstrate the feasibility and ...
Resource overbooking and application profiling in shared hosting platforms
OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation (Copyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading)In this paper, we present techniques for provisioning CPU and network resources in shared hosting platforms running potentially antagonistic third-party applications. The primary contribution of our work is to demonstrate the feasibility and benefits of ...
Resource allocation algorithms for virtualized service hosting platforms
Commodity clusters are used routinely for deploying service hosting platforms. Due to hardware and operation costs, clusters need to be shared among multiple services. Crucial for enabling such shared hosting platforms is virtual machine (VM) technology,...





Comments