skip to main content
research-article

Models and framework for supporting runtime decisions in Web-based systems

Published:08 July 2008Publication History
Skip Abstract Section

Abstract

Efficient management of distributed Web-based systems requires several mechanisms that decide on request dispatching, load balance, admission control, request redirection. The algorithms behind these mechanisms typically make fast decisions on the basis of the load conditions of the system resources. The architecture complexity and workloads characterizing most Web-based services make it extremely difficult to deduce a representative view of a resource load from collected measures that show extreme variability even at different time scales. Hence, any decision based on instantaneous or average views of the system load may lead to useless or even wrong actions. As an alternative, we propose a two-phase strategy that first aims to obtain a representative view of the load trend from measured system values and then applies this representation to support runtime decision systems. We consider two classical problems behind decisions: how to detect significant and nontransient load changes of a system resource and how to predict its future load behavior. The two-phase strategy is based on stochastic functions that are characterized by a computational complexity that is compatible with runtime decisions. We describe, test, and tune the two-phase strategy by considering as a first example a multitier Web-based system that is subject to different classes of realistic and synthetic workloads. Also, we integrate the proposed strategy into a framework that we validate by applying it to support runtime decisions in a cluster Web system and in a locally distributed Network Intrusion Detection System.

References

  1. Abdelzaher, T., Shin, K. G., and Bhatti, N. 2002. Performance guarantees for Web server end-systems: A control-theoretical approach. IEEE Trans. Paral. Distrib. Syst. 13, 1, 80--96.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Andreolini, M. and Casolari, S. 2006. Load prediction models in Web-based systems. In Proceedings of the 1th International Performance Evaluation Methodologies and Tools Conference (VALUETOOLS'06). Pisa, Italy.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Andreolini, M., Colajanni, M., and Nuccio, M. 2003. Scalability of content-aware server switches for cluster-based Web information systems. In Proceedings of 12th International World Wide Web Conf. (WWW'03). Budapest, Hungary.]]Google ScholarGoogle Scholar
  4. Apache. 1999. Apache HTTP server project. http://www.apache.org.]]Google ScholarGoogle Scholar
  5. Arlitt, M., Krishnamurthy, D., and Rolia, J. 2001. Characterizing the scalability of a large Web-based shopping system. IEEE Trans. Intern. Techn. 1, 1, 44--69.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Bahi, J., Contassot-Vivier, S., and Couturier, R. 2006. Dynamic load balancing and efficient load estimators for asynchronous iterative algorithms. IEEE Trans. Paral. Distrib. Syst. 16, 4, 289--299.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Barford, P. and Crovella, M. E. 1998. Generating representative Web workloads for network and server performance evaluation. In Proceedings of the 1st the Joint International Conference on Measurement and Modeling of Computer Systems (ACM SIGMETRICS'98/Performance'98). Madison, WI.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Baryshnikov, Y., Coffman, E., Pierre, G., Rubenstein, D., Squillante, M., and Yimwadsana, T. 2005. Predictability of Web server traffic congestion. In Proceedings of 10th International Workshop of Web Content Caching and Distribution (WCW'05). Sophia Antipolis, France.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Bonett, D. 2006. Approximate confidence interval for standard deviation of nonnormal distributions. Comput. Statis. Data Anal. 50, 3, 775--882.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Box, G., Jenkins, G., and Reinsel, G. 1994. Time Series Analysis Forecasting and Control. Prentice Hall.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Brockwell, B. L. and Davis, R. A. 1987. Time Series: Theory and Methods. Springer-Verlag.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Bryhni, H. 2000. A comparison of load balancing techniques for scalable web servers. IEEE Netw. 14, 4, 58--64.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Cain, H. W., Rajwar, R., Marden, M., and Lipasti, M. H. 2001. An architectural evaluation of Java TPC-W. In Proceedings of the 7th International Symposium on High-Performance Computer Architecture (HPCA'01). Nuovo Leone, Mexico.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Canali, C., Xiao, Z., and Rabinovich, M. 2004. Utility computing for Internet applications. In Web Content Delivery, X. Tang, J. Xu, and S. Chanson, Eds. Vol. 2. Springer Verlag, 131--151.]]Google ScholarGoogle Scholar
  15. Cardellini, V., Casalicchio, E., Colajanni, M., and Yu, P. 2002. The state of the art in locally distributed Web-server system. ACM Comput. Surv. 2, 263--311.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Cardellini, V., Colajanni, M., and Yu, P. 2003. Request redirection algorithms for distributed Web systems. IEEE Trans. Paral. Distrib. Syst. 14, 5, 355--368.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Cardellini, V., Colajanni, M., and Yu, P. S. 2000. Geographic load balancing for scalable distributed Web systems. In Proceedings of the 8th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS'00). San Francisco, CA.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Castro, M., Dwyer, M., and Rumsewicz, M. 1999. Load balancing and control for distributed World Wide Web servers. In Proceedings of the International Conference on Control Applications (CCA'99). Kohala Coast, HI.]]Google ScholarGoogle Scholar
  19. Cecchet, E., Chanda, A., Elnikety, S., Marguerite, J., and Zwaenepoel, W. 2003. Performance comparison of middleware architectures for generating dynamic Web content. In Proceedings of the 4th Middleware Conference. Rio de Janeiro, Brazil.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Challenger, J., Dantzig, P., Iyengar, A., Squillante, M., and Zhang, L. 2004. Efficiently serving dynamic data at highly accessed Web sites. IEEE/ACM Trans. on Networ. 12, 2, 233--246.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Chen, H. and Mohapatra, P. 2002. Session-based overload control in QoS-aware Web server. In Proceedings of the 21th IEEE International Conference on Computer Communications (INFOCOM'02).]]Google ScholarGoogle Scholar
  22. Chen, H. and Mohapatra, P. 2003. Overload control in QoS-aware Web servers. Comput. Netw. 42, 1, 119--133.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Chen, X. and Heidemann, J. 2005. Flash crowd mitigation via an adaptive admission control based on application-level observations. IEEE Trans. Inter. Tech. 5, 3, 532--569.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Cherkasova, L. and Phaal, P. 1999. Session based admission control: A mechanism for improving performance of commercial Web sites. In Proceedings of the 7th International Workshop on Quality of Service (IWQoS'99). London, UK, 226--235.]]Google ScholarGoogle Scholar
  25. Cherkasova, L. and Phaal, P. 2002. Session-based admission control: A mechanism for peak load management of commercial Web sites. IEEE Trans. Comput. 51, 6, 669--685.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Choi, B., Park, J., and Zhang, Z. 2003. Adaptive random sampling for traffic load measurement. In Proceedings of the 16th IEEE International Conference on Communications (ICC'03). Anchorage, AL.]]Google ScholarGoogle Scholar
  27. Colajanni, M. and Marchetti, M. 2006. A parallel architecture for stateful intrusion detection in high traffic networks. In Proceedings of the IEEE/IST Workshop on Monitoring, Attack Detection and Mitigation (MonAM'06). Tuebingen, Germany.]]Google ScholarGoogle Scholar
  28. Crovella, M. E., Taqqu, M. S., and Bestavros, A. 1998. Heavy-tailed probability distributions in the World Wide Web. In A Practical Guide To Heavy Tails. Chapman and Hall, 3--26.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Dahlin, M. 2000. Interpreting stale load information. IEEE Trans. Paral. Distrib. Syst. 11, 10, 1033--1047.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Dinda, P. and O'Hallaron, D. 2000. Host load prediction using linear models. Cluster Comput. 3, 4, 265--280.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Dodge, R. C., Menascé, D. A., and Barbará, D. 2001. Testing e-commerce site scalability with TPC-W. In Proceedings of the 27th International Computer Measurement Group Conference. Orlando, FL.]]Google ScholarGoogle Scholar
  32. Duffield, N. G. and Lo Presti, F. 2000. Multicast inference of packet delay variance at interior network links. In Proceedings of the 19th IEEE International Conference on Computer Communications (INFOCOM'00). Tel Aviv, Israel.]]Google ScholarGoogle Scholar
  33. Elnikety, S., Nahum, E., Tracey, J., and Zwaenepoel, W. 2004. A method for transparent admission control and request scheduling in e-commerce Web sites. In Proceedings of the 13th International World Wide Web Conference. New York, NY.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Eubank, R. L. and Eubank, E. 1999. Non parametric regression and spline smoothing. CRC Press.]]Google ScholarGoogle Scholar
  35. Ferrari, D. and Zhou, S. 1987. An empirical investigation of load indices for load balancing applications. In Proceedings of the 12th IFIP WG 7.3 International Symposium on Computer Performance Modeling, Measurement and Evaluation (PERFORMANCE'87). Brussels, Belgium.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Fishman, G. and Adan, I. 2006. How heavy-tailed distributions affect simulation-generated time averages. ACM Trans. Model. Comput. Simul. 16, 2, 152--173.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Floyd, S. and Paxson, V. 2001. Difficulties in simulating the Internet. IEEE/ACM Trans. Networ. 9, 3, 392--403.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Forsythe, G. E., Malcolm, M. A., and Moler, C. B. 1977. Computer Methods for Mathematical Computations. Prentice-Hall.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Ganek, A. G. and Corbi, T. 2003. The dawning of the autonomic computing era. IBM Syst. J. 42, 1, 5--18.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Gautama, H. and van Gemund, A. 2006. Low-cost static performance prediction of parallel stochastic task compositions. IEEE Trans. Paral. Distrib. Syst. 17, 1, 78--91.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Iyer, R. 2001. Exploring the cache design space for Web servers. In Proceedings of the 15th International Parallel and Distributed Processing Symposium (PDPS'01). San Francisco, CA.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Jung, J., Krishnamurthy, B., and Rabinovich, M. 2002. Flash crowds and denial of service attacks: Characterization and implications for CDNs and Web sites. In Proceedings of the 11th International World Wide Web Conference (WWW'02). Honolulu, HW.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Kamra, A., Misra, V., and Nahum, E. M. 2004. Yaksha: A self-tuning controller for managing the performance of 3-tiered sites. In Proceedings of the 12th International Workshop on Quality of Service (IWQOS'04). Montreal, Canada.]]Google ScholarGoogle Scholar
  44. Karbhari, P., Rabinovich, M., Xiao, Z., and Douglis, F. 2002. ACDN: A content delivery network for applications. In Proceedings of the 21st ACM International Conference on Management of Data (SIGMOD'02). Madison, WI.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Kelly, T. 2005. Detecting performance anomalies in global applications. In Proceedings of the USENIX Workshop on Real, Large Distributed Systems (WORLDS'05). San Francisco, CA.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Kendall, M. and Ord, J. 1990. Time Series. Oxford University Press.]]Google ScholarGoogle Scholar
  47. Kephart, J. O. and Chess, D. M. 2003. The vision of Autonomic Computing. IEEE Comput. 36, 1, 41--50.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Lilja, D. J. 2000. Measuring Computer Performance. A Practitioner's Guide. Cambridge University Press.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Lingyun, Y., Foster, I., and Schopf, J. M. 2003. Homeostatic and tendency-based CPU load predictions. In Proceedings of the 8th International Parallel and Distributed Processing Symposium (IPDPS'03). Nice, France.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Lippmann, R., Haines, J. W., Fried, D., and Korba, J. Das, K. 2000. Analysis and results of the 1999 DARPA off-line intrusion detection evaluation. In Proceedings of the 3rd International Workshop on Recent Advances in Intrusion Detection (RAID'00). London, UK.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Luo, S. and Marin, G. 2005. Realistic Internet traffic simulation through mixture modeling and a case study. In Proceedings of the 37th IEEE Winter Simulation Conference (WSC'05). Orlando, FL.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Menascé, D. and Kephart, J. 2007. Autonomic computing. IEEE Intern. Comput. 11, 1, 18--21.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Mitzenmacher, M. 2000. How useful is old information. IEEE Trans. Paral. Distrib. Syst. 11, 1, 6--20.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. MySQL 2005. MySQL Database server. http://www.mysql.com/.]]Google ScholarGoogle Scholar
  55. Pai, V. S., Aron, M., Banga, G., Svendsen, M., Druschel, P., Zwaenepoel, W., and Nahum, E. M. 1998. Locality-aware request distribution in cluster-based network servers. In Proceedings of the 8th ACM Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'98). San Jose, CA.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Pandey, R. and Barnes, J. F. Olsson, R. 1998. Supporting quality of service in HTTP servers. In Proceedings of the 17th ACM Symposium on Principles of Distributed Computing (PODC'98). Puerto Vallarta, Mexico.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Pierre, G. and Van Steen, M. 2001. Globule: A platform for self replicating Web documents. In Proceedings of the 6th Conference on Protocols for Multimedia systems (PROMS'01). Enschede, The Netherlands.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Poirier, D. J. 1973. Piecewise regression using cubic spline. J. Amer. Statist. Ass. 68, 343, 515--524.]]Google ScholarGoogle Scholar
  59. Pradhan, P., Tewari, R., Sahu, S., Chandra, A., and Shenoy, P. 2002. An observation-based approach towards self-managing Web servers. In Proceedings of the 10th International Workshop on Quality of Service (IWQOS'02). Monterey, CA.]]Google ScholarGoogle Scholar
  60. Rabinovich, M., Triukose, S., Wen, Z., and Wang, L. 2006. DipZoom: The Internet measurement marketplace. In Proceedings of the 9th IEEE Global Internet Symposium. Barcelona, Spain.]]Google ScholarGoogle Scholar
  61. Rabinovich, M., Zhen, X., and Aggarwal, A. 2003. Computing on the edge: A platform for replicating Internet applications. In Proceedings of the 8th International Workshop of Web Content Caching and Distribution (WCW'03). Hawthorne, NY.]]Google ScholarGoogle Scholar
  62. Ramanathan, P. 1999. Overload management in real-time control applications using (m,k)-firm guarantee. Perform. Eval. Rev. 10, 6, 549--559.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Sang, A. and Li, S. 2000. A predictability analysis of network traffic. In Proceedings of the 19th IEEE International Conference on Computer Communications (INFOCOM'00). Tel Aviv, Israel.]]Google ScholarGoogle Scholar
  64. Satyanarayanan, M., Narayanan, D., Tilton, J., Flinn, J., and Walker, K. 1997. Agile application-aware adaptation for mobility. In Proceedings of the 16th ACM International Symposium on Operating Systems Principles (SOSP'97). Saint-Malo, France.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Sivasubramanian, S., Pierre, G., and Van Steen, M. 2004. Replication for Web hosting systems. ACM Comput. Surv. 36, 3, 291--334.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. SpecWEB05 2005. The SPECWeb05 benchmark. http://www.spec.org/osg/web2005/.]]Google ScholarGoogle Scholar
  67. SpecWEB96 1996. The SPECWeb96 benchmark. http://www.spec.org/osg/web96/.]]Google ScholarGoogle Scholar
  68. Stankovic, J. A. 1984. Simulations of three adaptive, decentralized controlled, job scheduling algorithms. Comput. Netw. 8, 3, 199--217.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. Tomcat 2005. The Tomcat Servlet Engine. http://jakarta.apache.org/tomcat/.]]Google ScholarGoogle Scholar
  70. TPC-W 2004. TPC-W transactional Web e-commerce benchmark. http://www.tpc.org/tpcw/.]]Google ScholarGoogle Scholar
  71. Tran, N. and Reed, D. 2004. Automatic ARIMA time series modeling for adaptive I/O prefetching. IEEE Trans. Paral. Distrib. Syst. 15, 4, 362--377.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  72. Utts, J. M. 2004. Seeing Through Statistics. Thomson Brooks/Cole.]]Google ScholarGoogle Scholar
  73. Wildstrom, J., Stone, P., Witchel, E., Mooney, R., and Dahlin, M. 2005. Towards self-configuring hardware for distributed computer systems. In Proceedings of the 2nd International Conference on Autonomic Computing (ICAC'05). Seattle, WA.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  74. Wolber, G. and Alfy, I. 1999. Monotonic cubic spline interpolation. In Proceedings of the International Conference on Computer Graphics. Canmore, CA, 188.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  75. Wolski, R., Spring, N. T., and Hayes, J. 1999. The Network Weather Service: A distributed resource performance forecasting service for metacomputing. Future Generation Comput. Syst. 15, 5, 757--768.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Models and framework for supporting runtime decisions in Web-based systems

                    Recommendations

                    Comments

                    Login options

                    Check if you have access through your login credentials or your institution to get full access on this article.

                    Sign in

                    Full Access

                    PDF Format

                    View or Download as a PDF file.

                    PDF

                    eReader

                    View online with eReader.

                    eReader
                    About Cookies On This Site

                    We use cookies to ensure that we give you the best experience on our website.

                    Learn more

                    Got it!