Abstract
Efficient management of distributed Web-based systems requires several mechanisms that decide on request dispatching, load balance, admission control, request redirection. The algorithms behind these mechanisms typically make fast decisions on the basis of the load conditions of the system resources. The architecture complexity and workloads characterizing most Web-based services make it extremely difficult to deduce a representative view of a resource load from collected measures that show extreme variability even at different time scales. Hence, any decision based on instantaneous or average views of the system load may lead to useless or even wrong actions. As an alternative, we propose a two-phase strategy that first aims to obtain a representative view of the load trend from measured system values and then applies this representation to support runtime decision systems. We consider two classical problems behind decisions: how to detect significant and nontransient load changes of a system resource and how to predict its future load behavior. The two-phase strategy is based on stochastic functions that are characterized by a computational complexity that is compatible with runtime decisions. We describe, test, and tune the two-phase strategy by considering as a first example a multitier Web-based system that is subject to different classes of realistic and synthetic workloads. Also, we integrate the proposed strategy into a framework that we validate by applying it to support runtime decisions in a cluster Web system and in a locally distributed Network Intrusion Detection System.
- Abdelzaher, T., Shin, K. G., and Bhatti, N. 2002. Performance guarantees for Web server end-systems: A control-theoretical approach. IEEE Trans. Paral. Distrib. Syst. 13, 1, 80--96.]] Google Scholar
Digital Library
- Andreolini, M. and Casolari, S. 2006. Load prediction models in Web-based systems. In Proceedings of the 1th International Performance Evaluation Methodologies and Tools Conference (VALUETOOLS'06). Pisa, Italy.]] Google Scholar
Digital Library
- Andreolini, M., Colajanni, M., and Nuccio, M. 2003. Scalability of content-aware server switches for cluster-based Web information systems. In Proceedings of 12th International World Wide Web Conf. (WWW'03). Budapest, Hungary.]]Google Scholar
- Apache. 1999. Apache HTTP server project. http://www.apache.org.]]Google Scholar
- Arlitt, M., Krishnamurthy, D., and Rolia, J. 2001. Characterizing the scalability of a large Web-based shopping system. IEEE Trans. Intern. Techn. 1, 1, 44--69.]] Google Scholar
Digital Library
- Bahi, J., Contassot-Vivier, S., and Couturier, R. 2006. Dynamic load balancing and efficient load estimators for asynchronous iterative algorithms. IEEE Trans. Paral. Distrib. Syst. 16, 4, 289--299.]] Google Scholar
Digital Library
- Barford, P. and Crovella, M. E. 1998. Generating representative Web workloads for network and server performance evaluation. In Proceedings of the 1st the Joint International Conference on Measurement and Modeling of Computer Systems (ACM SIGMETRICS'98/Performance'98). Madison, WI.]] Google Scholar
Digital Library
- Baryshnikov, Y., Coffman, E., Pierre, G., Rubenstein, D., Squillante, M., and Yimwadsana, T. 2005. Predictability of Web server traffic congestion. In Proceedings of 10th International Workshop of Web Content Caching and Distribution (WCW'05). Sophia Antipolis, France.]] Google Scholar
Digital Library
- Bonett, D. 2006. Approximate confidence interval for standard deviation of nonnormal distributions. Comput. Statis. Data Anal. 50, 3, 775--882.]] Google Scholar
Digital Library
- Box, G., Jenkins, G., and Reinsel, G. 1994. Time Series Analysis Forecasting and Control. Prentice Hall.]] Google Scholar
Digital Library
- Brockwell, B. L. and Davis, R. A. 1987. Time Series: Theory and Methods. Springer-Verlag.]] Google Scholar
Digital Library
- Bryhni, H. 2000. A comparison of load balancing techniques for scalable web servers. IEEE Netw. 14, 4, 58--64.]] Google Scholar
Digital Library
- Cain, H. W., Rajwar, R., Marden, M., and Lipasti, M. H. 2001. An architectural evaluation of Java TPC-W. In Proceedings of the 7th International Symposium on High-Performance Computer Architecture (HPCA'01). Nuovo Leone, Mexico.]] Google Scholar
Digital Library
- Canali, C., Xiao, Z., and Rabinovich, M. 2004. Utility computing for Internet applications. In Web Content Delivery, X. Tang, J. Xu, and S. Chanson, Eds. Vol. 2. Springer Verlag, 131--151.]]Google Scholar
- Cardellini, V., Casalicchio, E., Colajanni, M., and Yu, P. 2002. The state of the art in locally distributed Web-server system. ACM Comput. Surv. 2, 263--311.]] Google Scholar
Digital Library
- Cardellini, V., Colajanni, M., and Yu, P. 2003. Request redirection algorithms for distributed Web systems. IEEE Trans. Paral. Distrib. Syst. 14, 5, 355--368.]] Google Scholar
Digital Library
- Cardellini, V., Colajanni, M., and Yu, P. S. 2000. Geographic load balancing for scalable distributed Web systems. In Proceedings of the 8th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS'00). San Francisco, CA.]] Google Scholar
Digital Library
- Castro, M., Dwyer, M., and Rumsewicz, M. 1999. Load balancing and control for distributed World Wide Web servers. In Proceedings of the International Conference on Control Applications (CCA'99). Kohala Coast, HI.]]Google Scholar
- Cecchet, E., Chanda, A., Elnikety, S., Marguerite, J., and Zwaenepoel, W. 2003. Performance comparison of middleware architectures for generating dynamic Web content. In Proceedings of the 4th Middleware Conference. Rio de Janeiro, Brazil.]] Google Scholar
Digital Library
- Challenger, J., Dantzig, P., Iyengar, A., Squillante, M., and Zhang, L. 2004. Efficiently serving dynamic data at highly accessed Web sites. IEEE/ACM Trans. on Networ. 12, 2, 233--246.]] Google Scholar
Digital Library
- Chen, H. and Mohapatra, P. 2002. Session-based overload control in QoS-aware Web server. In Proceedings of the 21th IEEE International Conference on Computer Communications (INFOCOM'02).]]Google Scholar
- Chen, H. and Mohapatra, P. 2003. Overload control in QoS-aware Web servers. Comput. Netw. 42, 1, 119--133.]] Google Scholar
Digital Library
- Chen, X. and Heidemann, J. 2005. Flash crowd mitigation via an adaptive admission control based on application-level observations. IEEE Trans. Inter. Tech. 5, 3, 532--569.]] Google Scholar
Digital Library
- Cherkasova, L. and Phaal, P. 1999. Session based admission control: A mechanism for improving performance of commercial Web sites. In Proceedings of the 7th International Workshop on Quality of Service (IWQoS'99). London, UK, 226--235.]]Google Scholar
- Cherkasova, L. and Phaal, P. 2002. Session-based admission control: A mechanism for peak load management of commercial Web sites. IEEE Trans. Comput. 51, 6, 669--685.]] Google Scholar
Digital Library
- Choi, B., Park, J., and Zhang, Z. 2003. Adaptive random sampling for traffic load measurement. In Proceedings of the 16th IEEE International Conference on Communications (ICC'03). Anchorage, AL.]]Google Scholar
- Colajanni, M. and Marchetti, M. 2006. A parallel architecture for stateful intrusion detection in high traffic networks. In Proceedings of the IEEE/IST Workshop on Monitoring, Attack Detection and Mitigation (MonAM'06). Tuebingen, Germany.]]Google Scholar
- Crovella, M. E., Taqqu, M. S., and Bestavros, A. 1998. Heavy-tailed probability distributions in the World Wide Web. In A Practical Guide To Heavy Tails. Chapman and Hall, 3--26.]] Google Scholar
Digital Library
- Dahlin, M. 2000. Interpreting stale load information. IEEE Trans. Paral. Distrib. Syst. 11, 10, 1033--1047.]] Google Scholar
Digital Library
- Dinda, P. and O'Hallaron, D. 2000. Host load prediction using linear models. Cluster Comput. 3, 4, 265--280.]] Google Scholar
Digital Library
- Dodge, R. C., Menascé, D. A., and Barbará, D. 2001. Testing e-commerce site scalability with TPC-W. In Proceedings of the 27th International Computer Measurement Group Conference. Orlando, FL.]]Google Scholar
- Duffield, N. G. and Lo Presti, F. 2000. Multicast inference of packet delay variance at interior network links. In Proceedings of the 19th IEEE International Conference on Computer Communications (INFOCOM'00). Tel Aviv, Israel.]]Google Scholar
- Elnikety, S., Nahum, E., Tracey, J., and Zwaenepoel, W. 2004. A method for transparent admission control and request scheduling in e-commerce Web sites. In Proceedings of the 13th International World Wide Web Conference. New York, NY.]] Google Scholar
Digital Library
- Eubank, R. L. and Eubank, E. 1999. Non parametric regression and spline smoothing. CRC Press.]]Google Scholar
- Ferrari, D. and Zhou, S. 1987. An empirical investigation of load indices for load balancing applications. In Proceedings of the 12th IFIP WG 7.3 International Symposium on Computer Performance Modeling, Measurement and Evaluation (PERFORMANCE'87). Brussels, Belgium.]] Google Scholar
Digital Library
- Fishman, G. and Adan, I. 2006. How heavy-tailed distributions affect simulation-generated time averages. ACM Trans. Model. Comput. Simul. 16, 2, 152--173.]] Google Scholar
Digital Library
- Floyd, S. and Paxson, V. 2001. Difficulties in simulating the Internet. IEEE/ACM Trans. Networ. 9, 3, 392--403.]] Google Scholar
Digital Library
- Forsythe, G. E., Malcolm, M. A., and Moler, C. B. 1977. Computer Methods for Mathematical Computations. Prentice-Hall.]] Google Scholar
Digital Library
- Ganek, A. G. and Corbi, T. 2003. The dawning of the autonomic computing era. IBM Syst. J. 42, 1, 5--18.]] Google Scholar
Digital Library
- Gautama, H. and van Gemund, A. 2006. Low-cost static performance prediction of parallel stochastic task compositions. IEEE Trans. Paral. Distrib. Syst. 17, 1, 78--91.]] Google Scholar
Digital Library
- Iyer, R. 2001. Exploring the cache design space for Web servers. In Proceedings of the 15th International Parallel and Distributed Processing Symposium (PDPS'01). San Francisco, CA.]] Google Scholar
Digital Library
- Jung, J., Krishnamurthy, B., and Rabinovich, M. 2002. Flash crowds and denial of service attacks: Characterization and implications for CDNs and Web sites. In Proceedings of the 11th International World Wide Web Conference (WWW'02). Honolulu, HW.]] Google Scholar
Digital Library
- Kamra, A., Misra, V., and Nahum, E. M. 2004. Yaksha: A self-tuning controller for managing the performance of 3-tiered sites. In Proceedings of the 12th International Workshop on Quality of Service (IWQOS'04). Montreal, Canada.]]Google Scholar
- Karbhari, P., Rabinovich, M., Xiao, Z., and Douglis, F. 2002. ACDN: A content delivery network for applications. In Proceedings of the 21st ACM International Conference on Management of Data (SIGMOD'02). Madison, WI.]] Google Scholar
Digital Library
- Kelly, T. 2005. Detecting performance anomalies in global applications. In Proceedings of the USENIX Workshop on Real, Large Distributed Systems (WORLDS'05). San Francisco, CA.]] Google Scholar
Digital Library
- Kendall, M. and Ord, J. 1990. Time Series. Oxford University Press.]]Google Scholar
- Kephart, J. O. and Chess, D. M. 2003. The vision of Autonomic Computing. IEEE Comput. 36, 1, 41--50.]] Google Scholar
Digital Library
- Lilja, D. J. 2000. Measuring Computer Performance. A Practitioner's Guide. Cambridge University Press.]] Google Scholar
Digital Library
- Lingyun, Y., Foster, I., and Schopf, J. M. 2003. Homeostatic and tendency-based CPU load predictions. In Proceedings of the 8th International Parallel and Distributed Processing Symposium (IPDPS'03). Nice, France.]] Google Scholar
Digital Library
- Lippmann, R., Haines, J. W., Fried, D., and Korba, J. Das, K. 2000. Analysis and results of the 1999 DARPA off-line intrusion detection evaluation. In Proceedings of the 3rd International Workshop on Recent Advances in Intrusion Detection (RAID'00). London, UK.]] Google Scholar
Digital Library
- Luo, S. and Marin, G. 2005. Realistic Internet traffic simulation through mixture modeling and a case study. In Proceedings of the 37th IEEE Winter Simulation Conference (WSC'05). Orlando, FL.]] Google Scholar
Digital Library
- Menascé, D. and Kephart, J. 2007. Autonomic computing. IEEE Intern. Comput. 11, 1, 18--21.]] Google Scholar
Digital Library
- Mitzenmacher, M. 2000. How useful is old information. IEEE Trans. Paral. Distrib. Syst. 11, 1, 6--20.]] Google Scholar
Digital Library
- MySQL 2005. MySQL Database server. http://www.mysql.com/.]]Google Scholar
- Pai, V. S., Aron, M., Banga, G., Svendsen, M., Druschel, P., Zwaenepoel, W., and Nahum, E. M. 1998. Locality-aware request distribution in cluster-based network servers. In Proceedings of the 8th ACM Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'98). San Jose, CA.]] Google Scholar
Digital Library
- Pandey, R. and Barnes, J. F. Olsson, R. 1998. Supporting quality of service in HTTP servers. In Proceedings of the 17th ACM Symposium on Principles of Distributed Computing (PODC'98). Puerto Vallarta, Mexico.]] Google Scholar
Digital Library
- Pierre, G. and Van Steen, M. 2001. Globule: A platform for self replicating Web documents. In Proceedings of the 6th Conference on Protocols for Multimedia systems (PROMS'01). Enschede, The Netherlands.]] Google Scholar
Digital Library
- Poirier, D. J. 1973. Piecewise regression using cubic spline. J. Amer. Statist. Ass. 68, 343, 515--524.]]Google Scholar
- Pradhan, P., Tewari, R., Sahu, S., Chandra, A., and Shenoy, P. 2002. An observation-based approach towards self-managing Web servers. In Proceedings of the 10th International Workshop on Quality of Service (IWQOS'02). Monterey, CA.]]Google Scholar
- Rabinovich, M., Triukose, S., Wen, Z., and Wang, L. 2006. DipZoom: The Internet measurement marketplace. In Proceedings of the 9th IEEE Global Internet Symposium. Barcelona, Spain.]]Google Scholar
- Rabinovich, M., Zhen, X., and Aggarwal, A. 2003. Computing on the edge: A platform for replicating Internet applications. In Proceedings of the 8th International Workshop of Web Content Caching and Distribution (WCW'03). Hawthorne, NY.]]Google Scholar
- Ramanathan, P. 1999. Overload management in real-time control applications using (m,k)-firm guarantee. Perform. Eval. Rev. 10, 6, 549--559.]] Google Scholar
Digital Library
- Sang, A. and Li, S. 2000. A predictability analysis of network traffic. In Proceedings of the 19th IEEE International Conference on Computer Communications (INFOCOM'00). Tel Aviv, Israel.]]Google Scholar
- Satyanarayanan, M., Narayanan, D., Tilton, J., Flinn, J., and Walker, K. 1997. Agile application-aware adaptation for mobility. In Proceedings of the 16th ACM International Symposium on Operating Systems Principles (SOSP'97). Saint-Malo, France.]] Google Scholar
Digital Library
- Sivasubramanian, S., Pierre, G., and Van Steen, M. 2004. Replication for Web hosting systems. ACM Comput. Surv. 36, 3, 291--334.]] Google Scholar
Digital Library
- SpecWEB05 2005. The SPECWeb05 benchmark. http://www.spec.org/osg/web2005/.]]Google Scholar
- SpecWEB96 1996. The SPECWeb96 benchmark. http://www.spec.org/osg/web96/.]]Google Scholar
- Stankovic, J. A. 1984. Simulations of three adaptive, decentralized controlled, job scheduling algorithms. Comput. Netw. 8, 3, 199--217.]] Google Scholar
Digital Library
- Tomcat 2005. The Tomcat Servlet Engine. http://jakarta.apache.org/tomcat/.]]Google Scholar
- TPC-W 2004. TPC-W transactional Web e-commerce benchmark. http://www.tpc.org/tpcw/.]]Google Scholar
- Tran, N. and Reed, D. 2004. Automatic ARIMA time series modeling for adaptive I/O prefetching. IEEE Trans. Paral. Distrib. Syst. 15, 4, 362--377.]] Google Scholar
Digital Library
- Utts, J. M. 2004. Seeing Through Statistics. Thomson Brooks/Cole.]]Google Scholar
- Wildstrom, J., Stone, P., Witchel, E., Mooney, R., and Dahlin, M. 2005. Towards self-configuring hardware for distributed computer systems. In Proceedings of the 2nd International Conference on Autonomic Computing (ICAC'05). Seattle, WA.]] Google Scholar
Digital Library
- Wolber, G. and Alfy, I. 1999. Monotonic cubic spline interpolation. In Proceedings of the International Conference on Computer Graphics. Canmore, CA, 188.]] Google Scholar
Digital Library
- Wolski, R., Spring, N. T., and Hayes, J. 1999. The Network Weather Service: A distributed resource performance forecasting service for metacomputing. Future Generation Comput. Syst. 15, 5, 757--768.]] Google Scholar
Digital Library
Index Terms
Models and framework for supporting runtime decisions in Web-based systems
Recommendations
The design of AHPEC in web-based decision support system for making decision
AIC'10/BEBI'10: Proceedings of the 10th WSEAS international conference on applied informatics and communications, and 3rd WSEAS international conference on Biomedical electronics and biomedical informaticsThis paper presents the architecture of the WDSS using AHPEC for making decision. The WDSS-AHPEC involves client, user interface, server, web server, database, and DSS model. The DSS model in this study is using AHPEC method. In applying AHPEC, it ...
Progress in Web-based decision support technologies
World Wide Web technologies have transformed the design, development, implementation and deployment of decision support systems. This article reviews and summarizes recent technology developments, current usage of Web-based DSS, and trends in the ...
A Performance Study of Robust Load Sharing Strategies for Distributed Heterogeneous Web Server Systems
Replication of information across multiple servers is becoming a common approach to support popular Web sites. A distributed architecture with some mechanisms to assign client requests to Web servers is more scalable than any centralized or mirrored ...






Comments