Abstract
The overall increase in traffic on the World Wide Web is augmenting user-perceived response times from popular Web sites, especially in conjunction with special events. System platforms that do not replicate information content cannot provide the needed scalability to handle large traffic volumes and to match rapid and dramatic changes in the number of clients. The need to improve the performance of Web-based services has produced a variety of novel content delivery architectures. This article will focus on Web system architectures that consist of multiple server nodes distributed on a local area, with one or more mechanisms to spread client requests among the nodes. After years of continual proposals of new system solutions, routing mechanisms, and policies (the first dated back to 1994 when the NCSA Web site had to face the first million of requests per day), many problems concerning multiple server architectures for Web sites have been solved. Other issues remain to be addressed, especially at the network application layer, but the main techniques and methodologies for building scalable Web content delivery architectures placed in a single location are settled now. This article classifies and describes main mechanisms to split the traffic load among the server nodes, discussing both the alternative architectures and the load sharing policies. To this purpose, it focuses on architectures, internal routing mechanisms, and dispatching request algorithms for designing and implementing scalable Web-server systems under the control of one content provider. It identifies also some of the open research issues associated with the use of distributed systems for highly accessed Web sites.
- Abdelzaher, T., Shin, K. G., and Bhatti, N. 2002. Performance guarantees for Web server end-systems: A control-theoretical approach. IEEE Trans. Parall. Distrib. Syst. 13, 1 (Jan.), 80--96.]] Google Scholar
- Akamai Tech. 2002. http://www.akamai.com.]]Google Scholar
- Allot Communications. 2002. http://www.allot.com.]]Google Scholar
- Almeida, J., Dabu, M., Manikntty, A., and Cao, P. 1998. Providing differentiated levels of service in Web content hosting. In Proceedings of Workshop on Internet Server Performance (Madison, WI, June).]]Google Scholar
- Anderson, E., Patterson, D., and Brewer, E. 1996. The Magicrouter, an application of fast packet interposing. http://www.cs.berkeley.edu/~eanders/projects/magicrouter/.]]Google Scholar
- Andreolini, M., Casalicchio, E., Colajanni, M., and Mambelli, M. 2001. Performance analysis of layer-7 switches for cluster-based Web servers. Tech. Rep. RR-01.24, Univ. of Roma Tor Vergata, Computer Engineering Dept.]]Google Scholar
- Apostolopoulos, G., Aubespin, D., Peris, V., Pradhan, P., and Saha, D. 2000a. Design, implementation and performance of a content-based switch. In Proceedings of the 19th IEEE International Conference on Computer Communications (INFOCOM 2000) (Tel Aviv, Israel, March). IEEE Computer Soc. Press, Los Alamitos, CA, 1117--1126.]]Google Scholar
- Apostolopoulos, G., Peris, V., Pradhan, P., and Saha, D. 2000b. Securing electronic commerce: Reducing the SSL overhead. IEEE Network 14, 4 (July/Aug.), 8--16.]]Google Scholar
- Arlitt, M. F. and Jin, T. 2000. A workload characterization study of the 1998 World Cup Web site. IEEE Network 14, 3 (May/June), 30--37.]]Google Scholar
- Aron, M., Druschel, P., and Zwaenepoel, Z. 1999. Efficient support for P-HTTP in cluster-based Web servers. In Proceedings of the 1999 USENIX Annual Technical Conference (Monterey, CA, June). USENIX Assoc., Berkeley, CA, 185--198.]] Google Scholar
- Aron, M., Druschel, P., and Zwaenepoel, W. 2000. Cluster reserves: A mechanism for resource management in cluster-based network servers. In Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS 2000) (Santa Clara, CA, June). ACM Press, New York, 90--101.]] Google Scholar
- Aron, M., Sanders, D., Druschel, P., and Zwaenepoel, W. 2000. Scalable content-aware request distribution in cluster-based network servers. In Proceedings of the 2000 USENIX Annual Technical Conference (San Diego, CA, June). USENIX Assoc., Berkeley, CA.]] Google Scholar
- Array Networks. 2002. http://www.arraynet-works.net.]]Google Scholar
- Aversa, L. and Bestavros, A. 2000. Load balancing a cluster of Web servers using Distributed Packet Rewriting. In Proceedings of the 19th IEEE International Performance, Computing, and Communication Conference (Phoenix, AZ, Feb.). IEEE Computer Soc. Press, Los Alamitos, CA, 24--29.]]Google Scholar
- Baentsch, M., Baum, L., Molter, G., Rothkugel, S., and Sturm, P. 1997. Enhancing the Web infrastructure: From caching to replication. IEEE Internet Computing 1, 2 (Mar./Apr.), 18--27.]] Google Scholar
- Banga, G., Druschel, P., and Mogul, J. C. 1998. Better operating system features for faster network servers. ACM Performance Evaluation Review 26, 3 (Dec.), 23--30.]] Google Scholar
- Banga, G., Druschel, P., and Mogul, J. C. 1999. Resource containers: A new facility for resource management in server systems. In Proceedings of the 3rd USENIX Symposium on Operating Systems Design and Implementation (New Orleans, LA, Feb.). USENIX Assoc., Berkeley, CA, 45--58.]] Google Scholar
- Bansal, N. and Harchol-Balter, M. 2001. Analysis of SRPT scheduling: Investigating unfairness. In Proceedings of the 2001 ACM/IFIP Joint International Conference on Measurement and Modeling of Computer Systems (Cambridge, MA, June). ACM Press, New York, 279--290.]] Google Scholar
- Barford, P. and Crovella, M. E. 2001. Critical path analysis of TCP transactions. IEEE/ACM Trans. Networking 9, 3 (June), 238--248.]] Google Scholar
- Barish, G. and Obraczka, K. 2000. World Wide Web caching: Trends and techniques. IEEE Commun. 38, 5 (May), 178--184.]]Google Scholar
- Berners-Lee, T., Fielding, R., and Frystyk, H. 1996. Hypertext Transfer Protocol---HTTP/1.0. RFC 1945.]] Google Scholar
- Bhatti, N. and Friedrich, R. 1999. Web server support for tiered services. IEEE Network 13, 5 (Sept./Oct.), 64--71.]]Google Scholar
- Bourke, T. 2001. Server Load Balancing. O'Reilly and Associates, Sebastopol, CA.]] Google Scholar
- Brewer, E. A. 2001. Lessons from giant-scale services. IEEE Internet Computing 5, 4 (July/Aug.), 46--55.]] Google Scholar
- Brisco, T. 1995. DNS support for load balancing. RFC 1794.]] Google Scholar
- Bunt, R. B., Eager, D. L., Oster, G. M., and Williamson, C. L. 1999. Achieving load balance and effective caching in clustered Web servers. In Proceedings of the 4th International Web Caching Workshop (San Diego, CA, Apr.). 159--169.]]Google Scholar
- Candan, K. S., Li, W.-S., Luo, Q., Hsiung, W.-P., and Agrawal, D. 2001. Enabling dynamic content caching for database-driven Web sites. In Proceedings of 2001 ACM SIGMOD International Conf. on Management of Data (Santa Barbara, CA). ACM Press, New York, 532--543.]] Google Scholar
- Cardellini, V., Casalicchio, E., Colajanni, M., and Mambelli, M. 2001a. Web switch support for differentiated services. ACM Performance Evaluation Review 29, 2 (Sept.), 14--19.]] Google Scholar
- Cardellini, V., Casalicchio, E., Colajanni, M., and Tucci, S. 2001b. Mechanisms for quality of service in Web clusters. Computer Networks 36, 6 (Nov.), 759--769.]] Google Scholar
- Cardellini, V., Colajanni, M., and Yu, P. S. 1999. Dynamic load balancing on Web-server systems. IEEE Internet Computing 3, 3 (May/June), 28--39.]] Google Scholar
- Carrera, E. V. and Bianchini, R. 2001. Efficiency vs. portability in cluster-based network servers. In Proceedings of the 8th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (Snowbird, UT, June). ACM Press, New York, 113--122.]] Google Scholar
- Casalicchio, E., Cardellini, V., and Colajanni, M. 2002. Content-aware dispatching algorithms for cluster-based Web servers. Cluster Computing 5, 1 (Jan.), 67--76.]] Google Scholar
- Casalicchio, E. and Colajanni, M. 2001. A client-aware dispatching algorithm for Web clusters providing multiple services. In Proceedings of the 10th International World Wide Web Conference (Hong Kong, May). ACM Press, New York, 535--544.]] Google Scholar
- Casavant, T. L. and Kuhl, J. G. 1988. A taxonomy of scheduling in general-purpose distributed computing systems. IEEE Trans. Softw. Eng. 14, 2 (Feb.), 141--154.]] Google Scholar
- Challenger, J., Iyengar, A., and Dantzig, P. 1999. A scalable system for consistently caching dynamic Web data. In Proceedings of the 18th IEEE Conference on Computer Communications (INFOCOM 1999) (New York, NY, March). IEEE Computer Soc. Press, Los Alamitos, CA, 294--303.]]Google Scholar
- Challenger, J., Iyengar, A., Dantzig, P., Dias, D., and Mills, N. 2001. Engineering highly accessed Web sites for performance. In Web Engineering, Y. Deshpande and S. Murugesan, Eds. Springer-Verlag, Heidelberg, 247--265.]] Google Scholar
- Chen, X. and Mohapatra, P. 1999. Providing differentiated service from an Internet server. In Proceedings of the 8th IEEE International Conference on Computer Communications and Networks (Boston, MA, Oct.). IEEE Computer Soc. Press, Los Alamitos, CA, 214--217.]]Google Scholar
- Cherkasova, L. and Karlsson, M. 2001. Scalable Web server cluster design with WARD. In Proceedings of the 3rd International Workshop on Advanced issues of E-Commerce and Web-Based Information Systems (San Jose, CA, June). IEEE Computer Soc. Press, Los Alamitos, CA, 212--221.]] Google Scholar
- Cherkasova, L. and Phaal, P. 1999. Session based admission control: a mechanism for improving performance of commercial Web sites. In Proceedings of the International Workshop on Quality of Service (London, UK, June).]]Google Scholar
- Cherkasova, L. and Ponnekanti, S. 2000. Optimizing the "content-aware" load balancing strategy for shared Web hosting service. In Proceedings of the 8th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS 2000) (San Francisco, CA, Aug./Sept.). IEEE Computer Soc. Press, Los Alamitos, CA, 492--499.]] Google Scholar
- Ciardo, G., Riska, A., and Smirni, E. 2001. Equi-Load: a load balancing policy for clustered Web servers. Performance Evaluation 46, 2--3 (Oct.), 223--239.]] Google Scholar
- Cisco Systems. 2002. http://www.cisco.com/.]]Google Scholar
- Coffman, K. G. and Odlyzko, A. M. 2001. Internet growth: Is there a "Moore's Law" for data traffic? In Handbook of Massive Data Sets, J. Abello, P. M. Pardalos, and M. G. C. Resende, Eds. Kluwer Academic Publ., Dordrecht, The Netherlands.]] Google Scholar
- Cohen, A., Rangarajan, S., and Slye, H. 1999. On the performance of TCP splicing for URL-aware redirection. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems (Boulder, CO). USENIX Assoc., Berkeley, CA.]] Google Scholar
- Cohen, E. and Kaplan, H. 2001. Proactive caching of DNS records: Addressing a performance bottleneck. In Proceedings of the 2001 Symposium on Applications and the Internet (San Diego, CA, Jan.). IEEE Computer Soc. Press, Los Alamitos, CA, 85--94.]] Google Scholar
- Colajanni, M., Yu, P. S., and Dias, D. M. 1998. Analysis of task assignment policies in scalable distributed Web-server systems. IEEE Trans. Parall. Distrib. Syst. 9, 6 (June), 585--600.]] Google Scholar
- Coyote Point Systems. 2002. http://www.coyotepoint.com.]]Google Scholar
- Crovella, M. E. and Bestavros, A. 1997. Self-similarity in World Wide Web traffic: Evidence and possible causes. IEEE/ACM Trans. Networking 5, 6 (Dec.), 835--846.]] Google Scholar
- Crovella, M. E., Frangioso, R., and Harchol-Balter, M. 1999. Connection scheduling in Web servers. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems (Boulder, CO, Oct.). USENIX Assoc., Berkeley, CA.]] Google Scholar
- Dahlin, M. 2000. Interpreting stale load information. IEEE Trans. Parall. Distrib. Syst. 11, 10 (Oct.), 1033--1047.]] Google Scholar
- Damani, O. P., Chung, P. E., Huang, Y., Kintala, C., and Wang, Y.-M. 1997. ONE-IP: Techniques for hosting a service on a cluster of machines. Computer Networks 29, 8-13, 1019--1027.]] Google Scholar
- Degenaro, L., Iyengar, A., Lipkind, I., and Rouvellou, I. 2000. A middleware system which intelligently caches query results. In Proceedings of IFIP/ACM International Conference on Distributed Systems Platforms and Open Distributed Processing (Middleware 2000) (New York, NY, April). Springer-Verlag, Heidelberg, 24--44.]] Google Scholar
- Devlin, B., Gray, J., Laing, B., and Spix, G. 1999. Scalability terminology: Farms, clones, partitions, and pack: RACS and RAPS. Tech. Rep. MS_TR-99-85, Microsoft Research.]]Google Scholar
- Dias, D. M., Kish, W., Mukherjee, R., and Tewari, R. 1996. A scalable and highly available Web server. In Proceedings of the 41st IEEE Computer Society International Conference (San Jose, CA, Feb.). IEEE Computer Soc. Press, Los Alamitos, CA, 85--92.]] Google Scholar
- Digital Island. 2002. http://www.digitalisland.net.]]Google Scholar
- Doyle, R., Chase, J. S., Gadde, S., and Vahdat, A. M. 2001. The trickle-down effect: Web caching and server request distribution. In Proceedings of the 6th International Workshop on Web Caching and Content Delivery (Boston, MA). Elsevier Science, Amsterdam.]]Google Scholar
- Eggert, L. and Heidemann, J. 1999. Application-level differentiated services for Web servers. World Wide Web 2, 3 (July), 133--142.]] Google Scholar
- F5 Networks. 2002. http://www.f5labs.com/.]]Google Scholar
- Ferrari, D. and Zhou, S. 1987. An empirical investigation of load indices for load balancing applications. In Proceedings of the 12th International Symposium on Computer Performance Modeling, Measurement, and Evaluation (Brussels, Belgium). Elsevier Science, Amsterdam, 515--528.]] Google Scholar
- Fielding, R. T., Gettys, J., Mogul, J. C., Frystyk, H. F., Masinter, L., Leach, P. J., and Berners-Lee, T. 1999. Hypertext Transfer Protocol---HTTP/1.1. RFC 2616.]] Google Scholar
- Foundry Networks. 2002. Foundry networks' serveriron. http://www.foundrynet.com/products/webswitches/serveriron/.]]Google Scholar
- Fox, A., Gribble, S. D., Chawathe, Y., Brewer, E. A., and Gauthier, P. 1997. Cluster-based scalable network services. In Proceedings of the 16th ACM Symposium on Operating Systems Principles (Saint-Malo, France, Oct.). ACM Press, New York, 78--91.]] Google Scholar
- Gadde, S., Chase, J., and Rabinovich, M. 2001. Web caching and content distribution: A view from the interior. Computer Commun. 24, 1--2 (Feb.), 222--231.]]Google Scholar
- Gan, X. and Ramamurthy, B. 2000. LSMAC: An improved load sharing network service dispatcher. World Wide Web 3, 1 (Jan.), 53--59.]] Google Scholar
- Gilder, G. 1997. Fiber keeps its promise: Get ready. Bandwidth will triple each year for the next 25. Forbes. 7 April.]]Google Scholar
- Goldberg, A., Buff, R., and Schmitt, A. 1998. Secure Web server performance dramatically improved by caching SSL session keys. In Proceedings of Workshop on Internet Server Performance (Madison, WI, June).]]Google Scholar
- Gray, J. and Shenoy, P. 2000. Rules of thumb in data engineering. In Proceedings of the 16th IEEE International Conference on Data Engineering (San Diego, CA, Apr.). IEEE Computer Soc. Press, Los Alamitos, CA, 3--10.]] Google Scholar
- Harchol-Balter, M., Crovella, M. E., and Murta, C. D. 1999. On choosing a task assignment policy for a distributed server system. J. of Parallel and Distributed Computing 59, 204--228.]] Google Scholar
- Hennessy, J. 1999. The future of system research. IEEE Computer 32, 8 (Aug.), 27--33.]] Google Scholar
- Hu, Y., Nanda, A., and Yang, Q. 1999. Measurement, analysis and performance improvement of Apache Web server. In Proceedings of the 18th IEEE International Performance, Computing, and Communications Conference (Phoenix, AZ, Feb.). IEEE Computer Soc. Press, Los Alamitos, CA.]]Google Scholar
- Huitema, C. 2000. Network vs. server issues in end-to-end performance. Keynote speech at Performance and Architecture of Web Servers Workshop (Santa Clara, CA, June). http://kkant.ccwebhost.com/PAWS2000/huitema_keynote.ppt.]]Google Scholar
- Hunt, G. D. H., Goldszmidt, G. S., King, R. P., and Mukherjee, R. 1998. Network Dispatcher: A connection router for scalable Internet services. Computer Networks 30, 1--7, 347--357.]] Google Scholar
- IBM. 2002. IBM WebSphere Edge Server. http://www.ibm.com/software/webservers/edgeserver/.]]Google Scholar
- Intel. 2002. Intel NetStructure. http://www. intel.com/network/idc/products/traffic_equipment.htm.]]Google Scholar
- Kremier, O. and Kramer, J. 1992. Methodical analysis of adaptive load sharing algorithms. IEEE Trans. Parall. Distrib. Syst. 3, 6 (Nov.), 747--760.]] Google Scholar
- Kwan, T. T., McGrath, R. E., and Reed, D. A. 1995. NCSA's World Wide Web server: Design and performance. IEEE Computer 28, 11 (Nov.), 68--74.]] Google Scholar
- Levy-Abegnoli, E., Iyengar, A., Song, J., and Dias, D. 1999. Design and performance of a Web server accelerator. In Proceedings of the 18th IEEE International Conference on Computer Communications (INFOCOM 1999) (New York, NY, March). IEEE Computer Soc. Press, Los Alamitos, CA, 135--143.]]Google Scholar
- Li, K. and Jamin, S. 2000. A measurement-based admission-controlled Web server. In Proceedings of the 19th IEEE International Conference on Computer Communications (INFOCOM 2000) (Tel Aviv, Israel, March). IEEE Computer Soc. Press, Los Alamitos, CA, 651--659.]]Google Scholar
- Li, Q. and Moon, B. 2001. Distributed Cooperative Apache Web server. In Proceedings of the 10th International World Wide Web Conference (Hong Kong, May). ACM Press, New York, 555--564.]] Google Scholar
- Linux Virtual Server. 2002. Linux Virtual Server project. http://www.linuxvirtualserver.org/.]]Google Scholar
- Lucent Tech. 2002. Lucent Web Switch. http://www.bell-labs.com/project/webswitch/.]]Google Scholar
- Luo, M.-Y. and Yang, C.-S. 2001a. Constructing zero-loss Web services. In Proceedings of the 20th IEEE International Conference on Computer Communications (INFOCOM 2001) (Anchorage, AK, Apr.). IEEE Computer Soc. Press, Los Alamitos, CA, 1781--1790.]]Google Scholar
- Luo, M.-Y. and Yang, C.-S. 2001b. System support for scalable and reliable and highly manageable Web hosting service. In Proceedings of the 3rd USENIX Symposium on Internet Technologies and Systems (San Francisco, CA, March). USENIX Assoc., Berkeley, CA.]] Google Scholar
- Luotonen, A. M. 1997. Web Proxy Servers. Prentice Hall, Englewood Cliffs, NJ.]] Google Scholar
- Maltz, D. and Bhagwat, P. 1998. Application layer proxy performance using TCP splice. Tech. Rep. RC 21139, IBM T. J. Watson Research Center.]]Google Scholar
- Menascé, D. A., Almeida, J., Fonseca, R., and Mendes, M. A. 2000. Business-oriented resource management policies for e-commerce servers. Performance Evaluation 42, 2--3 (Sept.), 223--239.]] Google Scholar
- Microsoft. 2002. Network load balancing. http://www.microsoft.com/windows2000/techinfo/howitworks/cluster/nlb.as%p.]]Google Scholar
- Mirror Image Internet. 2002. http://www. mirror-image.com/.]]Google Scholar
- Mitzenmacher, M. 2000. How useful is old information. IEEE Trans. Parall. Distrib. Syst. 11, 1 (Jan.), 6--20.]] Google Scholar
- Mosedale, D., Foss, W., and McCool, R. 1997. Lessons learned administering Netscape's Internet site. IEEE Internet Computing 1, 2 (Mar./Apr.), 28--35.]] Google Scholar
- Nahum, E. M., Barzilai, T., and Kandlur, D. D. 2002. Performance issues in WWW servers. IEEE/ACM Trans. Networking 10, 2 (Feb.), 2--11.]] Google Scholar
- NetScaler. 2002. Netscaler's Request Switch. http://www.netscaler.com.]]Google Scholar
- Nortel Networks. 2002. Nortel Networks Web OS. http://www.nortelnetworks.com/products/01/alteon/.]]Google Scholar
- Oracle. 2002. Oracle9iAS Web Cache. http://www.oracle.com/ip/deploy/ias/caching/index.html.]]Google Scholar
- Othman, O., O'Ryan, C., and Schmidt, D. C. 2001. Strategies for CORBA middleware-based load balancing. IEEE Distributed Systems Online 2, 3 (Mar.).]]Google Scholar
- Pai, V. S., Aron, M., Banga, G., Svendsen, M., Druschel, P., Zwaenepoel, W., and Nahum, E. M. 1998. Locality-aware request distribution in cluster-based network servers. In Proceedings of the 8th ACM Conference on Architectural Support for Programming Languages and Operating Systems (San Jose, CA, Oct.). ACM Press, New York, 205--216.]] Google Scholar
- Pai, V. S., Druschel, P., and Zwaenepoel, W. 1999. Flash: An efficient and portable Web server. In Proceedings of the 1999 USENIX Annual Technical Conference (Monterrey, CA, June). USENIX Assoc., Berkeley, CA, 199--212.]] Google Scholar
- Pai, V. S., Druschel, P., and Zwaenepoel, W. 2000. IO-Lite: A unified I/O buffering and caching system. ACM Trans. Comput. Syst. 18, 1 (Feb.), 37--66.]] Google Scholar
- Pandey, R., Barnes, J. F., and Olsson, R. 1998. Supporting quality of service in HTTP servers. In Proceedings of the ACM Symposium on Principles of Distributed Computing (Puerto Vallarta, Mexico, June). ACM Press, New York, 247--256.]] Google Scholar
- Perkins, C. 1996. IP encapsulation within IP. RFC 2003.]] Google Scholar
- Persistence Software. 2002. Persistence Dynamai. http://www.persistence.com/products/dynamai/index.php.]]Google Scholar
- Pierre, G., van Steen M., and Tanenbaum, A. S. 2002. Dynamically selecting optimal distribution strategies for Web documents. IEEE Trans. Comput. 51. To appear in 2002.]] Google Scholar
- Radware. 2002. http://www.radware.com/.]]Google Scholar
- Resonate. 2002. http://www.resonate.com/.]]Google Scholar
- Rijsinghani, A. 1994. Computation of the Internet checksum via incremental update. RFC 1624.]] Google Scholar
- Rosu, D., Iyengar, A., and Dias, D. 2001. Web proxy acceleration. Cluster Computing 4, 4 (Oct.), 307--317.]] Google Scholar
- Rosu, M.-C. and Rosu, D. 2002. Evaluation of TCP splice benefits in Web proxy servers. In Proceedings of the 11th International World Wide Web Conference (Honolulu, HI, May). ACM Press, New York.]] Google Scholar
- Shaikh, A., Tewari, R., and Agrawal, M. 2001. On the effectiveness of DNS-based server selection. In Proceedings of the 20th IEEE International Conference on Computer Communications (INFOCOM 2001) (Anchorage, AK, April). IEEE Computer Soc. Press, Los Alamitos, CA, 1801--1810.]]Google Scholar
- Shirazi, B. A., Hurson, A. R., and Kavi, K. M. 1995. Scheduling and Load Balancing in Parallel and Distributed Systems. IEEE Computer Soc. Press, Los Alamitos, CA.]] Google Scholar
- Shivaratri, N. G., Krueger, P., and Singhal, M. 1992. Load distributing for locally distributed systems. IEEE Computer 25, 12 (Dec.), 33--44.]] Google Scholar
- Song, J., Iyengar, A., Levy-Abegnoli, E., and Dias, D. 2002. Architecture of a Web server accelerator. Computer Networks 38, 1 (Jan.), 75--97.]] Google Scholar
- Song, J., Levy-Abegnoli, E., Iyengar, A., and Dias, D. 2000. Design alternatives for scalable Web server accelerators. In Proceedings of the 2000 IEEE International Symposium on Performance Analysis of Systems and Software (Austin, TX, April). IEEE Computer Soc. Press, Los Alamitos, CA, 184--192.]] Google Scholar
- Spatscheck, O., Hansen, J. S., Hartman, J. H., and Peterson, L. L. 2000. Optimizing TCP forwarder performance. IEEE/ACM Trans. Networking 8, 2 (Apr.), 146--157.]] Google Scholar
- Srisuresh, P. and Egevang, K. 2001. Traditional IP Network Address Translator (Traditional NAT). RFC 3022.]] Google Scholar
- Srisuresh, P. and Gan, D. 1998. Load sharing using IP Network Address Translation. RFC 2391.]] Google Scholar
- Tang, W., Cherkasova, L., Russell, L., and Mutka, M. W. 2001. Modular TCP handoff design in STREAMS-based TCP/IP implementation. In Proceedings of the 1st International Conference on Networking (Colmar, France, July). Lecture Notes in Computer Science, vol. 2094. Springer-Verlag, Heidelberg, 71--80.]] Google Scholar
- Vaidya, S. and Christensen, K. 2001. A single system image server cluster using duplicated MAC and IP addresses. In Proceedings of the IEEE 26th Conference on Local Computer Networks (Tampa, FL, Nov.). IEEE Computer Soc. Press, Los Alamitos, CA, 206--214.]] Google Scholar
- Vasiliou, N. and Lutfiyya, H. L. 2000. Providing a differentiated quality of service in a World Wide Web server. ACM Performance Evaluation Review 28, 2 (Sept.), 22--28.]] Google Scholar
- Vingralek, R., Sayal, M., Breitbart, Y., and Scheuermann, P. 2000. Web++ architecture, design and performance. World Wide Web 3, 2 (Apr.), 65--77.]] Google Scholar
- Wang, J. 1999. A survey of Web caching schemes for the Internet. ACM Computer Commun. Review 29, 5 (Oct.), 36--46.]] Google Scholar
- Wang, Y. T. and Morris, R. J. T. 1985. Load sharing in distributed systems. IEEE Trans. Comput. 34, 3 (Mar.), 204--217.]]Google Scholar
- Wessels, D. 2001. Web Caching. O'Reilly and Associates, Sebastopol, CA.]] Google Scholar
- Wolf, J. L. and Yu, P. S. 2001. On balancing the load in a clustered Web farm. ACM Trans. Internet Technology 1, 2 (Nov.), 231--251.]] Google Scholar
- Yagoub, K., Florescu, D., Issarny, V., and Valduriez, P. 2000. Caching strategies for data-intensive Web sites. In Proceedings of the 24th International Conference on Very Large Databases (Cairo, Egypt, Sept.). Morgan Kaufmann, San Francisco, 188--199.]] Google Scholar
- Yang, C.-S. and Luo, M.-Y. 2000. A content placement and management system for distributed Web-server systems. In Proceedings of the 20th IEEE International Conference on Distributed Computing Systems (Taipei, Taiwan, April). IEEE Computer Soc. Press, Los Alamitos, CA, 691--698.]] Google Scholar
- Yoshikawa, C., Chun, B., Eastham, P., Vahdat, A., Anderson, T., and Culler, D. 1997. Using Smart Clients to build scalable services. In Proceedings of the 1997 USENIX Annual Technical Conference (Anaheim, CA, Jan.). USENIX Assoc., Berkeley, CA, 105--117.]] Google Scholar
- Zeus Tech. 2002. http://www.zeus.com/.]]Google Scholar
- Zhang, X., Barrientos, M., Chen, J. B., and Seltzer, M. 1999. HACC: An architecture for cluster-based Web servers. In Proceedings of the 3rd USENIX Windows NT Symposium (Seattle, WA, July). USENIX Assoc., Berkeley, CA, 155--164.]] Google Scholar
- Zhu, H., Smith, B., and Yang, T. 1999. Scheduling optimization for resource-intensive Web requests on server clusters. In Proceedings of the 11th ACM Symposium on Parallel Algorithms and Architectures (SPAA'99) (June). ACM Press, New York, 13--22.]] Google Scholar
- Zhu, H. and Tang, H. 2001. Class-based cache management for dynamic Web content. In Proceedings of the 20th IEEE International Conference on Computer Communications (INFOCOM 2001) (Anchorage, AK, April). IEEE Computer Soc. Press, Los Alamitos, CA, 1215--1224.]]Google Scholar
- Zhu, H., Tang, H., and Yang, T. 2001. Demand-driven service differentiation in cluster-based network servers. In Proceedings of the 20th IEEE International Conference on Computer Communications (INFOCOM 2001) (Anchorage, AK, April). IEEE Computer Soc. Press, Los Alamitos, CA, 679--688.]]Google Scholar
Recommendations
Request Redirection Algorithms for Distributed Web Systems
Replication of information among multiple servers is necessary to support high request rates to popular Web sites. We consider systems that maintain one interface to the users, even if they consist of multiple nodes with visible IP addresses that are ...
Global Adaptive Request Distribution with Broker
KES '07: Knowledge-Based Intelligent Information and Engineering Systems and the XVII Italian Workshop on Neural Networks on Proceedings of the 11th International ConferenceThis paper presents the application of fuzzy logic and neural networks to HTTP request dispatching performed within a geographically distributed Web system. Web sites serve as a global content delivery system where each Web server can respond to the ...
A Performance Study of Robust Load Sharing Strategies for Distributed Heterogeneous Web Server Systems
Replication of information across multiple servers is becoming a common approach to support popular Web sites. A distributed architecture with some mechanisms to assign client requests to Web servers is more scalable than any centralized or mirrored ...
Comments