Abstract
Recently the notion of self-similarity has been shown to apply to wide-area and local-area network traffic. In this paper we examine the mechanisms that give rise to the self-similarity of network traffic. We present a hypothesized explanation for the possible self-similarity of traffic by using a particular subset of wide area traffic: traffic due to the World Wide Web (WWW). Using an extensive set of traces of actual user executions of NCSA Mosaic, reflecting over half a million requests for WWW documents, we examine the dependence structure of WWW traffic. While our measurements are not conclusive, we show evidence that WWW traffic exhibits behavior that is consistent with self-similar traffic models. Then we show that the self-similarity in such traffic can be explained based on the underlying distributions of WWW document sizes, the effects of caching and user preference in file transfer, the effect of user "think time", and the superimposition of many such transfers in a local area network. To do this we rely on empirically measured distributions both from our traces and from data independently collected at over thirty WWW sites.
- 1 Martin F. Arlitt and Carey L. Williamson. Web server workload characterization: The search for invariants. In Proceedings of the 1996 SIGMETRICS Conference on Measurement and Modeling of Computer Systems, 1996.]] Google ScholarDigital Library
- 2 Jan Beran. Statistics for Long-Memory Processes. Monographs on Statistics and Applied Probability. Chapman and Hall, New York, NY, 1994.]]Google Scholar
- 3 T. Berners-Lee, L. Masinter, and M.McCahill. Uniform resource locators. RFC 1738, December 1994.]] Google ScholarDigital Library
- 4 Peter J. Brockwell and Richard A. Davis. Time Series: Theory and Methods. Springer Series in Statistics. Springer-Verlag, second edition, 1991.]] Google ScholarDigital Library
- 5 Lara D. Catledge and James E. Pitkow. Characterizing browsing strategies in the World-Wide Web. In Proceedings of the Third WWW Conference, 1994.]] Google ScholarDigital Library
- 6 Netscape Communications Corp. Netscape Navigator software. Available from http://w-~.netscape, com.]]Google Scholar
- 7 Mark E. Crovella and Azer Bestavros. Explaining world wide web traffic self-similarity. Technical Report TR-95-015 (Revised), Boston University Department of Computer Science, October 1995.]] Google ScholarDigital Library
- 8 Carlos R. Cunha, A2er Bostavros, and Mark E. Crovella. Characteristics of WWW client-based traces. Technical Report P,U- CS-95-010, Boston University Computer Science Department, 1995.]] Google ScholarDigital Library
- 9 National Center for Supercomputing Applications. Mosaic software. Available at ftp://ftp.ncsa, uiuc. edu/Mosaic.]]Google Scholar
- 10 Steven Glassman. A Caching Relay for the World Wide Web. In First International Conference on the World-Wide Web, CERN, Geneva (Switzerland)~ May 1994. Elsevier Science.]] Google ScholarDigital Library
- 11 B. M. Hill. A simple general approach to inference about the tail of a distribution. The Annals of Statistics, 3:1163-1174, 1975.]]Google ScholarCross Ref
- 12 Merit Network Inc. NSF Network statistics. Available at ftp :- //nis.nsf.net/statistics/nsfnet/, December 1994.]]Google Scholar
- 13 Gordon Irlam. Unix file size survey- 1993. Available at http ://www. base. com/gordoni/ufs93 .hCml, September 199,1.]]Google Scholar
- 14 W. LeIand, M. Taqqu, W. Willinger, and D. Wilson. On the self-similar nature of Ethernet traffic. In Proceedings of SIG- COMM '93, pages 183-193, September 1993.]] Google ScholarDigital Library
- 15 W. E. Leland and D. V. Wilson. High time-resolution measurement and analysis of LAN traffic: Implications for LAN interconnection. In Proceeedings of IEEE lnfocomm '91, pages 1360-1366, Bal Harbour, FL, 1991.]]Google Scholar
- 16 W.E. Leland, M.S. Taqqu, W. Willinger, and D.V. Wilson. (:)n the self-similar nature of Ethernet traffic (extended version). IEEE/ACM Transactions on Networking, 2:1-15, 1994.]] Google ScholarDigital Library
- 17 Benoit B. Mandelbrot. Long-run linearity, locally Gaussian processes, H-spectra and infinite variances. Intern. Econom. Rev.~ 10:82-113, 1969.]]Google Scholar
- 18 Benoit B. Mandelbrot. The Fractal Geometry of Nature. W. H. Freedman and Co., New York, 1983.]]Google Scholar
- 19 Vern Paxson. Empirically-derived analytic models of wide-axea TCP connections. IEEE/ACM Transactions on Networking, 2(4):316-336, August 1994.]] Google ScholarDigital Library
- 20 Vern Paxson and Sally Floyd. Wide-area traffic: The failure of poisson modeling. In Proceedings o/SIGCOMM '9~, 1994.]] Google ScholarDigital Library
- 21 James E. Pitkow and Margaret M. Recker. A Simple Yet Robust Caching Algorithm Based on Dynamic Access Patterns, In Electronic Prec. of the 2nd WWW Conference, 1994.]]Google Scholar
- 22 Regents of the University of California. www-stat 1.0 software. Available from http://~, ics .uci. edu/WebSoft/~stat/.]]Google Scholar
- 23 Jeff Sedayao. "Mosaic Will Kill My Network!" - Studying Network Traffic Patterns of Mosaic Use. In Electronic Proceedings of the Second World Wide Web Conference '95: Mosaic and the Web, Chicago, illinois, October 1994.]]Google Scholar
- 24 M. S. Taqqu, V. Teverovsky, and W. Willinger. Estimators for long-range dependence: an empirical study, 1995. Preprint.]]Google Scholar
- 25 Murad Taqqu. Personal communication.]]Google Scholar
- 26 Murad S. Taqqu and Joshua B. Levy. Using renewal processes to generate long-range dependence and high variability. In Ernst Eberlein and Murad S. Taqqu, editors, Dependence in Probability and Statistics, pages 73-90. Birkhauser, 1986.]]Google Scholar
- 27 Walter Willinger, Murad S. Taqqu, Will E. Leland, and Daniel V. Wilson. Self-similarity in high-speed packet traf tic: Analysis and modeling of Ethernet traffic measurements. Statistical Science, 10(1):67-85, 1995.]]Google ScholarCross Ref
- 28 Walter Willinger, Murad S. Taqqu, Robert Sherman, and Daniel V. Wilson. Self-similarity through high-variability: Statistical analysis of Ethernet LAN traffic at the source level. In Proceedings of SIGCOMM '95, pages 100-113, Boston, MA, 1995.]] Google ScholarDigital Library
Index Terms
- Self-similarity in World Wide Web traffic: evidence and possible causes
Recommendations
Self-similarity in World Wide Web traffic: evidence and possible causes
SIGMETRICS '96: Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systemsRecently the notion of self-similarity has been shown to apply to wide-area and local-area network traffic. In this paper we examine the mechanisms that give rise to the self-similarity of network traffic. We present a hypothesized explanation for the ...
Comments