skip to main content
article

Graph evolution: Densification and shrinking diameters

Published:01 March 2007Publication History
Skip Abstract Section

Abstract

How do real graphs evolve over time? What are normal growth patterns in social, technological, and information networks? Many studies have discovered patterns in static graphs, identifying properties in a single snapshot of a large network or in a very small number of snapshots; these include heavy tails for in- and out-degree distributions, communities, small-world phenomena, and others. However, given the lack of information about network evolution over long periods, it has been hard to convert these findings into statements about trends over time.

Here we study a wide range of real graphs, and we observe some surprising phenomena. First, most of these graphs densify over time with the number of edges growing superlinearly in the number of nodes. Second, the average distance between nodes often shrinks over time in contrast to the conventional wisdom that such distance parameters should increase slowly as a function of the number of nodes (like O(log n) or O(log(log n)).

Existing graph generation models do not exhibit these types of behavior even at a qualitative level. We provide a new graph generator, based on a forest fire spreading process that has a simple, intuitive justification, requires very few parameters (like the flammability of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study.

We also notice that the forest fire model exhibits a sharp transition between sparse graphs and graphs that are densifying. Graphs with decreasing distance between the nodes are generated around this transition point.

Last, we analyze the connection between the temporal evolution of the degree distribution and densification of a graph. We find that the two are fundamentally related. We also observe that real networks exhibit this type of relation between densification and the degree distribution.

References

  1. Abello, J. 2004. Hierarchical graph maps. Comput. Graph. 28, 3, 345--359.]]Google ScholarGoogle ScholarCross RefCross Ref
  2. Abello, J., Buchsbaum, A. L., and Westbrook, J. 1998. A functional approach to external graph algorithms. In Proceedings of the 6th Annual European Symposium on Algorithms. Springer-Verlag, 332--343.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Abello, J., Pardalos, P. M., and Resende, M. G. C. 2002. Handbook of Massive Data Sets. Kluwer Academic Publishing.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Adamic, L. A. 2000. Zipf, power-law, pareto---a ranking tutorial. http://www.hpl.hp.com/research/idl/papers/ranking.]]Google ScholarGoogle Scholar
  5. Albert, R. and Barabasi, A.-L. 1999. Emergence of scaling in random networks. Science. 509--512.]]Google ScholarGoogle Scholar
  6. Albert, R., Jeong, H., and Barabasi, A.-L. 1999. Diameter of the World-Wide Web. Nature 401, 130--131.]]Google ScholarGoogle ScholarCross RefCross Ref
  7. Alderson, D., Doyle, J. C., Li, L., and Willinger, W. 2005. Towards a theory of scale-free graphs: Definition, properties, and implications. Internet Math. 2, 4.]]Google ScholarGoogle Scholar
  8. Bi, Z., Faloutsos, C., and Korn, F. 2001. The dgx distribution for mining massive, skewed data. In Proceedings of Knowledge Discovery and Data Mining (KDD). 17--26.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Bollobas, B. and Riordan, O. 2004. The diameter of a scale-free random graph. Combinatorica 24, 1, 5--34.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., and Wiener, J. 2000a. Graph structure in the Web. In Proceedings of the 9th International World Wide Web Conference on Computer Networks: The International Journal of Computer and Telecommunications Netowrking. North-Holland Publishing Co., Amsterdam, The Netherlands, 309--320.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., and Wiener, J. 2000b. Graph structure in the web: experiments and models. In Proceedings of World Wide Web Conference.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Chakrabarti, D. and Faloutsos, C. 2006. Graph mining: Laws, generators, and algorithms. ACM Comput. Sur. 38, 1.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Chakrabarti, D., Zhan, Y., and Faloutsos, C. 2004. R-mat: A recursive model for graph mining. In Proceedings of the SIAM Conference on Data Mining (SDM).]]Google ScholarGoogle Scholar
  14. Chung, F. and Lu, L. 2002. The average distances in random graphs with given expected degrees. Proceedings of the National Academy of Sciences 99, 25, 15879--15882.]]Google ScholarGoogle ScholarCross RefCross Ref
  15. Cooper, C. and Frieze, A. 2003. A general model of web graphs. Random Struct. Algo. 22, 3, 311--335.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Dorogovtsev, S. and Mendes, J. 2001a. Effect of the accelerated growth of communications networks on their structure. Phys. Rev. E 63, 025101.]]Google ScholarGoogle ScholarCross RefCross Ref
  17. Dorogovtsev, S. and Mendes, J. 2001b. Language as an evolving word web. Proceedings of the Royal Society of London B 268, 2603.]]Google ScholarGoogle ScholarCross RefCross Ref
  18. Dorogovtsev, S. and Mendes, J. 2002. Accelerated growth of networks. In Handbook of Graphs and Networks: From the Genome to the Internet, S. Bornholdt and H.G. Schuster. Eds. Wiley-VCH, Berlin, Germany.]]Google ScholarGoogle Scholar
  19. Dorogovtsev, S. and Mendes, J. 2003. Evolution of Networks: From Biological Nets to the Internet and WWW. Oxford University Press, Oxford, UK.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Faloutsos, M., Faloutsos, P., and Faloutsos, C. 1999. On power-law relationships of the Internet topology. In SIGCOMM. 251--262.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Gehrke, J., Ginsparg, P., and Kleinberg, J. M. 2003. Overview of the 2003 kdd cup. SIGKDD Explora. 5, 2, 149--151.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Hall, B. H., Jaffe, A. B., and Trajtenberg, M. 2001. The nber patent citation data file: Lessons, insights and methodological tools. NBER Working Papers 8498, National Bureau of Economic Research, Inc. (Oct.)]]Google ScholarGoogle Scholar
  23. Huberman, B. A. and Adamic, L. A. 1999. Growth dynamics of the world-wide web. Nature 399, 131.]]Google ScholarGoogle ScholarCross RefCross Ref
  24. Katz, J. S. 1999. The self-similar science system. Resear. Policy 28, 501--517.]]Google ScholarGoogle ScholarCross RefCross Ref
  25. Katz, J. S. 2005. Scale independent bibliometric indicators. Measure.: Interdisciplin. Resea. Perspect. 3, 24--28.]]Google ScholarGoogle ScholarCross RefCross Ref
  26. Kleinberg, J. M. 2002. Small-world phenomena and the dynamics of information. In Advances in Neural Information Processing Systems 14.]]Google ScholarGoogle Scholar
  27. Kleinberg, J. M., Kumar, R., Raghavan, P., Rajagopalan, S., and Tomkins, A. 1999. The Web as a graph: Measurements, models, and methods. In Proceedings of the International Conference on Combinatorics and Computing. 1--17.]]Google ScholarGoogle Scholar
  28. Kossinets, G. and Watts, D. J. 2006. Empirical analysis of an evolving social network. Science 311, 88--90.]]Google ScholarGoogle ScholarCross RefCross Ref
  29. Krapivsky, P. L. and Redner, S. 2001. Organization of growing random networks. Phys. Rev. E 63, 066123.]]Google ScholarGoogle ScholarCross RefCross Ref
  30. Krapivsky, P. L. and Redner, S. 2005. Network growth by copying. Phys. Rev. E 71, 036118.]]Google ScholarGoogle ScholarCross RefCross Ref
  31. Kumar, R., Raghavan, P., Rajagopalan, S., Sivakumar, D., Tomkins, A., and Upfal, E. 2000. Stochastic models for the web graph. In Proceedings of the 41st IEEE Symposium on Foundations of Computer Science.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Kumar, R., Raghavan, P., Rajagopalan, S., and Tomkins, A. 1999. Trawling the Web for emerging cyber-communities. In Proceedings of the 8th International World Wide Web Conference.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Leskovec, J., Adamic, L., and Huberman, B. 2006. The dynamics of viral marketing. ACM Conference on Electronic Commerce.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Leskovec, J., Chakrabarti, D., Kleinberg, J. M., and Faloutsos, C. 2005. Realistic, mathematically tractable graph generation and evolution, using kronecker multiplication. In Proceedings of the European International Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD'05). 133--145.]]Google ScholarGoogle Scholar
  35. Leskovec, J. and Faloutsos, C. 2006. Sampling from large graphs. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06). ACM Press, New York, NY, USA, 631--636.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Leskovec, J., Kleinberg, J., and Faloutsos, C. 2005. Graphs over time: densification laws, shrinking diameters and possible explanations. In Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD'05). Chicago, IL.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Menczer, F. 2002. Growing and navigating the small world web by local content. Proceedings of the National Academy of Sciences 99, 22, 14014--14019.]]Google ScholarGoogle ScholarCross RefCross Ref
  38. Milgram, S. 1967. The small-world problem. Psycholo. Today 2, 60--67.]]Google ScholarGoogle Scholar
  39. Mitzenmacher, M. 2004. A brief history of generative models for power law and lognormal distributions. Internet Math. 1, 2, 226--251.]]Google ScholarGoogle ScholarCross RefCross Ref
  40. Newman, M. E. J. 2003. The structure and function of complex networks. SIAM Review 45, 167--256.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Newman, M. E. J. 2005. Power laws, pareto distributions and zipf's law. Contemp. Phys. 46, 323--351.]]Google ScholarGoogle ScholarCross RefCross Ref
  42. Ntoulas, A., Cho, J., and Olston, C. 2004. What's new on the web? the evolution of the web from a search engine perspective. In World Wide Web Conference. New York, 1--12.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Oregon. 1997. University of Oregon route views project. online data and reports. http://www.routeviews.org.]]Google ScholarGoogle Scholar
  44. Palmer, C. R., Gibbons, P. B., and Faloutsos, C. 2002. Anf: A fast and scalable tool for data mining in massive graphs. In SIGKDD. Edmonton, AB, Canada.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Redner, S. 2004. Citation statistics from more than a century of physical review. Tech. rep. physics/0407137, arXiv.]]Google ScholarGoogle Scholar
  46. Schroeder, M. 1991. Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise. W.H. Freeman and Company, New York, NY.]]Google ScholarGoogle Scholar
  47. Tauro, S. L., Palmer, C., Siganos, G., and Faloutsos, M. 2001. A simple conceptual model for the internet topology. In Global Internet. San Antonio, TX.]]Google ScholarGoogle Scholar
  48. Vazquez, A. 2001. Disordered networks generated by recursive searches. Europhy. Lett. 54, 4, 430--435.]]Google ScholarGoogle Scholar
  49. Vazquez, A. 2003. Growing networks with local rules: Preferential attachment, clustering hierarchy and degree correlations. Physical Review E 67, 056104.]]Google ScholarGoogle ScholarCross RefCross Ref
  50. Watts, D. J., Dodds, P. S., and Newman, M. E. J. 1998. Collective dynamics of ‘small-world’ networks. Nature 393, 440--442.]]Google ScholarGoogle ScholarCross RefCross Ref
  51. Watts, D. J., Dodds, P. S., and Newman, M. E. J. 2002. Identity and search in social networks. Science 296, 1302--1305.]]Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Graph evolution: Densification and shrinking diameters

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Knowledge Discovery from Data
      ACM Transactions on Knowledge Discovery from Data  Volume 1, Issue 1
      March 2007
      161 pages
      ISSN:1556-4681
      EISSN:1556-472X
      DOI:10.1145/1217299
      Issue’s Table of Contents

      Copyright © 2007 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 March 2007
      Published in tkdd Volume 1, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader