skip to main content
10.1145/2783258.2789987acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
tutorial

Dense Subgraph Discovery: KDD 2015 tutorial

Published:10 August 2015Publication History

ABSTRACT

Finding dense subgraphs is a fundamental graph-theoretic problem, that lies in the heart of numerous graph-mining applications, ranging from finding communities in social networks, to detecting regulatory motifs in DNA, and to identifying real-time stories in news. The problem of finding dense subgraphs has been studied extensively in theoretical computer science, and recently, due to the relevance of the problem in real-world applications, it has attracted considerable attention in the data-mining community.

In this tutorial we aim to provide a comprehensive overview of (i) major algorithmic techniques for finding dense subgraphs in large graphs and (ii) graph mining applications that rely on dense subgraph extraction. We will present fundamental concepts and algorithms that date back to 80's, as well as the latest advances in the area, from theoretical and from practical point-of-view. We will motivate the problem of finding dense subgraphs by discussing how it can be used in real-world applications. We will discuss different density definitions and the complexity of the corresponding optimization problems. We will also present efficient algorithms for different density measures and under different computational models. Specifically, we will focus on scalable streaming, distributed and MapReduce algorithms. Finally we will discuss problem variants, extensions, and will provide pointers for future research directions.

Skip Supplemental Material Section

Supplemental Material

p23131.m4v

m4v

2.5 GB

p23132.m4v

m4v

2.5 GB

References

  1. R. Andersen and K. Chellapilla. Finding dense subgraphs with size bounds. In WAW, pages 25--37, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Angel, N. Sarkas, N. Koudas, and D. Srivastava. Dense subgraph maintenance under streaming edge weight updates for real-time story identification. VLDB Endowment, 5(6):574--585, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. B. Bahmani, R. Kumar, and S. Vassilvitskii. Densest subgraph in streaming and mapreduce. Proceedings of the VLDB Endowment, 5(5):454--465, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. O. D. Balalau, F. Bonchi, T. Chan, F. Gullo, and M. Sozio. Finding subgraphs with maximum total density and limited overlap. In WSDM, pages 379--388, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Bhattacharya, Sayan Henzinger, D. Nanongkai, and C. Tsourakakis. Space- and time-efficient algorithms for maintaining dense subgraphs on one-pass dynamic streams. In 47th ACM Symposium on Theory of Computing (STOC), 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Charikar. Greedy approximation algorithms for finding dense components in a graph. In APPROX, pages 84--95, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. U. Feige, D. Peleg, and G. Kortsarz. The dense k-subgraph problem. Algorithmica, 29(3):410--421, 2001.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. E. Fratkin, B. T. Naughton, D. L. Brutlag, and S. Batzoglou. Motifcut: regulatory motifs finding with maximum density subgraphs. Bioinformatics, 22(14):e150--e157, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. V. Goldberg. Finding a maximum density subgraph. UCB technical report, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Khuller and B. Saha. On finding dense subgraphs. In Automata, Languages and Programming, pages 597--608, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Trawling the Web for emerging cyber-communities. Computer Networks, 31(11--16):1481--1493, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. V. E. Lee, N. Ruan, R. Jin, and C. Aggarwal. A survey of algorithms for dense subgraph discovery. In Managing and Mining Graph Data, pages 303--336. Springer, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  13. J. Nishimura and J. Ugander. Restreaming graph partitioning: simple versatile algorithms for advanced balancing. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1106--1114. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. Rozenshtein, A. Anagnostopoulos, A. Gionis, and N. Tatti. Event detection in activity networks. In KDD, pages 1176--1185, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. P. Rozenshtein, N. Tatti, and A. Gionis. Discovering dynamic communities in interaction networks. In ECML PKDD, pages 678--693. Springer, 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Sozio and A. Gionis. The community-search problem and how to plan a successful cocktail party. In KDD, pages 939--948, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. I. Stanton and G. Kliot. Streaming graph partitioning for large distributed graphs. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1222--1230. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. N. Tatti and A. Gionis. Density-friendly graph decomposition. In WWW, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. H. Tong and C. Faloutsos. Center-piece subgraphs: problem definition and fast solutions. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 404--413. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Tsourakakis. The k-clique densest subgraph problem. WWW, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C. Tsourakakis, F. Bonchi, A. Gionis, F. Gullo, and M. Tsiarli. Denser than the densest subgraph: Extracting optimal quasi-cliques with quality guarantees. In KDD, pages 104--112, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. Tsourakakis, C. Gkantsidis, B. Radunovic, and M. Vojnovic. Fennel: Streaming graph partitioning for massive scale graphs. In WSDM, pages 333--342. ACM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Dense Subgraph Discovery: KDD 2015 tutorial

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
      August 2015
      2378 pages
      ISBN:9781450336642
      DOI:10.1145/2783258

      Copyright © 2015 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 10 August 2015

      Check for updates

      Qualifiers

      • tutorial

      Acceptance Rates

      KDD '15 Paper Acceptance Rate160of819submissions,20%Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader