skip to main content
10.1145/1989323.1989413acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Graph cube: on warehousing and OLAP multidimensional networks

Authors Info & Claims
Published:12 June 2011Publication History

ABSTRACT

We consider extending decision support facilities toward large sophisticated networks, upon which multidimensional attributes are associated with network entities, thereby forming the so-called multidimensional networks. Data warehouses and OLAP (Online Analytical Processing) technology have proven to be effective tools for decision support on relational data. However, they are not well-equipped to handle the new yet important multidimensional networks. In this paper, we introduce Graph Cube, a new data warehousing model that supports OLAP queries effectively on large multidimensional networks. By taking account of both attribute aggregation and structure summarization of the networks, Graph Cube goes beyond the traditional data cube model involved solely with numeric value based group-by's, thus resulting in a more insightful and structure-enriched aggregate network within every possible multidimensional space. Besides traditional cuboid queries, a new class of OLAP queries, crossboid, is introduced that is uniquely useful in multidimensional networks and has not been studied before. We implement Graph Cube by combining special characteristics of multidimensional networks with the existing well-studied data cube techniques. We perform extensive experimental studies on a series of real world data sets and Graph Cube is shown to be a powerful and efficient tool for decision support on large multidimensional networks.

References

  1. C. C. Aggarwal and H. Wang. Managing and Mining Graph Data. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. T. Baird. The Truth About Facebook. Emereo Pty Limited, 2009.Google ScholarGoogle Scholar
  3. D. Burdick, A. Doan, R. Ramakrishnan, and S. Vaithyanathan. OLAP over imprecise data with domain constraints. In VLDB, pages 39--50, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Chaudhuri and U. Dayal. An overview of data warehousing and OLAP technology. SIGMOD Rec., 26(1):65--74, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Chen, X. Yan, F. Zhu, J. Han, and P. S. Yu. Graph OLAP: Towards online analytical processing on graphs. In ICDM, pages 103--112, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Gibson, R. Kumar, and A. Tomkins. Discovering large dense subgraphs in massive graphs. In VLDB, pages 721--732, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Gray, S. Chaudhuri, A. Bosworth, A. Layman, D. Reichart, M. Venkatrao, F. Pellow, and H. Pirahesh. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. Data Min. Knowl. Discov., 1(1):29--53, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. J. Haas, J. F. Naughton, S. Seshadri, and L. Stokes. Sampling-based estimation of the number of distinct values of an attribute. In VLDB, pages 311--322, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Han, Y. Chen, G. Dong, J. Pei, B. W. Wah, J. Wang, and Y. D. Cai. Stream cube: An architecture for multi-dimensional analysis of data streams. Distrib. Parallel Databases, 18(2):173--197, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Harinarayan, A. Rajaraman, and J. D. Ullman. Implementing data cubes efficiently. In SIGMOD, pages 205--216, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. H. Karloff and M. Mihail. On the complexity of the view-selection problem. In PODS, pages 167--173, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. LeFevre and E. Terzi. GraSS: Graph structure summarization. In SDM, pages 454--465, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  13. J. Leskovec and C. Faloutsos. Sampling from large graphs. In KDD, pages 631--636, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. X. Li, J. Han, and H. Gonzalez. High-dimensional OLAP: a minimal cubing approach. In VLDB, pages 528--539, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. X. Lin, B. Ding, J. Han, F. Zhu, and B. Zhao. Text cube: Computing IR measures for multidimensional text database analysis. In ICDM, pages 905--910, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. E. Lo, B. Kao, W.-S. Ho, S. D. Lee, C. K. Chui, and D. W. Cheung. OLAP on sequence data. In SIGMOD, pages 649--660, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. Morfonios, S. Konakas, Y. Ioannidis, and N. Kotsis. ROLAP implementations of the data cube. ACM Comput. Surv., 39(4):12, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Navlakha, R. Rastogi, and N. Shrivastava. Graph summarization with bounded error. In SIGMOD, pages 419--432, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. Navlakha, M. C. Schatz, and C. Kingsford. Revealing biological modules via graph summarization. Journal of Computational Biology, 16:253--264, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  20. M. Newman. Networks: An Introduction. Oxford University Press, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Y. Qi, K. S. Candan, J. Tatemura, S. Chen, and F. Liao. Supporting OLAP operations over imperfectly integrated taxonomies. In SIGMOD, pages 875--888, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Thomases. Twitter Marketing: An Hour a Day. Wiley, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Y. Tian, R. A. Hankins, and J. Patel. Efficient aggregation for graph summarization. In SIGMOD, pages 567--580, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Wattenberg. Visual exploration of multivariate graphs. In CHI, pages 811--819, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. N. Zhang, Y. Tian, and J. M. Patel. Discovery-driven graph summarization. In ICDE, pages 880--891, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  26. Y. Zhou, H. Cheng, and J. X. Yu. Graph clustering based on structural/attribute similarities. PVLDB, 2(1):718--729, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Graph cube: on warehousing and OLAP multidimensional networks

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
          June 2011
          1364 pages
          ISBN:9781450306614
          DOI:10.1145/1989323

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 June 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate785of4,003submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader