ABSTRACT
As information continues to grow at an explosive rate, more and more heterogeneous network data sources are coming into being. While OLAP (On-Line Analytical Processing) techniques have been proven effective for analyzing and mining structured data, unfortunately, to our best knowledge, there are no OLAP tools available that are able to analyze multi-dimensional heterogeneous networks from different perspectives and with multiple granularities. Therefore, we have developed a novel HMGraph OLAP (Heterogeneous and Multi-dimensional Graph OLAP) framework for the purpose of providing more dimensions and operations to mine multi-dimensional heterogeneous information network. After information dimensions and topological dimensions, we have been the first to propose entity dimensions, which represent an important dimension for heterogeneous network analysis. On the basis of this notion, we designed HMGraph OLAP operations named (Rotate and Stretch for entity dimensions, which are able to mine relationships between different entities. We then proposed the HMGraph Cube, which is an efficient data warehousing model for HMGraph OLAP. In addition, through comparison with common strategies, we have shown that the optimizations we have proposed deliver better performance. Finally, we have implemented a HMGraph OLAP prototype, LiterMiner, which has proven effective for the analysis of multi-dimensional heterogeneous networks.
- K. S. Beyer and R. Ramakrishnan. Bottom-up computation of sparse and iceberg cubes. In SIGMOD, pages 359--370, 199. Google ScholarDigital Library
- D. Burdick, A. Doan, R. Ramakrishnan, and S. Vaithyanathan. Olap over imprecise data with domain constraints. In VLDB, pages 39--50, 2007. Google ScholarDigital Library
- S. Chaudhuri and U. Dayal. An overview of data warehousing and olap technology. 26(1):65--74, March 1997. Google ScholarDigital Library
- C. Chen, X. Yan, Z. Feida, J. Han, and P. S.Yu. Graph olap: Towards online analytical processing on graphs. In ICDM'08, pages 103--112, Dec 2008. Google ScholarDigital Library
- C. Chen, X. Yan, Z. Feida, J. Han, and P. S. Yu. Grapholap: a multi-dimentional framework for graph data analysis. Knowledge and Information System, 21(1):41--63, 2009. Google ScholarDigital Library
- V. Harinarayan, A. Rajaraman, and J. D. Ullman. Implementing data cubes efficiently. In SIGMOD'96, volume 25, pages 205--216, June 1996. Google ScholarDigital Library
- C. Li, P. S.Yu, L. Zhao, Y. Xie, and W. Lin. Infonetolaper : Integrating infonetwarehouse and infonetcube with infonetolap. In VLDB'11, volume 4, pages 1422--1425, 2011.Google ScholarDigital Library
- C. Li, L. Zhao, C. Tang, Y. Chen, J. Li, X. Zhao, and X. Liu. Modeling, design and implementation of graph olaping. Journal of Software, 22 (2):258--268, 2011.Google ScholarCross Ref
- X. Li, J. Han, and H. Gonzalez. High-dimensional olap: a minimal cubing approach. In VLDB, pages 528--539, 2004. Google ScholarDigital Library
- X. Li, J. Han, and H. Gonzalez. High-dimensional olap: A minimal cubing approach. In VLDB'04, volume 30, pages 528--539, 2004. Google ScholarDigital Library
- K. Morfonios, S. Konakas, Y. Ioannidis, and N. Kotsis. Rolap implementations of the data cube. In ACM Computing Surveys, volume 4, 2007. Google ScholarDigital Library
- Q. Qu, F. Zhu, X. Yan, J. Han, P. S. Yu, and H. Li. Efficient topological olap on information networks. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, pages 389--403, 2011. Google ScholarDigital Library
- Y. Sun, J. Han, X. Yan, P. S. Yu, and T. Wu. Pathsim: Meta path-based top-k similarity search in heterogeneous information. In VLDB'11, 2011.Google ScholarDigital Library
- Y. Sun, B. Norick, J. Han, X. Yan, P. S. Yu, and X. Yu. Integrating meta-path selection with user-guided object clustering in heterogeneous information networks. In KDD'12, 2012. Google ScholarDigital Library
- Y. Sun, T. Wu, Z. Yin, H. Cheng, J. Han, and X. Yin. Bibnetminer: Mining bibliographic information networks. In SIGMOD'08, pages 1341--1344, June 2008. Google ScholarDigital Library
- W. wei. Complex network virtualization and link olap. 2007.Google Scholar
- P. Zhao, X. Li, D. Xin, and J. Han. Graph cube: On warehousing and olap multidimensional networks. In SIGMOD'11, pages 12--16, 2011. Google ScholarDigital Library
Index Terms
- HMGraph OLAP: a novel framework for multi-dimensional heterogeneous network analysis
Recommendations
Graph cube: on warehousing and OLAP multidimensional networks
SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of dataWe consider extending decision support facilities toward large sophisticated networks, upon which multidimensional attributes are associated with network entities, thereby forming the so-called multidimensional networks. Data warehouses and OLAP (Online ...
Graph OLAP: a multi-dimensional framework for graph data analysis
Databases and data warehouse systems have been evolving from handling normalized spreadsheets stored in relational databases, to managing and analyzing diverse application-oriented data with complex interconnecting structures. Responding to this emerging ...
Graph OLAP: a multi-dimensional framework for graph data analysis
Databases and data warehouse systems have been evolving from handling normalized spreadsheets stored in relational databases, to managing and analyzing diverse application-oriented data with complex interconnecting structures. Responding to this ...
Comments