skip to main content
research-article

Attributed Collaboration Network Embedding for Academic Relationship Mining

Authors Info & Claims
Published:24 November 2020Publication History
Skip Abstract Section

Abstract

Finding both efficient and effective quantitative representations for scholars in scientific digital libraries has been a focal point of research. The unprecedented amounts of scholarly datasets, combined with contemporary machine learning and big data techniques, have enabled intelligent and automatic profiling of scholars from this vast and ever-increasing pool of scholarly data. Meanwhile, recent advance in network embedding techniques enables us to mitigate the challenges of large scale and sparsity of academic collaboration networks. In real-world academic social networks, scholars are accompanied with various attributes or features, such as co-authorship and publication records, which result in attributed collaboration networks. It has been observed that both network topology and scholar attributes are important in academic relationship mining. However, previous studies mainly focus on network topology, whereas scholar attributes are overlooked. Moreover, the influence of different scholar attributes are unclear. To bridge this gap, in this work, we present a novel framework of Attributed Collaboration Network Embedding (ACNE) for academic relationship mining. ACNE extracts four types of scholar attributes based on the proposed scholar profiling model, including demographics, research, influence, and sociability. ACNE can learn a low-dimensional representation of scholars considering both scholar attributes and network topology simultaneously. We demonstrate the effectiveness and potentials of ACNE in academic relationship mining by performing collaborator recommendation on two real-world datasets and the contribution and importance of each scholar attribute on scientific collaborator recommendation is investigated. Our work may shed light on academic relationship mining by taking advantage of attributed collaboration network embedding.

References

  1. Uchenna Akujuobi, Han Yufei, Qiannan Zhang, and Xiangliang Zhang. 2019. Collaborative graph walk for semi-supervised multi-label node classification. Retrieved from https://Arxiv:1910.1910.09706.Google ScholarGoogle Scholar
  2. Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, et al. 2018. Relational inductive biases, deep learning, and graph networks. Retrieved from https://Arxiv:1806.01261.Google ScholarGoogle Scholar
  3. H. Cai, V. W. Zheng, and K. C. Chang. 2018. A comprehensive survey of graph embedding: Problems, techniques, and applications. IEEE Trans. Knowl. Data Eng. 30, 9 (2018), 1616--1637. DOI:https://doi.org/10.1109/TKDE.2018.2807452Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Hung-Hsuan Chen, Liang Gou, Xiaolong Zhang, and Clyde Lee Giles. 2011. Collabseer: A search engine for collaboration discovery. In Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries. ACM, 231--240.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Yankai Chen, Jie Zhang, Yixiang Fang, Xin Cao, and Irwin King. 2020. Efficient community search over large directed graph: An augmented index-based approach. In Proceedings of the International Joint Conference on Artificial Inteeligence (IJCAI’20). 3544--3550.Google ScholarGoogle ScholarCross RefCross Ref
  6. Yuxiao Dong, Nitesh V. Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable representation learning for heterogeneous networks. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 135--144.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Yuxiao Dong, Nitesh V. Chawla, Jie Tang, Yang Yang, and Yang Yang. 2017. User modeling on demographic attributes in big mobile social networks. ACM Trans. Info. Syst. 35, 4, Article 35 (July 2017), 33 pages. DOI:https://doi.org/10.1145/3057278Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Yuxiao Dong, Reid A. Johnson, and Nitesh V. Chawla. 2016. Can scientific impact be predicted? IEEE Trans. Big Data 2, 1 (2016), 18--30.Google ScholarGoogle ScholarCross RefCross Ref
  9. Lun Du, Yun Wang, Guojie Song, Zhicong Lu, and Junshan Wang. 2018. Dynamic network embedding: An extended approach for skip-gram based network embedding. In Proceedings of the International Joint Conference on Artificial Inteeligence (IJCAI’18). 2086--2092.Google ScholarGoogle ScholarCross RefCross Ref
  10. Xinyu Fu, Jiani Zhang, Ziqiao Meng, and Irwin King. 2020. MAGNN: Metapath aggregated graph neural network for heterogeneous graph embedding. In Proceedings of the World Wide Web Conference (WWW). 2331--2341.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hongchang Gao and Heng Huang. 2018. Deep attributed network embedding. In Proceedings of the International Joint Conference on Artificial Inteeligence (IJCAI’18). 3364--3370.Google ScholarGoogle ScholarCross RefCross Ref
  12. Palash Goyal, Sujit Rokka Chhetri, and Arquimedes Canedo. 2020. dyngraph2vec: Capturing network dynamics using dynamic graph representation learning. Knowl.-Based Syst. 187 (January 2020), 104816. DOI:https://doi.org/10.1016/j.knosys.2019.06.024Google ScholarGoogle Scholar
  13. Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 855--864.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems. MIT Press, 1024--1034.Google ScholarGoogle Scholar
  15. R. Hong, Y. He, L. Wu, Y. Ge, and X. Wu. 2019. Deep attributed network embedding by preserving structure and attribute information. IEEE Trans. Syst. Man Cybernet.: Syst. (2019), 1--12. DOI:https://doi.org/10.1109/TSMC.2019.2897152Google ScholarGoogle Scholar
  16. Xiao Huang, Jundong Li, and Xia Hu. 2017. Accelerated attributed network embedding. In Proceedings of the SIAM International Conference on Data Mining. SIAM, 633--641.Google ScholarGoogle ScholarCross RefCross Ref
  17. Ming Ji, Yizhou Sun, Marina Danilevsky, Jiawei Han, and Jing Gao. 2010. Graph regularized transductive classification on heterogeneous information networks. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 570--586.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Yantao Jia, Yuanzhuo Wang, Xiaolong Jin, Hailun Lin, and Xueqi Cheng. 2018. Knowledge graph embedding: A locally and temporally adaptive translation-based approach. ACM Trans. Web 12, 2 (2018), 8.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Zhuoren Jiang, Yue Yin, Liangcai Gao, Yao Lu, and Xiaozhong Liu. 2018. Cross-language citation recommendation via hierarchical representation learning on heterogeneous graph. In Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 635--644.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Madian Khabsa and C. Lee Giles. 2014. The number of scholarly documents on the public web. PloS ONE 9, 5 (2014), e93949.Google ScholarGoogle ScholarCross RefCross Ref
  21. Samiya Khan, Xiufeng Liu, Kashish A. Shakil, and Mansaf Alam. 2017. A survey on scholarly data: From big data perspective. Info. Process. Manage. 53, 4 (2017), 923--944.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Junghwan Kim, Haekyu Park, Ji-Eun Lee, and U. Kang. 2018. Side: Representation learning in signed directed networks. In Proceedings of the World Wide Web Conference. International World Wide Web Conferences Steering Committee, 509--518.Google ScholarGoogle Scholar
  23. Thomas N. Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. Retrieved from https://Arxiv:1609.02907.Google ScholarGoogle Scholar
  24. Xiangjie Kong, Huizhen Jiang, Teshome Megersa Bekele, Wei Wang, and Zhenzhen Xu. 2017. Random walk-based beneficial collaborators recommendation exploiting dynamic research interests and academic influence. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 1371--1377.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Ronald N. Kostoff, J. Antonio del Rio, James A. Humenik, Esther Ofilia Garcia, and Ana Maria Ramirez. 2001. Citation mining: Integrating text mining and bibliometrics for research user profiling. J. Amer. Soc. Info. Sci. Technol. 52, 13 (2001), 1148--1156.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Jianxin Li, Taotao Cai, Ke Deng, Xinjue Wang, Timos Sellis, and Feng Xia. 2020. Community-diversified influence maximization in social networks. Info. Syst. 92 (September 2020), 101522. DOI:https://doi.org/10.1016/j.is.2020.101522Google ScholarGoogle Scholar
  27. Jundong Li, Harsh Dani, Xia Hu, Jiliang Tang, Yi Chang, and Huan Liu. 2017. Attributed network embedding for learning in a dynamic environment. In Proceedings of the ACM on Conference on Information and Knowledge Management. ACM, 387--396.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Yusheng Li, Yilun Shang, and Yiting Yang. 2017. Clustering coefficients of large networks. Info. Sci. 382 (2017), 350--358.Google ScholarGoogle Scholar
  29. Lizi Liao, Xiangnan He, Hanwang Zhang, and Tat-Seng Chua. 2018. Attributed social network embedding. IEEE Trans. Knowl. Data Eng. 30, 12 (2018), 2257--2270.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Han Liu, Xianchao Zhang, and Xiaotong Zhang. 2018. Possible world based consistency learning model for clustering and classifying uncertain data. Neural Netw. 102 (2018), 48--66.Google ScholarGoogle ScholarCross RefCross Ref
  31. Jiaying Liu, Feng Xia, Lei Wang, Bo Xu, Xiangjie Kong, Hanghang Tong, and Irwin King. 2019. Shifu2: A network representation learning based model for advisor-advisee relationship mining. IEEE Trans. Knowl. Data Eng. (2019). DOI:https://doi.org/10.1109/TKDE.2019.2946825Google ScholarGoogle Scholar
  32. Zheng Liu, Xing Xie, and Lei Chen. 2018. Context-aware academic collaborator recommendation. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1870--1879.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Chunyu Lu, Pengfei Jiao, Hongtao Liu, Yaping Wang, Hongyan Xu, and Wenjun Wang. 2019. SSNE: Status signed network embedding. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 81--93.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Linyuan Lü and Tao Zhou. 2011. Link prediction in complex networks: A survey. Physica A: Stat. Mech. Appl. 390, 6 (2011), 1150--1170.Google ScholarGoogle ScholarCross RefCross Ref
  35. Zaiqiao Meng, Shangsong Liang, Hongyan Bao, and Xiangliang Zhang. 2019. Co-embedding attributed networks. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining. 393--401.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Stuart E. Middleton, Nigel R. Shadbolt, and David C. De Roure. 2004. Ontological user profiling in recommender systems. ACM Trans. Info. Syst. 22, 1 (2004), 54--88.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 701--710.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Alexander Michael Petersen. 2015. Quantifying the impact of weak, strong, and super ties in scientific careers. Proc. Natl. Acad. Sci. U.S.A. 112, 34 (2015), E4671--E4680.Google ScholarGoogle ScholarCross RefCross Ref
  39. Dominic Seyler, Praveen Chandar, and Matthew Davis. 2018. An information retrieval framework for contextual suggestion based on heterogeneous information network embeddings. In Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 953--956.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Yu Shi, Qi Zhu, Fang Guo, Chao Zhang, and Jiawei Han. 2018. Easing embedding learning by comprehensive transcription of heterogeneous information networks. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2190--2199.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Roberta Sinatra, Dashun Wang, Pierre Deville, Chaoming Song, and Albert-László Barabási. 2016. Quantifying the evolution of individual scientific impact. Science 354, 6312 (2016), aaf5239.Google ScholarGoogle Scholar
  42. Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1067--1077.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Jie Tang, Limin Yao, Duo Zhang, and Jing Zhang. 2010. A combination approach to web user profiling. ACM Trans. Knowl. Discov. Data 5, 1 (2010), 2.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. Arnetminer: Extraction and mining of academic social networks. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 990--998.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Cunchao Tu, Han Liu, Zhiyuan Liu, and Maosong Sun. 2017. CANE: Context-aware network embedding for relation modeling. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Vol. 1. 1722--1731.Google ScholarGoogle ScholarCross RefCross Ref
  46. Chong Wang and David M. Blei. 2011. Collaborative topic modeling for recommending scientific articles. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 448--456.Google ScholarGoogle Scholar
  47. Chi Wang, Jiawei Han, Yuntao Jia, Jie Tang, Duo Zhang, Yintao Yu, and Jingyi Guo. 2010. Mining advisor-advisee relationships from research publication networks. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 203--212.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Suhang Wang, Charu Aggarwal, Jiliang Tang, and Huan Liu. 2017. Attributed signed network embedding. In Proceedings of the ACM Conference on Information and Knowledge Management. ACM, 137--146.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Wei Wang, Jiaying Liu, Feng Xia, Irwin King, and Hanghang Tong. 2017. Shifu: Deep learning based advisor-advisee relationship mining in scholarly big data. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 303--310.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Wei Wang, Jiaying Liu, Zhuo Yang, Xiangjie Kong, and Feng Xia. 2019. Sustainable collaborator recommendation based on conference closure. IEEE Trans. Comput. Soc. Syst. 6, 2 (2019), 311--322.Google ScholarGoogle ScholarCross RefCross Ref
  51. Wei Wang, Shuo Yu, Teshome Megersa Bekele, Xiangjie Kong, and Feng Xia. 2017. Scientific collaboration patterns vary with scholars’ academic ages. Scientometrics 112, 1 (2017), 329--343.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Yueyang Wang, Ziheng Duan, Binbing Liao, Fei Wu, and Yueting Zhuang. 2019. Heterogeneous attributed network embedding with graph convolutional networks. Methods 25, 50 (2019), 75.Google ScholarGoogle ScholarCross RefCross Ref
  53. Kyle Williams, Jian Wu, Sagnik Ray Choudhury, Madian Khabsa, and C. Lee Giles. 2014. Scholarly big data information extraction and integration in the citeseer χ digital library. In Proceedings of the IEEE 30th International Conference on Data Engineering Workshops. IEEE, 68--73.Google ScholarGoogle Scholar
  54. Wei Wu, Bin Li, Ling Chen, and Chengqi Zhang. 2018. Efficient attributed etwork embedding via recursive randomized hashing. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’18). 2861--2867.Google ScholarGoogle Scholar
  55. Feng Xia, Zhen Chen, Wei Wang, Jing Li, and Laurence T. Yang. 2014. Mvcwalker: Random walk-based most valuable collaborators recommendation exploiting academic factors. IEEE Trans. Emerg. Top. Comput. 2, 3 (2014), 364--375.Google ScholarGoogle ScholarCross RefCross Ref
  56. Feng Xia, Wei Wang, Teshome Megersa Bekele, and Huan Liu. 2017. Big scholarly data: A survey. IEEE Trans. Big Data 3, 1 (2017), 18--35.Google ScholarGoogle ScholarCross RefCross Ref
  57. Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Y. Chang. 2015. Network representation learning with rich text information. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’15). 2111--2117.Google ScholarGoogle Scholar
  58. Shuo Yu, Feng Xia, and Huan Liu. 2019. Academic team formulation based on liebig’s barrel: Discovery of anticask effect. IEEE Trans. Comput. Soc. Syst. 6, 5 (Oct. 2019), 1083--1094. DOI:https://doi.org/10.1109/TCSS.2019.2913460Google ScholarGoogle ScholarCross RefCross Ref
  59. Shuhan Yuan, Xintao Wu, and Yang Xiang. 2017. SNE: Signed network embedding. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 183--195.Google ScholarGoogle ScholarCross RefCross Ref
  60. Chenwei Zhang, Yi Bu, Ying Ding, and Jian Xu. 2018. Understanding scientific collaboration: Homophily, transitivity, and preferential attachment. J. Assoc. Info. Sci. Technol. 69, 1 (2018), 72--86.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Daokun Zhang, Jie Yin, Xingquan Zhu, and Chengqi Zhang. 2020. Network representation learning: A survey. IEEE Trans. Big Data 6, 1 (March 2020), 3--28. DOI:https://doi.org/10.1109/TBDATA.2018.2850013Google ScholarGoogle ScholarCross RefCross Ref
  62. Zhen Zhang, Hongxia Yang, Jiajun Bu, Sheng Zhou, Pinggang Yu, Jianwei Zhang, Martin Ester, and Can Wang. 2018. ANRL: Attributed network representation learning via deep neural networks. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’18). 3155--3161.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Attributed Collaboration Network Embedding for Academic Relationship Mining

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on the Web
        ACM Transactions on the Web  Volume 15, Issue 1
        February 2021
        142 pages
        ISSN:1559-1131
        EISSN:1559-114X
        DOI:10.1145/3432274
        Issue’s Table of Contents

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 24 November 2020
        • Accepted: 1 July 2020
        • Revised: 1 June 2020
        • Received: 1 November 2019
        Published in tweb Volume 15, Issue 1

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format