Abstract
Finding both efficient and effective quantitative representations for scholars in scientific digital libraries has been a focal point of research. The unprecedented amounts of scholarly datasets, combined with contemporary machine learning and big data techniques, have enabled intelligent and automatic profiling of scholars from this vast and ever-increasing pool of scholarly data. Meanwhile, recent advance in network embedding techniques enables us to mitigate the challenges of large scale and sparsity of academic collaboration networks. In real-world academic social networks, scholars are accompanied with various attributes or features, such as co-authorship and publication records, which result in attributed collaboration networks. It has been observed that both network topology and scholar attributes are important in academic relationship mining. However, previous studies mainly focus on network topology, whereas scholar attributes are overlooked. Moreover, the influence of different scholar attributes are unclear. To bridge this gap, in this work, we present a novel framework of Attributed Collaboration Network Embedding (ACNE) for academic relationship mining. ACNE extracts four types of scholar attributes based on the proposed scholar profiling model, including demographics, research, influence, and sociability. ACNE can learn a low-dimensional representation of scholars considering both scholar attributes and network topology simultaneously. We demonstrate the effectiveness and potentials of ACNE in academic relationship mining by performing collaborator recommendation on two real-world datasets and the contribution and importance of each scholar attribute on scientific collaborator recommendation is investigated. Our work may shed light on academic relationship mining by taking advantage of attributed collaboration network embedding.
- Uchenna Akujuobi, Han Yufei, Qiannan Zhang, and Xiangliang Zhang. 2019. Collaborative graph walk for semi-supervised multi-label node classification. Retrieved from https://Arxiv:1910.1910.09706.Google Scholar
- Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, et al. 2018. Relational inductive biases, deep learning, and graph networks. Retrieved from https://Arxiv:1806.01261.Google Scholar
- H. Cai, V. W. Zheng, and K. C. Chang. 2018. A comprehensive survey of graph embedding: Problems, techniques, and applications. IEEE Trans. Knowl. Data Eng. 30, 9 (2018), 1616--1637. DOI:https://doi.org/10.1109/TKDE.2018.2807452Google ScholarDigital Library
- Hung-Hsuan Chen, Liang Gou, Xiaolong Zhang, and Clyde Lee Giles. 2011. Collabseer: A search engine for collaboration discovery. In Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries. ACM, 231--240.Google ScholarDigital Library
- Yankai Chen, Jie Zhang, Yixiang Fang, Xin Cao, and Irwin King. 2020. Efficient community search over large directed graph: An augmented index-based approach. In Proceedings of the International Joint Conference on Artificial Inteeligence (IJCAI’20). 3544--3550.Google ScholarCross Ref
- Yuxiao Dong, Nitesh V. Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable representation learning for heterogeneous networks. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 135--144.Google ScholarDigital Library
- Yuxiao Dong, Nitesh V. Chawla, Jie Tang, Yang Yang, and Yang Yang. 2017. User modeling on demographic attributes in big mobile social networks. ACM Trans. Info. Syst. 35, 4, Article 35 (July 2017), 33 pages. DOI:https://doi.org/10.1145/3057278Google ScholarDigital Library
- Yuxiao Dong, Reid A. Johnson, and Nitesh V. Chawla. 2016. Can scientific impact be predicted? IEEE Trans. Big Data 2, 1 (2016), 18--30.Google ScholarCross Ref
- Lun Du, Yun Wang, Guojie Song, Zhicong Lu, and Junshan Wang. 2018. Dynamic network embedding: An extended approach for skip-gram based network embedding. In Proceedings of the International Joint Conference on Artificial Inteeligence (IJCAI’18). 2086--2092.Google ScholarCross Ref
- Xinyu Fu, Jiani Zhang, Ziqiao Meng, and Irwin King. 2020. MAGNN: Metapath aggregated graph neural network for heterogeneous graph embedding. In Proceedings of the World Wide Web Conference (WWW). 2331--2341.Google ScholarDigital Library
- Hongchang Gao and Heng Huang. 2018. Deep attributed network embedding. In Proceedings of the International Joint Conference on Artificial Inteeligence (IJCAI’18). 3364--3370.Google ScholarCross Ref
- Palash Goyal, Sujit Rokka Chhetri, and Arquimedes Canedo. 2020. dyngraph2vec: Capturing network dynamics using dynamic graph representation learning. Knowl.-Based Syst. 187 (January 2020), 104816. DOI:https://doi.org/10.1016/j.knosys.2019.06.024Google Scholar
- Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 855--864.Google ScholarDigital Library
- Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems. MIT Press, 1024--1034.Google Scholar
- R. Hong, Y. He, L. Wu, Y. Ge, and X. Wu. 2019. Deep attributed network embedding by preserving structure and attribute information. IEEE Trans. Syst. Man Cybernet.: Syst. (2019), 1--12. DOI:https://doi.org/10.1109/TSMC.2019.2897152Google Scholar
- Xiao Huang, Jundong Li, and Xia Hu. 2017. Accelerated attributed network embedding. In Proceedings of the SIAM International Conference on Data Mining. SIAM, 633--641.Google ScholarCross Ref
- Ming Ji, Yizhou Sun, Marina Danilevsky, Jiawei Han, and Jing Gao. 2010. Graph regularized transductive classification on heterogeneous information networks. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 570--586.Google ScholarDigital Library
- Yantao Jia, Yuanzhuo Wang, Xiaolong Jin, Hailun Lin, and Xueqi Cheng. 2018. Knowledge graph embedding: A locally and temporally adaptive translation-based approach. ACM Trans. Web 12, 2 (2018), 8.Google ScholarDigital Library
- Zhuoren Jiang, Yue Yin, Liangcai Gao, Yao Lu, and Xiaozhong Liu. 2018. Cross-language citation recommendation via hierarchical representation learning on heterogeneous graph. In Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 635--644.Google ScholarDigital Library
- Madian Khabsa and C. Lee Giles. 2014. The number of scholarly documents on the public web. PloS ONE 9, 5 (2014), e93949.Google ScholarCross Ref
- Samiya Khan, Xiufeng Liu, Kashish A. Shakil, and Mansaf Alam. 2017. A survey on scholarly data: From big data perspective. Info. Process. Manage. 53, 4 (2017), 923--944.Google ScholarDigital Library
- Junghwan Kim, Haekyu Park, Ji-Eun Lee, and U. Kang. 2018. Side: Representation learning in signed directed networks. In Proceedings of the World Wide Web Conference. International World Wide Web Conferences Steering Committee, 509--518.Google Scholar
- Thomas N. Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. Retrieved from https://Arxiv:1609.02907.Google Scholar
- Xiangjie Kong, Huizhen Jiang, Teshome Megersa Bekele, Wei Wang, and Zhenzhen Xu. 2017. Random walk-based beneficial collaborators recommendation exploiting dynamic research interests and academic influence. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 1371--1377.Google ScholarDigital Library
- Ronald N. Kostoff, J. Antonio del Rio, James A. Humenik, Esther Ofilia Garcia, and Ana Maria Ramirez. 2001. Citation mining: Integrating text mining and bibliometrics for research user profiling. J. Amer. Soc. Info. Sci. Technol. 52, 13 (2001), 1148--1156.Google ScholarDigital Library
- Jianxin Li, Taotao Cai, Ke Deng, Xinjue Wang, Timos Sellis, and Feng Xia. 2020. Community-diversified influence maximization in social networks. Info. Syst. 92 (September 2020), 101522. DOI:https://doi.org/10.1016/j.is.2020.101522Google Scholar
- Jundong Li, Harsh Dani, Xia Hu, Jiliang Tang, Yi Chang, and Huan Liu. 2017. Attributed network embedding for learning in a dynamic environment. In Proceedings of the ACM on Conference on Information and Knowledge Management. ACM, 387--396.Google ScholarDigital Library
- Yusheng Li, Yilun Shang, and Yiting Yang. 2017. Clustering coefficients of large networks. Info. Sci. 382 (2017), 350--358.Google Scholar
- Lizi Liao, Xiangnan He, Hanwang Zhang, and Tat-Seng Chua. 2018. Attributed social network embedding. IEEE Trans. Knowl. Data Eng. 30, 12 (2018), 2257--2270.Google ScholarDigital Library
- Han Liu, Xianchao Zhang, and Xiaotong Zhang. 2018. Possible world based consistency learning model for clustering and classifying uncertain data. Neural Netw. 102 (2018), 48--66.Google ScholarCross Ref
- Jiaying Liu, Feng Xia, Lei Wang, Bo Xu, Xiangjie Kong, Hanghang Tong, and Irwin King. 2019. Shifu2: A network representation learning based model for advisor-advisee relationship mining. IEEE Trans. Knowl. Data Eng. (2019). DOI:https://doi.org/10.1109/TKDE.2019.2946825Google Scholar
- Zheng Liu, Xing Xie, and Lei Chen. 2018. Context-aware academic collaborator recommendation. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1870--1879.Google ScholarDigital Library
- Chunyu Lu, Pengfei Jiao, Hongtao Liu, Yaping Wang, Hongyan Xu, and Wenjun Wang. 2019. SSNE: Status signed network embedding. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 81--93.Google ScholarDigital Library
- Linyuan Lü and Tao Zhou. 2011. Link prediction in complex networks: A survey. Physica A: Stat. Mech. Appl. 390, 6 (2011), 1150--1170.Google ScholarCross Ref
- Zaiqiao Meng, Shangsong Liang, Hongyan Bao, and Xiangliang Zhang. 2019. Co-embedding attributed networks. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining. 393--401.Google ScholarDigital Library
- Stuart E. Middleton, Nigel R. Shadbolt, and David C. De Roure. 2004. Ontological user profiling in recommender systems. ACM Trans. Info. Syst. 22, 1 (2004), 54--88.Google ScholarDigital Library
- Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 701--710.Google ScholarDigital Library
- Alexander Michael Petersen. 2015. Quantifying the impact of weak, strong, and super ties in scientific careers. Proc. Natl. Acad. Sci. U.S.A. 112, 34 (2015), E4671--E4680.Google ScholarCross Ref
- Dominic Seyler, Praveen Chandar, and Matthew Davis. 2018. An information retrieval framework for contextual suggestion based on heterogeneous information network embeddings. In Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 953--956.Google ScholarDigital Library
- Yu Shi, Qi Zhu, Fang Guo, Chao Zhang, and Jiawei Han. 2018. Easing embedding learning by comprehensive transcription of heterogeneous information networks. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2190--2199.Google ScholarDigital Library
- Roberta Sinatra, Dashun Wang, Pierre Deville, Chaoming Song, and Albert-László Barabási. 2016. Quantifying the evolution of individual scientific impact. Science 354, 6312 (2016), aaf5239.Google Scholar
- Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1067--1077.Google ScholarDigital Library
- Jie Tang, Limin Yao, Duo Zhang, and Jing Zhang. 2010. A combination approach to web user profiling. ACM Trans. Knowl. Discov. Data 5, 1 (2010), 2.Google ScholarDigital Library
- Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. Arnetminer: Extraction and mining of academic social networks. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 990--998.Google ScholarDigital Library
- Cunchao Tu, Han Liu, Zhiyuan Liu, and Maosong Sun. 2017. CANE: Context-aware network embedding for relation modeling. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Vol. 1. 1722--1731.Google ScholarCross Ref
- Chong Wang and David M. Blei. 2011. Collaborative topic modeling for recommending scientific articles. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 448--456.Google Scholar
- Chi Wang, Jiawei Han, Yuntao Jia, Jie Tang, Duo Zhang, Yintao Yu, and Jingyi Guo. 2010. Mining advisor-advisee relationships from research publication networks. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 203--212.Google ScholarDigital Library
- Suhang Wang, Charu Aggarwal, Jiliang Tang, and Huan Liu. 2017. Attributed signed network embedding. In Proceedings of the ACM Conference on Information and Knowledge Management. ACM, 137--146.Google ScholarDigital Library
- Wei Wang, Jiaying Liu, Feng Xia, Irwin King, and Hanghang Tong. 2017. Shifu: Deep learning based advisor-advisee relationship mining in scholarly big data. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 303--310.Google ScholarDigital Library
- Wei Wang, Jiaying Liu, Zhuo Yang, Xiangjie Kong, and Feng Xia. 2019. Sustainable collaborator recommendation based on conference closure. IEEE Trans. Comput. Soc. Syst. 6, 2 (2019), 311--322.Google ScholarCross Ref
- Wei Wang, Shuo Yu, Teshome Megersa Bekele, Xiangjie Kong, and Feng Xia. 2017. Scientific collaboration patterns vary with scholars’ academic ages. Scientometrics 112, 1 (2017), 329--343.Google ScholarDigital Library
- Yueyang Wang, Ziheng Duan, Binbing Liao, Fei Wu, and Yueting Zhuang. 2019. Heterogeneous attributed network embedding with graph convolutional networks. Methods 25, 50 (2019), 75.Google ScholarCross Ref
- Kyle Williams, Jian Wu, Sagnik Ray Choudhury, Madian Khabsa, and C. Lee Giles. 2014. Scholarly big data information extraction and integration in the citeseer χ digital library. In Proceedings of the IEEE 30th International Conference on Data Engineering Workshops. IEEE, 68--73.Google Scholar
- Wei Wu, Bin Li, Ling Chen, and Chengqi Zhang. 2018. Efficient attributed etwork embedding via recursive randomized hashing. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’18). 2861--2867.Google Scholar
- Feng Xia, Zhen Chen, Wei Wang, Jing Li, and Laurence T. Yang. 2014. Mvcwalker: Random walk-based most valuable collaborators recommendation exploiting academic factors. IEEE Trans. Emerg. Top. Comput. 2, 3 (2014), 364--375.Google ScholarCross Ref
- Feng Xia, Wei Wang, Teshome Megersa Bekele, and Huan Liu. 2017. Big scholarly data: A survey. IEEE Trans. Big Data 3, 1 (2017), 18--35.Google ScholarCross Ref
- Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Y. Chang. 2015. Network representation learning with rich text information. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’15). 2111--2117.Google Scholar
- Shuo Yu, Feng Xia, and Huan Liu. 2019. Academic team formulation based on liebig’s barrel: Discovery of anticask effect. IEEE Trans. Comput. Soc. Syst. 6, 5 (Oct. 2019), 1083--1094. DOI:https://doi.org/10.1109/TCSS.2019.2913460Google ScholarCross Ref
- Shuhan Yuan, Xintao Wu, and Yang Xiang. 2017. SNE: Signed network embedding. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 183--195.Google ScholarCross Ref
- Chenwei Zhang, Yi Bu, Ying Ding, and Jian Xu. 2018. Understanding scientific collaboration: Homophily, transitivity, and preferential attachment. J. Assoc. Info. Sci. Technol. 69, 1 (2018), 72--86.Google ScholarDigital Library
- Daokun Zhang, Jie Yin, Xingquan Zhu, and Chengqi Zhang. 2020. Network representation learning: A survey. IEEE Trans. Big Data 6, 1 (March 2020), 3--28. DOI:https://doi.org/10.1109/TBDATA.2018.2850013Google ScholarCross Ref
- Zhen Zhang, Hongxia Yang, Jiajun Bu, Sheng Zhou, Pinggang Yu, Jianwei Zhang, Martin Ester, and Can Wang. 2018. ANRL: Attributed network representation learning via deep neural networks. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’18). 3155--3161.Google ScholarCross Ref
Index Terms
- Attributed Collaboration Network Embedding for Academic Relationship Mining
Recommendations
Scholar2vec: Vector Representation of Scholars for Lifetime Collaborator Prediction
While scientific collaboration is critical for a scholar, some collaborators can be more significant than others, e.g., lifetime collaborators. It has been shown that lifetime collaborators are more influential on a scholar’s academic performance. However,...
Research on Academic Discourse Power of Scientific Collaboration on COVID-19 Pandemic
ICDEL '22: Proceedings of the 7th International Conference on Distance Education and LearningThe sudden outbreak of COVID-19 pandemic at the beginning of 2020 poses a significant threat to the health and safety of people worldwide. Given the speed and scope of the COVID-19 pandemic, countries around the world have carried out scientific ...
Scientific collaboration patterns vary with scholars' academic ages
Scientists may encounter many collaborators of different academic ages throughout their careers. Thus, they are required to make essential decisions to commence or end a creative partnership. This process can be influenced by strategic motivations ...
Comments