skip to main content
research-article

Personalised Reranking of Paper Recommendations Using Paper Content and User Behavior

Authors Info & Claims
Published:16 March 2019Publication History
Skip Abstract Section

Abstract

Academic search engines have been widely used to access academic papers, where users’ information needs are explicitly represented as search queries. Some modern recommender systems have taken one step further by predicting users’ information needs without the presence of an explicit query. In this article, we examine an academic paper recommender that sends out paper recommendations in email newsletters, based on the users’ browsing history on the academic search engine. Specifically, we look at users who regularly browse papers on the search engine, and we sign up for the recommendation newsletters for the first time. We address the task of reranking the recommendation candidates that are generated by a production system for such users.

We face the challenge that the users on whom we focus have not interacted with the recommender system before, which is a common scenario that every recommender system encounters when new users sign up. We propose an approach to reranking candidate recommendations that utilizes both paper content and user behavior. The approach is designed to suit the characteristics unique to our academic recommendation setting. For instance, content similarity measures can be used to find the closest match between candidate recommendations and the papers previously browsed by the user. To this end, we use a knowledge graph derived from paper metadata to compare entity similarities (papers, authors, and journals) in the embedding space. Since the users on whom we focus have no prior interactions with the recommender system, we propose a model to learn a mapping from users’ browsed articles to user clicks on the recommendations. We combine both content and behavior into a hybrid reranking model that outperforms the production baseline significantly, providing a relative 13% increase in Mean Average Precision and 28% in Precision@1.

Moreover, we provide a detailed analysis of the model components, highlighting where the performance boost comes from. The obtained insights reveal useful components for the reranking process and can be generalized to other academic recommendation settings as well, such as the utility of graph embedding similarity. Also, recent papers browsed by users provide stronger evidence for recommendation than historical ones.

References

  1. Fabio Aiolli. 2013. A preliminary study on a recommender system for the million songs dataset challenge. In Proceedings of the 4th Italian Information Retrieval Workshop (CEUR Workshop Proceedings), Vol. 964. CEUR-WS.org, 73--83. Retrieved from http://ceur-ws.org/Vol-964/paper12.pdf.Google ScholarGoogle Scholar
  2. Joeran Beel, Akiko Aizawa, Corinna Breitinger, and Bela Gipp. 2017. Mr. DLib: Recommendations-as-a-Service (RaaS) for academia. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL’17). IEEE, 1--2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In Adv. Neural Info. Process. Syst. 2787--2795. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Christopher J. C. Burges. 2010. From RankNet to LambdaRank to LambdaMart: An overview. Learning 11, 23--581 (2010), 81.Google ScholarGoogle Scholar
  5. Laurent Charlin, Richard S. Zemel, and Hugo Larochelle. 2014. Leveraging user libraries to bootstrap collaborative filtering. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 173--182. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Tianqi Chen, Weinan Zhang, Qiuxia Lu, Kailong Chen, Zhao Zheng, and Yong Yu. 2012. SVDFeature: A toolkit for feature-based collaborative filtering. J. Mach. Learn. Res. 13 (2012), 3619--3622. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Yao Cheng, Li’ang Yin, and Yong Yu. 2014. LorSLIM: Low rank sparse linear methods for top-N recommendations. In Proceedings of the 2014 IEEE International Conference on Data Mining (ICDM’14). IEEE Computer Society, 90--99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Evangelia Christakopoulou and George Karypis. 2016. Local item-item models for top-N recommendation. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, 67--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Fan R. K. Chung. 1997. Spectral Graph Theory. Number 92. American Mathematical Soc.Google ScholarGoogle Scholar
  10. Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for YouTube recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, 191--198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Paolo Cremonesi, Yehuda Koren, and Roberto Turrin. 2010. Performance of recommender algorithms on top-n recommendation tasks. In Proceedings of the 2010 ACM Conference on Recommender Systems, RecSys 2010. ACM, 39--46. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Van Dang. 2018. The Lemur Project-Wiki-RankLib. Lemur Project. Retrieved from https://sourceforge.net/projects/lemur/.Google ScholarGoogle Scholar
  13. James Davidson, Benjamin Liebald, Junning Liu, Palash Nandy, Taylor Van Vleet, Ullas Gargi, Sujoy Gupta, Yu He, Mike Lambert, Blake Livingston, and Dasarathi Sampath. 2010. The YouTube video recommendation system. In Proceedings of the 4th ACM Conference on Recommender Systems. ACM, 293--296. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Mukund Deshpande and George Karypis. 2004. Item-based top-N recommendation algorithms. ACM Trans. Info. Syst. 22, 1 (2004), 143--177. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12 (2011), 2121--2159. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Travis Ebesu and Yi Fang. 2017. Neural citation network for context-aware citation recommendation. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1093--1096. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Michael D. Ekstrand, Praveen Kannan, James A. Stemper, John T. Butler, Joseph A. Konstan, and John T. Riedl. 2010. Automatically building research reading lists. In Proceedings of the 4th ACM Conference on Recommender Systems. ACM, 159--166. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Asmaa Elbadrawy and George Karypis. 2015. User-specific feature-based similarity models for top-n recommendation of new items. ACM Trans. Intell. Syst. Technol. 6, 3 (2015), 33:1--33:20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Felice Ferrara, Nirmala Pudota, and Carlo Tasso. 2011. A keyphrase-based paper recommender system. In Italian Research Conference on Digital Libraries. Springer, 14--25.Google ScholarGoogle ScholarCross RefCross Ref
  20. Google Scholar. 2018. Retrieved from https://scholar.google.com/.Google ScholarGoogle Scholar
  21. Qi He, Daniel Kifer, Jian Pei, Prasenjit Mitra, and C. Lee Giles. 2011. Citation recommendation without author supervision. In Proceedings of the 4th ACM International Conference on Web Search and Data Mining. ACM, 755--764. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 355--364. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Maya Hristakeva, Daniel Kershaw, Marco Rossetti, Petr Knoth, Benjamin Pettit, Saúl Vargas, and Kris Jack. 2017. Building recommender systems for scholarly information. In Proceedings of the 1st Workshop on Scholarly Web Mining. ACM, 25--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Wenyi Huang, Zhaohui Wu, Liang Chen, Prasenjit Mitra, and C. Lee Giles. 2015. A neural probabilistic model for context-based citation recommendation. In Proceedings of the 29th AAAI Conference on Artificial Intelligence. AAAI, 2404--2410. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Yichen Jiang, Aixia Jia, Yansong Feng, and Dongyan Zhao. 2012. Recommending academic papers via users’ reading purposes. In Proceedings of the 6th ACM Conference on Recommender Systems. ACM, 241--244. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Thorsten Joachims. 2006. Training linear SVMs in linear time. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 217--226. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Santosh Kabbur, Xia Ning, and George Karypis. 2013. FISM: Factored item similarity models for top-N recommender systems. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 659--667. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Zhao Kang and Qiang Cheng. 2016. Top-N recommendation with novel rank approximation. In Proceedings of the 2016 SIAM International Conference on Data Mining. SIAM, 126--134.Google ScholarGoogle ScholarCross RefCross Ref
  29. Hao-Ren Ke, Rolf Kwakkelaar, Yu-Min Tai, and Li-Chun Chen. 2002. Exploring behavior of e-journal users in science and technology: Transaction log analysis of elsevier’s sciencedirect OnSite in Taiwan. Library Info. Sci. Res. 24, 3 (2002), 265--291.Google ScholarGoogle Scholar
  30. Madian Khabsa, Zhaohui Wu, and C. Lee Giles. 2016. Towards better understanding of academic search. In Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries. ACM, 111--114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Taraneh Khazaei and Orland Hoeber. 2017. Supporting academic search tasks through citation visualization and exploration. Int. J. Dig. Libraries 18, 1 (2017), 59--72. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google ScholarGoogle Scholar
  33. Onur Küçüktunç, Erik Saule, Kamer Kaya, and Ümit V. Çatalyürek. 2012. Recommendation on academic networks using direction aware citation analysis. arXiv preprint arXiv:1205.1143 (2012).Google ScholarGoogle Scholar
  34. Damien Lefortier, Pavel Serdyukov, and Maarten de Rijke. 2014. Online exploration for detecting shifts in fresh intent. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. ACM, 589--598. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Huajing Li, Isaac Councill, Wang-Chien Lee, and C. Lee Giles. 2006. CiteSeerx: An architecture and web service design for an academic document search engine. In Proceedings of the 15th International Conference on World Wide Web. ACM, 883--884. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Xinyi Li and Maarten de Rijke. 2017. Academic search in response to major scientific events. In Proceedings of the 5th International Workshop on Bibliometric-enhanced Information Retrieval.Google ScholarGoogle Scholar
  37. Xinyi Li and Maarten de Rijke. 2017. Do topic shift and query reformulation patterns correlate in academic search? In Proceedings of the 39th European Conference on IR Research. Springer, 146--159.Google ScholarGoogle ScholarCross RefCross Ref
  38. Xinyi Li and Maarten de Rijke. 2019. Characterizing and predicting downloads in academic search. Info. Process. Manage. 56, 3 (2019), 394--407.Google ScholarGoogle ScholarCross RefCross Ref
  39. Xinyi Li, Bob J. A. Schijvenaars, and Maarten de Rijke. 2017. Investigating queries and search failures in academic search. Info. Process. Manage. 53, 3 (May 2017), 666--683. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Yankai Lin, Zhiyuan Liu, Maosong Sun, Yang Liu, and Xuan Zhu. 2015. Learning entity and relation embeddings for knowledge graph completion. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence, Vol. 15. 2181--2187. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Greg Linden, Brent Smith, and Jeremy York. 2003. Amazon.Com recommendations: Item-to-item collaborative filtering. IEEE Internet Comput. 7, 1 (Jan. 2003), 76--80. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Haifeng Liu, Xiangjie Kong, Xiaomei Bai, Wei Wang, Teshome Megersa Bekele, and Feng Xia. 2015. Context-based collaborative filtering for citation recommendation. IEEE Access 3 (2015), 1695--1703.Google ScholarGoogle ScholarCross RefCross Ref
  43. Hao Ma, Dengyong Zhou, Chao Liu, Michael R. Lyu, and Irwin King. 2011. Recommender systems with social regularization. In Proceedings of the 4th International Conference on Web Search and Web Data Mining. ACM, 287--296. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Anasua Mitra and Amit Awekar. 2017. On low overlap among search results of academic search engines. In Proceedings of the 26th International Conference on World Wide Web Companion. ACM, 823--824. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Taesup Moon, Wei Chu, Lihong Li, Zhaohui Zheng, and Yi Chang. 2012. An online learning framework for refining recency search results with user click feedback. ACM Trans. Info. Syst. 30, 4 (2012), 20:1--20:28. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Cristiano Nascimento, Alberto H. F. Laender, Altigran S. da Silva, and Marcos André Gonçalves. 2011. A source independent framework for research paper recommendation. In Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries. ACM, 297--306. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Xia Ning and George Karypis. 2011. SLIM: Sparse linear methods for top-N recommender systems. In Proceedings of the 11th IEEE International Conference on Data Mining. IEEE Computer Society, 497--506. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Xi Niu and Bradley M. Hemminger. 2012. A study of factors that affect the information-seeking behavior of academic scientists. J. Amer. Soc. Info. Sci. Technol. 63, 2 (2012), 336--353. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Zhen Pan, Enhong Chen, Qi Liu, Tong Xu, Haiping Ma, and Hongjie Lin. 2016. Sparse factorization machines for click-through rate prediction. In Proceedings of the 16th International Conference on Data Mining. IEEE Computer Society, 400--409.Google ScholarGoogle ScholarCross RefCross Ref
  50. David M. Pennock, Eric Horvitz, Steve Lawrence, and C. Lee Giles. 2000. Collaborative filtering by personality diagnosis: A hybrid memory-and model-based approach. In Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence. Morgan Kaufmann Publishers Inc., 473--480. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Sheila Pontis and Ann Blandford. 2015. Understanding “Influence”: An exploratory study of academics’ processes of knowledge construction through iterative and interactive information seeking. J. Assoc. Info. Sci. Technol. 66, 8 (2015), 1576--1593. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Sheila Pontis, Ann Blandford, Elke Greifeneder, Hesham Attalla, and David Neal. 2015. Keeping up to date: An academic researcher’s information journey. J. Amer. Soc. Info. Sci. Technol. 68, 1 (2015), 22--35.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Steffen Rendle. 2012. Factorization machines with libFM. ACM Trans. Intell. Syst. Technol. 3, 3 (2012), 57:1--57:22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. AUAI Press, 452--461. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Francesco Ricci, Lior Rokach, and Bracha Shapira (Eds.). 2015. Recommender Systems Handbook. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2000. Analysis of recommendation algorithms for e-commerce. In Proceedings of the 2nd ACM Conference on Electronic Commerce. ACM, 158--167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2001. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International Conference on World Wide Web. ACM, 285--295. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Martin Saveski and Amin Mantrach. 2014. Item cold-start recommendations: Learning local collective embeddings. In Proceedings of the 8th ACM Conference on Recommender Systems. ACM, 89--96. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. ScienceDirect. 2015. Retrieved from https://sciencedirect.com.Google ScholarGoogle Scholar
  60. ScienceDirect. 2016. Retrieved from https://www.elsevier.com/solutions/sciencedirect/features.Google ScholarGoogle Scholar
  61. Semantic Scholar. 2018. Retrieved from https://www.semanticscholar.org/.Google ScholarGoogle Scholar
  62. Aravind Sesagiri Raamkumar, Schubert Foo, and Natalie Pang. 2018. Can I have more of these please? assisting researchers in finding similar research papers from a seed basket of papers. Emerald Publishing Limited.Google ScholarGoogle Scholar
  63. Guocong Song. 2014. Point-wise approach for yandex personalized web search challenge. In Proceedings of the WSDM 2014 Workshop on Web Search Click Data. ACM.Google ScholarGoogle Scholar
  64. Trevor Strohman, W. Bruce Croft, and David Jensen. 2007. Recommending citations for academic papers. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 705--706. Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Kazunari Sugiyama and Min-Yen Kan. 2010. Scholarly paper recommendation via user’s recent research interests. In Proceedings of the 10th Annual Joint Conference on Digital Libraries. ACM, 29--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. Jie Tang. 2016. AMiner: Toward understanding big scholar data. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. ACM, 467--467. Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. Jie Tang, Ruoming Jin, and Jing Zhang. 2008. A topic modeling approach and its integration into the random walk framework for academic search. In Proceedings of the 8th IEEE International Conference on Data Mining. IEEE, 1055--1060. Google ScholarGoogle ScholarDigital LibraryDigital Library
  68. Roberto Torres, Sean M. McNee, Mara Abel, Joseph A. Konstan, and John Riedl. 2004. Enhancing digital libraries with TechLens+. In Proceedings of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries. ACM, 228--236. Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. Chong Wang and David M. Blei. 2011. Collaborative topic modeling for recommending scientific articles. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 448--456. Google ScholarGoogle ScholarDigital LibraryDigital Library
  70. Yao Wu, Christopher DuBois, Alice X. Zheng, and Martin Ester. 2016. Collaborative denoising auto-encoders for top-N recommender systems. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. ACM, 153--162. Google ScholarGoogle ScholarDigital LibraryDigital Library
  71. Zhibo Xiao, Feng Che, Enuo Miao, and Mingyu Lu. 2014. Increasing serendipity of recommender system with ranking topic model. Appl. Math. Info. Sci. 8, 4 (2014), 2041.Google ScholarGoogle ScholarCross RefCross Ref
  72. Chenyan Xiong, Russell Power, and Jamie Callan. 2017. Explicit semantic ranking for academic search via knowledge graph embedding. In Proceedings of the 26th International Conference on World Wide Web. ACM, 1271--1279. Google ScholarGoogle ScholarDigital LibraryDigital Library
  73. Feipeng Zhao and Yuhong Guo. 2016. Improving top-N recommendation with heterogeneous loss. In Proceedings of the 25th International Joint Conference on Artificial Intelligence. IJCAI/AAAI Press, 2378--2384. Google ScholarGoogle ScholarDigital LibraryDigital Library
  74. Masrour Zoghi, Tomáš Tunys, Lihong Li, Damien Jose, Junyan Chen, Chun Ming Chin, and Maarten de Rijke. 2016. Click-based hot fixes for underperforming torso queries. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 195--204. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Personalised Reranking of Paper Recommendations Using Paper Content and User Behavior

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Information Systems
      ACM Transactions on Information Systems  Volume 37, Issue 3
      July 2019
      335 pages
      ISSN:1046-8188
      EISSN:1558-2868
      DOI:10.1145/3320115
      Issue’s Table of Contents

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 March 2019
      • Revised: 1 February 2019
      • Accepted: 1 February 2019
      • Received: 1 July 2018
      Published in tois Volume 37, Issue 3

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format