ABSTRACT
Capturing the temporal dynamics of user preferences over items is important for recommendation. Existing methods mainly assume that all time steps in user-item interaction history are equally relevant to recommendation, which however does not apply in real-world scenarios where user-item interactions can often happen accidentally. More importantly, they learn user and item dynamics separately, thus failing to capture their joint effects on user-item interactions. To better model user and item dynamics, we present the Interacting Attention-gated Recurrent Network (IARN) which adopts the attention model to measure the relevance of each time step. In particular, we propose a novel attention scheme to learn the attention scores of user and item history in an interacting way, thus to account for the dependencies between user and item dynamics in shaping user-item interactions. By doing so, IARN can selectively memorize different time steps of a user's history when predicting her preferences over different items. Our model can therefore provide meaningful interpretations for recommendation results, which could be further enhanced by auxiliary features. Extensive validation on real-world datasets shows that IARN consistently outperforms state-of-the-art methods.
- Oren Barkan and Noam Koenigstein. 2016. Item2Vec: Neural item embedding for collaborative filtering. IEEE Workshop on MLSP (2016).Google ScholarCross Ref
- Yoshua Bengio, Nicolas Boulanger-Lewandowski, and Razvan Pascanu. 2013. Advances in optimizing recurrent networks.. In ICASSP. IEEE, 8624--8628.Google Scholar
- James Bennett, Stan Lanning, and others. 2007. The Netflix Prize KDD Cup Workshop, Vol. Vol. 2007. New York, NY, USA, 35.Google Scholar
- Jane Bromley, James W. Bentz, Léon Bottou, Isabelle Guyon, Yann LeCun, Cliff Moore, Eduard S"ackinger, and Roopak Shah. 1993. Signature verification using a "Siamese" time delay neural network. IJPRAI, Vol. 7, 4 (1993), 669--688.Google ScholarCross Ref
- Huiji Gao, Jiliang Tang, Xia Hu, and Huan Liu. 2013. Exploring temporal effects for location recommendation on location-based social networks. In RecSys. ACM, 93--100. Google ScholarDigital Library
- Mihajlo Grbovic, Vladan Radosavljevic, Nemanja Djuric, Narayan Bhamidipati, Jaikit Savla, Varun Bhagwan, and Doug Sharp. 2015. E-commerce in your inbox: rroduct recommendations at scale KDD. ACM, 1809--1818. Google ScholarDigital Library
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In ICCV. 1026--1034. Google ScholarDigital Library
- Ruining He, Chunbin Lin, Jianguo Wang, and Julian McAuley. 2016. Sherlock: Sparse hierarchical embeddings for visually-aware one-class collaborative Filtering. In IJCAI. 3740--3746. Google ScholarDigital Library
- Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW. ACM, 173--182. Google ScholarDigital Library
- Balázs Hidasi, Alexandros Karatzoglou, Linas Baltrunas, and Domonkos Tikk. 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939 (2015).Google Scholar
- Geoffrey E Hinton and Ruslan R Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. Science, Vol. 313, 5786 (2006), 504--507.Google Scholar
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation, Vol. 9, 8 (1997), 1735--1780. Google ScholarDigital Library
- Seyed Abbas Hosseini, Keivan Alizadeh, Ali Khodadadi, Ali Arabzadeh, Mehrdad Farajtabar, Hongyuan Zha, and Hamid R Rabiee. 2017. Recurrent Poisson factorization for temporal recommendation. arXiv preprint arXiv:1703.01442 (2017).Google Scholar
- How Jing and Alexander J Smola. 2017. Neural survival recommender. In WSDM. ACM, 515--524. Google ScholarDigital Library
- Noam Koenigstein, Gideon Dror, and Yehuda Koren. 2011. Yahoo! Music recommendations: modeling music ratings with temporal dynamics and item taxonomy. In RecSys. ACM, 165--172. Google ScholarDigital Library
- Yehuda Koren. 2008. Factorization meets the neighborhood: a multifaceted collaborative filtering model KDD. ACM, 426--434. Google ScholarDigital Library
- Yehuda Koren. 2009. Collaborative filtering with temporal dynamics. In KDD. ACM, 447--456. Google ScholarDigital Library
- Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer, Vol. 42, 8 (2009). Google ScholarDigital Library
- Julian McAuley, Christopher Targett, Qinfeng Shi, and Anton Van Den Hengel. 2015. Image-based recommendations on styles and substitutes SIGIR. ACM, 43--52. Google ScholarDigital Library
- Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013 a. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).Google Scholar
- Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013 b. Distributed representations of words and phrases and their compositionality NIPS. 3111--3119. Google ScholarDigital Library
- Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve restricted boltzmann machines ICML. 807--814. Google ScholarDigital Library
- Wenjie Pei, Tadas Baltrusaitis, David M.J. Tax, and Louis-Philippe Morency. 2017. Temporal attention-gated model for robust sequence classification CVPR.Google Scholar
- Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. Glove: Global vectors for word representation.. In EMNLP, Vol. Vol. 14. 1532--1543.Google ScholarCross Ref
- Tuan-Anh Nguyen Pham, Xutao Li, Gao Cong, and Zhenjie Zhang. 2015. A general graph-based model for recommendation in event-based social networks ICDE. IEEE, 567--578.Google Scholar
- Steffen Rendle. 2010. Factorization machines. In ICDM. IEEE, 995--1000. Google ScholarDigital Library
- Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback UAI. AUAI Press, 452--461. Google ScholarDigital Library
- David E Rumelhart, Geoffrey E Hinton, and Ronald J Williams. 1988. Learning representations by back-propagating errors. Cognitive Modeling, Vol. 5, 3 (1988), 1.Google Scholar
- Mike Schuster and Kuldip K Paliwal. 1997. Bidirectional recurrent neural networks. IEEE Transactions on Signal Processing Vol. 45, 11 (1997), 2673--2681. Google ScholarDigital Library
- Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie. 2015. Autorec: Autoencoders meet collaborative filtering WWW. ACM, 111--112. Google ScholarDigital Library
- Yue Shi, Martha Larson, and Alan Hanjalic. 2014. Collaborative filtering beyond the user-item matrix: A survey of the state of the art and future challenges. CSUR, Vol. 47, 1 (2014), 3. Google ScholarDigital Library
- Ajit P Singh and Geoffrey J Gordon. 2008. Relational learning via collective matrix factorization KDD. ACM, 650--658. Google ScholarDigital Library
- Nitish Srivastava, Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. JMLR, Vol. 15, 1 (2014), 1929--1958. Google ScholarDigital Library
- Xiaoyuan Su and Taghi M Khoshgoftaar. 2009. A survey of collaborative filtering techniques. Advances in Artificial Intelligence Vol. 2009 (2009), 4. Google ScholarDigital Library
- Zhu Sun, Jie Yang, Jie Zhang, and Alessandro Bozzon. 2017. Exploiting both vertical and horizontal dimensions of feature hierarchy for effective recommendation. In AAAI. 189--195.Google Scholar
- Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning, Vol. 4, 2 (2012).Google Scholar
- Flavian Vasile, Elena Smirnova, and Alexis Conneau. 2016. Meta-Prod2Vec: Product embeddings using side-Information for recommendation RecSys. ACM, 225--232. Google ScholarDigital Library
- Paul J Werbos. 1988. Generalization of backpropagation with application to a recurrent gas market model. Neural Networks, Vol. 1, 4 (1988), 339--356.Google ScholarCross Ref
- Chao-Yuan Wu, Amr Ahmed, Alex Beutel, Alexander J Smola, and How Jing. 2017. Recurrent recommender networks. In WSDM. ACM, 495--503. Google ScholarDigital Library
- Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C Courville, Ruslan Salakhutdinov, Richard S Zemel, and Yoshua Bengio. 2015. Show, attend and tell: neural image caption generation with visual attention. ICML, Vol. Vol. 14. 77--81. Google ScholarDigital Library
- Jie Yang, Zhu Sun, Alessandro Bozzon, and Jie Zhang. 2016 a. Learning hierarchical feature influence for recommendation by recursive regularization RecSys. ACM, 51--58. Google ScholarDigital Library
- Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, and Eduard Hovy. 2016 b. Hierarchical attention networks for document classification NACAL-HLT. 1480--1489.Google Scholar
- Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, and Nadia Magnenat Thalmann. 2013. Time-aware point-of-interest recommendation. In SIGIR. ACM, 363--372. Google ScholarDigital Library
- Wojciech Zaremba, Ilya Sutskever, and Oriol Vinyals. 2014. Recurrent neural network regularization. arXiv preprint arXiv:1409.2329 (2014).Google Scholar
Index Terms
- Interacting Attention-gated Recurrent Networks for Recommendation
Recommendations
Gated recurrent units based neural network for time heterogeneous feedback recommendation
Nowadays, recommender systems face the problem of time heterogeneous feedback recommendation, in which items are recommended according to several kinds of user feedback with time stamps. Previously proposed recurrent neural network based recommendation ...
Interactive Sequential Basket Recommendation by Learning Basket Couplings and Positive/Negative Feedback
Sequential recommendation, such as next-basket recommender systems (NBRS), which model users’ sequential behaviors and the relevant context/session, has recently attracted much attention from the research community. Existing session-based NBRS involve ...
Dynamic slate recommendation with gated recurrent units and Thompson sampling
AbstractWe consider the problem of recommending relevant content to users of an internet platform in the form of lists of items, called slates. We introduce a variational Bayesian Recurrent Neural Net recommender system that acts on time series of ...
Comments