ABSTRACT
Modern recommender systems usually employ collaborative filtering with rating information to recommend items to users due to its successful performance. However, because of the drawbacks of collaborative-based methods such as sparsity, cold start, etc., more attention has been drawn to hybrid methods that consider both the rating and content information. Most of the previous works in this area cannot learn a good representation from content for recommendation task or consider only text modality of the content, thus their methods are very limited in current multimedia scenario. This paper proposes a Bayesian generative model called collaborative variational autoencoder (CVAE) that considers both rating and content for recommendation in multimedia scenario. The model learns deep latent representations from content data in an unsupervised manner and also learns implicit relationships between items and users from both content and rating. Unlike previous works with denoising criteria, the proposed CVAE learns a latent distribution for content in latent space instead of observation space through an inference network and can be easily extended to other multimedia modalities other than text. Experiments show that CVAE is able to significantly outperform the state-of-the-art recommendation methods with more robust performance.
Supplemental Material
- Gediminas Adomavicius and Alexander Tuzhilin 2005. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, Vol. 17, 6 (2005), 734--749. Google ScholarDigital Library
- Deepak Agarwal and Bee-Chung Chen 2009. Regression-based latent factor models. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 19--28. Google ScholarDigital Library
- Daniel Billsus and Michael J Pazzani 1998. Learning collaborative information filters.. In Proceedings of the 15th International Conference on Machine Learning, Vol. Vol. 98. 46--54.Google Scholar
- David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of Machine Learning Research Vol. 3, Jan (2003), 993--1022.Google ScholarDigital Library
- Samuel R Bowman, Luke Vilnis, Oriol Vinyals, Andrew M Dai, Rafal Jozefowicz, and Samy Bengio. 2016. Generating sentences from a continuous space. In CONLL. 10--21.Google Scholar
- Mark Claypool, Anuja Gokhale, Tim Miranda, Pavel Murnikov, Dmitry Netes, and Matthew Sartin. 1999. Combining content-based and collaborative filters in an online newspaper Proceedings of ACM SIGIR workshop on recommender systems, Vol. Vol. 60. Citeseer.Google Scholar
- Kostadin Georgiev and Preslav Nakov 2013. A non-IID framework for collaborative filtering with restricted Boltzmann machines. Proceedings of the 30th International Conference on Machine Learning. 1148--1156.Google Scholar
- Alex Graves, Santiago Fernández, Faustino Gomez, and Jürgen Schmidhuber 2006. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In Proceedings of the 23rd International Conference on Machine Learning. ACM, 369--376. Google ScholarDigital Library
- Diederik P Kingma and Max Welling 2013. Auto-encoding variational bayes. In Proceedings of the 2nd International Conference on Learning Representations.Google Scholar
- Daphne Koller and Nir Friedman 2009. Probabilistic graphical models: principles and techniques. MIT press.Google ScholarDigital Library
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks Advances in Neural Information Processing Systems. 1097--1105.Google Scholar
- Ken Lang 1995. Newsweeder: Learning to filter netnews. In Proceedings of the 12th International Conference on Machine Learning. 331--339. Google ScholarCross Ref
- Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature, Vol. 521, 7553 (2015), 436--444. Google Scholar
- Wu-Jun Li, Dit-Yan Yeung, and Zhihua Zhang 2011. Generalized latent factor models for social network analysis Proceedings of the 22nd International Joint Conference on Artificial Intelligence, Barcelona, Spain. 1705.Google Scholar
- Xiaopeng Li, Ming Cheung, and James She 2016. Connection Discovery using Shared Images by Gaussian Relational Topic Model International Conference on Big Data. IEEE, 931--936.Google Scholar
- Kevin P Murphy. 2012. Machine learning: a probabilistic perspective. MIT press.Google ScholarDigital Library
- Michael Pazzani and Daniel Billsus 1997. Learning and revising user profiles: The identification of interesting web sites. Machine Learning, Vol. 27, 3 (1997), 313--331. Google ScholarDigital Library
- Michael J Pazzani. 1999. A framework for collaborative, content-based and demographic filtering. Artificial Intelligence Review Vol. 13, 5--6 (1999), 393--408.Google ScholarDigital Library
- Yunchen Pu, Zhe Gan, Ricardo Henao, Xin Yuan, Chunyuan Li, Andrew Stevens, and Lawrence Carin 2016natexlaba. Variational autoencoder for deep learning of images, labels and captions Advances in Neural Information Processing Systems. 2352--2360.Google Scholar
- Yunchen Pu, Xin Yuan, Andrew Stevens, Chunyuan Li, and Lawrence Carin 2016. A deep generative deconvolutional image model. In Proceedings of the 19th International Conference on Artificial Intelligence and Statistics. 741--750.Google Scholar
- Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme 2009. BPR: Bayesian personalized ranking from implicit feedback Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. AUAI Press, 452--461.Google Scholar
- Ruslan Salakhutdinov and Andriy Mnih 2007. Probabilistic matrix factorization.. In Advances in Neural Information Processing Systems, Vol. Vol. 1. 2--1.Google ScholarDigital Library
- Ruslan Salakhutdinov, Andriy Mnih, and Geoffrey Hinton. 2007. Restricted Boltzmann machines for collaborative filtering Proceedings of the 24th International Conference on Machine Learning. ACM, 791--798.Google Scholar
- Ajit P Singh and Geoffrey J Gordon 2008. Relational learning via collective matrix factorization Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 650--658.Google Scholar
- Aaron Van den Oord, Sander Dieleman, and Benjamin Schrauwen. 2013. Deep content-based music recommendation. In Advances in Neural Information Processing Systems. 2643--2651.Google Scholar
- Pascal Vincent, Hugo Larochelle, Isabelle Lajoie, Yoshua Bengio, and Pierre-Antoine Manzagol. 2010. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research Vol. 11, Dec (2010), 3371--3408.Google ScholarDigital Library
- Chong Wang and David M Blei 2011. Collaborative topic modeling for recommending scientific articles Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 448--456.Google Scholar
- Hao Wang, Binyi Chen, and Wu-Jun Li 2013. Collaborative topic regression with social regularization for tag recommendation. International Joint Conference on Artificial Intelligence.Google Scholar
- Hao Wang, Naiyan Wang, and Dit-Yan Yeung 2015. Collaborative deep learning for recommender systems Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1235--1244.Google Scholar
- Hao Wang, SHI Xingjian, and Dit-Yan Yeung 2016. Collaborative recurrent autoencoder: Recommend while learning to fill in the blanks Advances in Neural Information Processing Systems. 415--423.Google Scholar
- Xinxi Wang and Ye Wang 2014. Improving content-based and hybrid music recommendation using deep learning Proceedings of the 22nd ACM International Conference on Multimedia. ACM, 627--636.Google Scholar
Index Terms
- Collaborative Variational Autoencoder for Recommender Systems
Recommendations
Mutually-Regularized Dual Collaborative Variational Auto-encoder for Recommendation Systems
WWW '22: Proceedings of the ACM Web Conference 2022Recently, user-oriented auto-encoders (UAEs) have been widely used in recommender systems to learn semantic representations of users based on their historical ratings. However, since latent item variables are not modeled in UAE, it is difficult to ...
Collaborative Multi-Auxiliary Information Variational Autoencoder for Recommender Systems
ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and ComputingCollaborative filtering is widely used in recommendation systems. Hybrid approach has been proposed since the collaborative-based method is susceptible to problems such as sparsity and cold start. Recently, the related methods have pointed out that it ...
A survey of autoencoder-based recommender systems
AbstractIn the past decade, recommender systems have been widely used to provide users with personalized products and services. However, most traditional recommender systems are still facing a challenge in dealing with the huge volume, complexity, and ...
Comments