ABSTRACT
With the fast development of web 2.0, user-centric publishing and knowledge management platforms, such as Wiki, Blogs, and Q & A systems attract a large number of users. Given the availability of the huge amount of meaningful user generated content, incremental model based recommendation techniques can be employed to improve users' experience using automatic recommendations. In this paper, we propose an incremental recommendation algorithm based on Probabilistic Latent Semantic Analysis (PLSA). The proposed algorithm can consider not only the users' long-term and short-term interests, but also users' negative and positive feedback. We compare the proposed method with several baseline methods using a real-world Question & Answer website called Wenda. Experiments demonstrate both the effectiveness and the efficiency of the proposed methods.
- Thomas Hofmann. Unsupervised Learning by Probabilistic Latent Semantic Analysis. Maching Learning Journal, Vol. 42, No. 1-2, pp. 177--196, 2001. Google ScholarDigital Library
- M. Girolami and A. Kaban. On an Equivalence Between PLSI and LDA. In: Proceedings of SIGIR, pp. 433--434, 2003. Google ScholarDigital Library
- Dempster A. P., Laird N. M., and Rubin D. B. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B, Vol. 39, No. 1, pp. 1--38, 1977.Google Scholar
- Christophe G. Carrier. A Note on the Utility of Incremental Learning. AI Communications, Vol. 13, No. 4, pp. 215--223, 2000. Google ScholarDigital Library
- T. C. Chou and M.C Chen. Using Incremental PLSA for Threshold Resilient Online Event Anlysis. IEEE Transaction on Knowledge and Data Engineering, Vol. 20, No. 3, pp. 289--299, 2008. Google ScholarDigital Library
- T. Hofmann. Latent Semantic Models for Collaborative Filtering. ACM Transaction Information System, Vol. 22, No. 1, pp.89--115, 2004. Google ScholarDigital Library
- L. Zhang and C. Li, etc. An Efficient Solution to Factor Drifting Problem in the PLSA Model. In: Proceedings of the The Fifth International Conference on Computer and Information Technology, pp.175--181, 2005. Google ScholarDigital Library
- J. Zhang, Z. Ghahramani, and Y. Yang. A Probabilistic Model for Online Document Clustering with Applications to Novelty Detection. In Proceedings of NIPS, pp. 1617--1624, 2005.Google Scholar
- Arun C. Surendran and Suvrit Sra. Incremental Aspect Models for Mining Document Streams. 10th European Conferences on Principles and Practice of Knowledge Discovery, pp. 633--640, 2006. Google ScholarDigital Library
- J. T. Chien and M. S. Wu. Adaptive Bayesian Latent Semantic Analysis. IEEE Transactions on Audio, Speech, and Language Processing, Vol. 16, No. 1, pp. 198--207, 2008. Google ScholarDigital Library
- B. Marlin. Collaborative Filtering: A Machine Learning Perspective. Master's thesis, University of Toronto, 2004.Google Scholar
- Das A., Datar M., Garg A. and Rajaram S. Google News Personalization: Scalable Online Collaborative Filtering. In: Proc. of the 16th Int. Conf. on World Wide Web, pp. 270--280, 2007. Google ScholarDigital Library
- D. M. Blei, A. Ng, and M. I. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3, 993--1022, 2003. Google ScholarDigital Library
- R. M. Neal and G. E. Hinton. A View of the EM Algorithm that Justifies Incremental, Sparse, and other Variants. In Learning in Graphical Models. Kluwer Academic Press, pp. 355--368, 1998. Google ScholarCross Ref
- Arindam Banerjee and Sugato Basu. Topic Models over Text Streams: A Study of Batch and Online Unsupervised Learning. In: Proceedings of the SIAM International Conference on Data Mining (SDM), pp.437--442, 2007.Google ScholarCross Ref
- Asela Gunawardana, William Byrne. Convergence Theorems for Generalized Alternating Minimization Procedures. The Journal of Machine Learning Research, Vol. 6, pp. 2049--2073, 2005. Google ScholarDigital Library
- Y. B. Cao, H. Z. Duan, C. Y. Lin, Y. Yu, and H. W. Hon. Recommending Questions Using the MDL-based Tree Cut Model. In: Proc. of the 17th Int. Conf. on World Wide Web, 2008. Google ScholarDigital Library
- Lada, A. Adamic, J. Zhang, and etc. Knowledge Sharing and Yahoo Answers: Everyone Knows Something. In: Proc. of the 17th Int. Conf. on World Wide Web, pp. 665--674, 2008. Google ScholarDigital Library
- Z. Gyöngyi, G. Koutrika, etc. Questioning Yahoo! Answers. First WWW Workshop on Question Answering on the Web, 2008.Google Scholar
Index Terms
- Incremental probabilistic latent semantic analysis for automatic question recommendation
Recommendations
Probabilistic question recommendation for question answering communities
WWW '09: Proceedings of the 18th international conference on World wide webUser-Interactive Question Answering (QA) communities such as Yahoo! Answers are growing in popularity. However, as these QA sites always have thousands of new questions posted daily, it is difficult for users to find the questions that are of interest ...
Collaborative recommendation algorithm based on probabilistic matrix factorization in probabilistic latent semantic analysis
In order to effectively solve the problem of new items and obviously improve the accuracy of the recommended results, we proposed a collaborative recommendation algorithm based on improved probabilistic latent semantic model in this paper, which ...
Incremental probabilistic Latent Semantic Analysis for video retrieval
Recent research trends in Content-based Video Retrieval have shown topic models as an effective tool to deal with the semantic gap challenge. In this scenario, this paper has a dual target: (1) it is aimed at studying how the use of different topic ...
Comments