ABSTRACT
With product reviews growing in depth and becoming more numerous, it is growing challenge to acquire a comprehensive understanding of their contents, for both customers and product manufacturers. We built a system that automatically summarizes a large collection of product reviews to generate a concise summary. Importantly, our system not only extracts the review sentiments but also the underlying justification for their opinion. We solve this problem through a novel application of clustering and validate our approach through an empirical study, obtaining good performance as judged by F-measure (the harmonic mean of purity and inverse purity).
- R. Agrawal and R. Srikant. Fast Algorithms for Mining Association Rules. In Proc. of 20th International Conference on Very Large Data Bases (VLDB'94), pages 487--499, 1994. Google ScholarDigital Library
- J. Carbonell and J. Goldstein. The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries. In Proc. of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '98), pages 335--336, 1998. Google ScholarDigital Library
- X. Ding, B. Liu, and P. S. Yu. A Holistic Lexicon-based Approach to Opinion Mining. In Proc. of the International Conference on Web Search and Web Data Mining (WSDM'08), pages 231--240, 2008. Google ScholarDigital Library
- A. Hotho, A. Nürnberger, and G. Paaß. A Brief Survey of Text Mining. GLDV-Journal for Computational Linguistics and Language Technology, 20(1):19--62, 2005.Google Scholar
- M. Hu and B. Liu. Mining and Summarizing Customer Reviews. In Proc. of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04), pages 168--177, 2004. Google ScholarDigital Library
- M. Hu and B. Liu. Mining Opinion Features in Customer Reviews. In Proc. of the 19th National Conference on Artificial Intelligence (AAAI-2004), pages 755--760, 2004. Google ScholarDigital Library
- S.-M. Kim and E. Hovy. Determining the Sentiment of Opinions. In Proc. of the 20th International Conference on Computational Linguistics (COLING 2004), pages 1367--1374, 2004. Google ScholarDigital Library
- S.-M. Kim and E. Hovy. Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text. In Proc. of the Workshop on Sentiment and Subjectivity in Text co-located with the 44th Annual Meeting of the Association for Computational Linguistics (ACL 2006), pages 1--8, 2006. Google ScholarDigital Library
- B. Liu, M. Hu, and J. Cheng. Opinion Observer: Analyzing and Comparing Opinions on the Web. In Proc. of the 14th International World Wide Web Conference (WWW2005), pages 342--351, 2005. Google ScholarDigital Library
- G. A. Miller. WordNet: A Lexical Database for English. Communications of the ACM, 38(11):39--41, 1995. Google ScholarDigital Library
- A.-M. Popescu and O. Etzioni. Extracting Product Features and Opinions from Reviews. In Proc. of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), pages 339--346, 2005. Google ScholarDigital Library
- D. R. Radev and K. R. McKeown. Generating Natural Language Summaries from Multiple On-line Sources. Computational Linguistics, 24(3):470--500, 1998. Google ScholarDigital Library
- H. Spath. The Cluster Dissection and Analysis Theory FORTRAN Programs Examples. Prentice-Hall, Inc., 1985. Google ScholarDigital Library
Index Terms
- Product review summarization from a deeper perspective
Recommendations
Movie review mining and summarization
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge managementWith the flourish of the Web, online review is becoming a more and more useful and important information resource for people. As a result, automatic review mining and summarization has become a hot research topic recently. Different from traditional ...
Brand-Related Events Detection, Classification and Summarization on Twitter
WI-IAT '12: Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01The huge and ever increasing amount of text generated by Twitter users everyday embeds a wealth of information, in particular, about themes that become suddenly relevant to many users as well as about the sentiment polarity that users tend to associate ...
Query-relevant sentiment summarization based on facet identification and sentence clustering
SAC '22: Proceedings of the 37th ACM/SIGAPP Symposium on Applied ComputingMulti-document sentiment analysis is an important natural language processing problem. Summaries generated by these analyzers can greatly reduce the time necessary to read a collection of topically-related documents to locate the desired information ...
Comments