ABSTRACT
Sentiment analysis has being used in several applications including the analysis of the repercussion of events in online social networks (OSNs), as well as to summarize public perception about products and brands on discussions on those systems. There are multiple methods to measure sentiments, varying from lexical-based approaches to machine learning methods. Despite the wide use and popularity of some those methods, it is unclear which method is better for identifying the polarity (i.e. positive or negative) of a message, as the current literature does not provide a comparison among existing methods. This comparison is crucial to allow us to understand the potential limitations, advantages, and disadvantages of popular methods in the context of OSNs messages. This work aims at filling this gap by presenting a comparison between 8 popular sentiment analysis methods. Our analysis compares these methods in terms of coverage and in terms of correct sentiment identification. We also develop a new method that combines existing approaches in order to provide the best coverage results with competitive accuracy. Finally, we present iFeel, a Web service which provides an open API for accessing and comparing results across different sentiment methods for a given text.
- Msn messenger emoticons. http://messenger.msn.com/Resource/Emoticons.aspx.Google Scholar
- Omg! oxford english dictionary grows a heart: Graphic symbol for love (and that exclamation) are added as words. tinyurl.com/klv36p.Google Scholar
- Sentistrength 2.0. http://sentistrength.wlv.ac.uk/Download.Google Scholar
- Yahoo messenger emoticons. http://messenger.yahoo.com/features/emoticons.Google Scholar
- Amazon. Amazon mechanical turk. https://www.mturk.com/. Accessed June 17, 2013.Google Scholar
- F. Benevenuto, G. Magno, T. Rodrigues, and V. Almeida. Detecting spammers on twitter. In Collaboration, Electronic messaging, Anti-Abuse and Spam Conference (CEAS), 2010.Google Scholar
- J. Bollen, A. Pepe, and H. Mao. Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena. CoRR, abs/0911.1583, 2009.Google Scholar
- M. M. Bradley and P. J. Lang. Affective norms for english words (ANEW): Stimuli, instruction manual, and affective ratings. Technical report, Center for Research in Psychophysiology, University of Florida, Gainesville, Florida, 1999.Google Scholar
- E. Cambria, A. Hussain, C. Havasi, C. Eckl, and J. Munro. Towards crowd validation of the uk national health service. In ACM Web Science Conference (WebSci), 2010.Google Scholar
- E. Cambria, R. Speer, C. Havasi, and A. Hussain. Senticnet: A publicly available semantic resource for opinion mining. In AAAI Fall Symposium Series, 2010.Google Scholar
- M. Cha, H. Haddadi, F. Benevenuto, and K. P. Gummadi. Measuring User Influence in Twitter: The Million Follower Fallacy. In Int'l AAAI Conference on Weblogs and Social Media (ICWSM), 2010.Google Scholar
- P. S. Dodds and C. M. Danforth. Measuring the happiness of large-scale written expression: songs, blogs, and presidents. Journal of Happiness Studies, 11(4):441--456, 2009.Google ScholarCross Ref
- Esuli and Sebastiani. Sentwordnet: A publicly available lexical resource for opinion mining. In In Conference on Language Resources and Evaluation, 2006.Google Scholar
- P. Goncalves and F. Benevenuto. O que tweets contendo emoticons podem revelar sobre sentimentos coletivos? In II Brazilian Workshop on Social Network Analysis and Mining (BraSNAM), 2013.Google Scholar
- P. Goncalves, W. Dores, and F. Benevenuto. Panas-t: Uma escala psicometrica para analise de sentimentos no twitter. In I Brazilian Workshop on Social Network Analysis and Mining (BraSNAM), 2012.Google Scholar
- A. Hannak, E. Anderson, L. F. Barrett, S. Lehmann, A. Mislove, and M. Riedewald. Tweetin' in the rain: Exploring societal-scale effects of weather on mood. In Int'l AAAI Conference on Weblogs and Social Media (ICWSM), 2012.Google Scholar
- G. A. Miller. Wordnet: a lexical database for english. Communications of the ACM, 38(11):39--41, 1995. Google ScholarDigital Library
- J. Park, V. Barash, C. Fink, and M. Cha. Emoticon style: Interpreting differences in emoticons across cultures. In Int'l AAAI Conference on Weblogs and Social Media (ICWSM), 2013.Google Scholar
- J. Read. Using emoticons to reduce dependency in machine learning techniques for sentiment classification. In ACL Student Research Workshop, pages 43--48, 2005. Google ScholarDigital Library
- S. Somasundaran, J. Wiebe, and J. Ruppenhofer. Discourse level opinion interpretation. In Int'l Conference on Computational Linguistics (COLING), pages 801--808, 2008. Google ScholarDigital Library
- Y. R. Tausczik and J. W. Pennebaker. The psychological meaning of words: Liwc and computerized text analysis methods. Journal of Language and Social Psychology, 29(1):24--54, 2010.Google ScholarCross Ref
- M. Thelwall. Heart and soul: Sentiment strength detection in the social web with sentistrength. http://migre.me/fHgj9.Google Scholar
- H. Wang, D. Can, A. Kazemzadeh, F. Bar, and S. Narayanan. A system for real-time twitter sentiment analysis of 2012 u.s. presidential election cycle. In ACL System Demonstrations, 2012. Google ScholarDigital Library
- D. Watson and L. Clark. Development and validation of brief measures of positive and negative affect: the panas scales. Journal of Personality and Social Psychology, 54(1):1063--1070, 1985.Google Scholar
- K. Wickre. Celebrating twitter7. http://migre.me/fHgjA.Google Scholar
- T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. Opinionfinder: a system for subjectivity analysis. In HLT/EMNLP on Interactive Demonstrations, 2005. Google ScholarDigital Library
Index Terms
- Measuring sentiments in online social networks
Recommendations
Twitter Opinion Topic Model: Extracting Product Opinions from Tweets by Leveraging Hashtags and Sentiment Lexicon
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge ManagementAspect-based opinion mining is widely applied to review data to aggregate or summarize opinions of a product, and the current state-of-the-art is achieved with Latent Dirichlet Allocation (LDA)-based model. Although social media data like tweets are ...
A: ) Is Worth a Thousand Words: How People Attach Sentiment to Emoticons and Words in Tweets
SOCIALCOM '13: Proceedings of the 2013 International Conference on Social ComputingEmoticons are widely used to express positive or negative sentiment on Twitter. We report on a study with live users to determine whether emoticons are used to merely emphasize the sentiment of tweets, or whether they are the main elements carrying the ...
Online Forums vs. Social Networks: Two Case Studies to Support eGovernment with Topic Opinion Analysis
EGOV 2013: Proceedings of the 12th IFIP WG 8.5 International Conference on Electronic Government - Volume 8074This paper suggests how eGovernment and public services can apply "topic-opinion" analysis developed in the EC IST FP7 WeGov project on citizens' opinions on the Internet. In many cases, discussion tracks on the Internet become quite long and complex. ...
Comments