ABSTRACT
Twitter is a globally used micro-blogging platform with hundreds of millions of tweets sent every day. Many researchers have explored Twitter analytics across a wide range of areas such as topic modeling, sentiment analysis, event detection, as well as the application of Twitter for a variety of domain-specific application areas, e.g. disaster management. One area that has not been explored is how changes in sentiment can be used to identify events. In this paper we present a scalable Cloud-based platform for harvesting, processing, analyzing and visualizing large-scale Twitter data. We focus especially on how changes in sentiment can be used to identify events in given contexts. What is novel is that the events that are detected are not dependent explicitly on the topic of any given tweet, but entirely on the change in sentiment. This offers new capabilities for event detection that have hitherto not been explored. To illustrate the approach, we present case studies related to sporting events identified entirely through changing sentiment with specific focus on the 2014 FIFA World Cup of Soccer and the 2015 World Cup of Cricket. (Abstract)
- S. Rosenthal, A. Ritter, P. Nakov, and V. Stoyanov (2009). Sentiment analysis in twitter. In Proceedings of the 8th International Workshop on Semantic Evaluation, http://www.aclweb.org/anthology/S14-2009.Google Scholar
- F. Atefeh, W. Khreich, (2015). A survey of techniques for event detection in twitter. Computational Intelligence, 31(1), pp.132--164. Google ScholarDigital Library
- R.O. Sinnott, S. Yin, (2015). Accident Black Spot Identification, Verification and Prediction through Social Media, IEEE International Conference on Data Science and Data Intensive Systems, Sydney, Australia. Google ScholarDigital Library
- M. Cataldi, L. Di Caro, C. Schifanella, (2010). Emerging topic detection on twitter based on temporal and social terms evaluation. In Proceedings of the Tenth International Workshop on Multimedia Data Mining (p. 4). ACM. Google ScholarDigital Library
- J. Bollen, H. Mao, X. Zeng, (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), pp.1--8.Google ScholarCross Ref
- M. Gaurav, A. Srivastava, A. Kumar, S. Miller, (2013). Leveraging candidate popularity on Twitter to predict election outcome. In Proceedings of the 7th Workshop on Social Network Mining and Analysis (p. 7). ACM. Google ScholarDigital Library
- R. Feldman, (2013). Techniques and applications for sentiment analysis. Communications of the ACM, 56(4), 82--89. Google ScholarDigital Library
- A. Kaur, V. Gupta, (2013). A Survey on Sentiment Analysis and Opinion Mining Techniques. Journal of Emerging Technologies in Web Intelligence, 5(4), 367--371Google ScholarCross Ref
- U. Pavalanathan, J. Eisenstein, (2015). Emoticons vs. emojis on Twitter: A causal inference approach. arXiv preprint arXiv:1510.08480.Google Scholar
- K. Gimpel, N. Schneider, B. O'Connor, D. Das, D. Mills, J. Eisenstein, M. Heilman, D. Yogatama, J. Flanigan, N. Smith (2011). Part-of-speech tagging for twitter: Annotation, features, and experiments. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers-Volume 2 (pp. 42--47). Google ScholarDigital Library
- A. Pak, P. Paroubek, (2010). Twitter as a Corpus for Sentiment Analysis and Opinion Mining. In LREc (Vol. 10, pp. 1320--1326).Google Scholar
- E. Kouloumpis, T. Wilson, J. Moore, (2011). Twitter sentiment analysis: The good the bad and the omg!. ICWSM, 11, pp.538--541.Google Scholar
- D. Manning, C. Surdeanu, M. Bauer, J. Finkel, J. Bethard, D. McClosky, The Stanford CoreNLP Natural Language Processing Toolkit, In ACL (System Demonstrations), pp. 55--60.Google Scholar
- TextBlob, https://pypi.python.org/pypi/textblobGoogle Scholar
- X. Hu, et al. (2013) "Unsupervised sentiment analysis with emotional signals." Proceedings of the 22nd international conference on World Wide Web. International World Wide Web Conferences Steering Committee, 2013. Google ScholarDigital Library
- G. Paltoglou M. Thelwall, (2012). "Twitter, MySpace, Digg: Unsupervised sentiment analysis in social media." ACM Transactions on Intelligent Systems and Technology (TIST) 3.4 (2012): 66. Google ScholarDigital Library
- F.N. Ribeiro, M. Araújo, P. Gonçalves, M.A. Gonçalves, F. Benevenuto, F. (2016). SentiBench-a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Science, 5(1), pp.1--29.Google ScholarCross Ref
- A. Giachanou, F. Crestani (2016). Like It or Not: A Survey of Twitter Sentiment Analysis Methods. ACM Computing Surveys (CSUR), 49(2), p.28. Google ScholarDigital Library
- R.O. Sinnott, S. Cui, (2016). Benchmarking Sentiment Analysis Approaches on the Cloud, submitted to 22nd IEEE International Conference on Parallel and Distributed Systems (ICPADS 2016), Wuhan, China.Google Scholar
- Y. Yang, T. Pierce, J. Carbonell. (1998) A Study on Retrospective and On-Line Event Detection, Proceedings of the ACM SIGIR conference, Google ScholarDigital Library
- R. Maeireizo, D. Litman. (2004) Co-training for predicting emotions with spoken dialogue data. Association for Computational Linguistics Stroudsburg, Article No. 28, 2004. Google ScholarDigital Library
- D. Koller, S. Tong. Support vector machine active learning with applications to text classification. The Journal of Machine Learning Research archive, 2: 45{66, January 2002. Google ScholarDigital Library
- A. Rajaraman; J.D Ullman (2011). Introduction to sentiment analysis. Data Mining: Mining of Massive Datasets, ISBN 9781139058452:1--17.Google Scholar
- J. Kleinberg. (2002) Bursty and hierarchical structure in streams. In KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 91--101, New York, NY, USA, 2002. ACM. Google ScholarDigital Library
- J. Weng, B.S. Lee, (2011). Event Detection in Twitter. ICWSM, 11, pp.401--408.Google Scholar
- T. Sakaki, M. Okazaki, Y. Matsuo. (2010). Earthquake shakes Twitter users: real-time event detection by social sensors. In Proceedings of the 19th international conference on World wide web (pp. 851--860). ACM. Google ScholarDigital Library
- Apache CouchDb http://couchdb.apache.org/.Google Scholar
- Boto, https://pypi.python.org/pypi/botoGoogle Scholar
- Ansible, https://pypi.python.org/pypi/ansibleGoogle Scholar
- Zdenek Zabokrtsky. Feature engineering in machine learning. http://ufal.mff.cuni.cz/~zabokrtsky/courses/npfl104/html/feature_engineering.pdfGoogle Scholar
- A.G. Jivani. A comparative study of stemming algorithms. Int. J. Comp. Tech. Appl, Vol 2 (6):1930--1938.Google Scholar
- S. Asur, B. A. Huberman (2010). Predicting the future with social media. In Proceedings of the International Conference on Web Intelligence and Intelligent Agent Technology Google ScholarDigital Library
- M. Hofmann, R. Klinkenberg (2013). RapidMiner: Data mining use cases and business analytics applications. CRC Press. Google ScholarDigital Library
- M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, I. Witten (2009). The WEKA data mining software: an update. ACM SIGKDD explorations newsletter, 11(1), pp.10--18. Google ScholarDigital Library
- L.A. Smith, T.J. Monk, R..S. Mitchell, G. Holme (1994). Geometric comparison of classifcations and rule sets. Workshop on Knowledge Discovery in Databases. Google ScholarDigital Library
- I. Rish. An empirical study of the Naive Bayes Classifier. RC 22230 (W0111-014).Google Scholar
- Tom M. Mitchell. (1997), Lecture slides for textbook machine learning. http://www.cs.cmu.edu/afs/cs/project/theo-20/www/mlbook/ch3.pdf.Google Scholar
- Hadoop, http://hadoop.apache.orgGoogle Scholar
- Apache Spark, http://spark.apache.orgGoogle Scholar
- S. Wu, L. Morandini, R.O. Sinnott, (2015) SMASH: A Cloud-based Architecture for Big Data Processing and Visualization of Traffic Data, IEEE International Conference on Data Science and Data Intensive Systems, Sydney, Australia. Google ScholarDigital Library
- M. Nino-Ruiz, C. Bayliss, G. Galang, G. Grazioli, R. Rabanal, R.O. Sinnott, M. Tomko, (2014) Elastic scaling of e-Infrastructures to Support Data-intensive Research Collaborations, International Conference on e-Science 2014, Sao Paolo, Brazil. Google ScholarDigital Library
Recommendations
SentiStory: multi-grained sentiment analysis and event summarization with crowdsourced social media data
The massive social media data bring timely, multi-dimensional and rich information. Recently, many researchers have worked on event summarization with crowdsourced social media data. While existing works mostly focus on text-based summary, they only ...
Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog
As a new form of social media, microblogging provides platform sharing, wherein users can share their feelings and ideas on certain topics. Bursty topics from microblogs are the results of the emerging issues that instantly attract more followers and ...
Sentiment-based and hashtag-based Chinese online bursty event detection
How to detect bursty events in data streams on social media is a hot research topic in natural language processing. However, current methods for extracting bursty events suffer from poor accuracy and low efficiency. Fortunately, sentiment analysis has ...
Comments