skip to main content
10.1145/2996890.2996898acmotherconferencesArticle/Chapter ViewAbstractPublication PagesuccConference Proceedingsconference-collections
research-article

My ever changing moods: sentiment-based event detection on the cloud

Authors Info & Claims
Published:06 December 2016Publication History

ABSTRACT

Twitter is a globally used micro-blogging platform with hundreds of millions of tweets sent every day. Many researchers have explored Twitter analytics across a wide range of areas such as topic modeling, sentiment analysis, event detection, as well as the application of Twitter for a variety of domain-specific application areas, e.g. disaster management. One area that has not been explored is how changes in sentiment can be used to identify events. In this paper we present a scalable Cloud-based platform for harvesting, processing, analyzing and visualizing large-scale Twitter data. We focus especially on how changes in sentiment can be used to identify events in given contexts. What is novel is that the events that are detected are not dependent explicitly on the topic of any given tweet, but entirely on the change in sentiment. This offers new capabilities for event detection that have hitherto not been explored. To illustrate the approach, we present case studies related to sporting events identified entirely through changing sentiment with specific focus on the 2014 FIFA World Cup of Soccer and the 2015 World Cup of Cricket. (Abstract)

References

  1. S. Rosenthal, A. Ritter, P. Nakov, and V. Stoyanov (2009). Sentiment analysis in twitter. In Proceedings of the 8th International Workshop on Semantic Evaluation, http://www.aclweb.org/anthology/S14-2009.Google ScholarGoogle Scholar
  2. F. Atefeh, W. Khreich, (2015). A survey of techniques for event detection in twitter. Computational Intelligence, 31(1), pp.132--164. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R.O. Sinnott, S. Yin, (2015). Accident Black Spot Identification, Verification and Prediction through Social Media, IEEE International Conference on Data Science and Data Intensive Systems, Sydney, Australia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Cataldi, L. Di Caro, C. Schifanella, (2010). Emerging topic detection on twitter based on temporal and social terms evaluation. In Proceedings of the Tenth International Workshop on Multimedia Data Mining (p. 4). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Bollen, H. Mao, X. Zeng, (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), pp.1--8.Google ScholarGoogle ScholarCross RefCross Ref
  6. M. Gaurav, A. Srivastava, A. Kumar, S. Miller, (2013). Leveraging candidate popularity on Twitter to predict election outcome. In Proceedings of the 7th Workshop on Social Network Mining and Analysis (p. 7). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Feldman, (2013). Techniques and applications for sentiment analysis. Communications of the ACM, 56(4), 82--89. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Kaur, V. Gupta, (2013). A Survey on Sentiment Analysis and Opinion Mining Techniques. Journal of Emerging Technologies in Web Intelligence, 5(4), 367--371Google ScholarGoogle ScholarCross RefCross Ref
  9. U. Pavalanathan, J. Eisenstein, (2015). Emoticons vs. emojis on Twitter: A causal inference approach. arXiv preprint arXiv:1510.08480.Google ScholarGoogle Scholar
  10. K. Gimpel, N. Schneider, B. O'Connor, D. Das, D. Mills, J. Eisenstein, M. Heilman, D. Yogatama, J. Flanigan, N. Smith (2011). Part-of-speech tagging for twitter: Annotation, features, and experiments. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers-Volume 2 (pp. 42--47). Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Pak, P. Paroubek, (2010). Twitter as a Corpus for Sentiment Analysis and Opinion Mining. In LREc (Vol. 10, pp. 1320--1326).Google ScholarGoogle Scholar
  12. E. Kouloumpis, T. Wilson, J. Moore, (2011). Twitter sentiment analysis: The good the bad and the omg!. ICWSM, 11, pp.538--541.Google ScholarGoogle Scholar
  13. D. Manning, C. Surdeanu, M. Bauer, J. Finkel, J. Bethard, D. McClosky, The Stanford CoreNLP Natural Language Processing Toolkit, In ACL (System Demonstrations), pp. 55--60.Google ScholarGoogle Scholar
  14. TextBlob, https://pypi.python.org/pypi/textblobGoogle ScholarGoogle Scholar
  15. X. Hu, et al. (2013) "Unsupervised sentiment analysis with emotional signals." Proceedings of the 22nd international conference on World Wide Web. International World Wide Web Conferences Steering Committee, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Paltoglou M. Thelwall, (2012). "Twitter, MySpace, Digg: Unsupervised sentiment analysis in social media." ACM Transactions on Intelligent Systems and Technology (TIST) 3.4 (2012): 66. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. F.N. Ribeiro, M. Araújo, P. Gonçalves, M.A. Gonçalves, F. Benevenuto, F. (2016). SentiBench-a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Science, 5(1), pp.1--29.Google ScholarGoogle ScholarCross RefCross Ref
  18. A. Giachanou, F. Crestani (2016). Like It or Not: A Survey of Twitter Sentiment Analysis Methods. ACM Computing Surveys (CSUR), 49(2), p.28. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R.O. Sinnott, S. Cui, (2016). Benchmarking Sentiment Analysis Approaches on the Cloud, submitted to 22nd IEEE International Conference on Parallel and Distributed Systems (ICPADS 2016), Wuhan, China.Google ScholarGoogle Scholar
  20. Y. Yang, T. Pierce, J. Carbonell. (1998) A Study on Retrospective and On-Line Event Detection, Proceedings of the ACM SIGIR conference, Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Maeireizo, D. Litman. (2004) Co-training for predicting emotions with spoken dialogue data. Association for Computational Linguistics Stroudsburg, Article No. 28, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. D. Koller, S. Tong. Support vector machine active learning with applications to text classification. The Journal of Machine Learning Research archive, 2: 45{66, January 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. A. Rajaraman; J.D Ullman (2011). Introduction to sentiment analysis. Data Mining: Mining of Massive Datasets, ISBN 9781139058452:1--17.Google ScholarGoogle Scholar
  24. J. Kleinberg. (2002) Bursty and hierarchical structure in streams. In KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 91--101, New York, NY, USA, 2002. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. Weng, B.S. Lee, (2011). Event Detection in Twitter. ICWSM, 11, pp.401--408.Google ScholarGoogle Scholar
  26. T. Sakaki, M. Okazaki, Y. Matsuo. (2010). Earthquake shakes Twitter users: real-time event detection by social sensors. In Proceedings of the 19th international conference on World wide web (pp. 851--860). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Apache CouchDb http://couchdb.apache.org/.Google ScholarGoogle Scholar
  28. Boto, https://pypi.python.org/pypi/botoGoogle ScholarGoogle Scholar
  29. Ansible, https://pypi.python.org/pypi/ansibleGoogle ScholarGoogle Scholar
  30. Zdenek Zabokrtsky. Feature engineering in machine learning. http://ufal.mff.cuni.cz/~zabokrtsky/courses/npfl104/html/feature_engineering.pdfGoogle ScholarGoogle Scholar
  31. A.G. Jivani. A comparative study of stemming algorithms. Int. J. Comp. Tech. Appl, Vol 2 (6):1930--1938.Google ScholarGoogle Scholar
  32. S. Asur, B. A. Huberman (2010). Predicting the future with social media. In Proceedings of the International Conference on Web Intelligence and Intelligent Agent Technology Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. M. Hofmann, R. Klinkenberg (2013). RapidMiner: Data mining use cases and business analytics applications. CRC Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, I. Witten (2009). The WEKA data mining software: an update. ACM SIGKDD explorations newsletter, 11(1), pp.10--18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. L.A. Smith, T.J. Monk, R..S. Mitchell, G. Holme (1994). Geometric comparison of classifcations and rule sets. Workshop on Knowledge Discovery in Databases. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. I. Rish. An empirical study of the Naive Bayes Classifier. RC 22230 (W0111-014).Google ScholarGoogle Scholar
  37. Tom M. Mitchell. (1997), Lecture slides for textbook machine learning. http://www.cs.cmu.edu/afs/cs/project/theo-20/www/mlbook/ch3.pdf.Google ScholarGoogle Scholar
  38. Hadoop, http://hadoop.apache.orgGoogle ScholarGoogle Scholar
  39. Apache Spark, http://spark.apache.orgGoogle ScholarGoogle Scholar
  40. S. Wu, L. Morandini, R.O. Sinnott, (2015) SMASH: A Cloud-based Architecture for Big Data Processing and Visualization of Traffic Data, IEEE International Conference on Data Science and Data Intensive Systems, Sydney, Australia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. M. Nino-Ruiz, C. Bayliss, G. Galang, G. Grazioli, R. Rabanal, R.O. Sinnott, M. Tomko, (2014) Elastic scaling of e-Infrastructures to Support Data-intensive Research Collaborations, International Conference on e-Science 2014, Sao Paolo, Brazil. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    UCC '16: Proceedings of the 9th International Conference on Utility and Cloud Computing
    December 2016
    549 pages
    ISBN:9781450346160
    DOI:10.1145/2996890

    Copyright © 2016 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 6 December 2016

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    Overall Acceptance Rate38of125submissions,30%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader