Abstract
The recent outbreak of coronavirus disease (COVID-19) has vividly proven to be a global outbreak that has caused millions of lives and left the world in distress. Many research agencies and government agencies are still trying to find the cause and origin. The various measures and policies were taken care of by the government to assure the minimal effect on the spread of the virus. However, a major challenge was to determine the mental status of the patient while he was abandoned by the rest of the world. Further, the basic concern is to relate the trivial features of sudden changes in their daily lives, lockdown rules, and overall economy rate. In this paper, we propose a topic extraction and sentiment framework for Twitter and CORD-19 data to analyze themes across text corpus. The collected tweet dataset is used to gain insights into people’s emotions and how they are responding to measures taken during the pandemic crisis.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
World Health Organization: Coronavirus disease (covid-19) pandemic (2020), https://www.who.int/emergencies/diseases/novel-coronavirus-2019, [Online; accessed 2020–09-17]
Blei, D. M., & Lafferty, J. D. (2006, June). Dynamic topic models. In Proceedings of the 23rd international conference on Machine learning (pp. 113-120).
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of machine Learning research, 3(Jan), 993-1022.
Chen, E., Lerman, K., & Ferrara, E. (2020). Covid-19: The first public coronavirus twitter dataset. arXiv preprint arXiv:2003.07372.
https://en.wikipedia.org/wiki/Template:COVID-19_pandemic_data
Cinelli, M., Quattrociocchi, W., Galeazzi, A., Valensise, C. M., Brugnoli, E., Schmidt, A. L.,& Scala, A. (2020). The covid-19 social media infodemic. arXiv preprint arXiv:2003.05004.
Yin, H., Yang, S., & Li, J. (2020). Detecting Topic and Sentiment Dynamics Due to COVID-19 Pandemic Using Social Media. arXiv preprint arXiv:2007.02304.
Prabhakar Kaila, D., & Prasad, D. A. (2020). Informational flow on Twitter–Corona virus outbreak–topic modelling approach. International Journal of Advanced Research in Engineering and Technology (IJARET), 11(3).
Huang, B., & Carley, K. M. (2020). Disinformation and Misinformation on Twitter during the Novel Coronavirus Outbreak. arXiv preprint arXiv:2006.04278.
Zhou, J., Yang, S., Xiao, C., & Chen, F. (2020). Examination of community sentiment dynamics due to covid-19 pandemic: a case study from Australia. arXiv preprint arXiv:2006.12185.
Venigalla, A. S. M., Vagavolu, D., & Chimalakonda, S. (2020). Mood of India During Covid-19–An Interactive Web Portal Based on Emotion Analysis of Twitter Data. arXiv preprint arXiv:2005.02955.
Hou, Z., Du, F., Jiang, H., Zhou, X., & Lin, L. (2020). Assessment of public attention, risk perception, emotional and behavioural responses to the COVID-19 outbreak: social media surveillance in China. Risk Perception, Emotional and Behavioural Responses to the COVID-19 Outbreak: Social Media Surveillance in China (3/6/2020).
Mathur, A., Kubde, P., & Vaidya, S. (2020, June). Emotional Analysis using Twitter Data during Pandemic Situation: COVID-19. In 2020 5th International Conference on Communication and Electronics Systems (ICCES) (pp. 845–848). IEEE.
Doanvo, A. L., Qian, X., Ramjee, D., Piontkivska, H., Desai, A. N., & Majumder, M. S. (2020). Machine Learning Maps Research Needs in COVID-19 Literature. bioRxiv.
Samuel, J., Ali, G. G., Rahman, M., Esawi, E., & Samuel, Y. (2020). Covid-19 public sentiment insights and machine learning for tweets classification. Information, 11(6), 314.
Nwankwo, E., Okolo, C., & Habonimana, C. Topic Modeling Approaches for Understanding COVID-19 Misinformation Spread in Sub-Saharan Africa.
Do, H. J., Lim, C. G., Kim, Y. J., & Choi, H. J. (2016, January). Analyzing emotions in twitter during a crisis: A case study of the 2015 Middle East Respiratory Syndrome outbreak in Korea. In 2016 international conference on big data and smart computing (BigComp) (pp. 415–418). IEEE.
Queiroz, M. M., Ivanov, D., Dolgui, A., & Wamba, S. F. (2020). Impacts of epidemic outbreaks on supply chains: mapping a research agenda amid the COVID-19 pandemic through a structured literature review. Annals of Operations Research, 1–38.
Shuja, J., Alanazi, E., Alasmary, W., & Alashaikh, A. (2020). Covid-19 open-source data sets: A comprehensive survey. Applied Intelligence, 1–30.
Tweet dataset at https://www.kaggle.com/sandhyaavasthi/covid19-tweetsjuly2020december2020
Roesslein, J. (2009). tweepy Documentation. Online], . readthedocs. io/en/v3, 5.
Wang, L. L., Lo, K., Chandrasekhar, Y., Reas, R., Yang, J., Eide, D., & Kohlmeier, S. (2020). Cord-19: The covid-19 open research dataset. ArXiv.
Avasthi, S., Chauhan, R., & Acharjya, D. P. (2020). Techniques, Applications, and Issues in Mining Large-Scale Text Databases. Advances in Information Communication Technology and Computing: Proceedings of AICTC, 2019, 385.
Avasthi, S., Chauhan, R., & Acharjya, D. P. (2021) Processing Large Text Corpus using N-gram language modeling and smoothing. In International Conference on ‘Information Management & Machine Intelligence’, Springer.
Rehurek, R., & Sojka, P. (2010). Software framework for topic modelling with large corpora. In In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks.
Kumar, A., & Garg, G. (2019). Sentiment analysis of multimodal twitter data. Multimedia Tools and Applications, 78(17), 24103–24119.
Carchiolo, V., Longheu, A., & Malgeri, M. (2015, September). Using twitter data and sentiment analysis to study diseases dynamics. In International Conference on Information Technology in Bio-and Medical Informatics (pp. 16–24). Springer, Cham.
Hutto, C.J. & Gilbert, E.E. (2014). VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text. Eighth International Conference on Weblogs and Social Media (ICWSM-14). Ann Arbor, MI, June 2014.
Elbagir, S., & Yang, J. (2019). Twitter Sentiment Analysis Using Natural Language Toolkit and VADER Sentiment. In Proceedings of the International MultiConference of Engineers and Computer Scientists (pp. 122–16).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Avasthi, S., Chauhan, R., Acharjya, D.P. (2022). Information Extraction and Sentiment Analysis to Gain Insight into the COVID-19 Crisis. In: Khanna, A., Gupta, D., Bhattacharyya, S., Hassanien, A.E., Anand, S., Jaiswal, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1387. Springer, Singapore. https://doi.org/10.1007/978-981-16-2594-7_28
Download citation
DOI: https://doi.org/10.1007/978-981-16-2594-7_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2593-0
Online ISBN: 978-981-16-2594-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)