skip to main content
10.1145/3357384.3358060acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

#suicidal - A Multipronged Approach to Identify and Explore Suicidal Ideation in Twitter

Published:03 November 2019Publication History

ABSTRACT

Technological advancements have led to the creation of social media platforms like Twitter, where people have started voicing their views over rarely discussed and socially stigmatizing issues. Twitter, is increasingly being used for studying psycho-linguistic phenomenon spanning from expressions of adverse drug reactions, depressions, to suicidality. In this work we focus on identifying suicidal posts from Twitter. Towards this objective we take a multipronged approach and implement different neural network models such assequential models andgraph convolutional networks, that are trained on textual content shared in Twitter, the historical tweeting activity of the users and social network formed between different users posting about suicidality. We train a stacked ensemble of classifiers representing different aspects of suicidal tweeting activity, and achieve state-of-the-art results on a new manually annotated dataset developed by us, that contains textual as well as network information of suicidal tweets. We further investigate into the trained models and perform qualitative analysis showing how historical tweeting activity and rich information embedded in the homophily networks amongst users in Twitter, aids in accurately identifying tweets expressing suicidal intent.

References

  1. Silvio Amir, Byron C Wallace, Hao Lyu, and Paula Carvalho Mário J Silva. 2016. Modelling context with user embeddings for sarcasm detection in social media. arXiv preprint arXiv:1607.00976 (2016).Google ScholarGoogle Scholar
  2. Pinkesh Badjatiya, Shashank Gupta, Manish Gupta, and Vasudeva Varma. 2017. Deep learning for hate speech detection in tweets. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 759--760.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Sairam Balani and Munmun De Choudhury. 2015. Detecting and characterizing mental health related self-disclosure in social media. In Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems. ACM, 1373--1378.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Adrian Benton, Margaret Mitchell, and Dirk Hovy. 2017. Multi-task learning for mental health using social media text. arXiv preprint arXiv:1712.03538 (2017).Google ScholarGoogle Scholar
  5. Steven Bird and Edward Loper. 2004. NLTK: the natural language toolkit. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions. Association for Computational Linguistics, 31.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Louise Brådvik, Cecilia Mattisson, Mats Bogren, and Per Nettelbladt. 2008. Long-term suicide risk of depression in the Lundby cohort 1947--1997--severity and gender. Acta Psychiatrica Scandinavica , Vol. 117, 3 (2008), 185--191.Google ScholarGoogle ScholarCross RefCross Ref
  7. Pete Burnap, Walter Colombo, and Jonathan Scourfield. 2015. Machine classification and analysis of suicide-related communication on twitter. In Proceedings of the 26th ACM conference on hypertext & social media. ACM, 75--84.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Patricia A Cavazos-Rehg, Melissa J Krauss, Shaina Sowles, Sarah Connolly, Carlos Rosas, Meghana Bharadwaj, and Laura J Bierut. 2016. A content analysis of depression-related tweets. Computers in human behavior , Vol. 54 (2016), 351--357.Google ScholarGoogle Scholar
  9. Stevie Chancellor, Michael L Birnbaum, Eric D Caine, Vincent Silenzio, and Munmun De Choudhury. 2019. A Taxonomy of Ethical Tensions in Inferring Mental Health States from Social Media. In Proceedings of the 2nd ACM Conference on Fairness, Accountability, and Transparency (Atlanta GA .Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Gualtiero B Colombo, Pete Burnap, Andrei Hodorog, and Jonathan Scourfield. 2016. Analysing the connectivity and communication of suicidal users on twitter. Computer communications , Vol. 73 (2016), 291--300.Google ScholarGoogle Scholar
  11. Munmun De Choudhury, Michael Gamon, Scott Counts, and Eric Horvitz. 2013. Predicting depression via social media. ICWSM , Vol. 13 (2013), 1--10.Google ScholarGoogle Scholar
  12. Janez Demvs ar. 2006. Statistical comparisons of classifiers over multiple data sets. Journal of Machine learning research , Vol. 7, Jan (2006), 1--30.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Jasper Friedrichs, Debanjan Mahata, and Shubham Gupta. 2018. InfyNLP at SMM4H task 2: stacked ensemble of shallow convolutional neural networks for identifying personal medication intake from Twitter. arXiv preprint arXiv:1803.07718 (2018).Google ScholarGoogle Scholar
  14. Madelyn Gould, Patrick Jamieson, and Daniel Romer. 2003. Media contagion and suicide among the young. American Behavioral Scientist , Vol. 46, 9 (2003), 1269--1284.Google ScholarGoogle ScholarCross RefCross Ref
  15. Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 855--864.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Sharath Chandra Guntuku, David B Yaden, Margaret L Kern, Lyle H Ungar, and Johannes C Eichstaedt. 2017. Detecting depression and mental illness on social media: an integrative review. Current Opinion in Behavioral Sciences , Vol. 18 (2017), 43--49.Google ScholarGoogle ScholarCross RefCross Ref
  17. Nina Jacob, Jonathan Scourfield, and Rhiannon Evans. 2014. Suicide prevention via the internet. Crisis (2014).Google ScholarGoogle Scholar
  18. Jared Jashinsky, Scott H Burton, Carl L Hanson, Josh West, Christophe Giraud-Carrier, Michael D Barnes, and Trenton Argyle. 2014. Tracking suicide risk factors through Twitter in the US. Crisis (2014).Google ScholarGoogle Scholar
  19. Abhinav Khattar, Karan Dabas, Kshitij Gupta, Shaan Chopra, and Ponnurangam Kumaraguru. 2018. White or Blue, the Whale gets its Vengeance: A Social Media Analysis of the Blue Whale Challenge. arXiv preprint arXiv:1801.05588 (2018).Google ScholarGoogle Scholar
  20. Yoon Kim. 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014).Google ScholarGoogle Scholar
  21. Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).Google ScholarGoogle Scholar
  22. Depeng Liang and Yongdong Zhang. 2016. AC-BLSTM: Asymmetric Convolutional Bidirectional LSTM Networks for Text Classification. arXiv preprint arXiv:1611.01884 (2016).Google ScholarGoogle Scholar
  23. Debanjan Mahata, Jasper Friedrichs, Rajiv Ratn Shah, et almbox. 2018. # phramacovigilance-Exploring Deep Learning Techniques for Identifying Mentions of Medication Intake from Twitter. arXiv preprint arXiv:1805.06375 (2018).Google ScholarGoogle Scholar
  24. Debanjan Mahata, John R Talburt, and Vivek Kumar Singh. 2015. From chirps to whistles: discovering event-specific informative content from Twitter. In Proceedings of the ACM web science conference. ACM, 17.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Rada Mihalcea and Paul Tarau. 2004. Textrank: Bringing order into text. In Proceedings of the 2004 conference on empirical methods in natural language processing .Google ScholarGoogle Scholar
  26. Pushkar Mishra, Marco Del Tredici, Helen Yannakoudakis, and Ekaterina Shutova. 2018. Author profiling for abuse detection. In Proceedings of the 27th International Conference on Computational Linguistics. 1088--1098.Google ScholarGoogle Scholar
  27. Bridianne O'Dea, Melinda R Achilles, Mark E Larsen, Philip J Batterham, Alison L Calear, and Helen Christensen. 2018. The rate of reply and nature of responses to suicide-related posts on Twitter. Internet interventions , Vol. 13 (2018), 105--107.Google ScholarGoogle Scholar
  28. Bridianne O'Dea, Stephen Wan, Philip J Batterham, Alison L Calear, Cecile Paris, and Helen Christensen. 2015. Detecting suicidality on Twitter. Internet Interventions , Vol. 2, 2 (2015), 183--188.Google ScholarGoogle ScholarCross RefCross Ref
  29. Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining . ACM, 701--710.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Jing Qian, Mai ElSherief, Elizabeth M Belding, and William Yang Wang. 2018. Leveraging Intra-User and Inter-User Representation Learning for Automated Hate Speech Detection. arXiv preprint arXiv:1804.03124 (2018).Google ScholarGoogle Scholar
  31. J Ross Quinlan et almbox. 1996. Bagging, boosting, and C4. 5. In AAAI/IAAI, Vol. 1. 725--730.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Abeed Sarker, Rachel Ginn, Azadeh Nikfarjam, Karen O'Connor, Karen Smith, Swetha Jayaraman, Tejaswi Upadhaya, and Graciela Gonzalez. 2015. Utilizing social media data for pharmacovigilance: a review. Journal of biomedical informatics , Vol. 54 (2015), 202--212.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Ramit Sawhney, Prachi Manchanda, Puneet Mathur, Rajiv Shah, and Raj Singh. 2018a. Exploring and Learning Suicidal Ideation Connotations on Social Media with Deep Learning. In Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis . 167--175.Google ScholarGoogle ScholarCross RefCross Ref
  34. Ramit Sawhney, Prachi Manchanda, Raj Singh, and Swati Aggarwal. 2018b. A computational approach to feature extraction for identification of suicidal ideation in tweets. In Proceedings of ACL 2018, Student Research Workshop. 91--98.Google ScholarGoogle ScholarCross RefCross Ref
  35. Elizabeth M Seabrook, Margaret L Kern, Ben D Fulcher, and Nikki S Rickard. 2018. Predicting depression from language-based emotion dynamics: longitudinal analysis of Facebook and Twitter status updates. Journal of medical Internet research , Vol. 20, 5 (2018), e168.Google ScholarGoogle ScholarCross RefCross Ref
  36. Zeerak Waseem and Dirk Hovy. 2016. Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In Proceedings of the NAACL student research workshop. 88--93.Google ScholarGoogle ScholarCross RefCross Ref
  37. David H Wolpert. 1992. Stacked generalization. Neural networks , Vol. 5, 2 (1992), 241--259.Google ScholarGoogle Scholar
  38. RF Woolson. 2007. Wilcoxon signed-rank test. Wiley encyclopedia of clinical trials (2007), 1--3.Google ScholarGoogle Scholar
  39. Yi Yang and Jacob Eisenstein. 2017. Overcoming language variation in sentiment analysis with social attention. Transactions of the Association for Computational Linguistics , Vol. 5 (2017), 295--307.Google ScholarGoogle ScholarCross RefCross Ref
  40. Yang Yu, Xiaojun Wan, and Xinjie Zhou. 2016. User embedding for scholarly microblog recommendation. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Vol. 2. 449--453.Google ScholarGoogle ScholarCross RefCross Ref
  41. Reza Zafarani, Mohammad Ali Abbasi, and Huan Liu. 2014. Social media mining: an introduction .Cambridge University Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Ziqi Zhang and Lei Luo. 2018. Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter. arXiv preprint arXiv:1803.03662 (2018).Google ScholarGoogle Scholar
  43. Qingyuan Zhao, Murat A Erdogdu, Hera Y He, Anand Rajaraman, and Jure Leskovec. 2015. Seismic: A self-exciting point process model for predicting tweet popularity. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1513--1522.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Peng Zhou, Zhenyu Qi, Suncong Zheng, Jiaming Xu, Hongyun Bao, and Bo Xu. 2016. Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling. arXiv preprint arXiv:1611.06639 (2016).Google ScholarGoogle Scholar

Index Terms

  1. #suicidal - A Multipronged Approach to Identify and Explore Suicidal Ideation in Twitter

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management
      November 2019
      3373 pages
      ISBN:9781450369763
      DOI:10.1145/3357384

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 November 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CIKM '19 Paper Acceptance Rate202of1,031submissions,20%Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader