ABSTRACT
Technological advancements have led to the creation of social media platforms like Twitter, where people have started voicing their views over rarely discussed and socially stigmatizing issues. Twitter, is increasingly being used for studying psycho-linguistic phenomenon spanning from expressions of adverse drug reactions, depressions, to suicidality. In this work we focus on identifying suicidal posts from Twitter. Towards this objective we take a multipronged approach and implement different neural network models such assequential models andgraph convolutional networks, that are trained on textual content shared in Twitter, the historical tweeting activity of the users and social network formed between different users posting about suicidality. We train a stacked ensemble of classifiers representing different aspects of suicidal tweeting activity, and achieve state-of-the-art results on a new manually annotated dataset developed by us, that contains textual as well as network information of suicidal tweets. We further investigate into the trained models and perform qualitative analysis showing how historical tweeting activity and rich information embedded in the homophily networks amongst users in Twitter, aids in accurately identifying tweets expressing suicidal intent.
- Silvio Amir, Byron C Wallace, Hao Lyu, and Paula Carvalho Mário J Silva. 2016. Modelling context with user embeddings for sarcasm detection in social media. arXiv preprint arXiv:1607.00976 (2016).Google Scholar
- Pinkesh Badjatiya, Shashank Gupta, Manish Gupta, and Vasudeva Varma. 2017. Deep learning for hate speech detection in tweets. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 759--760.Google ScholarDigital Library
- Sairam Balani and Munmun De Choudhury. 2015. Detecting and characterizing mental health related self-disclosure in social media. In Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems. ACM, 1373--1378.Google ScholarDigital Library
- Adrian Benton, Margaret Mitchell, and Dirk Hovy. 2017. Multi-task learning for mental health using social media text. arXiv preprint arXiv:1712.03538 (2017).Google Scholar
- Steven Bird and Edward Loper. 2004. NLTK: the natural language toolkit. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions. Association for Computational Linguistics, 31.Google ScholarDigital Library
- Louise Brådvik, Cecilia Mattisson, Mats Bogren, and Per Nettelbladt. 2008. Long-term suicide risk of depression in the Lundby cohort 1947--1997--severity and gender. Acta Psychiatrica Scandinavica , Vol. 117, 3 (2008), 185--191.Google ScholarCross Ref
- Pete Burnap, Walter Colombo, and Jonathan Scourfield. 2015. Machine classification and analysis of suicide-related communication on twitter. In Proceedings of the 26th ACM conference on hypertext & social media. ACM, 75--84.Google ScholarDigital Library
- Patricia A Cavazos-Rehg, Melissa J Krauss, Shaina Sowles, Sarah Connolly, Carlos Rosas, Meghana Bharadwaj, and Laura J Bierut. 2016. A content analysis of depression-related tweets. Computers in human behavior , Vol. 54 (2016), 351--357.Google Scholar
- Stevie Chancellor, Michael L Birnbaum, Eric D Caine, Vincent Silenzio, and Munmun De Choudhury. 2019. A Taxonomy of Ethical Tensions in Inferring Mental Health States from Social Media. In Proceedings of the 2nd ACM Conference on Fairness, Accountability, and Transparency (Atlanta GA .Google ScholarDigital Library
- Gualtiero B Colombo, Pete Burnap, Andrei Hodorog, and Jonathan Scourfield. 2016. Analysing the connectivity and communication of suicidal users on twitter. Computer communications , Vol. 73 (2016), 291--300.Google Scholar
- Munmun De Choudhury, Michael Gamon, Scott Counts, and Eric Horvitz. 2013. Predicting depression via social media. ICWSM , Vol. 13 (2013), 1--10.Google Scholar
- Janez Demvs ar. 2006. Statistical comparisons of classifiers over multiple data sets. Journal of Machine learning research , Vol. 7, Jan (2006), 1--30.Google ScholarDigital Library
- Jasper Friedrichs, Debanjan Mahata, and Shubham Gupta. 2018. InfyNLP at SMM4H task 2: stacked ensemble of shallow convolutional neural networks for identifying personal medication intake from Twitter. arXiv preprint arXiv:1803.07718 (2018).Google Scholar
- Madelyn Gould, Patrick Jamieson, and Daniel Romer. 2003. Media contagion and suicide among the young. American Behavioral Scientist , Vol. 46, 9 (2003), 1269--1284.Google ScholarCross Ref
- Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 855--864.Google ScholarDigital Library
- Sharath Chandra Guntuku, David B Yaden, Margaret L Kern, Lyle H Ungar, and Johannes C Eichstaedt. 2017. Detecting depression and mental illness on social media: an integrative review. Current Opinion in Behavioral Sciences , Vol. 18 (2017), 43--49.Google ScholarCross Ref
- Nina Jacob, Jonathan Scourfield, and Rhiannon Evans. 2014. Suicide prevention via the internet. Crisis (2014).Google Scholar
- Jared Jashinsky, Scott H Burton, Carl L Hanson, Josh West, Christophe Giraud-Carrier, Michael D Barnes, and Trenton Argyle. 2014. Tracking suicide risk factors through Twitter in the US. Crisis (2014).Google Scholar
- Abhinav Khattar, Karan Dabas, Kshitij Gupta, Shaan Chopra, and Ponnurangam Kumaraguru. 2018. White or Blue, the Whale gets its Vengeance: A Social Media Analysis of the Blue Whale Challenge. arXiv preprint arXiv:1801.05588 (2018).Google Scholar
- Yoon Kim. 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014).Google Scholar
- Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).Google Scholar
- Depeng Liang and Yongdong Zhang. 2016. AC-BLSTM: Asymmetric Convolutional Bidirectional LSTM Networks for Text Classification. arXiv preprint arXiv:1611.01884 (2016).Google Scholar
- Debanjan Mahata, Jasper Friedrichs, Rajiv Ratn Shah, et almbox. 2018. # phramacovigilance-Exploring Deep Learning Techniques for Identifying Mentions of Medication Intake from Twitter. arXiv preprint arXiv:1805.06375 (2018).Google Scholar
- Debanjan Mahata, John R Talburt, and Vivek Kumar Singh. 2015. From chirps to whistles: discovering event-specific informative content from Twitter. In Proceedings of the ACM web science conference. ACM, 17.Google ScholarDigital Library
- Rada Mihalcea and Paul Tarau. 2004. Textrank: Bringing order into text. In Proceedings of the 2004 conference on empirical methods in natural language processing .Google Scholar
- Pushkar Mishra, Marco Del Tredici, Helen Yannakoudakis, and Ekaterina Shutova. 2018. Author profiling for abuse detection. In Proceedings of the 27th International Conference on Computational Linguistics. 1088--1098.Google Scholar
- Bridianne O'Dea, Melinda R Achilles, Mark E Larsen, Philip J Batterham, Alison L Calear, and Helen Christensen. 2018. The rate of reply and nature of responses to suicide-related posts on Twitter. Internet interventions , Vol. 13 (2018), 105--107.Google Scholar
- Bridianne O'Dea, Stephen Wan, Philip J Batterham, Alison L Calear, Cecile Paris, and Helen Christensen. 2015. Detecting suicidality on Twitter. Internet Interventions , Vol. 2, 2 (2015), 183--188.Google ScholarCross Ref
- Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining . ACM, 701--710.Google ScholarDigital Library
- Jing Qian, Mai ElSherief, Elizabeth M Belding, and William Yang Wang. 2018. Leveraging Intra-User and Inter-User Representation Learning for Automated Hate Speech Detection. arXiv preprint arXiv:1804.03124 (2018).Google Scholar
- J Ross Quinlan et almbox. 1996. Bagging, boosting, and C4. 5. In AAAI/IAAI, Vol. 1. 725--730.Google ScholarDigital Library
- Abeed Sarker, Rachel Ginn, Azadeh Nikfarjam, Karen O'Connor, Karen Smith, Swetha Jayaraman, Tejaswi Upadhaya, and Graciela Gonzalez. 2015. Utilizing social media data for pharmacovigilance: a review. Journal of biomedical informatics , Vol. 54 (2015), 202--212.Google ScholarDigital Library
- Ramit Sawhney, Prachi Manchanda, Puneet Mathur, Rajiv Shah, and Raj Singh. 2018a. Exploring and Learning Suicidal Ideation Connotations on Social Media with Deep Learning. In Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis . 167--175.Google ScholarCross Ref
- Ramit Sawhney, Prachi Manchanda, Raj Singh, and Swati Aggarwal. 2018b. A computational approach to feature extraction for identification of suicidal ideation in tweets. In Proceedings of ACL 2018, Student Research Workshop. 91--98.Google ScholarCross Ref
- Elizabeth M Seabrook, Margaret L Kern, Ben D Fulcher, and Nikki S Rickard. 2018. Predicting depression from language-based emotion dynamics: longitudinal analysis of Facebook and Twitter status updates. Journal of medical Internet research , Vol. 20, 5 (2018), e168.Google ScholarCross Ref
- Zeerak Waseem and Dirk Hovy. 2016. Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In Proceedings of the NAACL student research workshop. 88--93.Google ScholarCross Ref
- David H Wolpert. 1992. Stacked generalization. Neural networks , Vol. 5, 2 (1992), 241--259.Google Scholar
- RF Woolson. 2007. Wilcoxon signed-rank test. Wiley encyclopedia of clinical trials (2007), 1--3.Google Scholar
- Yi Yang and Jacob Eisenstein. 2017. Overcoming language variation in sentiment analysis with social attention. Transactions of the Association for Computational Linguistics , Vol. 5 (2017), 295--307.Google ScholarCross Ref
- Yang Yu, Xiaojun Wan, and Xinjie Zhou. 2016. User embedding for scholarly microblog recommendation. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Vol. 2. 449--453.Google ScholarCross Ref
- Reza Zafarani, Mohammad Ali Abbasi, and Huan Liu. 2014. Social media mining: an introduction .Cambridge University Press.Google ScholarDigital Library
- Ziqi Zhang and Lei Luo. 2018. Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter. arXiv preprint arXiv:1803.03662 (2018).Google Scholar
- Qingyuan Zhao, Murat A Erdogdu, Hera Y He, Anand Rajaraman, and Jure Leskovec. 2015. Seismic: A self-exciting point process model for predicting tweet popularity. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1513--1522.Google ScholarDigital Library
- Peng Zhou, Zhenyu Qi, Suncong Zheng, Jiaming Xu, Hongyun Bao, and Bo Xu. 2016. Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling. arXiv preprint arXiv:1611.06639 (2016).Google Scholar
Index Terms
- #suicidal - A Multipronged Approach to Identify and Explore Suicidal Ideation in Twitter
Recommendations
”I’m always in so much pain and no one will understand” - Detecting Patterns in Suicidal Ideation on Reddit
WWW '22: Companion Proceedings of the Web Conference 2022Social media has become another venue for those struggling with thoughts of suicide. Many turn to social media to express suicidal ideation and look for peer support. In our study we seek to better understand patterns in the behaviors of these users ...
Analysing the connectivity and communication of suicidal users on twitter
We investigate the characteristics of the authors of Tweets containing suicidal intent or thinking, through the analysis of their online social network relationships and interactions.Results show a high degree of reciprocal connectivity between the ...
Machine Learning for Suicidal Ideation Identification on Twitter for the Portuguese Language
Intelligent SystemsAbstractSuicidal ideation is one of the main predictors of the risk of suicide attempt and can be described as thoughts, ideas, planning, and desire to commit suicide. Fast detection of such ideation in early stages is essential for effective treatment. ...
Comments