research-article

#suicidal - A Multipronged Approach to Identify and Explore Suicidal Ideation in Twitter

Authors:
Pradyumna Prakhar Sinha

Delhi Technological University, New Delhi, India

Delhi Technological University, New Delhi, India
View Profile

,
Rohan Mishra

Delhi Technological University, New Delhi, India

Delhi Technological University, New Delhi, India
View Profile

,
Ramit Sawhney

Netaji Subhash Insitute of Technology, New Delhi, India

Netaji Subhash Insitute of Technology, New Delhi, India
View Profile

,
Debanjan Mahata

Bloomberg & IIIT-Delhi, New York, NY, USA

Bloomberg & IIIT-Delhi, New York, NY, USA
View Profile

,
Rajiv Ratn Shah

IIIT-Delhi, New Delhi, India

IIIT-Delhi, New Delhi, India
View Profile

,
Huan Liu

Arizona State University, Tempe, AZ, USA

Arizona State University, Tempe, AZ, USA
View Profile

CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge ManagementNovember 2019Pages 941–950https://doi.org/10.1145/3357384.3358060

Published:03 November 2019Publication History

CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management

Pages 941–950

ABSTRACT

Technological advancements have led to the creation of social media platforms like Twitter, where people have started voicing their views over rarely discussed and socially stigmatizing issues. Twitter, is increasingly being used for studying psycho-linguistic phenomenon spanning from expressions of adverse drug reactions, depressions, to suicidality. In this work we focus on identifying suicidal posts from Twitter. Towards this objective we take a multipronged approach and implement different neural network models such assequential models andgraph convolutional networks, that are trained on textual content shared in Twitter, the historical tweeting activity of the users and social network formed between different users posting about suicidality. We train a stacked ensemble of classifiers representing different aspects of suicidal tweeting activity, and achieve state-of-the-art results on a new manually annotated dataset developed by us, that contains textual as well as network information of suicidal tweets. We further investigate into the trained models and perform qualitative analysis showing how historical tweeting activity and rich information embedded in the homophily networks amongst users in Twitter, aids in accurately identifying tweets expressing suicidal intent.

References

Silvio Amir, Byron C Wallace, Hao Lyu, and Paula Carvalho Mário J Silva. 2016. Modelling context with user embeddings for sarcasm detection in social media. arXiv preprint arXiv:1607.00976 (2016).Google Scholar
Pinkesh Badjatiya, Shashank Gupta, Manish Gupta, and Vasudeva Varma. 2017. Deep learning for hate speech detection in tweets. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 759--760.Google ScholarDigital Library
Sairam Balani and Munmun De Choudhury. 2015. Detecting and characterizing mental health related self-disclosure in social media. In Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems. ACM, 1373--1378.Google ScholarDigital Library
Adrian Benton, Margaret Mitchell, and Dirk Hovy. 2017. Multi-task learning for mental health using social media text. arXiv preprint arXiv:1712.03538 (2017).Google Scholar
Steven Bird and Edward Loper. 2004. NLTK: the natural language toolkit. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions. Association for Computational Linguistics, 31.Google ScholarDigital Library
Louise Brådvik, Cecilia Mattisson, Mats Bogren, and Per Nettelbladt. 2008. Long-term suicide risk of depression in the Lundby cohort 1947--1997--severity and gender. Acta Psychiatrica Scandinavica , Vol. 117, 3 (2008), 185--191.Google ScholarCross Ref
Pete Burnap, Walter Colombo, and Jonathan Scourfield. 2015. Machine classification and analysis of suicide-related communication on twitter. In Proceedings of the 26th ACM conference on hypertext & social media. ACM, 75--84.Google ScholarDigital Library
Patricia A Cavazos-Rehg, Melissa J Krauss, Shaina Sowles, Sarah Connolly, Carlos Rosas, Meghana Bharadwaj, and Laura J Bierut. 2016. A content analysis of depression-related tweets. Computers in human behavior , Vol. 54 (2016), 351--357.Google Scholar
Stevie Chancellor, Michael L Birnbaum, Eric D Caine, Vincent Silenzio, and Munmun De Choudhury. 2019. A Taxonomy of Ethical Tensions in Inferring Mental Health States from Social Media. In Proceedings of the 2nd ACM Conference on Fairness, Accountability, and Transparency (Atlanta GA .Google ScholarDigital Library
Gualtiero B Colombo, Pete Burnap, Andrei Hodorog, and Jonathan Scourfield. 2016. Analysing the connectivity and communication of suicidal users on twitter. Computer communications , Vol. 73 (2016), 291--300.Google Scholar
Munmun De Choudhury, Michael Gamon, Scott Counts, and Eric Horvitz. 2013. Predicting depression via social media. ICWSM , Vol. 13 (2013), 1--10.Google Scholar
Janez Demvs ar. 2006. Statistical comparisons of classifiers over multiple data sets. Journal of Machine learning research , Vol. 7, Jan (2006), 1--30.Google ScholarDigital Library
Jasper Friedrichs, Debanjan Mahata, and Shubham Gupta. 2018. InfyNLP at SMM4H task 2: stacked ensemble of shallow convolutional neural networks for identifying personal medication intake from Twitter. arXiv preprint arXiv:1803.07718 (2018).Google Scholar
Madelyn Gould, Patrick Jamieson, and Daniel Romer. 2003. Media contagion and suicide among the young. American Behavioral Scientist , Vol. 46, 9 (2003), 1269--1284.Google ScholarCross Ref
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 855--864.Google ScholarDigital Library
Sharath Chandra Guntuku, David B Yaden, Margaret L Kern, Lyle H Ungar, and Johannes C Eichstaedt. 2017. Detecting depression and mental illness on social media: an integrative review. Current Opinion in Behavioral Sciences , Vol. 18 (2017), 43--49.Google ScholarCross Ref
Nina Jacob, Jonathan Scourfield, and Rhiannon Evans. 2014. Suicide prevention via the internet. Crisis (2014).Google Scholar
Jared Jashinsky, Scott H Burton, Carl L Hanson, Josh West, Christophe Giraud-Carrier, Michael D Barnes, and Trenton Argyle. 2014. Tracking suicide risk factors through Twitter in the US. Crisis (2014).Google Scholar
Abhinav Khattar, Karan Dabas, Kshitij Gupta, Shaan Chopra, and Ponnurangam Kumaraguru. 2018. White or Blue, the Whale gets its Vengeance: A Social Media Analysis of the Blue Whale Challenge. arXiv preprint arXiv:1801.05588 (2018).Google Scholar
Yoon Kim. 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014).Google Scholar
Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).Google Scholar
Depeng Liang and Yongdong Zhang. 2016. AC-BLSTM: Asymmetric Convolutional Bidirectional LSTM Networks for Text Classification. arXiv preprint arXiv:1611.01884 (2016).Google Scholar
Debanjan Mahata, Jasper Friedrichs, Rajiv Ratn Shah, et almbox. 2018. # phramacovigilance-Exploring Deep Learning Techniques for Identifying Mentions of Medication Intake from Twitter. arXiv preprint arXiv:1805.06375 (2018).Google Scholar
Debanjan Mahata, John R Talburt, and Vivek Kumar Singh. 2015. From chirps to whistles: discovering event-specific informative content from Twitter. In Proceedings of the ACM web science conference. ACM, 17.Google ScholarDigital Library
Rada Mihalcea and Paul Tarau. 2004. Textrank: Bringing order into text. In Proceedings of the 2004 conference on empirical methods in natural language processing .Google Scholar
Pushkar Mishra, Marco Del Tredici, Helen Yannakoudakis, and Ekaterina Shutova. 2018. Author profiling for abuse detection. In Proceedings of the 27th International Conference on Computational Linguistics. 1088--1098.Google Scholar
Bridianne O'Dea, Melinda R Achilles, Mark E Larsen, Philip J Batterham, Alison L Calear, and Helen Christensen. 2018. The rate of reply and nature of responses to suicide-related posts on Twitter. Internet interventions , Vol. 13 (2018), 105--107.Google Scholar
Bridianne O'Dea, Stephen Wan, Philip J Batterham, Alison L Calear, Cecile Paris, and Helen Christensen. 2015. Detecting suicidality on Twitter. Internet Interventions , Vol. 2, 2 (2015), 183--188.Google ScholarCross Ref
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining . ACM, 701--710.Google ScholarDigital Library
Jing Qian, Mai ElSherief, Elizabeth M Belding, and William Yang Wang. 2018. Leveraging Intra-User and Inter-User Representation Learning for Automated Hate Speech Detection. arXiv preprint arXiv:1804.03124 (2018).Google Scholar
J Ross Quinlan et almbox. 1996. Bagging, boosting, and C4. 5. In AAAI/IAAI, Vol. 1. 725--730.Google ScholarDigital Library
Abeed Sarker, Rachel Ginn, Azadeh Nikfarjam, Karen O'Connor, Karen Smith, Swetha Jayaraman, Tejaswi Upadhaya, and Graciela Gonzalez. 2015. Utilizing social media data for pharmacovigilance: a review. Journal of biomedical informatics , Vol. 54 (2015), 202--212.Google ScholarDigital Library
Ramit Sawhney, Prachi Manchanda, Puneet Mathur, Rajiv Shah, and Raj Singh. 2018a. Exploring and Learning Suicidal Ideation Connotations on Social Media with Deep Learning. In Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis . 167--175.Google ScholarCross Ref
Ramit Sawhney, Prachi Manchanda, Raj Singh, and Swati Aggarwal. 2018b. A computational approach to feature extraction for identification of suicidal ideation in tweets. In Proceedings of ACL 2018, Student Research Workshop. 91--98.Google ScholarCross Ref
Elizabeth M Seabrook, Margaret L Kern, Ben D Fulcher, and Nikki S Rickard. 2018. Predicting depression from language-based emotion dynamics: longitudinal analysis of Facebook and Twitter status updates. Journal of medical Internet research , Vol. 20, 5 (2018), e168.Google ScholarCross Ref
Zeerak Waseem and Dirk Hovy. 2016. Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In Proceedings of the NAACL student research workshop. 88--93.Google ScholarCross Ref
David H Wolpert. 1992. Stacked generalization. Neural networks , Vol. 5, 2 (1992), 241--259.Google Scholar
RF Woolson. 2007. Wilcoxon signed-rank test. Wiley encyclopedia of clinical trials (2007), 1--3.Google Scholar
Yi Yang and Jacob Eisenstein. 2017. Overcoming language variation in sentiment analysis with social attention. Transactions of the Association for Computational Linguistics , Vol. 5 (2017), 295--307.Google ScholarCross Ref
Yang Yu, Xiaojun Wan, and Xinjie Zhou. 2016. User embedding for scholarly microblog recommendation. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Vol. 2. 449--453.Google ScholarCross Ref
Reza Zafarani, Mohammad Ali Abbasi, and Huan Liu. 2014. Social media mining: an introduction .Cambridge University Press.Google ScholarDigital Library
Ziqi Zhang and Lei Luo. 2018. Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter. arXiv preprint arXiv:1803.03662 (2018).Google Scholar
Qingyuan Zhao, Murat A Erdogdu, Hera Y He, Anand Rajaraman, and Jure Leskovec. 2015. Seismic: A self-exciting point process model for predicting tweet popularity. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1513--1522.Google ScholarDigital Library
Peng Zhou, Zhenyu Qi, Suncong Zheng, Jiaming Xu, Hongyun Bao, and Bo Xu. 2016. Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling. arXiv preprint arXiv:1611.06639 (2016).Google Scholar

Index Terms

#suicidal - A Multipronged Approach to Identify and Explore Suicidal Ideation in Twitter
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms

Recommendations

”I’m always in so much pain and no one will understand” - Detecting Patterns in Suicidal Ideation on Reddit
WWW '22: Companion Proceedings of the Web Conference 2022

Social media has become another venue for those struggling with thoughts of suicide. Many turn to social media to express suicidal ideation and look for peer support. In our study we seek to better understand patterns in the behaviors of these users ...
Read More
Analysing the connectivity and communication of suicidal users on twitter

We investigate the characteristics of the authors of Tweets containing suicidal intent or thinking, through the analysis of their online social network relationships and interactions.Results show a high degree of reciprocal connectivity between the ...
Read More
Machine Learning for Suicidal Ideation Identification on Twitter for the Portuguese Language
Intelligent Systems
Abstract
Suicidal ideation is one of the main predictors of the risk of suicide attempt and can be described as thoughts, ideas, planning, and desire to commit suicide. Fast detection of such ideation in early stages is essential for effective treatment. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management
November 2019
3373 pages
ISBN:9781450369763
DOI:10.1145/3357384
General Chairs:
Wenwu Zhu
Tsinghua University, China
,
Dacheng Tao
University of Massachusetts, USA
,
Xueqi Cheng
Institute of Computing Technology, CAS, China
,
Program Chairs:
Peng Cui
Tsinghua University, China
,
Elke Rundensteiner
Worcester Polytechnic Institute, USA
,
David Carmel
Amazon Research, USA
,
Qi He
LinkedIn, USA
,
Jeffrey Xu Yu
Chinese University of Hong Kong, China
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 3 November 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
health informatics
social media mining
suicidal ideation
Qualifiers
- research-article
Conference

Acceptance Rates
CIKM '19 Paper Acceptance Rate202of1,031submissions,20%Overall Acceptance Rate1,861of8,427submissions,22%
More
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 40
  Total Citations
  View Citations
- 754
  Total Downloads
- Downloads (Last 12 months)115
- Downloads (Last 6 weeks)15
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

#suicidal - A Multipronged Approach to Identify and Explore Suicidal Ideation in Twitter

CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management

ABSTRACT

References

Cited By

Index Terms

Recommendations

”I’m always in so much pain and no one will understand” - Detecting Patterns in Suicidal Ideation on Reddit

Analysing the connectivity and communication of suicidal users on twitter

Machine Learning for Suicidal Ideation Identification on Twitter for the Portuguese Language