ABSTRACT
Understanding the relationships among environment, behavior, and health is a core concern of public health researchers. While a number of recent studies have investigated the use of social media to track infectious diseases such as influenza, little work has been done to determine if other health concerns can be inferred. In this paper, we present a large-scale study of 27 health-related statistics, including obesity, health insurance coverage, access to healthy foods, and teen birth rates. We perform a linguistic analysis of the Twitter activity in the top 100 most populous counties in the U.S., and find a significant correlation with 6 of the 27 health statistics. When compared to traditional models based on demographic variables alone, we find that augmenting models with Twitter-derived information improves predictive accuracy for 20 of 27 statistics, suggesting that this new methodology can complement existing approaches.
- Anselin, L. Spatial econometrics: methods and models. Kluwer Academic Publishers, Dordrecht; Boston, 1988.Google Scholar
- Chen, M. K. The effect of language on economic behavior: Evidence from savings rates, health behaviors, and retirement assets. American Economic Review 103, 2 (Apr. 2013), 690--731.Google ScholarCross Ref
- Chiswick, B. R., and Miller, P. W. The economics of language international analyses. Routledge, London; New York, 2007.Google Scholar
- Clifford, P., Richardson, S., and Hmon, D. Assessing the significance of the correlation between two spatial processes. Biometrics 45, 1 (Mar. 1989), 123--134. PMID: 2720048.Google ScholarCross Ref
- Culotta, A. Towards detecting influenza epidemics by analyzing twitter messages. In Proceedings of the First Workshop on Social Media Analytics, ACM (New York, NY, USA, 2010), 115--122. Google ScholarDigital Library
- Culotta, A. Lightweight methods to estimate influenza rates and alcohol sales volume from twitter messages. Lang. Resour. Eval. 47, 1 (Mar. 2013), 217238. Google ScholarDigital Library
- Danner, D. D., Snowdon, D. A., and Friesen, W. V. Positive emotions in early life and longevity: findings from the nun study. Journal of personality and social psychology 80, 5 (May 2001), 804--813. PMID: 11374751.Google Scholar
- De Choudhury, M., Gamon, M., Counts, S., and Horvitz, E. Predicting depression via social media. In ICWSM (2013).Google Scholar
- Dredze, M. How social media will change public health. IEEE Intelligent Systems 27, 4 (2012), 81--84. Google ScholarDigital Library
- Duggan, M., and Brenner, J. The demographics of social media users - 2012. Pew Internet & American Life Project, Feb 2013.Google Scholar
- Flores, B. E. A pragmatic view of accuracy measurement in forecasting. Omega 14, 2 (1986), 93--98.Google ScholarCross Ref
- Ghosh, D. D., and Guha, R. What are we tweeting about obesity? Mapping tweets with topic modeling and geographic information system. Cartography and Geographic Information Science 40, 2 (2013), 90--102.Google ScholarCross Ref
- Gottschalk, L. A., and Gleser, G. C. The Measurement of Psychological States Through the Content Analysis of Verbal Behavior. University of California Press, Jan. 1979.Google Scholar
- Graham, L E, n., Scherwitz, L., and Brand, R. Self-reference and coronary heart disease incidence in the western collaborative group study. Psychosomatic medicine 51, 2 (Apr. 1989), 137--144. PMID: 2710908.Google Scholar
- Hanson, C. L., Burton, S. H., Giraud-Carrier, C., West, J. H., Barnes, M. D., and Hansen, B. Tweaking and tweeting: Exploring twitter for nonmedical use of a psychostimulant drug (adderall) among college students. Journal of Medical Internet Research 15, 4 (Apr. 2013), e62.Google ScholarCross Ref
- Hecht, B., Hong, L., Suh, B., and Chi, E. H. Tweets from Justin Bieber's heart: The dynamics of the location field in user profiles. In CHI (New York, NY, USA, 2011), 237--246. Google ScholarDigital Library
- Howell, R. T., Kern, M. L., and Lyubomirsky, S. Health benefits: Meta-analytically determining the impact of well-being on objective health outcomes. Health Psychology Review 1, 1 (2007), 83--136.Google ScholarCross Ref
- James W Pennebaker, M. R. M. Psychological aspects of natural language. use: our words, our selves. Annual review of psychology 54 (2003), 547--77.Google Scholar
- Jamison-Powell, S., Linehan, C., Daley, L., Garbett, A., and Lawson, S. "I can't get no sleep": discussing #insomnia on Twitter. In CHI, ACM (New York, NY, USA, 2012), 1501--1510. Google ScholarDigital Library
- Lampos, V., De Bie, T., and Cristianini, N. Flu detector: tracking epidemics on twitter. In ECML/PKDD (2010), 599--602. Google ScholarDigital Library
- Messer, L. C. Neighborhood-level characteristics as predictors of preterm birth: Examples from wake county, north carolina. Tech. rep., North Carolina Dept. of Health and Human Services, 2005.Google Scholar
- Paul, M. J., and Dredze, M. You are what you tweet: Analyzing Twitter for public health. In ICWSM (2011).Google Scholar
- Pedregosa, F., et al. Scikit-learn: Machine learning in python. Machine Learning Research 12 (2011), 28252830. Google ScholarDigital Library
- Pennebaker, J., Francis, J., and Booth, R. Linguistic inquiry and word count: LIWC 2001. World Journal of the International Linguistic Association (2001).Google Scholar
- Qiu, L., Lin, H., Ramsay, J., and Yang, F. You are what you tweet: Personality expression and perception on twitter. Journal of Research in Personality 46, 6 (Dec. 2012), 710--718.Google ScholarCross Ref
- Rabi, D. M., et al. Association of socio-economic status with diabetes prevalence and utilization of diabetes care services. BMC Health Services Research 6, 1 (Oct. 2006), 124. PMID: 17018153.Google ScholarCross Ref
- Sadilek, A., Kautz, H., and Silenzio, V. Predicting disease transmission from geo-tagged micro-blog data. In AAAI (Dec. 2012).Google Scholar
- Schwartz, H. A., et al. Characterizing geographic variation in well-being using tweets. In Seventh International AAAI Conference on Weblogs and Social Media (ICWSM) (2013).Google Scholar
- Seligman, M. E. P. Flourish: a visionary new understanding of happiness and well-being. Free Press, New York, 2011.Google Scholar
- Signorini, A., Segre, A. M., and Polgreen, P. M. The use of Twitter to track levels of disease activity and public concern in the U.S. during the influenza a H1N1 pandemic. PLoS ONE 6, 5 (May 2011), e19467.Google ScholarCross Ref
- Sobal, J., and Stunkard, A. J. Socioeconomic status and obesity: A review of the literature. Psychological Bulletin 105, 2 (1989), 260--275.Google ScholarCross Ref
- Stewart, A., and Diaz, E. Epidemic intelligence: For the crowd, by the crowd. In Web Engineering, M. Brambilla, T. Tokuda, and R. Tolksdorf, Eds., no. 7387 in Lecture Notes in Computer Science. Springer Berlin Heidelberg, Jan. 2012, 504--505. Google ScholarDigital Library
Index Terms
- Estimating county health statistics with twitter
Recommendations
A Model for Social Network-Enhanced Health Communication
DASC '11: Proceedings of the 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure ComputingSocial networks have the potential to provide a number of capabilities for augmenting healthcare service delivery and providing new capabilities not present in traditional clinical health communication or public health communication. In this paper we ...
Mining Social Media Streams to Improve Public Health Allergy Surveillance
ASONAM '15: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015Allergies are one of the most common chronic diseases worldwide. One in five Americans suffer from either allergy or asthma symptoms. With the prevalence of social media, people sharing experiences and opinions on personal health symptoms and concerns ...
The use of Electronic Health Records to Support Population Health: A Systematic Review of the Literature
Electronic health records (EHRs) have emerged among health information technology as "meaningful use" to improve the quality and efficiency of healthcare, and health disparities in population health. In other instances, they have also shown lack of ...
Comments