research-article

Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network

Authors:
Ying Sha

Georgia Institute of Technology, Atlanta, GA, USA

Georgia Institute of Technology, Atlanta, GA, USA
View Profile

,
May D. Wang

Georgia Institute of Technology, Atlanta, GA, USA

Georgia Institute of Technology, Atlanta, GA, USA
View Profile

ACM-BCB '17: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health InformaticsAugust 2017Pages 233–240https://doi.org/10.1145/3107411.3107445

Published:20 August 2017Publication History

ACM-BCB '17: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics

Pages 233–240

ABSTRACT

The increasing accumulation of healthcare data provides researchers with ample opportunities to build machine learning approaches for clinical decision support and to improve the quality of health care. Several studies have developed conventional machine learning approaches that rely heavily on manual feature engineering and result in task-specific models for health care. In contrast, healthcare researchers have begun to use deep learning, which has emerged as a revolutionary machine learning technique that obviates manual feature engineering but still achieves impressive results in research fields such as image classification. However, few of them have addressed the lack of the interpretability of deep learning models although interpretability is essential for the successful adoption of machine learning approaches by healthcare communities. In addition, the unique characteristics of healthcare data such as high dimensionality and temporal dependencies pose challenges for building models on healthcare data. To address these challenges, we develop a gated recurrent unit-based recurrent neural network with hierarchical attention for mortality prediction, and then, using the diagnostic codes from the Medical Information Mart for Intensive Care, we evaluate the model. We find that the prediction accuracy of the model outperforms baseline models and demonstrate the interpretability of the model in visualizations.

References

Bahdanau, D., Cho, K., and Bengio, Y., 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473.Google Scholar
Bengio, Y., Simard, P., and Frasconi, P., 1994. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on neural networks 5, 2, 157--166. Google ScholarDigital Library
Bergstra, J., Breuleux, O., Bastien, F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., and Bengio, Y., 2010. Theano: A CPU and GPU math compiler in Python. In Proc. 9th Python in Science Conf, 1--7.Google Scholar
Brown, P.F., Desouza, P.V., Mercer, R.L., Pietra, V.J.D., and Lai, J.C., 1992. Class-based n-gram models of natural language. Computational linguistics 18, 4, 467--479. Google ScholarDigital Library
Cho, K., Courville, A., and Bengio, Y., 2015. Describing multimedia content using attention-based encoder-decoder networks. IEEE Transactions on Multimedia 17, 11, 1875--1886.Google ScholarDigital Library
Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., and Bengio, Y., 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.Google Scholar
Choi, E., Bahadori, M.T., Schuetz, A., Stewart, W.F., and Sun, J., 2016. RETAIN: Interpretable Predictive Model in Healthcare using Reverse Time Attention Mechanism. arXiv preprint arXiv:1608.05745.Google Scholar
Chollet, F., 2015. Keras.Google Scholar
Chung, J., Gulcehre, C., Cho, K., and Bengio, Y., 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555.Google Scholar
Free, C., Phillips, G., Watson, L., Galli, L., Felix, L., Edwards, P., Patel, V., and Haines, A., 2013. The effectiveness of mobile-health technologies to improve health care service delivery processes: a systematic review and meta-analysis. PLoS Med 10, 1, e1001363.Google ScholarCross Ref
Frisse, M.E. and Holmes, R.L., 2007. Estimated financial savings associated with health information exchange and ambulatory care referral. Journal of biomedical informatics 40, 6, S27-S32. Google ScholarDigital Library
He, D., Mathews, S.C., Kalloo, A.N., and Hutfless, S., 2014. Mining high-dimensional administrative claims data to predict early hospital readmissions. Journal of the American Medical Informatics Association 21, 2, 272--279.Google ScholarCross Ref
Hochreiter, S., 1998. The vanishing gradient problem during learning recurrent neural nets and problem solutions. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 6, 02, 107--116. Google ScholarDigital Library
Hochreiter, S. and Schmidhuber, J., 1997. Long short-term memory. Neural computation 9, 8, 1735--1780. Google ScholarDigital Library
Jensen, P.B., Jensen, L.J., and Brunak, S., 2012. Mining electronic health records: towards better research applications and clinical care. Nature Reviews Genetics 13, 6, 395--405.Google ScholarCross Ref
Johnson, A.E., Pollard, T.J., Shen, L., Lehman, L.-w.H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L.A., and Mark, R.G., 2016. MIMIC-III, a freely accessible critical care database. Scientific data 3.Google Scholar
Jones, S.S., Rudin, R.S., Perry, T., and Shekelle, P.G., 2014. Health information technology: an updated systematic review with a focus on meaningful use. Annals of internal medicine 160, 1, 48--54.Google ScholarCross Ref
Karpathy, A. and Fei-Fei, L., 2015. Deep visual-semantic alignments for generating image descriptions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3128--3137.Google Scholar
Kešelj, V., Peng, F., Cercone, N., and Thomas, C., 2003. N-gram-based author profiles for authorship attribution. In Proceedings of the conference pacific association for computational linguistics, PACLING, 255--264.Google Scholar
Krizhevsky, A., Sutskever, I., and Hinton, G.E., 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, 1097--1105. Google ScholarDigital Library
Lasko, T.A., Denny, J.C., and Levy, M.A., 2013. Computational phenotype discovery using unsupervised feature learning over noisy, sparse, and irregular clinical data. PloS one 8, 6, e66341.Google ScholarCross Ref
LeCun, Y., Bengio, Y., and Hinton, G., 2015. Deep learning. Nature 521, 7553, 436--444.Google Scholar
Marafino, B.J., Davies, J.M., Bardach, N.S., Dean, M.L., Dudley, R.A., and Boscardin, J., 2014. N-gram support vector machines for scalable procedure and diagnosis classification, with applications to clinical free text data from the intensive care unit. Journal of the American Medical Informatics Association 21, 5, 871--875.Google ScholarCross Ref
Matthews, B.W., 1975. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochimica et Biophysica Acta (BBA)-Protein Structure 405, 2, 442--451.Google ScholarCross Ref
Mikolov, T., Chen, K., Corrado, G., and Dean, J., 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781.Google Scholar
Miotto, R., Li, L., Kidd, B.A., and Dudley, J.T., 2016. Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records. Scientific Reports 6.Google Scholar
Nguyen, P., Tran, T., Wickramasinghe, N., and Venkatesh, S., 2016. Deepr: A Convolutional Net for Medical Records. arXiv preprint arXiv:1607.07519.Google Scholar
Pak, A. and Paroubek, P., 2010. Twitter as a Corpus for Sentiment Analysis and Opinion Mining. In LREc.Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., and Dubourg, V., 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12, Oct, 2825--2830. Google ScholarDigital Library
Pham, T., Tran, T., Phung, D., and Venkatesh, S., 2016. DeepCare: A Deep Dynamic Memory Model for Predictive Medicine. In Pacific-Asia Conference on Knowledge Discovery and Data Mining Springer, 30--41. Google ScholarDigital Library
Rios, A. and Kavuluru, R., 2013. Supervised extraction of diagnosis codes from EMRs: role of feature selection, data selection, and probabilistic thresholding. In Healthcare Informatics (ICHI), 2013 IEEE International Conference on IEEE, 66--73. Google ScholarDigital Library
Rocktäschel, T., Grefenstette, E., Hermann, K.M., Kočiský, T., and Blunsom, P., 2015. Reasoning about entailment with neural attention. arXiv preprint arXiv:1509.06664.Google Scholar
Sainath, T.N., Mohamed, A.-r., Kingsbury, B., and Ramabhadran, B., 2013. Deep convolutional neural networks for LVCSR. In Acoustics, speech and signal processing (ICASSP), 2013 IEEE international conference on IEEE, 8614--8618.Google Scholar
Steiger, J.H., 1980. Tests for comparing elements of a correlation matrix. Psychological bulletin 87, 2, 245--251.Google Scholar
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., and Hovy, E., 2016. Hierarchical attention networks for document classification. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.Google Scholar

Index Terms

Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network
1. Applied computing
  1. Life and medical sciences
    1. Health informatics
2. Computing methodologies
  1. Artificial intelligence

Recommendations

Interpretable Representation Learning for Healthcare via Capturing Disease Progression through Time
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Various deep learning models have recently been applied to predictive modeling of Electronic Health Records (EHR). In medical claims data, which is a particular type of EHR data, each patient is represented as a sequence of temporally ordered ...
Read More
Interpreting a recurrent neural network’s predictions of ICU mortality risk
Graphical abstract

Display Omitted
Highlights
- Introduce Learned Binary Masks (LBM) to interpret an RNN’s ICU mortality predictions.
Abstract
Deep learning has demonstrated success in many applications; however, their use in healthcare has been limited due to the lack of transparency into how they generate predictions. Algorithms such as Recurrent Neural Networks (RNNs) when ...
Read More
Medical secretaries' care of records: the cooperative work of a non-clinical group
CSCW '12: Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work

We describe the cooperative work of medical secretaries at two hospital departments, during the implementation of an electronic health record system. Medical secretaries' core task is to take care of patient records by ensuring that information is ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ACM-BCB '17: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics
August 2017
800 pages
ISBN:9781450347228
DOI:10.1145/3107411
General Chairs:
Nurit Haspel
University of Massachusetts Boston, USA
,
Lenore J. Cowen
Tufts University, USA
,
Program Chairs:
Amarda Shehu
George Mason University, USA
,
Tamer Kahveci
University of Florida, USA
,
Giuseppe Pozzi
Politecnico di Milano, Italy
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 August 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
attention
deep learning
electronic health records
health care
interpretability
recurrent neural networks
visualization
Qualifiers
- research-article
Conference

Acceptance Rates
ACM-BCB '17 Paper Acceptance Rate42of132submissions,32%Overall Acceptance Rate254of885submissions,29%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 82
  Total Citations
  View Citations
- 1,571
  Total Downloads
- Downloads (Last 12 months)68
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network

ACM-BCB '17: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics

ABSTRACT

References

Cited By

Index Terms

Recommendations

Interpretable Representation Learning for Healthcare via Capturing Disease Progression through Time

Interpreting a recurrent neural network’s predictions of ICU mortality risk

Medical secretaries' care of records: the cooperative work of a non-clinical group

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network

ACM-BCB '17: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics

ABSTRACT

References

Cited By

Index Terms

Recommendations

Interpretable Representation Learning for Healthcare via Capturing Disease Progression through Time

Interpreting a recurrent neural network’s predictions of ICU mortality risk

Medical secretaries' care of records: the cooperative work of a non-clinical group

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media