Abstract
Chronic kidney disease (CKD) is a global public health problem, affecting approximately 10% of the population worldwide. Yet, there is little direct evidence on how CKD can be diagnosed in a systematic and automatic manner. This paper investigates how CKD can be diagnosed by using machine learning (ML) techniques. ML algorithms have been a driving force in detection of abnormalities in different physiological data, and are, with a great success, employed in different classification tasks. In the present study, a number of different ML classifiers are experimentally validated to a real data set, taken from the UCI Machine Learning Repository, and our findings are compared with the findings reported in the recent literature. The results are quantitatively and qualitatively discussed and our findings reveal that the random forest (RF) classifier achieves the near-optimal performances on the identification of CKD subjects. Hence, we show that ML algorithms serve important function in diagnosis of CKD, with satisfactory robustness, and our findings suggest that RF can also be utilized for the diagnosis of similar diseases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
P. O. Duda and P. E. Hart, Pattern classification and scene analysis, New York: Wiley, 1973.
E. Alickovic and A. Subasi, “Medical Decision Support System for Diagnosis of Heart Arrhythmia using DWT and Random Forests Classifier,” Journal of Medical Systems, vol. 40, no. 108, 2016.
Z. Masetic and A. Subasi, “Congestive heart failure detection using random forest classifier,” Computer Methods and Programs in Biomedicine, vol. 130, pp. 54-64, 2016.
E. Alickovic, J. Kevric and A. Subasi, “Performance evaluation of EMD, DWT, and WPD for automated epileptic seizure detection and prediction,” Submitted to Biomedical Signal Processing and Control, 2016.
M. J. Pérez-Sáeza, . D. Prieto-Alhambra, C. Barrios, M. Crespo, D. Redondo, X. Nogués, A. Díez-Pérez and J. Pascual, “Increased hip fracture and mortality in chronic kidney disease individuals: The importance of competing risks,” Bone, vol. 73, p. 154–159, 2015.
A. M. Cueto-Manzano, L. Cortes-Sanabria, H. R. Martinez-Ramirez, E. Rojas-Campos, B. Gomez-Navarro and M. Castillero-Manzano, “Prevalence of Chronic Kidney Disease in an Adult Population,” Archives of Medical Research, vol. 45, pp. 507-513, 2014.
A. Levin and P. E. Stevens, “Summary of KDIGO 2012 CKD Guideline: behind the scenes, need for guidance, and a framework for moving forward,” Kidney International, vol. 85, no. 1, p. 49–61, 2014.
Z. Chen, Z. Zhang, R. Zhu, Y. Xiang and P. B. Harrington, “Diagnosis of patients with chronic kidney disease by using two fuzzy classifiers,” Chemometrics and Intelligent Laboratory Systems, vol. 153, p. 140–145, 2016.
P. Muthukumara and G. S. S. Krishnan, “A similarity measure of intuitionistic fuzzy soft sets and its application in medical diagnosis,” Applied Soft Computing, vol. 41, p. 148–156, 2016.
“UCI Machine Learning Repository: Chronic Kidney Disease (CKD) Data Set,” 15 November 2016. [Online]. Available: view-source:https://archive.ics.uci.edu/ml/datasets/Chronic_Kidney_Disease#. [Accessed Nov 15 2016].
B. Widrow, D. E. Rumelhart and M. A. Lehr, “Neural networks: applications in industry, business and science,” Communications of the ACM, vol. 12, pp. 93-105, 1994.
G. P. Zhang, Data Mining and Knowledge Discovery Handbook, O. Maimon and L. Rokach, Eds., New York: Springer, 2010.
V. Vapnik, The Nature of Statistical Learning Theory, New York: Springer Verlag, 1995.
S. Armin, Data Mining and Knowledge Discovery Handbook, 2nd ed., O. Maimon and L. Rokach, Eds., New York: Springer, 2010.
L. Rokach and O. Maimon, Data Mining and Knowledge Discovery Handbook, 2nd ed., M. Oded and R. Lior, Eds., New York: Springer, 2010.
L. Breiman, J. Friedman, R. Olshen and C. Stone, Classification and Regression Trees, Wadsworth Int. Group, 1984.
J. R. Quinlan, “Induction of Decision Trees,” Machine Learning, vol. 1, pp. 81-106, 1986.
J. R. Quinlan, C4.5: Program for Machine Learning, CA, Morgan Kaufman Publishing, 1993.
L. Breiman, “Random Forests,” Machine Learning, vol. 45, p. 5–32, 2001.
E. Alickovic and A. Subasi, “Breast cancer diagnosis using GA feature selection and Rotation Forest,” Neural Computing and Applications, pp. 1-11, 2015.
R. J. V. Rijsbergen, Information Retrieval, 2nd ed., Department of Computing Science, University of Glasgow, 1979.
N. Lazarevic-McManus, J. R. Renno, D. Makris and G. A. Jones, “An object-based comparative methodology for motion detection based on the F-Measure,” Computer Vision and Image Understanding, vol. 111, p. 74–85, 2008.
A. Khazaee and A. Ebrahimzadeh, “Classification of electrocardiogram signals with support vector machines and genetic algorithms using power spectral features,” Biomedical Signal Processing and Control, vol. 5, p. 252–263, 2010.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Subasi, A., Alickovic, E., Kevric, J. (2017). Diagnosis of Chronic Kidney Disease by Using Random Forest. In: Badnjevic, A. (eds) CMBEBIH 2017. IFMBE Proceedings, vol 62. Springer, Singapore. https://doi.org/10.1007/978-981-10-4166-2_89
Download citation
DOI: https://doi.org/10.1007/978-981-10-4166-2_89
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-4165-5
Online ISBN: 978-981-10-4166-2
eBook Packages: EngineeringEngineering (R0)