Abstract
Medical case-based reasoning (CBR) systems require the handling of vague or imprecise data. The fuzzy set theory is particularly suitable for this purpose. This paper proposes a case-base preparation framework for CBR systems, which converts the electronic health record medical data into fuzzy CBR knowledge. It generates fuzzy case-base knowledge by suggesting a standard crisp entity–relationship data model for CBR case-base. The resulting data model is fuzzified using a proposed relational data model fuzzification methodology. The performances of this methodology and its resulting fuzzy case-base structure are evaluated. Diabetes diagnosis is used as a case study. A set of 60 real diabetic cases is used in the study. A fuzzy CBR system is implemented to check the diagnoses accuracy. It combines the resulting fuzzy case-base with a proposed fuzzy similarity measure. Experimental results indicate that the proposed fuzzy CBR method is superior to traditional CBR and other machine-learning methods. Our fuzzy CBR achieves an accuracy of 95%, a precision of 96%, a recall 97.96%, an f-measure of 96.97%, a specificity of 81.82%, and good robustness for dealing with vagueness. The resulting fuzzy case-base relational database enhances the representation of case-base knowledge, the performance of retrieval algorithms, and the querying capabilities of CBR systems.
Similar content being viewed by others
References
Abbasbandy S, Hajjari T (2010) Weighted trapezoidal approximation-preserving cores of a fuzzy number. Comput Math Appl 59(9):3066–3077
Adekunle Y (2015) The prediction, diagnosis, and treatment of diabetes mellitus using an intelligent decision support system framework. Int J Adv Res Comput Sci Softw Eng 5(3):1285–1292
Aibinu A, Salami M, Shafie A (2010) Application of modeling techniques to diabetes diagnosis. In: IEEE EMBS conference on biomedical engineering and sciences (IECBES), pp 194–198
Ali M, Han S, Bilal H, Lee S, Kang M, Kang B, Razzaq M, Amin M (2018a) iCBLS: an interactive case-based learning system for medical education. Int J Med Inform 109:55–69
Ali F, Islam SR, Kwak D, Khan P, Ullah N, Yoo SJ, Kwak KS (2018b) Type-2 fuzzy ontology-aided recommendation systems for IoT-based healthcare. Comput Commun
Arias-Aranda D, Castro J, Navarro M, Zurita J (2009) A CBR system for knowing the relationship between flexibility and operations strategy. Found Intell Syst 5722:463–472
Balakrishnan V, Shakouri M, Hoodeh H, Loo H (2012) Predictions using data mining and case-based reasoning: a case study for retinopathy. World Acad Sci Eng Technol 63:573–576
Barakat N, Barakat M (2010) Intelligible support vector machines for diagnosis of diabetes mellitus. IEEE Trans Inf Technol Biomed 14(4):1114: 1120
Begum S, Ahmed M, Funk P, Xiong N, Schéele V (2009) A case-based decision support system for individual stress diagnosis using fuzzy similarity matching. Int J Comput Intell 25(3):180–195
Bellazzi R, Montani S, Portinale L (1998) Retrieval in a prototype-based case library: a case study in diabetes therapy revision. Adv Case Based Reason 1488:64–75
Biswas S, Chakraborty M, Singh H et al (2017) Hybrid case-based reasoning system by cost-sensitive neural network for classification. Soft Comput 21(24):7579–7596
Brown D, Aldea A, Harrison R, Martin C, Bayley I (2018) Temporal case-based reasoning for type 1 diabetes mellitus bolus insulin decision support. Artif Intell Med 85:28–42
Çalisir D, Dogantekin E (2011) An automatic diabetes diagnosis system based on LDA-wavelet support vector machine classifier. Expert Syst Appl 38(7):8311–8315
Chang X, Lilly J (2004) Evolutionary design of a fuzzy classifier from data. IEEE Trans Syst Man Cybern B Cybern 34(4):1894–1906
Dogantekin E, Dogantekin A, Avci D, Avci L (2010) An intelligent diagnosis system for diabetes on linear discriminant analysis and adaptive network based fuzzy inference system: LDA-ANFIS. Digit Signal Proc 20(4):1248–1255
El-Sappagh SH, Elmogy M, Riad A, Badria F, Zaghlol H (2014) EHR data preparation for case based reasoning construction. Adv Mach Learn Technol Appl 488:483–497
El-Sappagh SH, Elmogy M, Riad A (2015) A CBR system for diabetes mellitus diagnosis: CB standard data model. Int J Med Eng Inform 7(3):191–208
Fan C, Chang P, Lin J, Hsieh J (2011) A hybrid model combining case-based reasoning and Fuzzy decision tree for medical data classification. Appl Soft Comput 11:632–644
Ganji M, Abadeh A (2011) A Fuzzy classification system based on ant colony optimization for diabetes disease diagnosis. Exp Syst Appl 38:14650–9
Gerstenkorn T, Man’ko J (1991) Correlation of intuitionistic fuzzy sets. Fuzzy Sets Syst 44:39–43
Godo L, Sandri S, Dutra L, Freitas C, Carvalho O, Guimarães R, Amaral R (2009) Classification of schistosomiasis prevalence using fuzzy case-based reasoning. Bio Inspired Syst Comput Ambient Intell 5517:1053–1060
Goncalves L, Vellasco M, Pacheco M, de Souza F (2006) Inverted hierarchical neuro-fuzzy BSP system: a novel neuro-fuzzy model for pattern classification and rule extraction in databases. IEEE Trans Syst Man Cybern C Appl Rev 36(2):236–248
Hyung L, Song Y, Lee K (1994) Similarity measure between fuzzy sets and between elements. Fuzzy Sets Syst 62:291–293
Jin Q, Jie H, Ying-hong P, Wei-ming W, Zhen-fei Z (2010) New weighted fuzzy case retrieval method for customer-driven product design. J Shanghai Jiaotong Univ (Sci) 15(6):641–650
Kahramanli H, Allahverdi N (2008) Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 35(1–2):82–89
Kalaiselvi C, Nasira G (2014) A new approach for diagnosis of diabetes and prediction of cancer using ANFIS. In: IEEE conference of world congress on computing and communication technologies, pp 188–190
Kalpana M, Kumar A (2011a) Fuzzy expert system for diabetes using fuzzy verdict mechanism. Int J Adv Netw Appl 03(02):1128–1134
Kalpana M, Kumar A (2011b) Fuzzy expert system for diagnosis of diabetes using fuzzy determination mechanism. Int J Comput Sci Emerg Technol 2(6):354–361
Karatsiolis S, Schizas C (2012) Region based support vector machine algorithm for medical diagnosis on Pima Indian Diabetes dataset. In: IEEE 12th international conference on bioinformatics and bioengineering (BIBE), pp 139–144
Khanum A, Mufti M, Javed M, Shafiq M (2009) Fuzzy case-based reasoning for facial expression recognition. Fuzzy Sets Syst 160:231–250
Kumari S, Singh A (2013) A data mining approach for the diagnosis of diabetes mellitus. In: 7th International conference on intelligent systems and control (ISCO), pp 373–375
Lee C, Wang M (2011) A fuzzy expert system for diabetes decision support application. IEEE Trans Syst Man Cybern Part B Cybern 41(1):139–153
Li S, Ho H (2009) Predicting financial activity with evolutionary Fuzzy case-based reasoning. Expert Syst Appl 36:411–422
Li H, Sun J (2011) On performance of case-based reasoning in Chinese business failure prediction from sensitivity, specificity, positive and negative values. Appl Soft Comput 11(1):460–7
Marling C, Shubrook J, Schwartz F (2008) Case-based decision support for patients with type 1 diabetes on insulin pump therapy. In: Advances in case-based reasoning: 9th European Conference (ECCBR), vol 5239, pp 325–339
Marling C, Wiley M, Cooper T, Bunescu R, Shubrook J, Schwartz F (2011) The 4 diabetes support system: a case study in CBR research and development. Case Based Reason Res Dev 6880:137–150
Mohamudally N, Khan D (2011) Application of a unified medical data miner (UMDM) for prediction, classification, interpretation and visualization on medical datasets: the diabetes dataset case. In: Advances in data mining: applications and theoretical aspects, vol 6870, pp 78–95
Montani S, Bellazzi R, Portinale L, d’Annunzio G, Fiocchi S, Stefanelli M (2000) Diabetic patients management exploiting case-based reasoning techniques. Comput Methods Programs Biomed 62(3):205–218
Pappis C, Karacapilidis N (1993) A comparative assessment of measures of similarity of fuzzy values. Fuzzy Sets Syst 56:171–174
Patil B, Joshi R, Toshniwal D (2010) Hybrid prediction model for Type-2 diabetic patients. Expert Syst Appl 37(12):8102–8108
Petrovic S, Mishra N, Sundar S (2011) A novel case based reasoning approach to radiotherapy planning. Expert Syst Appl 38:10759–10769
Polat K, Güneş S (2007) An expert system approach based on principal component analysis and adaptive Neuro-fuzzy inference system to diagnosis of diabetes disease. Digit Signal Proc 17(4):702–710
Polat K, Gunes S, Aslan A (2008) A cascade learning system for classification of diabetes disease: generalized discriminant analysis and least square support vector machine. Expert Syst Appl 34(1):214–221
Portinale L, Montani S (2002) A fuzzy case retrieval approach based on SQL for implementing electronic catalogs. Adv Case Based Reason 2416:321–335
Qin Y, Lu W, Qi Q, Liu X, Huang M, Scott P, Jiang X (2018) Towards an ontology-supported case-base d reasoning approach for computer-aided tolerance specification. Knowl Based Syst 141:129–147
Radha R, Rajagopalan S (2007) Fuzzy logic approach for diagnosis of diabetics. Inf Technol J 61:96–102
Raza B, Kumar Y, Malik A, Anjum A, Faheem M (2018) Performance prediction and adaptation for database management system workload using case-based reasoning approach. Inf Syst 76:46–58
Relich M, Pawlewski P (2018) A case-based reasoning approach to cost estimation of new product development. Neurocomputing 272:40–45
Rodriguez Y, Garcia M, Baets B, Morell C, Bello R (2006) A connectionist fuzzy case-based reasoning model. In: MICAI: advances in artificial intelligence. Springer, Berlin, pp 176–185
Samant P, Agarwal R (2018) Machine learning techniques for medical diagnosis of diabetes using iris images. Comput Methods Programs Biomed 157:121–128
Sarkheyli-Hägele A, Söffker D (2017) Fuzzy SOM-based case-based reasoning for individualized situation recognition applied to supervision of human operators. Knowl Based Syst 137:42–53
Shankaracharya OD, Samanta S, Vidyarthi A (2010) Computational intelligence in early diabetes diagnosis: a review. Rev Diabet Stud 7(4):252–62
Sushmita S, Chaudhury S (2007) Hierarchical fuzzy case based reasoning with multi-criteria decision making for financial applications. Pattern Recognit Mach Intell 4815:226–234
Temurtas H, Yumusak N, Temurtas F (2009) A comparative study on diabetes disease diagnosis using neural networks. Expert Syst Appl 36(4):8610–8615
Varma K, Rao A, Lakshmi T, Rao P (2014) A computational intelligence approach for a better diagnosis of diabetic patients. Comput Electr Eng 40:1758–1765
Wang W (1997) New similarity measures on fuzzy sets and on elements. Fuzzy Sets Syst 85:305–309
Wu D, Li J, Bao C (2017) Case-based reasoning with optimized weight derived by particle swarm optimization for software effort estimation. Soft Comput 1–12
Xiaodong H, Jianwu W, Fuqian S, Haiyan C (2009) Apply fuzzy case-based reasoning to knowledge acquisition of product style. In: IEEE 10th international conference on computer-aided industrial design and conceptual design (CAID & CD), pp 383–386
Xiong N (2011) Learning fuzzy rules for similarity assessment in case-based reasoning. Expert Syst Appl 38:10780–10786
Yao B, Li S (2010) ANMM4CBR: a case-based reasoning method for gene expression data classification. Algorithms Mol Biol (AMB) 5:14
Yu F, Li X, Han X (2018) Risk response for urban water supply network using case-based reasoning during a natural disaster. Saf Sci 106:121–139
Zadeh LA (2003) From search engines to question-answering systems the need for new tools. In: The 12th IEEE international conference fuzzy systems, vol 2, pp 1107–1109
Zhang Z, Chen D, Feng Y, Yuan Z, Chen B, Qin W, Zou S, Qin S, Han J (2018) A strategy for enhancing the operational agility of petroleum refinery plant using case based fuzzy reasoning method. Comput Chem Eng 111:27–36
Acknowledgements
This work was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korea government (MSIT)-NRF-2017R1A2B2012337. The authors would like to thank Dr. Farid Badria, Prof. of Pharmacognosy, Department, and head of Liver Research Lab, Mansoura University, Egypt; and Dr. Hosam Zaghloul, Prof. at Clinical Pathology Department, Faculty of Medicine, Mansoura University, Egypt, for their efforts in this work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Communicated by V. Loia.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Appendices
Appendix A
The performance evaluation of the proposed fuzzy CBR system
\(K=3\) | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
Query cases | Retrieved cases | FKI-CBR diagnose | Domain expert diagnose | |||||||
Diabetes diagnosis | Nephropathy | Hypercholesterolemia | Cancer | Liver | Glomerulonephritis | Splenomegaly | Confidence (%) | |||
Case 1 | Case 1-1 | D | N | N | N | N | N | N | 90 | Diabetes Diagnosis: D |
Case 1-2 | P | N | N | N | N | N | N | 88.9 | Percentage of similarity for another diagnosis: 9/10 | |
Case 1-3 | D | N | H | N | N | N | N | 88.8 | ||
Case 2 | Case 2-1 | N | N | N | N | N | N | N | 95 | Diabetes Diagnosis: N |
Case 2-2 | N | N | N | N | N | N | N | 91 | Percentage of similarity for another diagnosis: 10/10 | |
Case 2-3 | N | N | N | N | N | N | N | 90 | ||
Case 3 | Case 3-1 | P | SH | N | N | N | G | N | 88 | Diabetes Diagnosis: D |
Case 3-2 | DG | SH | N | N | F | G | N | 85 | Percentage of similarity for another diagnosis: 10/10 | |
Case 3-3 | D | N | N | N | N | N | N | 84 | ||
Case 4 | Case 4-1 | D | N | H | N | N | G | N | 94 | Diabetes Diagnosis: D |
Case 4-2 | D | N | H | N | N | G | N | 93.2 | Percentage of similarity for another diagnosis: 10/10 | |
Case 4-3 | N | NP | N | N | N | G | N | 86.5 | ||
Case 5 | Case 5-1 | D | N | N | N | N | N | N | 100 | Diabetes Diagnosis: D |
Case 5-2 | D | N | N | O | N | N | N | 88.8 | Percentage of similarity for another diagnosis: 10/10 | |
Case 5-3 | D | N | N | N | N | N | N | 86.9 | ||
Case 6 | Case 6-1 | P | SH | N | N | N | G | N | 81 | Diabetes Diagnosis: P |
Case 6-2 | D | N | N | N | N | N | N | 79 | Percentage of similarity for another diagnosis: 8/10 | |
Case 6-3 | D | N | N | HCC | HCC | N | N | 79 | ||
Case 7 | Case 7-1 | N | N | N | N | N | N | N | 92.4 | Diabetes Diagnosis: N |
Case 7-2 | D | N | N | N | N | N | N | 89.6 | Percentage of similarity for another diagnosis: 10/10 | |
Case 7-3 | N | N | N | N | N | N | N | 89.4 | ||
Case 8 | Case 8-1 | D | N | N | N | HCC | N | N | 99 | Diabetes Diagnosis: D |
Case 8-2 | N | N | H | N | N | N | N | 82.5 | Percentage of similarity for another diagnosis: 9/10 | |
Case 8-3 | P | N | N | N | N | N | N | 80 | ||
Case 9 | Case 9-1 | D | N | H | N | N | N | N | 93 | Diabetes Diagnosis: D |
Case 9-2 | DG | SH | N | N | N | G | N | 91 | Percentage of similarity for another diagnosis: 10/10 | |
Case 9-3 | D | N | N | N | HCC | N | SG | 90 | ||
Case 10 | Case 10-1 | D | N | N | N | N | N | N | 91 | Diabetes Diagnosis: D |
Case 10-2 | DG | N | N | N | N | N | N | 89.9 | Percentage of similarity for another diagnosis: 10/10 | |
Case 10-3 | P | N | H | N | N | N | N | 89.2 |
N normal, A abnormal, D diabetic, P pre-diabetic, H hypercholesterolemia, O ovarian cancer, DG gestational diabetes, SH Shrunken Kidney, G glomerulonephritis, NP nephropathy, SG splenomegaly, HCC hepatocellular carcinoma, F fatty liver, LC liver cirrhosis
Appendix B
A comparison of the proposed fuzzy CBR system and traditional CBR system
Query case | Domain expert decision | Proposed system decision | Traditional system decision |
---|---|---|---|
Case1 | Diabetes diagnosis: D | D (90%) | P (78%) |
Case 2 | Diabetes diagnosis: N | N (95%) | N (83%) |
Case 3 | Diabetes diagnosis: D | P (88%) | D (70%) |
Case 4 | Diabetes diagnosis: D | D (94%) | P (64%) |
Case 5 | Diabetes diagnosis: D | D (100%) | D (80%) |
Case 6 | Diabetes diagnosis: P | P (81%) | D (64%) |
Case 7 | Diabetes diagnosis: N | N (92.4%) | N (85%) |
Case 8 | Diabetes diagnosis: D | D (99%) | P (79%) |
Case 9 | Diabetes diagnosis: D | D (93%) | P (74%) |
Case 10 | Diabetes diagnosis: D | D (91%) | P (75%) |
Appendix C
A comparison of our FCBR and ML classifiers using our case-base data
Fold | Algorithm | Precision (%) | TPR-Recall (%) | Accuracy (%) | F-Measure (%) |
---|---|---|---|---|---|
Machine-learning algorithms | |||||
Twofold | C4.5 | 90 | 93.1 | 90 | 91.5 |
k-NN (\(k=3\)) | 80 | 69 | 66.66 | 74.1 | |
SVM | 75.8 | 86.2 | 68.33 | 80.6 | |
Naive Bayes | 86.2 | 86.2 | 75 | 86.2 | |
ANN | 70.6 | 82.8 | 65 | 76.2 | |
Threefold | C4.5 | 89.7 | 89.7 | 88.33 | 89.7 |
k-NN (\(k=3\)) | 74.1 | 69 | 60 | 71.4 | |
SVM | 78.8 | 89.7 | 71.66 | 83.9 | |
Naive Bayes | 83.9 | 89.7 | 75 | 86.7 | |
ANN | 73.5 | 86.2 | 65 | 79.4 | |
Fourfold | C4.5 | 89.7 | 89.7 | 88.33 | 89.7 |
k-NN (\(k=3\)) | 76.9 | 69 | 65 | 72.7 | |
SVM | 77.4 | 82.8 | 71.66 | 80 | |
Naive Bayes | 79.3 | 79.3 | 71.66 | 79.3 | |
ANN | 76.7 | 79.3 | 66.66 | 78 | |
Fivefold | C4.5 | 93.1 | 93.1 | 91.67 | 93.1 |
k-NN (\(k=3\)) | 73.1 | 65.5 | 63.33 | 69.1 | |
SVM | 75 | 82.8 | 66.66 | 78.7 | |
Naive Bayes | 83.3 | 69 | 66.66 | 75.5 | |
ANN | 75.8 | 86.2 | 68.33 | 80.6 | |
Sixfold | C4.5 | 89.7 | 89.7 | 88.33 | 89.7 |
k-NN (\(k=3\)) | 74.1 | 69 | 63.33 | 71.4 | |
SVM | 81.3 | 89.7 | 75 | 85.2 | |
Naive Bayes | 86.2 | 86.2 | 73.33 | 86.2 | |
ANN | 76.5 | 89.7 | 70 | 82.5 | |
Sevenfold | C4.5 | 90 | 93.1 | 90 | 91.5 |
k-NN (\(k=3\)) | 63.3 | 65.5 | 58.33 | 64.4 | |
SVM | 76.7 | 79.3 | 71.66 | 78 | |
Naive Bayes | 75.9 | 75.9 | 70 | 75.9 | |
ANN | 76.7 | 79.3 | 65 | 78 | |
Eightfold | C4.5 | 89.3 | 86.2 | 86.66 | 87.7 |
k-NN (\(k=3\)) | 78.6 | 75.9 | 66.66 | 77.2 | |
SVM | 82.1 | 79.3 | 71.66 | 80.7 | |
Naive Bayes | 78.1 | 86.2 | 73.33 | 82 | |
ANN | 77.4 | 82.8 | 71.66 | 80 | |
Ninefold | C4.5 | 89.7 | 89.7 | 88.33 | 89.7 |
k-NN (\(k=3\)) | 83.3 | 69 | 66.66 | 75.5 | |
SVM | 80.6 | 86.2 | 76.66 | 83.3 | |
Naive Bayes | 82.1 | 79.3 | 73.33 | 80.7 | |
ANN | 83.9 | 89.7 | 75 | 86.7 | |
Tenfold | C4.5 | 89.7 | 89.7 | 88.33 | 89.7 |
k-NN (\(k=3\)) | 73.1 | 65.5 | 56.67 | 69.1 | |
SVM | 80.6 | 86.2 | 75 | 83.3 | |
Naive Bayes | 83.3 | 86.2 | 75 | 84.7 | |
ANN | 76.5 | 89.7 | 70 | 82.5 | |
k-fold (\(k=60\)) \(\equiv \) LOOCV | C4.5 | 89.7 | 89.7 | 88.33 | 89.7 |
k-NN (\(k=3\)) | 73.1 | 65.5 | 58.33 | 69.1 | |
SVM | 80 | 82.8 | 75 | 81.4 | |
Naive Bayes | 85.2 | 79.3 | 73.33 | 82.1 | |
ANN | 78.8 | 89.7 | 75 | 83.9 | |
Average (%) | 80.568 | 82.086 | 73.0972 | 81.164 | |
Maximum (%) | 93.1 | 93.1 | 91.67 | 93.1 | |
Conventional CBR system | 76 | 47.5 | 55 | 58.46 | |
Proposed FCBR system | 96 | 97.96 | 95 | 96.97 |
Rights and permissions
About this article
Cite this article
El-Sappagh, S., Elmogy, M., Ali, F. et al. A case-base fuzzification process: diabetes diagnosis case study. Soft Comput 23, 5815–5834 (2019). https://doi.org/10.1007/s00500-018-3245-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-018-3245-3