Skip to main content
Log in

A case-base fuzzification process: diabetes diagnosis case study

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Medical case-based reasoning (CBR) systems require the handling of vague or imprecise data. The fuzzy set theory is particularly suitable for this purpose. This paper proposes a case-base preparation framework for CBR systems, which converts the electronic health record medical data into fuzzy CBR knowledge. It generates fuzzy case-base knowledge by suggesting a standard crisp entity–relationship data model for CBR case-base. The resulting data model is fuzzified using a proposed relational data model fuzzification methodology. The performances of this methodology and its resulting fuzzy case-base structure are evaluated. Diabetes diagnosis is used as a case study. A set of 60 real diabetic cases is used in the study. A fuzzy CBR system is implemented to check the diagnoses accuracy. It combines the resulting fuzzy case-base with a proposed fuzzy similarity measure. Experimental results indicate that the proposed fuzzy CBR method is superior to traditional CBR and other machine-learning methods. Our fuzzy CBR achieves an accuracy of 95%, a precision of 96%, a recall 97.96%, an f-measure of 96.97%, a specificity of 81.82%, and good robustness for dealing with vagueness. The resulting fuzzy case-base relational database enhances the representation of case-base knowledge, the performance of retrieval algorithms, and the querying capabilities of CBR systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. http://archive.ics.uci.edu/ml/datasets.

  2. http://gaia.fdi.ucm.es/research/colibri/jcolibri.

References

  • Abbasbandy S, Hajjari T (2010) Weighted trapezoidal approximation-preserving cores of a fuzzy number. Comput Math Appl 59(9):3066–3077

    Article  MathSciNet  MATH  Google Scholar 

  • Adekunle Y (2015) The prediction, diagnosis, and treatment of diabetes mellitus using an intelligent decision support system framework. Int J Adv Res Comput Sci Softw Eng 5(3):1285–1292

    Google Scholar 

  • Aibinu A, Salami M, Shafie A (2010) Application of modeling techniques to diabetes diagnosis. In: IEEE EMBS conference on biomedical engineering and sciences (IECBES), pp 194–198

  • Ali M, Han S, Bilal H, Lee S, Kang M, Kang B, Razzaq M, Amin M (2018a) iCBLS: an interactive case-based learning system for medical education. Int J Med Inform 109:55–69

    Article  Google Scholar 

  • Ali F, Islam SR, Kwak D, Khan P, Ullah N, Yoo SJ, Kwak KS (2018b) Type-2 fuzzy ontology-aided recommendation systems for IoT-based healthcare. Comput Commun

  • Arias-Aranda D, Castro J, Navarro M, Zurita J (2009) A CBR system for knowing the relationship between flexibility and operations strategy. Found Intell Syst 5722:463–472

    Article  Google Scholar 

  • Balakrishnan V, Shakouri M, Hoodeh H, Loo H (2012) Predictions using data mining and case-based reasoning: a case study for retinopathy. World Acad Sci Eng Technol 63:573–576

    Google Scholar 

  • Barakat N, Barakat M (2010) Intelligible support vector machines for diagnosis of diabetes mellitus. IEEE Trans Inf Technol Biomed 14(4):1114: 1120

    Article  Google Scholar 

  • Begum S, Ahmed M, Funk P, Xiong N, Schéele V (2009) A case-based decision support system for individual stress diagnosis using fuzzy similarity matching. Int J Comput Intell 25(3):180–195

    Article  MathSciNet  Google Scholar 

  • Bellazzi R, Montani S, Portinale L (1998) Retrieval in a prototype-based case library: a case study in diabetes therapy revision. Adv Case Based Reason 1488:64–75

    Article  Google Scholar 

  • Biswas S, Chakraborty M, Singh H et al (2017) Hybrid case-based reasoning system by cost-sensitive neural network for classification. Soft Comput 21(24):7579–7596

    Article  Google Scholar 

  • Brown D, Aldea A, Harrison R, Martin C, Bayley I (2018) Temporal case-based reasoning for type 1 diabetes mellitus bolus insulin decision support. Artif Intell Med 85:28–42

    Article  Google Scholar 

  • Çalisir D, Dogantekin E (2011) An automatic diabetes diagnosis system based on LDA-wavelet support vector machine classifier. Expert Syst Appl 38(7):8311–8315

    Article  Google Scholar 

  • Chang X, Lilly J (2004) Evolutionary design of a fuzzy classifier from data. IEEE Trans Syst Man Cybern B Cybern 34(4):1894–1906

    Article  Google Scholar 

  • Dogantekin E, Dogantekin A, Avci D, Avci L (2010) An intelligent diagnosis system for diabetes on linear discriminant analysis and adaptive network based fuzzy inference system: LDA-ANFIS. Digit Signal Proc 20(4):1248–1255

    Article  Google Scholar 

  • El-Sappagh SH, Elmogy M, Riad A, Badria F, Zaghlol H (2014) EHR data preparation for case based reasoning construction. Adv Mach Learn Technol Appl 488:483–497

    Google Scholar 

  • El-Sappagh SH, Elmogy M, Riad A (2015) A CBR system for diabetes mellitus diagnosis: CB standard data model. Int J Med Eng Inform 7(3):191–208

    Article  Google Scholar 

  • Fan C, Chang P, Lin J, Hsieh J (2011) A hybrid model combining case-based reasoning and Fuzzy decision tree for medical data classification. Appl Soft Comput 11:632–644

    Article  Google Scholar 

  • Ganji M, Abadeh A (2011) A Fuzzy classification system based on ant colony optimization for diabetes disease diagnosis. Exp Syst Appl 38:14650–9

    Article  Google Scholar 

  • Gerstenkorn T, Man’ko J (1991) Correlation of intuitionistic fuzzy sets. Fuzzy Sets Syst 44:39–43

    Article  MathSciNet  MATH  Google Scholar 

  • Godo L, Sandri S, Dutra L, Freitas C, Carvalho O, Guimarães R, Amaral R (2009) Classification of schistosomiasis prevalence using fuzzy case-based reasoning. Bio Inspired Syst Comput Ambient Intell 5517:1053–1060

    Article  Google Scholar 

  • Goncalves L, Vellasco M, Pacheco M, de Souza F (2006) Inverted hierarchical neuro-fuzzy BSP system: a novel neuro-fuzzy model for pattern classification and rule extraction in databases. IEEE Trans Syst Man Cybern C Appl Rev 36(2):236–248

    Article  Google Scholar 

  • Hyung L, Song Y, Lee K (1994) Similarity measure between fuzzy sets and between elements. Fuzzy Sets Syst 62:291–293

    Article  MathSciNet  Google Scholar 

  • Jin Q, Jie H, Ying-hong P, Wei-ming W, Zhen-fei Z (2010) New weighted fuzzy case retrieval method for customer-driven product design. J Shanghai Jiaotong Univ (Sci) 15(6):641–650

    Article  Google Scholar 

  • Kahramanli H, Allahverdi N (2008) Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 35(1–2):82–89

    Article  Google Scholar 

  • Kalaiselvi C, Nasira G (2014) A new approach for diagnosis of diabetes and prediction of cancer using ANFIS. In: IEEE conference of world congress on computing and communication technologies, pp 188–190

  • Kalpana M, Kumar A (2011a) Fuzzy expert system for diabetes using fuzzy verdict mechanism. Int J Adv Netw Appl 03(02):1128–1134

    Google Scholar 

  • Kalpana M, Kumar A (2011b) Fuzzy expert system for diagnosis of diabetes using fuzzy determination mechanism. Int J Comput Sci Emerg Technol 2(6):354–361

    Google Scholar 

  • Karatsiolis S, Schizas C (2012) Region based support vector machine algorithm for medical diagnosis on Pima Indian Diabetes dataset. In: IEEE 12th international conference on bioinformatics and bioengineering (BIBE), pp 139–144

  • Khanum A, Mufti M, Javed M, Shafiq M (2009) Fuzzy case-based reasoning for facial expression recognition. Fuzzy Sets Syst 160:231–250

    Article  MathSciNet  Google Scholar 

  • Kumari S, Singh A (2013) A data mining approach for the diagnosis of diabetes mellitus. In: 7th International conference on intelligent systems and control (ISCO), pp 373–375

  • Lee C, Wang M (2011) A fuzzy expert system for diabetes decision support application. IEEE Trans Syst Man Cybern Part B Cybern 41(1):139–153

    Article  Google Scholar 

  • Li S, Ho H (2009) Predicting financial activity with evolutionary Fuzzy case-based reasoning. Expert Syst Appl 36:411–422

    Article  Google Scholar 

  • Li H, Sun J (2011) On performance of case-based reasoning in Chinese business failure prediction from sensitivity, specificity, positive and negative values. Appl Soft Comput 11(1):460–7

    Article  Google Scholar 

  • Marling C, Shubrook J, Schwartz F (2008) Case-based decision support for patients with type 1 diabetes on insulin pump therapy. In: Advances in case-based reasoning: 9th European Conference (ECCBR), vol 5239, pp 325–339

  • Marling C, Wiley M, Cooper T, Bunescu R, Shubrook J, Schwartz F (2011) The 4 diabetes support system: a case study in CBR research and development. Case Based Reason Res Dev 6880:137–150

    Article  Google Scholar 

  • Mohamudally N, Khan D (2011) Application of a unified medical data miner (UMDM) for prediction, classification, interpretation and visualization on medical datasets: the diabetes dataset case. In: Advances in data mining: applications and theoretical aspects, vol 6870, pp 78–95

  • Montani S, Bellazzi R, Portinale L, d’Annunzio G, Fiocchi S, Stefanelli M (2000) Diabetic patients management exploiting case-based reasoning techniques. Comput Methods Programs Biomed 62(3):205–218

    Article  Google Scholar 

  • Pappis C, Karacapilidis N (1993) A comparative assessment of measures of similarity of fuzzy values. Fuzzy Sets Syst 56:171–174

    Article  MathSciNet  MATH  Google Scholar 

  • Patil B, Joshi R, Toshniwal D (2010) Hybrid prediction model for Type-2 diabetic patients. Expert Syst Appl 37(12):8102–8108

    Article  Google Scholar 

  • Petrovic S, Mishra N, Sundar S (2011) A novel case based reasoning approach to radiotherapy planning. Expert Syst Appl 38:10759–10769

    Article  Google Scholar 

  • Polat K, Güneş S (2007) An expert system approach based on principal component analysis and adaptive Neuro-fuzzy inference system to diagnosis of diabetes disease. Digit Signal Proc 17(4):702–710

    Article  Google Scholar 

  • Polat K, Gunes S, Aslan A (2008) A cascade learning system for classification of diabetes disease: generalized discriminant analysis and least square support vector machine. Expert Syst Appl 34(1):214–221

    Article  Google Scholar 

  • Portinale L, Montani S (2002) A fuzzy case retrieval approach based on SQL for implementing electronic catalogs. Adv Case Based Reason 2416:321–335

    Article  MATH  Google Scholar 

  • Qin Y, Lu W, Qi Q, Liu X, Huang M, Scott P, Jiang X (2018) Towards an ontology-supported case-base d reasoning approach for computer-aided tolerance specification. Knowl Based Syst 141:129–147

    Article  Google Scholar 

  • Radha R, Rajagopalan S (2007) Fuzzy logic approach for diagnosis of diabetics. Inf Technol J 61:96–102

    Google Scholar 

  • Raza B, Kumar Y, Malik A, Anjum A, Faheem M (2018) Performance prediction and adaptation for database management system workload using case-based reasoning approach. Inf Syst 76:46–58

    Article  Google Scholar 

  • Relich M, Pawlewski P (2018) A case-based reasoning approach to cost estimation of new product development. Neurocomputing 272:40–45

    Article  Google Scholar 

  • Rodriguez Y, Garcia M, Baets B, Morell C, Bello R (2006) A connectionist fuzzy case-based reasoning model. In: MICAI: advances in artificial intelligence. Springer, Berlin, pp 176–185

  • Samant P, Agarwal R (2018) Machine learning techniques for medical diagnosis of diabetes using iris images. Comput Methods Programs Biomed 157:121–128

    Article  Google Scholar 

  • Sarkheyli-Hägele A, Söffker D (2017) Fuzzy SOM-based case-based reasoning for individualized situation recognition applied to supervision of human operators. Knowl Based Syst 137:42–53

    Article  Google Scholar 

  • Shankaracharya OD, Samanta S, Vidyarthi A (2010) Computational intelligence in early diabetes diagnosis: a review. Rev Diabet Stud 7(4):252–62

    Article  Google Scholar 

  • Sushmita S, Chaudhury S (2007) Hierarchical fuzzy case based reasoning with multi-criteria decision making for financial applications. Pattern Recognit Mach Intell 4815:226–234

    Article  Google Scholar 

  • Temurtas H, Yumusak N, Temurtas F (2009) A comparative study on diabetes disease diagnosis using neural networks. Expert Syst Appl 36(4):8610–8615

    Article  Google Scholar 

  • Varma K, Rao A, Lakshmi T, Rao P (2014) A computational intelligence approach for a better diagnosis of diabetic patients. Comput Electr Eng 40:1758–1765

    Article  Google Scholar 

  • Wang W (1997) New similarity measures on fuzzy sets and on elements. Fuzzy Sets Syst 85:305–309

    Article  MathSciNet  MATH  Google Scholar 

  • Wu D, Li J, Bao C (2017) Case-based reasoning with optimized weight derived by particle swarm optimization for software effort estimation. Soft Comput 1–12

  • Xiaodong H, Jianwu W, Fuqian S, Haiyan C (2009) Apply fuzzy case-based reasoning to knowledge acquisition of product style. In: IEEE 10th international conference on computer-aided industrial design and conceptual design (CAID & CD), pp 383–386

  • Xiong N (2011) Learning fuzzy rules for similarity assessment in case-based reasoning. Expert Syst Appl 38:10780–10786

    Article  Google Scholar 

  • Yao B, Li S (2010) ANMM4CBR: a case-based reasoning method for gene expression data classification. Algorithms Mol Biol (AMB) 5:14

    Article  Google Scholar 

  • Yu F, Li X, Han X (2018) Risk response for urban water supply network using case-based reasoning during a natural disaster. Saf Sci 106:121–139

    Article  Google Scholar 

  • Zadeh LA (2003) From search engines to question-answering systems the need for new tools. In: The 12th IEEE international conference fuzzy systems, vol 2, pp 1107–1109

  • Zhang Z, Chen D, Feng Y, Yuan Z, Chen B, Qin W, Zou S, Qin S, Han J (2018) A strategy for enhancing the operational agility of petroleum refinery plant using case based fuzzy reasoning method. Comput Chem Eng 111:27–36

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korea government (MSIT)-NRF-2017R1A2B2012337. The authors would like to thank Dr. Farid Badria, Prof. of Pharmacognosy, Department, and head of Liver Research Lab, Mansoura University, Egypt; and Dr. Hosam Zaghloul, Prof. at Clinical Pathology Department, Faculty of Medicine, Mansoura University, Egypt, for their efforts in this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammed Elmogy.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Communicated by V. Loia.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 204 KB)

Appendices

Appendix A

The performance evaluation of the proposed fuzzy CBR system

\(K=3\)

Query cases

Retrieved cases

FKI-CBR diagnose

Domain expert diagnose

Diabetes diagnosis

Nephropathy

Hypercholesterolemia

Cancer

Liver

Glomerulonephritis

Splenomegaly

Confidence (%)

Case 1

Case 1-1

D

N

N

N

N

N

N

90

Diabetes Diagnosis: D

Case 1-2

P

N

N

N

N

N

N

88.9

Percentage of similarity for another diagnosis: 9/10

Case 1-3

D

N

H

N

N

N

N

88.8

Case 2

Case 2-1

N

N

N

N

N

N

N

95

Diabetes Diagnosis: N

Case 2-2

N

N

N

N

N

N

N

91

Percentage of similarity for another diagnosis: 10/10

Case 2-3

N

N

N

N

N

N

N

90

Case 3

Case 3-1

P

SH

N

N

N

G

N

88

Diabetes Diagnosis: D

Case 3-2

DG

SH

N

N

F

G

N

85

Percentage of similarity for another diagnosis: 10/10

Case 3-3

D

N

N

N

N

N

N

84

Case 4

Case 4-1

D

N

H

N

N

G

N

94

Diabetes Diagnosis: D

Case 4-2

D

N

H

N

N

G

N

93.2

Percentage of similarity for another diagnosis: 10/10

Case 4-3

N

NP

N

N

N

G

N

86.5

Case 5

Case 5-1

D

N

N

N

N

N

N

100

Diabetes Diagnosis: D

Case 5-2

D

N

N

O

N

N

N

88.8

Percentage of similarity for another diagnosis: 10/10

Case 5-3

D

N

N

N

N

N

N

86.9

Case 6

Case 6-1

P

SH

N

N

N

G

N

81

Diabetes Diagnosis: P

Case 6-2

D

N

N

N

N

N

N

79

Percentage of similarity for another diagnosis: 8/10

Case 6-3

D

N

N

HCC

HCC

N

N

79

Case 7

Case 7-1

N

N

N

N

N

N

N

92.4

Diabetes Diagnosis: N

Case 7-2

D

N

N

N

N

N

N

89.6

Percentage of similarity for another diagnosis: 10/10

Case 7-3

N

N

N

N

N

N

N

89.4

Case 8

Case 8-1

D

N

N

N

HCC

N

N

99

Diabetes Diagnosis: D

Case 8-2

N

N

H

N

N

N

N

82.5

Percentage of similarity for another diagnosis: 9/10

Case 8-3

P

N

N

N

N

N

N

80

Case 9

Case 9-1

D

N

H

N

N

N

N

93

Diabetes Diagnosis: D

Case 9-2

DG

SH

N

N

N

G

N

91

Percentage of similarity for another diagnosis: 10/10

Case 9-3

D

N

N

N

HCC

N

SG

90

Case 10

Case 10-1

D

N

N

N

N

N

N

91

Diabetes Diagnosis: D

Case 10-2

DG

N

N

N

N

N

N

89.9

Percentage of similarity for another diagnosis: 10/10

Case 10-3

P

N

H

N

N

N

N

89.2

N normal, A abnormal, D diabetic, P pre-diabetic, H hypercholesterolemia, O ovarian cancer, DG gestational diabetes, SH Shrunken Kidney, G glomerulonephritis, NP nephropathy, SG splenomegaly, HCC hepatocellular carcinoma, F fatty liver, LC liver cirrhosis

Appendix B

A comparison of the proposed fuzzy CBR system and traditional CBR system

Query case

Domain expert decision

Proposed system decision

Traditional system decision

Case1

Diabetes diagnosis: D

D (90%)

P (78%)

Case 2

Diabetes diagnosis: N

N (95%)

N (83%)

Case 3

Diabetes diagnosis: D

P (88%)

D (70%)

Case 4

Diabetes diagnosis: D

D (94%)

P (64%)

Case 5

Diabetes diagnosis: D

D (100%)

D (80%)

Case 6

Diabetes diagnosis: P

P (81%)

D (64%)

Case 7

Diabetes diagnosis: N

N (92.4%)

N (85%)

Case 8

Diabetes diagnosis: D

D (99%)

P (79%)

Case 9

Diabetes diagnosis: D

D (93%)

P (74%)

Case 10

Diabetes diagnosis: D

D (91%)

P (75%)

Appendix C

A comparison of our FCBR and ML classifiers using our case-base data

Fold

Algorithm

Precision (%)

TPR-Recall (%)

Accuracy (%)

F-Measure (%)

Machine-learning algorithms

Twofold

C4.5

90

93.1

90

91.5

k-NN (\(k=3\))

80

69

66.66

74.1

SVM

75.8

86.2

68.33

80.6

Naive Bayes

86.2

86.2

75

86.2

ANN

70.6

82.8

65

76.2

Threefold

C4.5

89.7

89.7

88.33

89.7

k-NN (\(k=3\))

74.1

69

60

71.4

SVM

78.8

89.7

71.66

83.9

Naive Bayes

83.9

89.7

75

86.7

ANN

73.5

86.2

65

79.4

Fourfold

C4.5

89.7

89.7

88.33

89.7

k-NN (\(k=3\))

76.9

69

65

72.7

SVM

77.4

82.8

71.66

80

Naive Bayes

79.3

79.3

71.66

79.3

ANN

76.7

79.3

66.66

78

Fivefold

C4.5

93.1

93.1

91.67

93.1

k-NN (\(k=3\))

73.1

65.5

63.33

69.1

SVM

75

82.8

66.66

78.7

Naive Bayes

83.3

69

66.66

75.5

ANN

75.8

86.2

68.33

80.6

Sixfold

C4.5

89.7

89.7

88.33

89.7

k-NN (\(k=3\))

74.1

69

63.33

71.4

SVM

81.3

89.7

75

85.2

Naive Bayes

86.2

86.2

73.33

86.2

ANN

76.5

89.7

70

82.5

Sevenfold

C4.5

90

93.1

90

91.5

k-NN (\(k=3\))

63.3

65.5

58.33

64.4

SVM

76.7

79.3

71.66

78

Naive Bayes

75.9

75.9

70

75.9

ANN

76.7

79.3

65

78

Eightfold

C4.5

89.3

86.2

86.66

87.7

k-NN (\(k=3\))

78.6

75.9

66.66

77.2

SVM

82.1

79.3

71.66

80.7

Naive Bayes

78.1

86.2

73.33

82

ANN

77.4

82.8

71.66

80

Ninefold

C4.5

89.7

89.7

88.33

89.7

k-NN (\(k=3\))

83.3

69

66.66

75.5

SVM

80.6

86.2

76.66

83.3

Naive Bayes

82.1

79.3

73.33

80.7

ANN

83.9

89.7

75

86.7

Tenfold

C4.5

89.7

89.7

88.33

89.7

k-NN (\(k=3\))

73.1

65.5

56.67

69.1

SVM

80.6

86.2

75

83.3

Naive Bayes

83.3

86.2

75

84.7

ANN

76.5

89.7

70

82.5

k-fold (\(k=60\)) \(\equiv \) LOOCV

C4.5

89.7

89.7

88.33

89.7

k-NN (\(k=3\))

73.1

65.5

58.33

69.1

SVM

80

82.8

75

81.4

Naive Bayes

85.2

79.3

73.33

82.1

ANN

78.8

89.7

75

83.9

Average (%)

80.568

82.086

73.0972

81.164

Maximum (%)

93.1

93.1

91.67

93.1

Conventional CBR system

76

47.5

55

58.46

Proposed FCBR system

96

97.96

95

96.97

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

El-Sappagh, S., Elmogy, M., Ali, F. et al. A case-base fuzzification process: diabetes diagnosis case study. Soft Comput 23, 5815–5834 (2019). https://doi.org/10.1007/s00500-018-3245-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-018-3245-3

Keywords

Navigation