Boosted K-nearest neighbor classifiers based on fuzzy granules

doi:10.1016/j.knosys.2020.105606

Knowledge-Based Systems

Volume 195, 11 May 2020, 105606

https://doi.org/10.1016/j.knosys.2020.105606 Get rights and content

Abstract

K-nearest neighbor (KNN) is a classic classifier, which is simple and effective. Adaboost is a combination of several weak classifiers as a strong classifier to improve the classification effect. These two classifiers have been widely used in the field of machine learning. In this paper, based on information fuzzy granulation, KNN and Adaboost, we propose two algorithms, a fuzzy granule K-nearest neighbor (FGKNN) and a boosted fuzzy granule K-nearest neighbor (BFGKNN), for classification. By introducing granular computing, we normalize the process of solving problem as a structured and hierarchical process. Structured information processing is focused, so the performance including accuracy and robust can be enhanced to data classification. First, a fuzzy set is introduced, and an atom attribute fuzzy granulation is performed on samples in the classified system to form fuzzy granules. Then, a fuzzy granule vector is created by multiple attribute fuzzy granules. We design the operators and define the measure of fuzzy granule vectors in the fuzzy granule space. And we also prove the monotonic principle of the distance of fuzzy granule vectors. Furthermore, we also give the definition of the concept of K-nearest neighbor fuzzy granule vector and present FGKNN algorithm and BFGKNN algorithm. Finally, we compare the performance among KNN, Back Propagation Neural Network (BPNN), Support Vector Machine (SVM), Logistic Regression (LR), FGKNN and BFGKNN on UCI data sets. Theoretical analysis and experimental results show that FGKNN and BFGKNN have better performance than that of the methods mentioned above if the appropriate parameters are given.

Introduction

As early as the 1960s, Zadeh, a famous American cybernetic expert, proposed fuzzy set theory and in 1979 first presented the problem of fuzzy information granulation [1]. He believed that human cognition can be summarized as three main characteristics of granulation, organization and causality [2], [3], [4], [5]. In 1985, Hobbs [6] presented the concept of granularity. Later, Yager and Filev [7] further pointed out that “people have formed a granular view of the world”. From this point of view, human observation, measurement, conceptualization and reasoning are carried out in a granular sense. The granular computing was first proposed by T.Y. Lin [8], [9]. Information granules are not only reflective of the nature of the data but can efficiently capture some auxiliary domain knowledge conveyed by the user and in this way reflect the human-centricity aspects of the investigations and enhances the actionability aspects of the results [10].

The concept of information granularity is ubiquitous, and granular computing has also promoted the development of many concepts, which are as the follows: graphs [11], [12], [13], information tables [14], knowledge representation [15], association discovery and data mining [16], clustering [17] and rule clustering [18], classification [19] and so on. Granule computing is also widely applied in forecasting time series [20], prediction tasks [21], concept learning [22], perception [23], optimization [24], credit scoring [25] etc. Many scholars have conducted extensive and in-depth research from various angles. Miao discussed the structure of granular computing from the perspective of set theory [26]. Wang analyzed the uncertainty measure in granular computing and its application in big data [27], [28]. Yao proposed neighborhood system and neighborhood granular computing [29], [30]. Hu analyzed neighborhood reduction and classification [31], [32], [33]. Chen studied feature dimension reduction and optimization from the perspective of group intelligence [34], [35] and so on. These views suggest that granulation, as one of the important features of human cognition, plays an important role in modeling complex data.

KNN algorithm was first proposed by Hart in 1968 [36]. KNN is a non-parametric statistical method for classification and regression in the field of pattern recognition [37]. KNN uses a vector space model to classify cases with the same category, with high similarity to each other, and can calculate the similarity with known category cases to evaluate the possible classification of unknown category cases. It is a simple and effective non-parametric classification method. Its advantages include that it is very suitable for incremental learning without knowing the sample distribution in advance. Explicit rules are not required, and classification accuracy is high, so it is widely used in many fields such as clustering, big data, and multi-label learning [38], [39], [40], [41], [42], [43]. The classical KNN algorithm has high time and space complexity. The same weight of K neighbor samples affects classification accuracy. It is noise sensitivity and has low classification accuracy to unbalanced samples, and is also difficult in determining K value. Many scholars have proposed improvements from the above aspects and improved the its performance [44], [45], [46], [47], [48], [49], [50], [51], [52].

It is difficult to construct a single classifier with high accuracy. However, it is possible to construct a strong classifier with high accuracy via integrating some weak classifiers. The weak learning theorem [53] theoretically supports this possibility. It is the main content of integrated learning research how to construct weak classifiers and how to integrate them. At present, the more successful integrated learning algorithm is AdaBoost [37], first proposed by Freund and Schapire in 1995. In 1999, Schapire et al. extended AdaBoost, which deals with binary judgments, to continuous AdaBoost with continuous confidence output. Thus, it can more accurately describe classification boundaries and have better classification effects [54]. AdaBoost algorithm is simple and widely applied in many fields, such as face recognition, water quality detection, protein prediction, pedestrian detection, EEG signal analysis, urban rail transit, etc. [55], [56], [57], [58], [59], [60], [61], [62], [63], [64]. At the same time, it also attracted a large number of scholars to research and improve its generalization ability [65], [66], [67], [68].

In this paper, we define fuzzy granule vector based on fuzzy information granulation from a new perspective, and design two new classification models: FGKNN and BFGKNN. On the basis of fuzzy granulation of various attributes of a classification system, we define fuzzy granule vector distance, and propose the K-nearest fuzzy granule vector concept, and transform the classification problem into the K-nearest fuzzy granule vector search problem. Moreover, we present FGKNN classification model. Based on FGKNN, we furthermore design BFGKNN model. We employ 10-fold cross-validation to test the performance of the two algorithms on UCI data sets. Theoretical analysis and experimental results show that FGKNN and BFGKNN can achieve better performance under appropriate parameters.

Section snippets

Fuzzy Information granulation

In many cases, the granularity of human reasoning and conceptual construction is ambiguous, not precise. Fuzzy information granulation is usually obtained through a fuzzy binary relationship, and its fuzzy granulation is carried out in the entire fuzzy granule space. A series of definitions are given as the follows.

Definition 1

Let $C S = (S, R, L)$ be a classification system. Here $S = {s_{1}, s_{2}, \dots, s_{n}}$ represents a sample set. $R = {r_{1}, r_{2}, \dots, r_{m}}$ is an attribute set. And $L = {l}$ expresses label (its values can be discrete or

Experimental analysis

In this paper, we uses 5 data sets in UCI as the data source for experimental testing. Based on them, we modify 1% of data and build another 5 data sets with noise (see Table 4). Since the value ranges of the data sets in the table are different, the data set needs to be normalized. We employ the maximum and minimum method (see Eq. (17)) to ensure that all data can be converted to the range $[0, 1]$ . Samples are fuzzy granulated on each atomic attribute to form a fuzzy granule vectors. We adopt

Conclusion

The classic classifier is for numerical calculations and does not involve set operations. From the view point on the fuzzy granulation of samples, we propose two classified algorithms of fuzzy set forms, FGKNN and BFGKNN. First, the fuzzy granulation method is introduced, and the fuzzy granule vector and rules are constructed in the classification system, and the similarity and operation rules of fuzzy granule vectors are defined. Next, the concept of K-nearest fuzzy granule vector is defined,

CRediT authorship contribution statement

Wei Li: Conceptualization, Methodology, Software, Validation, Formal analysis, Data curation, Writing - original draft, Project administration, Funding acquisition, Supervision, Writing - review & editing, Investigation, Visualization, Resources. Yumin Chen: Visualization, Writing - review & editing. Yuping Song: Resources, Writing - review & editing.

Acknowledgments

This work was supported by Science and Technology Planning Guidance Project of Xiamen, China (No. 3502Z20179038), National Science Foundation of Fujian Province, China (No. 2015J05015) and National Natural Science Foundation of China (No. 61573297).

Wei Li is an associate professor, master supervisor with the School of Computer and Information Engineering at Xiamen University of Technology, Xiamen, China. He is also a member of China Computer Federation (CCF). His research interests include Artificial Intelligence, Computer Graphics, Machine Learning and Granular Computing. He received the Ph.D. degree in Basic Theory of Artificial Intelligence, Xiamen University, China in 2013. He was also a visiting scholar with department of computer

References (68)

ZadehL.A.
Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic
Fuzzy Sets and Systems
(1997)
WangQ. et al.
An application of fuzzy hypergraphs and hypergraphs in granular computing
Inform. Sci.
(2018)
ChiaselottiG. et al.
Simple graphs in granular computing
Inform. Sci.
(2016)
ChiaselottiG. et al.
Granular computing on information tables: Families of subsets and operators
Inform. Sci.
(2018)
ChiaselottiG. et al.
Knowledge pairing systems in granular computing
Knowl.-Based Syst.
(2017)
WangX. et al.
A rapid fuzzy rule clustering method based on granular computing
Appl. Soft Comput.
(2014)
SavchenkoA.V.
Fast multi-class recognition of piecewise regular objects based on sequential three-way decisions and granular computing
Knowl.-Based Syst.
(2016)
SinghP. et al.
A hybrid fuzzy time series forecasting model based on granular computing and bio-inspired optimization approaches
J. Comput. Sci.
(2018)
HanZ. et al.
Construction of prediction intervals for gas flow systems in steel industry based on granular computing
Control Eng. Pract.
(2018)
LiJ. et al.
Concept learning via granular computing: A cognitive viewpoint
Inform. Sci.
(2015)

HuH. et al.

Perception granular computing in visual haze-free task

Expert Syst. Appl.

(2014)

Martınez-FrutosJ. et al.

Efficient topology optimization using GPU computing with multilevel granularity

Adv. Eng. Softw.

(2017)

SaberiM. et al.

A granular computing-based approach to credit scoring modeling

Neurocomputing

(2013)

WangG. et al.

Monotonic uncertainty measures for attribute reduction in probabilistic rough set model

Internat. J. Approx. Reason.

(2015)

HuQ. et al.

Neighborhood classifiers

Expert Syst. Appl.

(2008)

HuQ. et al.

Neighborhood rough set based heterogeneous feature subset selection

Inform. Sci.

(2008)

ZhuP. et al.

Neighborhood classifiers

Inform. Sci.

(2013)

ChenY. et al.

A rough set approach to feature selection based on ant colony optimization

Pattern Recognit. Lett.

(2010)

ChenY. et al.

Finding rough set reducts with fish swarm algorithm

Knowl.-Based Syst.

(2015)

FreundY. et al.

A decision-theoretic generalization of on-line learning and an application to boosting

J. Comput. Syst. Sci.

(1997)

TanS.

An effective refinement strategy for KNN text classifier

Expert Syst. Appl.

(2006)

GallegoA.-J. et al.

Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation

Pattern Recognit.

(2018)

MailloJ. et al.

KNN-IS: An iterative spark-based design of the k-nearest neighbors classifier for big data

Knowl.-Based Syst.

(2017)

ZhangM. et al.

ML-KNN: A lazy learning approach to multi-label learning

Pattern Recognit.

(2007)

DuM. et al.

Study on density peaks clustering based on k-nearest neighbors and principa component analysis

Knowl.-Based Syst.

(2016)

LiuR. et al.

Shared-nearest-neighbor-based clustering by fast search and find of density peaks

Inform. Sci.

(2018)

BhattacharyaG. et al.

Granger causality driven AHP for feature weighted KNN

Pattern Recognit.

(2017)

KarS. et al.

Gene selection from microarray gene expression data for classification of cancer subgroups employing PSO and adaptive k-nearest neighborhood technique

Expert Syst. Appl.

(2015)

OchsR.A. et al.

Automated classification of lung bronchovascular anatomy in CT using AdaBoost

Med. Image Anal.

(2007)

Landesa-VazquezI. et al.

Shedding light on the asymmetric learning capability of AdaBoost

Pattern Recognit. Lett.

(2012)

NiuB. et al.

Predicting toxic action mechanisms of phenols using AdaBoost learner

Chemometr. Intell. Lab. Syst.

(2009)

SaghafiH. et al.

Modeling of ${CO}_{2}$ solubility in MEA, DEA, TEA, and MDEA aqueous solutions using AdaBoost-decision tree and artificial neural network

International Journal of Greenhouse Gas Control

(2017)

ChengW. et al.

A self-constructing cascade classifier with AdaBoost and SVM for pedestrian detection

Eng. Appl. Artif. Intell.

(2013)

ZadehL.A.

Fuzzy sets and information granularity

Cited by (40)

Boosted stochastic fuzzy granular hypersurface classifier
2024, Knowledge-Based Systems
In this work, we design a boosted stochastic fuzzy granular hypersurface classifier (BSFGHC) to resolve the classification issue of numerical data and non-numerical data (such as information granules) from the standpoint of granular computing. The scheme is divided into three parts: first, we present an adaptive cluster center clustering (ACCC) algorithm to achieve cluster centers of the data and to realize the fuzzy granulation of data parallelly based on Spark, which dramatically improves the granulation efficiency; second, we build a fuzzy granular space, design various fuzzy granular operators and measurement in the space to construct fuzzy granular hypersurfaces, create the loss function, and employ Particle Swarm Optimization (PSO) to resolve the optimal fuzzy granular hypersurfaces; third, we randomly divide the fuzzy granules to train multiple optimal fuzzy granular hypersurfaces and combine with the classification accuracy of fuzzy hypersurfaces and the difficulty of fuzzy granule subset classification to form a boosted fuzzy hypersurface to predict the data comprehensively. Experimental results and theoretical analysis demonstrate the outstanding performance of the method.
Gene selection of microarray data using Heatmap Analysis and Graph Neural Network
2023, Applied Soft Computing
It is not feasible to investigate the whole genes at a microscopic level for disease classification in Genomics. It might take substantial time to execute any meaningful analysis and the computational resources will be misused as not all the genes are responsible for the disease linked to a cell. Currently, it is quite challenging to select the most significant genes from high-dimensional microarray data for disease classification. In search of a better process, a novel gene subset selection technique has been developed based on Heatmap Analysis and Graph Neural Network (HAGNN). In the proposed method, a heatmap analysis has been performed for the different classes of microarray data to obtain the Region of Interest (ROIs). These ROIs are extracted from the original dataset and undergo a node reduction technique followed by an edge reduction technique in Graph Neural Network (GNN). This paper is concluded with an optimal subset of the most significant genes that cause cancer. The popular base classifiers have been used to evidence the importance of the selected genes as compared to the original data with the help of several metrics. The obtained results clearly show that the proposed methodology outperformed the other existing methods and make a greater impact on the advancement of the GNN-based gene selection method.
WABL method as a universal defuzzifier in the fuzzy gradient boosting regression model
2023, Expert Systems with Applications
Citation Excerpt :
The advantages of boosting methods when training fuzzy classifiers are that the size of the rule base is very small and learning is very fast. Two algorithms are proposed for classification, such as fuzzy granule k-Nearest Neighbor (FGKNN) and boosted fuzzy granule k-Nearest Neighbor (BFGKNN) based on fuzzy granulation, KNN (k-Nearest Neighbourd) and Adaboost information in the study (Li, Chen et al., 2020). Using granular computation, the problem solving process is normalized as a structured and hierarchical process.
Gradient Boosting Regression (GBR) models are widely used and can give effective results in regression and classification problems. The main value of the approach proposed in this study is that it allows the GBR algorithm to be used even if the target variables are fuzzy. The defuzzification strategy affects the solutions found. The solutions of the GBR algorithm, depending on various defuzzification strategies, in case the target values are fuzzy numbers, are examined. Techniques have been proposed for calculating fuzzy residuals and loss functions. Definitions and theorems are given to alleviate the computational load on triangular fuzzy numbers. Results of the Fuzzy GBR algorithm were compared on ten popular datasets using COG (Center of Gravity), MOM (Mean of Maxima) and WABL (Weighted Averaging Based on Levels) defuzzification methods. The fuzzy R-squared and the fuzzy RMSE (Root Mean Square Error) scores defined on the basis of fuzzy distance was used to evaluate the results obtained via different defuzzification methods. It has been seen that if the parameters of the WABL method are adjusted appropriately, it gives better results than other defuzzification methods in all data sets, so the WABL can be handled as a universal defuzzifier in the Fuzzy GBR models.
Fuzzy granular deep convolutional network with residual structures
2022, Knowledge-Based Systems
Citation Excerpt :
In 2020, modular granular neural networks [19] based on particle swarm optimization variants and fuzzy dynamic parameter adaptation were proposed by Daniela Sanchez et al. and applied to human recognition tasks. Li [20] integrated the boosting algorithm with the granular KNN algorithm to further improve the performance of the KNN algorithm. Granular computing is an arithmetic model defined from the perspective of human cognition, which is highly similar to human logic, cognition, and memory, and has been successfully applied in several fields [21–27].
In recent years, the deep neural network technology has developed rapidly and has been effective in processing and analyzing images, videos, sounds and many other aspects. However, traditional neural networks only learn the samples themselves and ignore the differences between samples. When the network depth is too deep, network degradation phenomenon may occur. To solve these drawbacks, we propose the fuzzy granular deep convolutional network with residual structures. Firstly, we define the concept, operation and correlation measures of fuzzy granules, and construct the granulation reference system by random sampling for fuzzy granulation. Furthermore, modules of granular neuron, granular activation function, granular convolution, and granular residual are defined, and the fuzzy granular deep convolutional network with residual structures is built. The loss functions and learning algorithm are designed for the granular neural network, and the fuzzy granular neural network is successfully trained. The fuzzy granular neural network owns the characteristics of multi-granularity, multi-angle and structured, which has good generalization performance. Finally, experiments are conducted on the Cifar100 and Tiny Imagenet datasets. The experimental results show the effectiveness of the residual granular neural network. We further found that the granular neural network could alleviate the degradation problem. Moreover granular neural network with fewer hidden layers can achieve the function of a neural network with more hidden layers.
Reinforced fuzzy clustering-based rule model constructed with the aid of exponentially weighted ℓ<inf>2</inf> regularization strategy and augmented random vector functional link network
2022, Fuzzy Sets and Systems
Citation Excerpt :
A fuzzy rule-based model exploits rules as a means of knowledge representation to formalize the knowledge existed in the model. Moreover, due to its modular architecture, well-developed design methodologies and practices, as well as its advantages in interpretability, it has been used in a wide spectrum of realms such as fuzzy control, pattern analysis, fuzzy decision, time series prediction, robotics, etc. [2–6]. There are two well-known fuzzy rule-based models in addressing regression problems.
Fuzzy rule-based models are widely employed to tackle regression problems due to their simplicity and comprehensibility. Numeric functions (e.g., linear ones) are generally utilized to represent the conclusion part of a rule, but this simple module cannot well describe the behavior of data located within local area defined by the premise part. In this study, a novel reinforced fuzzy clustering-based rule model (RFCRM) is presented to address this issue. First, information granulation (Fuzzy C-Means) approach is used to reveal the structure in the data and divide the input space into local regions, as well as form the premise parts. Second, the augmented Random Vector Functional Link Networks (RVFLNs) are employed as the conclusion parts to strengthen the description and representation of the behavior of data positioned within the local regions. Weighted Least Squares Error estimation with regularization is exploited to estimate the coefficients of the connection weights of RVFLN. Also, an exponential weight method benefited from the weight function theory encountered in harmonic analysis is introduced to improve the penalty strategy of regularization in coefficient estimation. More specifically, an exponential weighted $ℓ_{2}$ regularization ( $EW ℓ_{2}$ ) strategy equipped with the exponential functions as the penalty terms is proposed to realize the targeted penalty on coefficients. The merit of $EW ℓ_{2}$ over ordinary $ℓ_{2}$ is that different types of polynomial terms can be identified and penalized separately in coefficient estimation. This strategy not only prevents the decline of generalization ability, but also effectively enhances the prediction potential of the model. By combining weighted LSE based on $EW ℓ_{2}$ with FCM partition and the augmented RVFLN leads to better prediction accuracy as well as lower complexity (viz., number of rules). Finally, detailed experimental studies are conducted to demonstrate the effectiveness of the proposed RFCRM.
Design Gaussian information granule based on the principle of justifiable granularity: A multi-dimensional perspective
2022, Expert Systems with Applications
In this study, based on the principle of justifiable granularity, a method for designing multi-dimensional information granules is proposed. To design information granules, the correlations among the different variables within the data and their confidence levels are considered. The designed information granules reveal the relationships present in the experimental data and help to capture more features of the original data. In addition, a strategy for the exclusion of inhibitory data is considered, making the design of information granules more focused. Several experimental studies are conducted to quantify the effectiveness of the proposed method.

View all citing articles on Scopus

Yumin Chen is a profess or of Xiamen University of Technology and a tutor of Ph.D. students of Fuzhou University. He received Ph.D. from Tongji University, China, in 2010. He was a post-doctor of Electronic Science and Technology University of China from 2014 to 2017. He is a committee member of Rough Sets and Soft Computing of China, member of CCF, member of CAA and member of CAAI. His research interests includes artificial intelligence, machine learning, pattern recognition and rough sets.

Yuping Song is an associate professor in the School of Mathematical Sciences at Xiamen University. Her research interests include discrete differential geometry, computer graphics, artificial intelligence. She has a Ph.D. in Pure Mathematics from Peking University. Contact her at [email protected]

^☆: No author associated with this paper has disclosed any potential or pertinent conflicts which may be perceived to have impending conflict with this work. For full disclosure statements refer to https://doi.org/10.1016/j.knosys.2020.105606.

View full text

Boosted K-nearest neighbor classifiers based on fuzzy granules☆

Abstract

Introduction

Section snippets

Fuzzy Information granulation

Experimental analysis

Conclusion

CRediT authorship contribution statement

Acknowledgments

Fuzzy Sets and Systems

Inform. Sci.

Inform. Sci.

Inform. Sci.

Knowl.-Based Syst.

Appl. Soft Comput.

Knowl.-Based Syst.

J. Comput. Sci.

Control Eng. Pract.

Inform. Sci.

Expert Syst. Appl.

Adv. Eng. Softw.

Neurocomputing

Internat. J. Approx. Reason.

Expert Syst. Appl.

Inform. Sci.

Inform. Sci.

Pattern Recognit. Lett.

Knowl.-Based Syst.

J. Comput. Syst. Sci.

Expert Syst. Appl.

Pattern Recognit.

Knowl.-Based Syst.

Pattern Recognit.

Knowl.-Based Syst.

Inform. Sci.

Pattern Recognit.

Expert Syst. Appl.

Med. Image Anal.

Pattern Recognit. Lett.

Chemometr. Intell. Lab. Syst.

International Journal of Greenhouse Gas Control

Eng. Appl. Artif. Intell.

Fuzzy sets and information granularity