Abstract

Settlement simulating in cohesion materials is a crucial issue due to complexity of cohesion soil texture. This research emphasis on the implementation of newly developed machine learning models called hybridized Adaptive Neuro-Fuzzy Inference System (ANFIS) with Particle Swarm Optimization (PSO) algorithm, Ant Colony optimizer (ACO), Differential Evolution (DE), and Genetic Algorithm (GA) as efficient approaches to predict settlement of shallow foundation over cohesion soil properties. The width of footing (B), pressure of footing (qa), geometry of footing (L/B), count of SPT blow (N), and ratio of footing embedment (Df/B) are considered as predictive variables. Nonhomogeneity and inconsistency of employed dataset is a major concern during prediction modeling. Hence, two different modeling scenarios (i) preprocessed dataset (PP) and (ii) nonprocessed (initial) dataset (NP) were inspected. To assess the accuracy of the applied hybrid models and standalone one, multiple statistical metrics were computed and analyzed over the training and testing phases. Results indicated ANFIS-PSO model exhibited an accurate and reliable prediction data intelligent and had the highest predictability performance against all employed models. In addition, results demonstrated that data preprocessing is highly essential to be performed prior to building the predictive models. Overall, ANFIS-PSO model showed a robust machine learning for settlement prediction.

1. Introduction

1.1. Research Background

In practice, three types of shallow foundation settlements are commonly encountered, and these are immediate settlement, consolidation settlement, and secondary compression settlement [1]. Immediate settlement is encountered when the load is applied immediately after initiating a structure. It is primarily a consequence of soil-grain distortion and reorientation. Consolidation settlement, on the other hand, is time-dependent and generally takes longer time to occur. It occurs due to water pressure dissipation over time. Secondary compression settlement results as a soil creep; it is a viscous flow under load with no changes in effective stress. The total settlement of a foundation is the sum of the above three components. For cohesionless soils, the only source of concern is immediate settlement, while consolidation and secondary compression settlements are the primary factors associated with cohesive soils.

Generally, sand deposits are more heterogeneous than clay deposits; hence, there is likely to be higher differential settlements in sand deposits compared to clay deposits [2]. Due to the high level of permeability of cohesionless soils, it takes a shorter time for the settlement to occur after applying load [3]. The result of such a quick settlement is a relative rapid superstructure deformation and the consequential inability to curtail the damage and prevent further deformation. Furthermore, structural failures can result from excessive settlement [4]. The design of shallow foundations is mainly controlled by two major criteria which are the settlement of the foundations and the bearing capacity of the footing. However, settlement often controls the design process rather than the bearing capacity; this is mainly the case when the width of footing is more than 1 meter (3-4 ft) [5]. Therefore, settlement prediction is an essential and important criterion during the design of shallow foundations.

On cohesionless soils, the prediction of the settlement of shallow foundations is complicated and yet to be understood because settlement is governed by many uncertain and unquantifiable factors [6]. Some of these uncertain factors include the distribution of the applied stress [7], the soils’ stress and strain properties, the compressibility of the soil, and the difficulty in obtaining undisturbed samples of cohesionless soils for laboratory testing [8].

1.2. Literature Review and Research Motivation

There are several methods in the geotechnical literature (theoretical and experimental) for the prediction of the settlement of shallow foundations on several types of soil [2, 914]. As presented in Figure 1, usually several variables are incorporated with the magnitude of soil settlement such as net applied pressure (qo), Poisson’s ratio of soil (), average modulus of elasticity of soil (Es), and foundation dimensions (B and L). Thus, the settlement behavior is a highly complex geotechnical engineering problem owing to the association with varying variability.

Due to the difficulty in obtaining undisturbed samples for cohesionless soils, many methods for settlement prediction have concentrated on the correlations between in situ investigations, such as the standard penetration test (SPT) [15], cone penetration test (CPT) [16], dilatometer test [17], plate load test [18], and screw plate load test. In most of the available methods, the problem is simplified by introducing several assumptions related to the factors that affect settlement. As a result, these available methods (ranging from purely empirical to complex nonlinear finite elements) cannot achieve a consistent and accurate settlement prediction [19]. Over the literature, several comparative studies using machine learning models demonstrated inconsistent of settlement prediction magnitudes. Hence, reliable alternative model is always the inspiration of geotechnical scientists to explore and investigate [20]. Machine learning models exhibited a new era of modeling methodologies for various engineering applications [2127]. Within the settlement determination, artificial neural network (ANN) models have been introduced to this field since about three decades ago [2832]. This was followed by several implementations over the past two decades using diverse AI models including gene expression programming [33, 34], support vector machine [35], least square support vector machine [36], and neuro-fuzzy network [37, 38]. Among several AI models reported in the literature, adaptive neural fuzzy inference system (ANFIS) model exhibited a distinguished modeling technique in the field of geotechnical engineering [39]. This is owing to its robustness in capturing the high complex nonlinearity and nonstationarity of the experimental laboratory dataset. However, the main drawback associated with standalone AI models is the internal hyperparameter tuning, and thus, the integration of these models with bioinspired optimization algorithms and produce what so-called hybrid model is the era of modeling strategy for simulating complex engineering problems [4042]. In the current research, the feasibility of four different nature-inspired algorithms (i.e., PSO, ACO, DE, and GA) are investigated to tune ANFIS model as a predictive paradigm for simulating shallow foundation settlement. The selection of those optimizers is owing to their capacity in tuning the internal parameters of AI models and that was evidenced through several established research studies over the literature [43, 44].

1.3. Research Objectives

To the best knowledge of the current research, the development of a hybridized data-intelligence model based on the integration of bioinspired optimization algorithms with ANFIS model was proposed with the aim of achieving more reliable and robust settlement prediction results. The explored hybrid model was verified for viability on the shallow foundation settlement within cohesion soil properties. The major benefit of the proposed models is the ability to simulate the nonlinearity connection between the inputs (correlated attributes) and the output (target variable) without the need for a predefined formulation. Additionally, the concept is more like the introduced settlement prediction methodologies but with the added advantage of the intelligence to capture the complex mechanism of settlement variability.

2. Applied Hybrid Models

Soft computing models are mainly used to explain complex engineering problems which are highly stochastic and nonlinear. This is the case when the traditional models are not able to be adequately applied. SC is the basic form of artificial intelligence that can be implemented to solve difficult applications using the aspect on intelligent human brain behavior. Also, the soft computing modeling is following the basic concept of black box that is required no preknowledge of the investigated problem. Furthermore, both partial truth and approximation can be considered in soft computing.

2.1. Adaptive Neuro-Fuzzy Inference System (ANFIS)

The fuzzy logic (FL) concept was conceived several years back as a data processing method which allows partial set membership [45]. The FL method of data analysis was mainly popular because people do not intend the precise numerical input. Its major advantages include the provision of an easy way to arrive at a conclusion based on noisy or imprecise input data [46, 47]. A proper knowledge of the shape and type of fuzzy rules, as well as the membership functions, is important in order to achieve the best results. However, some cases require the use of the time-consuming methods such as trial and error. The model can be trained using artificial neural networks. A hybrid combination of neural networks with fuzzy systems could produce a more robust model with numerous advantages [48, 49].

Therefore, a neuro-fuzzy system could be considered as a hybrid algorithm which can make decisions based on fuzzy and, at the same time, a modern soft computing-based method in terms of ANN. Jang introduced ANFIS in 1993 while Sugeno fuzzy system was developed based on the learning capability of ANN [50]. In a fuzzy system, the basic and most important components are the rules. The required rules will be optimized using ANN [51, 52].

The first proposed ANFIS model had five layers. The schematic structure of ANFIS is presented in Figure 2(b). The utilized rules are as follows:where A1-A2 and B1-B2 are membership functions for input and input y, respectively.

In layer 1, each node is a square node creating the membership grades. Utilizing the membership function, inputs (x and y) would be translated into linguistic terms:where is the input value to node and is the linguistic term. Furthermore, is the membership function of . Available types for membership functions are Gaussian, triangular, and trapezoidal. The Gaussian function has the following formula:where and are the antecedent parameters.

Similarly, in the second layer, each node is a circle one, and the output would be obtained using the following function:where is the weight influence of the rule. In the third layer, the nodes calculate the ratio of the rules weight divided by the sum of all weights as follows:

All nodes in the fourth layer are square having the function as follows:where is the output of the third layer. , , and are the consequent parameters.

Finally, the fifth layer consists of only one circle node (∑) as follows:

The schematic view of ANFIS modeling is presented in Figure 2.

Herein, five membership functions (Gaussian function) are considered for each input variable and five membership functions (liner function) for output. Totally, the proposed ANFIS model has 80 parameters (50 antecedent parameters and 30 consequent parameters) which were optimized by proposed nature-inspired optimization algorithms.

2.2. Particle Swarm Optimization (PSO) Algorithm

PSO algorithm was developed as an optimization method based on the behavior and movements of a community of birds, fish, and insects. It was introduced by Reynolds when he considered 3 kinds of operators designated as alignment, separation, and cohesion. The algorithm was modelled in the likeness of a group of particles flowing within a search space in a bid to achieve an optimal point. The particles within the search space adjust their movement based on their own experience and on the experience of the other particles [5254]. Similarly, they also adjust their speed based on their own experience and on the experience of the other particles. They change their position with respect to the position/velocity/distance of the best particle. The particle update rule is as follows:with,where and rand are the position, direction, the weight of local data, the weight of global data, best position of the particles, best position of the swarm, and a random value, respectively. The following equations are used to update the particles’ velocities:In the above equation, the three major terms are inertia, personal influence, and social influence, respectively. Figure 3(a) presents the flowchart of PSO.

2.3. Ant Colony Optimization (ACO) Algorithm

Dorigo introduced ant colony system some years back, but in recent years, many researchers have focused on the extended versions of the ant system [55]. Being that the ACO can solve static and dynamic problems, they are applicable in several optimization problems. Activities such as foraging (food searching), cooperative transport, labor division, and brood sorting are governed by what is commonly known as stigmergy which enables them to achieve self-organization. Although there are simple individuals in the ant colonies, it could be considered as one of the most complicated but well-organized natural structures.

The pheromone produced by the ants is trailed by the other ants in order to find the shortest path to a food source. This algorithm uses a similar procedure to arrive at the optimum point in the search space. The ants can only make forward or backward movements; hence, a stepwise decision process will be applied by the ants to find the best solution to a given problem [5658]. Figure 3(b) presents the working flowchart of ACO.

2.4. Differential Evolution (DE) Algorithm

Some of the objective functions in real engineering problems are discrete, nonlinear, or multidimensional while some may have local minima. Such cases require a population-based algorithm with stochastic features to obtain the solution. This quality is obtainable in the Differential Evolution (DE) algorithm introduced by Storn and Price in 1996 [59, 60]. To optimize a function with n real parameters, the vectors would appear as follows:where G is the generation number. A definition of the upper and lower boundaries for each parameter would give

The initial values for the parameters would be randomly selected with identical probability. The flowchart of the DE algorithm is presented in Figure 3(c).

2.5. Genetic Algorithm (GA)

GA was developed as an evolutionary search algorithm based on the Darwinian principle of natural selection for solving optimization problems [61]. The algorithm initiates with a random generation of the initial population before proceeding to the evaluation of the fitness of the individuals using the fitness function. The next stage is the selection stage using methods such as Roulette Wheel. The two operators mainly used by the algorithm to generate new offspring are crossover and mutation [62, 63]. Figure 3(d) presents the schematic presentation of the GA.

The optimized ANFIS parameters (antecedent and consequent parameters) through the PSO algorithm (superior model) are presented in Table 1.

2.6. Prediction Performance Metrics

In this research, the mean performance (MP) index was computed to evaluate the proposed hybrid data-intelligence models. The main merit of this index is the incorporation all of the performance metrics including root mean square error (RMSE), mean absolute error (MAE), Legate and McCabe’s index (LMI), correlation coefficient (CC), Willmott’s index (WI), and relative root mean square error (RRMSE) [41, 6467]. The MP metric can be expressed mathematically as follows [68]:where , , , , , and are the standardized values of the employed performance indices of model (hybridized ANFIS models) and can be computed as follows:

3. Data Description and Input Model Development

The proposed hybrid and standalone ANFIS models were constructed for the training and testing phases using field measurements of settlement of shallow foundation. The obtained data are corresponding to the foundation dimensions and soil properties with 188 observations [69]. Based on the geotechnical engineering prospect, the settlement is associated with several variabilities such as footing width (B), soil compressibility within the effective influencing depth, and the net footing pressure (qa). Other variables such as footing embedment ratio (Df/B), footing geometry length to width ration (L/B), and count of SPT blow () are the related variables considered to build the input attribute matrix of the settlement prediction. The details of employed datasets over training and testing phases are presented in Table 2. The dataset represents a wide range of variation. The nonprocessed dataset was divided into 133–55 observations for the training-testing phases, whereas the preprocessed dataset was divided into 129–50 observations for the training-testing phases. The optimal data division was attained based on trial and error procedure in accordance with the obtained predictability performance.

4. Application, Analysis, and Discussion

The main motivation of the current research is to investigate the viability of the different versions of hybrid ANFIS models for predicting shallow foundation settlement. The limitations of the empirical formulation to simulate the exact relationship between the soil settlement and those various foundation loading and soil properties emphasize the implementation of data-intelligence models where more robust and reliable predictive models can be explored to configure the internal mechanism between the dependent and independent variables. Indeed, proposing such a reliable data-intelligence predictive model can contribute to the possibility of reliable foundation design in addition to various geotechnical engineering perspectives. The developed hybrid ANFIS models are appraised in different comparisons in accordance with statistical metrics, diagnostic plots, and error distributions between the laboratory measures and computed settlement values over the training and testing phases.

Using numerical validation, Tables 3 and 4 report the performance metrics (RMSE, MSE, LMI, CC, WI, and SRMSE) for the classical ANFIS, ANFIS-PSO, ANFIS-ACO, ANFIS-DE, and ANFIS-GA and for the two investigated scenarios (i.e., PP and NP). Generally, the execution of the developed models over the training phase showed a superior predictability in comparison with the testing phase and for both scenarios. In both scenarios and over the training phase, the hybrid ANFIS-PSO model exhibited the best results of settlement prediction with minimum absolute error measures (RMSE = 6.1 mm and MAE = 3.5 mm) for the preprocessed dataset and (RMSE = 8.09 mm and MAE = 4.92 mm) for the nonprocessed dataset. This can be best explained due to the potential of the particle swarm optimizer in tuning the membership function parameters of the ANFIS predictive model. This is in synchronization with various other applied engineering problems [7072]. On the other aspect, the data preprocessing demonstrated an excellent procedure where to produce more meaningful information for the predictive model. In other words, data preprocessing provided in more detectable manner information for the learning process of the established intelligence predictive models. Over the testing phase, ANFIS-PSO model displayed the best prediction results of settlement with minimum absolute error measures (RMSE = 9.02 mm and MAE = 6.5 mm) for the preprocessed dataset and (RMSE = 14.1 mm and MAE = 9.88 mm) for the nonprocessed dataset.

Figures 4 and 5display the mean performance (MP) index of all computed prediction measures metrics in addition to the heat map for both inspected modeling scenarios PP and NP, respectively. Figures 4(a) and 5(a) present the models training phase while Figures 4(b) and 5(b) report the testing phase. In harmony with revealed numerical metrics of Tables 3 and 4, the hybrid ANFIS-PSO models showed the accurate prediction for the shallow foundation settlement with superior MP index value.

The performance of the applied hybrid models was examined using scatter plot graphs. The scatter plot is an excellent graphical evaluation method to depict the variance between the measured settlement and the computer model’s computation (see Figure 6). Based on the variance around the fit line and the magnitude of the correlation coefficient, the best predictive model can be determined. Figure 6 exhibits the best hybrid predictive model (i.e., ANFIS-PSO) for both investigated scenarios and over both training and testing phases. On the one hand and over the training phase (Figure 6(a)), the PP-ANFIS-PSO attained R20.934 while NP-ANFIS-PSO attained R20.904. On the other hand, testing phase demonstrated PP-ANFIS-PSO attained R20.865 while NP-ANFIS-PSO attained R20.741. This is normal learning process as usually testing phase performs slightly worse than the training phase. This is might be due to the absence of the valuable information not comprehended perfectly.

Taylor diagrams were computed for the applied ANFIS, ANFIS-PSO, ANFIS-ACO, ANFIS-DE, and ANFIS-GA predictive models for both modeling phases (training and testing) and for both studied scenarios (see Figures 7 and 8), respectively. Taylor diagram is a summary of various statistical metrics including correlation, standard deviation, and root mean square. In accordance with the coordination of the predictive models from the benchmark (observed settlement records), the models are evaluated. In this way, Taylor map delivers an excellent graphical representation of the models’ accuracy. Figures 8(a) and 8(b) (preprocessed dataset scenario) indicate the hybrid ANFIS-PSO coordinated in the nearest position to the observed settlement measurements with correlation value over 0.95 for training phase and 0.92 for the testing phase, whereas ANFIS-ACO and ANFIS-DE displayed the largest distance from the observed benchmark record which signifies the worst prediction efficiency.

Among all the investigated hybrid intelligence models, the capacity of the ANFIS-PSO was demonstrated the superior potential over the other models. This is clearly an indication to the performance of the Particle Swarm Optimization algorithm for tuning the internal parameters of the ANFIS model and particularly for simulating the investigated geotechnical problem “i.e., shallow foundation settlement.” It is worth to highlight that the ability of the PSO algorithm was approved for optimizing ANFIS model over multiple engineering applications such as channel sediment transport, basin bank shape optimization, compressive strength of intact roach, friction capacity ration of driven piles, oil flocculated asphaltene weight percentage, and several others [7377].

Based on the attained modeling results, it is even better to highlight some valid critical observations to be established in future research. Settlement analysis is influenced by a certain level of uncertainty related to the variables influencing settlement behavior. Most of the existing methodologies conducted for settlement prediction on cohesionless soils do not consider the uncertainty component in the simulation. Hence, incorporating Monte Carlo model with the proposed hybrid ANFIS-PSO model can provide practical design tools for geotechnical engineering field where the inspection of the uncertainties associated with the variables affecting the settlement prediction is investigated. This is a very useful probabilistic approach in the sense that it can overcome the problems of the deterministic techniques and provide some guidance to the geotechnical engineers about the level of risk (i.e., degree of uncertainty) associated with the predicted settlement. Exploring other nature-inspired algorithms for optimizing ANFIS model can be further studied for better prediction accuracy [7880].

5. Conclusion and Remarks

This research was emphasized on the implementation of newly developed hybrid intelligence model based on the integration of ANFIS model with various bioinspired optimization algorithms (e.g., PSO, ACO, DE, and GA). The main enthusiasm of the current investigation is to attain an accurate intelligent predictive model for shallow foundation settlement. The developed predictive models were inspected on large field measurements gathered from the open sources’ literature. Two different modeling scenarios based on data processing were performed. Among the four hybrid models, ANFIS-PSO demonstrated an accurate prediction result with highest correlation value R20.865 and minimum absolute error measures (RMSE = 9.02 mm and MAE = 6.5 mm) for the testing phase and within preprocessing data scenario. The proposition of the hybrid intelligent model evidenced an excellent alternative for the empirical formulation. In addition, the modeling scenarios proved the necessity to examine the potential of the raw data applicability for establishing predictive model. It is clearly verified that eliminating some irrelevant dataset has boosted the prediction accuracy remarkably.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors have no conflicts of interest to publish this research.