Variable surrogate model-based particle swarm optimization for high-dimensional expensive problems

Tian, Jie; Hou, Mingdong; Bian, Hongli; Li, Junqing

doi:10.1007/s40747-022-00910-7

Variable surrogate model-based particle swarm optimization for high-dimensional expensive problems

Original Article
Open access
Published: 29 November 2022

Volume 9, pages 3887–3935, (2023)
Cite this article

Download PDF

You have full access to this open access article

Complex & Intelligent Systems Aims and scope Submit manuscript

Variable surrogate model-based particle swarm optimization for high-dimensional expensive problems

Download PDF

Jie Tian ORCID: orcid.org/0000-0002-0545-8091^1,2,
Mingdong Hou³,
Hongli Bian² &
…
Junqing Li²

2602 Accesses
47 Citations
Explore all metrics

Abstract

Many industrial applications require time-consuming and resource-intensive evaluations of suitable solutions within very limited time frames. Therefore, many surrogate-assisted evaluation algorithms (SAEAs) have been widely used to optimize expensive problems. However, due to the curse of dimensionality and its implications, scaling SAEAs to high-dimensional expensive problems is still challenging. This paper proposes a variable surrogate model-based particle swarm optimization (called VSMPSO) to meet this challenge and extends it to solve 200-dimensional problems. Specifically, a single surrogate model constructed by simple random sampling is taken to explore different promising areas in different iterations. Moreover, a variable model management strategy is used to better utilize the current global model and accelerate the convergence rate of the optimizer. In addition, the strategy can be applied to any SAEA irrespective of the surrogate model used. To control the trade-off between optimization results and optimization time consumption of SAEAs, we consider fitness value and running time as a bi-objective problem. Applying the proposed approach to a benchmark test suite of dimensions ranging from 30 to 200 and comparisons with four state-of-the-art algorithms show that the proposed VSMPSO achieves high-quality solutions and computational efficiency for high-dimensional problems.

An adaptive surrogate-assisted particle swarm optimization for expensive problems

Article 08 October 2021

Surrogate-assisted evolutionary sampling particle swarm optimization for high-dimensional expensive optimization

Article 29 May 2023

Surrogate-Assisted Particle Swarm with Local Search for Expensive Constrained Optimization

Introduction

Compared to conventional optimization algorithms, evolutionary algorithms (EAs) are more apt at handling a number of complex problems in real-world applications [1, 2]. EAs have, therefore, been widely applied in many real-world applications, including drug design [3], control engineering applications [4], and wing configuration design [5]. However, EAs generally require thousands of fitness evaluations to achieve a satisfactory candidate solution. In many engineering optimization computations, a single numerical simulation can take several minutes, hours, or even days to complete. Examples of this include computational fluid dynamics (CFD) simulation, in which performing a single simulation to evaluate a candidate design generally requires several hours. Furthermore, the number of required additional fitness evaluations rises with the dimension of the optimization problem, resulting in high computational costs to run hundreds or thousands of fitness evaluations. To solve such expensive optimization problems, surrogate model-based EAs, in which a surrogate model (also called a meta-model) is applied instead of the expensive original function, are often used.

Over the last few decades, a variety of surrogate-assisted evolutionary algorithms (SAEAs) have been identified. Existing strategies for SAEAs to employ surrogate models can generally be divided into the single-surrogate and multi-surrogate model-based strategies, depending on the number of surrogate models that SAEAs have used. Although many generic machine learning methods have been used to build surrogate models, no specific rule has been proposed to determine the type of model most suitable for use as a surrogate [6]. In general, single-surrogate model-based EAs employ the Gaussian process (GP) model, most likely because this model can predict a candidate solution while providing an estimate of the error of the predicted value. There are several infill sampling criteria that apply GP-provided prediction and error estimation approaches, including the expected improvement infill criterion [7, 8] and lower confidence bound infill criterion [9, 10], have been proposed as guides for achieving promising solutions. Most non-GP models, including polynomial repression surface models [11, 12], artificial neural networks [13, 14], radial basis functions (RBFs) [15,16,17] and many others [18], can only provide predictions and cannot provide error estimates for their predictions. Because of these limitations, multiple model-based EAs involving the production of multiple predictions from multiple models are applied to avoid cases in which the algorithms run into local optima.

For SAEAs, the core problem is how to use surrogate models to guide the optimization process reasonably. When an expensive problem has a high-dimensional decision space, it becomes more challenging for SAEAs to employ surrogate models to guide the optimization process effectively. Firstly, the number of training samples required by the surrogate model grows exponentially as the problem dimensions increase [19]. This implies more experimental evaluations are required, often expensive and infeasible in real applications. Due to the lack of samples on high-dimensional expensive problems, it is difficult to construct a single surrogate model with high accuracy [20]. It is commonly known that the use of inaccurate surrogates might cause an optimization process to be misleading. Second, it needs more time to create a surrogate model as the problem dimensions increase. For example, in the Gaussian process (GP) model, global optimization for high-dimensional acquisition functions is intrinsically a hard problem and can be prohibitively expensive to be feasible [21]. Generally speaking, existing research on extending the SAEAs to high-dimensional expensive problems can be roughly classified into three categories:

The first strategy is to deal with the lack of samples through data processing and data generation methods. DDEA-PES [22] used data perturbation to generate diverse datasets. SAEO [23] trained and activated the surrogate model only after enough data samples were collected. ESAO [24] randomly projected training samples into a set of low-dimensional sub-spaces rather than training in the original high-dimensional space.

The second strategy is to improve the performance of surrogate models. In [25], a GP model was combined with the partial least squares method to solve high-dimensional problems with up to 50 design variables. In our previous work, we proposed a multi-objective infill criterion [26] for GP model management. TR-SADEA [27] employs a self-adaptive GP model for antenna design. The RBF-assisted approach based on granulation was proposed in [28, 29]. In [30], a radial basis function network (RBFN) with a trust-region approach was considered a local model for solving 20-dimensional problems. Wang and Jin [31] employed three widely used models (i.e., PR, RBF, and GP ) to construct both one global ensemble model and a local ensemble model respectively. Li et al. [32] employed two criteria to balance exploitation and convergence to solve medium-scaled computationally expensive problems. MS-RV [33] transferred the knowledge from the coarse surrogate to the fine surrogate in off-line data-driven optimization.

The third strategy is to improve optimization efficiency by multiple swarms. Multiple swarms were used in SA-COSO [34] for solving high-dimensional problems ranging from 30 to 200 dimensions. Pan et al. [35] proposed an efficient surrogate-assisted hybrid optimization (SAHO) algorithm that combines two EAs (TLBO and DE) as the basic optimizer for 100-dimensional problems.

This paper proposes a variable surrogate model-based particle swarm optimization (VSMPSO) algorithm for high-dimensional expensive problems. To the best of our knowledge, VSMPSO is the first attempt to extend a single surrogate-assisted EA to solve the 200 dimension problems. The main contributions of this paper are as follows:

The proposed VSMPSO, does not focus on improving the accuracy of surrogate models, but rather relies on the blessing of uncertainty [36], which only employs one RBF model as a single surrogate in combination with the proposed variable surrogate model strategy to explore different promising area in different generation to avoid model misdirection throughout the whole optimization process.
The prediction ability of the surrogate model is not only used to predict the current population. For deep mining of the surrogate model prediction information, the most promising point of the surrogate model would be found and transferred into the optimizer population to accelerate the optimization.
The proposed algorithm framework of VSMPSO can be applied in any surrogate-assisted evolutionary algorithm irrespective of the surrogate model used.

The remainder of this paper is organized as follows: the next section introduces a brief overview of the related techniques used in this paper. The main framework of the proposed algorithm is then presented in the subsequent section. The penultimate section compares a few state-of-the-art algorithms with widely used benchmark problems with 30, 50, 100, and 200 dimensions. The final section provides the conclusion.

Related techniques in VSMPSO

Particle swarm optimization (PSO)

The canonical PSO, developed by Eberhart and Kennedy in 1995, is a population- or swarm-based intelligent optimisation algorithm inspired by the social behaviours of populations of organisms such as birds (flocking) or fish (schooling) [37]. Eq. (1) and Eq. (2) describe the evolution of $x_j$ (the position of the jth individual at generation $(t+1)$) along the dth dimension in the canonical PSO:

$$\begin{aligned} x_{j}^{d}(\textrm{t}+1)= & {} x_{j}^{d}(t)+\varDelta x_{j}^{d}(t+1) \end{aligned}$$

(1)

$$\begin{aligned} \varDelta x_{j}^{d}(t+1)= & {} \omega \varDelta x_{j}^{d}(t)+c_{1} r_{1} \cdot \left( P b e s t_{j}^{d}(t)-x_{j}^{d}(t)\right) \nonumber \\{} & {} +c_{2} r_{2}\cdot \left( G b e s t^{d}(t)-x_{j}^{d}(t)\right) . \end{aligned}$$

(2)

The feature distinguishing the canonical PSO from other EAs such as the genetic algorithm (GA) or differential evolution (DE) is that it converges rapidly but easily falls into local optima. To prevent premature convergence, a variety of modified PSOs have been proposed, including the comprehensive learning PSO [38], distance-based locally informed PSO [39], social learning PSO [40], and competitive swarm optimiser (CSO) [41]. Based on the effective performance of social learning particle swarm optimization (SLPSO), we propose a simplified SLPSO to generate candidate solutions whose primary structure is similar to the SLPSO algorithm proposed by Cheng and Jin. In this simplified SLPSO, individual $x_j$ are updated using the following formulas:

$$\begin{aligned} x_{j}^{d}(\textrm{t}+1)= & {} \left\{ \begin{array}{ll} {x_{j}^{d}(t)+\varDelta x_{j}^{d}(t+1)} &{} { \text{ if } p_{j}(t) \leqslant \textrm{P}_{j}^{L}} \\ {x_{j}^{d}(t)} &{} { \text{ otherwise } }\end{array}\right. \end{aligned}$$

(3)

$$\begin{aligned} \varDelta x_{j}^{d}(t+1)= & {} r_{1} \cdot \varDelta x_{j}^{d}(t)+r_{2} \cdot \left( x_{k}^{d}(t)-x_{j}^{d}(t)\right) \nonumber \\{} & {} +r_{3} \cdot \varepsilon \cdot \left( \overline{x}_{d}(t)-x_{j}^{d}(t)\right) , \end{aligned}$$

(4)

where $1\leqslant j<N$, N is the population size, $1\leqslant d\leqslant D$, and D is the dimension of the search space. In each generation, the population is sorted according to fitness value from bad to good, with $x_1$ and $x_N$ representing the worst and best solutions, respectively, at the current generation. $x_k$ is a randomly chosen demonstrator for $x_j$, $j<k\leqslant N$, and $x_{k}^{d}\left( t \right) $ represents the dth element of $x_k$. We note that a demonstrator should be chosen for each element of $x_j$. $P_{j}^{L}$ is the learning probability, which is inversely proportional to the fitness of $x_j$, $p_j\left( t \right) $ is a randomly generated probability for $x_j$, $r_1$, $r_2$, and $r_3$ are random numbers in the range $\left[ 0,1 \right] $, and $\varepsilon $ is the social influence factor that controls the influence of $\bar{x}_d\left( t \right) $. In Eq. (4), $\bar{x}_d\left( t \right) $ is the mean position along the dth dimension of the population at generation t. If a uniform sampling method such as Latin hypercube sampling (LHS) is used in the initialisation process, $\bar{x}_d\left( t \right) $ can be quite close to the $1\times D$ zero vector $o=\left[ 0,\ldots 0 \right] $. The function of the global optimum at the zero vector o can easily lead the population toward a promising region, and to avoid this coincidence, we set the parameter $r_3$ to zero to remove the effect of ${{\bar{x}}_{d}}(t)$. This simplifies Eq. (4) to

$$\begin{aligned} \varDelta x_{j}^{d}\left( t+1 \right) =r_1\cdot \varDelta x_{j}^{d}\left( t \right) +r_2\cdot \left( x_{k}^{d}\left( t \right) -x_{j}^{d}\left( t \right) \right) \end{aligned}$$

(5)

In this study, we generated new swarms using Eq. (3) and Eq. (5), and in the following sections, the variant SLPSO is referred to as ‘PSO’ for brevity.

RBF network

The functionality of an RBF network as a type of neural network was described in detail in [42] and can be represented by the following form:

$$\begin{aligned} \hat{f}(\textrm{x})=\sum _{i=1}^{M} \omega _{i} \phi \left( \left\| \textrm{x}-x_{i}\right\| \right) , \end{aligned}$$

(6)

where $x\in \mathbb {R}^D$ is an input vector, $\phi $ is the basic function of the RBF network, $\Vert \cdot \Vert $ is the 2-norm (also called the Euclidean norm), $\omega _i$ is the weight vector, M represents both the number of input units in the RBF input layer and the number of samples for building the RBF model. Because the basic function $\omega _i$ is one of the key factors affecting the performance of the model, many forms of $\omega _i$ have been developed, including multi-quadric, thin plate spline, Gaussian, and cubic forms. A comparison of different choices of $\omega _i$ in [43] revealed that the thin plate spline and linear and cubic RBFs theoretically perform better than either the multi-quadric or Gaussian RBF. Additionally, numerical investigation results have demonstrated that the cubic RBF can improve the performance of the thin plate spline and multi-quadric RBFs [44]. Furthermore, cubic RBF-assisted EAs have been successfully used in local function approximation. Based on these previous studies, the proposed method employs cubic basic function to construct RBF network, which is a common machine learning technique for fitness approximation [2, 8]. The basic function of the cubic RBF employed in this study is $\phi \left( \Vert x-x_i \Vert \right) =\left( \Vert x-x_i \Vert \right) ^3$.

Proposed VSMPSO algorithm

VSMPSO framework

The main framework shown in Fig. 1 presents the overall algorithm for VSMPSO. The solid black arrow lines represent the flow direction of the algorithm, and the green dot arrow dotted line to mark the data flow direction. At the beginning of VSMPSO, the initial individuals are generated, which are all evaluated and then taken into the database DB. In each generation, the variable model management strategy was proposed to decide how to construct the surrogate model for fitness estimation and select promising solutions for fitness evaluation. In the variable model management strategy (Part I) of Fig. 1, the simple random sampling method is used to select samples from the database DB for building an RBF model. Subsequently, the RBF model will be carried out to find the most promising point in the global search space. This is followed by the knowledge transferred from the RBF model to the current population. Then, in the variable model management strategy (Part II) of Fig. 1, the infill criterion used in VSMPSO considers potential optimum points to be evaluated and put into DB.

The details for the main components in VSMPSO have been explained in Algorithm 1. The optimizer used in VSMPSO is a variant of that used in SLPSO. Steps 2–4 of Algorithm 1 describe the steps for generating the initial population (called P(1)) by Latin Hypercube Sampling (LHS) and then creating the database DB by all the initial individuals in P(1) with their real fitness values. As shown in steps 6–11, the main optimization loop contains two main components of the proposed algorithm that will be described in detail in Algorithms 2 and 3. In steps 8 and 10, the variable model management strategy is proposed to design model construction and the infill criterion, which will be described in “Variable model management strategy” section. In step 9, the knowledge from the RBF model will be transferred to P(t) (the population in generation t). Finally, the program outputs the most satisfying solution with its real fitness value and end. Overall speaking, VSMPSO contains two-layer loops. The outer loop uses the variant SLPSO as the optimizer in Algorithm 1, and the inner loop uses the canonical PSO as the optimizer in Algorithm 3. In Algorithm 3, the current RBF model is used as the objective function to drive the canonical PSO iterations, and then, the global best solution found by canonical PSO is transferred to variant SLPSO. In the following subsections, we will detail the two main components of the proposed algorithm, i.e., the variable model management strategy and the knowledge transfer from model to population.

Variable model management strategy

The key issues influencing the performance of surrogates are mainly model selection and model management. First, for model selection, according to the previous work, on high-dimensional problems, constructing GP models becomes time-consuming, but RBF has been proven to perform better with small samples than other common surrogate models [45], so we determine RBF as a surrogate model. Additionally, for model management, as mentioned in the “Introduction” section, due to the lack of samples on high-dimensional expensive problems, it is not easy to construct a single surrogate model with high accuracy. Since the blessing of uncertainty and the multiple local optima of original expensive problems, an accurate surrogate model is not always necessarily in optimization. So in this work, unlike the previous work focusing on improving the accuracy of models, only one global surrogate model is trained. Furthermore, due to different samples that may construct models toward different promising areas, a global model management strategy inspired by simple random sampling is proposed to enhance the diversity of the single model between each generation and thus avoid persistent misleading in a wrong direction throughout the whole optimization process.

As observed from the pseudo-code of Algorithm 1, the RBF model is updated during each generation, in step 2 of Algorithm 2, $\lambda =\left[ M\times 80\% \right] $ samples are selected using simple random sampling, with the number of selected samples $\lambda $ accounting for 80% of the total sample size based on a common empirical value used in K-fold cross-validation [46] with $K=5$. Furthermore, the effectiveness of this strategy and sample size will be further verified in “Effects of variable model management strategy” and “Parameter sensitivity analysis”. After estimating the fitness value of P(t) by the current RBF model in step 4, the current population P(t) is sorted according to the estimated fitness value from bad to good in step 5, with $x_1$ and $x_N$ representing the worst solution PGworst and the best solution PGbest, respectively. As shown in steps 6–7, two potential optimum points are considered to be evaluated by the original expensive function. The first is PGbest, (the global optimum individual of the current population), and the second is MGbest, (the most promising point of the current surrogate model, obtained from Algorithm 3).

Knowledge transfer strategy

As mentioned in Algorithm 2, in each iteration of the optimization loop, a simple random sampling method selects different samples to construct the RBF model, and those RBF models may contribute toward different promising areas in different generations. So in Step 3 of Algorithm 3, for further data mining of the RBF model, the RBF model is considered as an objective function, which is defined as follows:

$$\begin{aligned}{} & {} \min \quad F_{\textrm{RBF}}\left( \textbf{x} \right) \nonumber \\{} & {} \text {s. t. }{{\textbf{x}}_{l}}\le \textbf{x}\le {{\textbf{x}}_{u}}, \end{aligned}$$

(7)

where $\textbf{x}\in {{\mathbb {R}}^{D}}$ is the feasible solution set, denotes the same search space as original expensive problem, $F_{\textrm{RBF}}\left( \textbf{x} \right) $ is the objective function, ${{\textbf{x}}_{l}}$ and ${{\textbf{x}}_{u}}$ are the lower and upper bounds of the decision variables. In steps 4–9, a canonical PSO is employed to find the global optimum of the current RBF model. In step 6, the canonical PSO updates individuals using Eq. (1) and (2). In step 8, Pbest (the personal best of the particle) and Gbest (the global best of the swarm) are updated in each generation. After the iteration is completed, in step 11, Gbest, the final best solution of the RBF model obtained through the canonical PSO, is assigned MGbest, which is considered the most promising point of the RBF model. In the following steps, by substituting MGbest for PGworst in $P\left( t \right) $, the knowledge transferred from the RBF model to the population is expected to be able to enhance the search performance of VSMPSO by reducing the likelihood of getting stuck in a local optimum. Furthermore, the effectiveness of the knowledge transfer strategy is further verified in “Effects of variable model management strategy” section by comparing VSMPSO with SMPSO.

Table 1 Description of benchmark functions

Variable surrogate model-based particle swarm optimization for high-dimensional expensive problems

Abstract

Similar content being viewed by others

An adaptive surrogate-assisted particle swarm optimization for expensive problems

Surrogate-assisted evolutionary sampling particle swarm optimization for high-dimensional expensive optimization

Surrogate-Assisted Particle Swarm with Local Search for Expensive Constrained Optimization

Introduction

Related techniques in VSMPSO

Particle swarm optimization (PSO)

RBF network

Proposed VSMPSO algorithm

VSMPSO framework

Variable model management strategy

Knowledge transfer strategy

Experimental studies

Experimental setup in details

Numerical results on 30- and 50-D F1–F7 functions

Numerical results and analysis on 50D F1–F7 functions with 2000 FEs

Numerical results on higher dimensional problems

Effects of variable model management strategy

Parameter sensitivity analysis

Numerical results on complex problems

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Arabic Keywords

Search

Navigation