Skip to main content
Log in

Adaptive in situ model refinement for surrogate-augmented population-based optimization

  • Research Paper
  • Published:
Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

Abstract

In surrogate-based optimization (SBO), the deception issues associated with the low fidelity of the surrogate model can be dealt with in situ model refinement that uses infill points during optimization. However, there is a lack of model refinement methods that are both independent of the choice of surrogate model (neural networks, radial basis functions, Kriging, etc.) and provides a methodical approach to preserve the fidelity of the search dynamics, especially in the case of population-based heuristic optimization processes. This paper presents an adaptive model refinement (AMR) approach to fill this important gap. Therein, the question of when to refine the surrogate model is answered by a novel hypothesis testing concept that compares the distribution of model error and distribution of function improvement over iterations. These distributions are respectively computed via a probabilistic cross-validation approach and by leveraging the probabilistic improvement information uniquely afforded by population-based algorithms such as particle swarm optimization. Moreover, the AMR method identifies the size of the batch of infill points needed for refinement. Numerical experiments performed on multiple benchmark functions and an optimal (building energy) planning problem demonstrate AMR’s ability to preserve computational efficiency of the SBO process while providing solutions of more attractive fidelity than those provisioned by a standard SBO approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Notes

  1. A Matlab implementation based on Jones et al. (1998)

References

  • Alexandrov N, Lewis R, Gumbert C, Green L, Newman P (1999) Optimization with variable-fidelity models applied to wing design. Tech. rep., ICASE, Institute for Computer Applications in Science and Engineering NASA Langley Research Center, Hampton, Virginia

  • Alexandrov N M, Dennis J E, Lewis R M, Torczon V (1998) A trust-region framework for managing the use of approximation models in optimization. Struct Optim 15(1):16–23

    Google Scholar 

  • Ascione F, Bianco N, Stasio C D, Mauro G M, Vanoli G P (2017) Artificial neural networks to predict energy performance and retrofit scenarios for any member of a building category: a novel approach. Energy 26(118):999–1017

    Google Scholar 

  • Audet C, Dennis J E, Moore D W, Booker A, Frank P D (2000) A surrogate-model-based method for constrained optimization. In: 8th symposium on multidisciplinary analysis and optimization. Long Beach, CA

  • Bichon B J, Eldred M S, Mahadevan S, McFarland J M (2013) Efficient global surrogate modeling for reliability-based design optimization. J Mech Des 135(1):011009

    Google Scholar 

  • Booker A J, Dennis J E, Frank P D, Serafini D B, Torczon V, Trosset M (1999) Rigorous framework for optimization of expensive functions by surrogates. Struct Optim 17:1–13

    Google Scholar 

  • Booker A J, Dennis J E, Frank P D, Serafini D B, Torczon V, Trosset M W (1999b) A rigorous framework for optimization of expensive functions by surrogates. Struct Optim 17(1):1–13

  • Chen X, Yang H, Sun K (2017) Developing a meta-model for sensitivity analyses and prediction of building performance for passively designed high-rise residential buildings. Applied energy 194:422–439

    Google Scholar 

  • Chen Y, Hong T (2018) Impacts of building geometry modeling methods on the simulation results of urban building energy models. Appl Energy 215:717–735

    Google Scholar 

  • Cheng G H, Younis A, Haji Hajikolaei K, Gary Wang G (2015) Trust region based mode pursuing sampling method for global optimization of high dimensional design problems. J Mech Des 137(2):021407

    Google Scholar 

  • Choi K, Youn BD, Yang RJ (2001) Moving least square method for reliability-based design optimization. Proc 4th world cong structural & multidisciplinary optimization

  • Chowdhury S, Tong W, Messac A, Zhang J (2013) A mixed-discrete particle swarm optimization algorithm with explicit diversity-preservation. Struct Multidiscip Optim 47(3):367–388

    MathSciNet  MATH  Google Scholar 

  • Chowdhury S, Mehmani A, Tong W, Messac A (2016) Adaptive model refinement in surrogate-based multiobjective optimization. In: 57th AIAA/ASCE/AHS/ASC structures, structural dynamics, and materials conference, p X00000. pp 0417

  • Clarke S M, Griebsch JH, Simpson TW (2005) Analysis of support vector regression for approximation of complex engineering analyses. Journal of Mechanical Design 127(6):1077–1087

    Google Scholar 

  • Corbin C D, Henze G P, May-Ostendorp P (2013) A model predictive control optimization environment for real-time commercial building application. J Build Perform Simul 5(3):159–174

    Google Scholar 

  • Deru M, Field K, Studer D, Benne K, Griffith B, Torcellini P, Liu B (2011) US Department of Energy commercial reference building models of the national building stock. Tech. rep., Department of Energy

  • Dixon L C W, Price R (1989) Truncated newton method for sparse unconstrained optimization using automatic differentiation. J Optim Theory Appl 60(2):261–275

    MathSciNet  MATH  Google Scholar 

  • DOE (2017) Commercial prototype building models. energycodes.gov/development/commercial

  • Duan Q, Sorooshian S, Gupta V (1992) Effective and efficient global optimization for conceptual rainfall-runoff models. Water Res Res 28(4):1015–1031

    Google Scholar 

  • Duong T, Hazelton M (2003) Plug-in bandwidth matrices for bivariate kernel density estimation. Nonparametr Stat 15(1):17–30

    MathSciNet  MATH  Google Scholar 

  • Epanechnikov V A (1969) Non-parametric estimation of a multivariate probability density. Theory Prob Appli 14(1):153–158

    MathSciNet  Google Scholar 

  • Fernández-Godino MG, Park C, Kim NH, Haftka RT (2016) Review of multi-fidelity models. arXiv:160907196

  • Forrester A, Keane A (2009) Recent advances in surrogate-based optimization. Prog Aerosp Sci 45(1-3):50–79

    Google Scholar 

  • Forrester A, Sobester A, Keane A (2008) Engineering design via surrogate modelling: a practical guide. Wiley, New York

    Google Scholar 

  • Ghassemi P, Zhu K, Chowdhury S (2017) Optimal surrogate and neural network modeling for day-ahead forecasting of the hourly energy consumption of university buildings. In: ASME 2017 international design engineering technical conferences and computers and information in engineering conference, American Society of Mechanical Engineers, pp V02BT03A026–V02BT03A026

  • Glantz SA, Slinker BK, Neilands TB (1990) Primer of applied regression and analysis of variance, vol 309. McGraw-Hill , New York

    Google Scholar 

  • Gräning L, Jin Y, Sendhoff B (2007) Individual-based management of meta-models for evolutionary optimization with application to three-dimensional blade optimization. In: Evolutionary computation in dynamic and uncertain environments, Springer, pp 225–250

  • Griewank A O (1981) Generalized descent for global optimization. J Optim Theory Appl 34(1):11–39

    MathSciNet  MATH  Google Scholar 

  • Hansen N (2006) The cma evolution strategy: a comparing review. In: Towards a new evolutionary computation, Springer, pp 75–102

  • Hardy R L (1971) Multiquadric equations of topography and other irregular surfaces. J Geophys Res 76:1905–1915

    Google Scholar 

  • Hennig P, Schuler C J (2012) Entropy search for information-efficient global optimization. J Mach Learn Res 13:1809–1837

    MathSciNet  MATH  Google Scholar 

  • Tyler Hoyt AE, Zhang H (2015) Extending air temperature setpoints: simulated energy savings and design considerations for new and retrofit buildings. Build Environ 88:89–96

    Google Scholar 

  • Jakobsson S, Patriksson M, Rudholm J, Wojciechowski A (2010) A method for simulation based optimization using radial basis functions. Optim Eng 11(4):501–532

    MathSciNet  MATH  Google Scholar 

  • Jin R, Chen W, Simpson T W (2001) Comparative studies of metamodelling techniques under multiple modelling criteria. Struct Multidiscip Optim 23(1):1–13

    Google Scholar 

  • Jin Y (2005) A comprehensive survey of fitness approximation in evolutionary computation. Soft Comput 9 (1):3–12

    Google Scholar 

  • Jin Y (2011) Surrogate-assisted evolutionary computation: recent advances and future challenges. Swarm Evol Comput 1(2):61–70

    Google Scholar 

  • Jin Y, Sendhoff B (2004) Reducing fitness evaluations using clustering techniques and neural network ensembles. In: Genetic and evolutionary computation, GECCO, vol 2004, pp 688–699

  • Jin Y, Olhofer M, Sendhoff B (2002) A framework for evolutionary optimization with approximate fitness functions. IEEE Trans Evol Comput 6(5):481–494

    Google Scholar 

  • Jones D, Schonlau M, Welch W (1998) Efficient global optimization of expensive black-box functions. J Glob Optim 13(4):455–492

    MathSciNet  MATH  Google Scholar 

  • Keane A, Nair P (2005) Computational approaches for aerospace design: the pursuit of excellence. Wiley, New York

    Google Scholar 

  • Keane A J (2006) Statistical improvement criteria for use in multiobjective design optimization. AIAA J 44 (4):879–891

    Google Scholar 

  • Kennedy J, Eberhart RC (1995) Particle swarm optimization. In: IEEE international conference on neural networks, , vol IV. IEEE, Piscataway, pp 1942–1948

  • Kleijnen J P, Beers W V, Nieuwenhuyse I V (2012) Expected improvement in efficient global optimization through bootstrapped kriging. J Global Optim 54(1):59–73

    MathSciNet  MATH  Google Scholar 

  • Kourakos G, Mantoglou A (2009) Pumping optimization of coastal aquifers based on evolutionary algorithms and surrogate modular neural network models. Adv Water Resour 32(4):507–521

    Google Scholar 

  • Liu Y, Ghassemi P, Chowdhury S, Zhang J (2018) Surrogate based multi-objective optimization of j-type battery thermal management system. In: ASME 2018 international design engineering technical conferences and computers and information in engineering conference, American society of mechanical engineers digital collection

  • Lulekar S, Ghassemi P, Chowdhury S (2018) Cfd-based analysis and surrogate-based optimization of bio-inspired surface riblets for aerodynamic efficiency. In: 2018 Multidisciplinary Analysis and Optimization Conference, p 3107

  • March A, Willcox K, Wang Q (2011) Gradient-based multifidelity optimisation for aircraft design using Bayesian model calibration. Aeronaut J 115(1174):729–738

    Google Scholar 

  • Marduel X, Tribes C, Trepanier J Y (2006) Variable-fidelity optimization: efficiency and robustness. Optim Eng 7(4):479– 500

    MathSciNet  MATH  Google Scholar 

  • Marmin S, Chevalier C, Ginsbourger D (2015) Differentiating the multipoint expected improvement for optimal batch design. In: International workshop on machine learning, Optimization and big data. Springer, pp 37–48

  • McKay M, Conover W, Beckman R (1979) A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 21(2):239–245

    MathSciNet  MATH  Google Scholar 

  • Meeker W, Hahn G, Escobar L (2017) Statistical intervals: a guide for practitioners and researchers. Wiley Series in Probability and Statistics, Wiley. https://books.google.com/books?id=y3o0DgAAQBAJ

  • Mehmani A, Chowdhury S, Messac A (2015a) Adaptive switching of variable-fidelity models in population-based optimization algorithms. In: 16th AIAA/ISSMO multidisciplinary analysis and optimization conference, p 3233

  • Mehmani A, Chowdhury S, Messac A (2015) Predictive quantification of surrogate model fidelity based on modal variations with sample density. Struct Multidiscip Optim 52(2):353–373

    Google Scholar 

  • Mehmani A, Chowdhury S, Messac A (2016) Variable-fidelity optimization with in-situ surrogate model refinement. In: ASME 2015 international design engineering technical conferences and computers and information in engineering conference, American society of mechanical engineers digital collection

  • Mehmani A, Chowdhury S, Meinrenken C, Messac A (2018) Concurrent surrogate model selection (cosmos): optimizing model type, kernel function, and hyper-parameters. Struct Multidiscip Optim 57(3):1093–1114

    MathSciNet  Google Scholar 

  • Molga M, Smutnicki C (2005) Test functions for optimization needs. Apr, 101

  • Moore R A, Romero D A, Paredis C J (2011) A rational design approach to gaussian process modeling for variable fidelity models. In: ASME 2011 International design engineering technical conferences (IDETC). Washington, DC

  • Pelikan M (2005) Hierarchical Bayesian optimization algorithm. In: Hierarchical Bayesian optimization algorithm, Springer, pp 105–129

  • Peng L, Liu L, Long T, Guo X (2014) Sequential rbf surrogate-based efficient optimization method for engineering design problems with expensive black-box functions. Chin J Mech Eng 27(6):1099–1111

    Google Scholar 

  • Rai R (2006) Qualitative and quantitative sequential sampling,. PhD thesis, University of Texas Texas, Austin, USA

  • Regis R G (2014) Constrained optimization by radial basis function interpolation for high-dimensional expensive black-box problems with infeasible initial points. Eng Optim 46(2):218–243

    MathSciNet  Google Scholar 

  • Regis R G, Shoemaker C A (2013) Combining radial basis function surrogates and dynamic coordinate search in high-dimensional expensive black-box optimization. Eng Optim 45(5):529–555

    MathSciNet  Google Scholar 

  • Robinson T D, Eldred M S, Willcox K E, Haimes R (2008) Surrogate-based optimization using multifidelity models with variable parameterization and corrected space mapping. AIAA J 46(11):2814–2822

    Google Scholar 

  • Rodriguez J F, Perez V M, Padmanabhan D, Renaud J E (2001) Sequential approximate optimization using variable fidelity response surface approximations. Struct Multidiscip Optim 22(1):24–34

    Google Scholar 

  • Sharif S A, Hammad A (2019) Developing surrogate ann for selecting near-optimal building energy renovation methods considering energy consumption, lcc and lca. J Buil Eng 25:100790

    Google Scholar 

  • Simpson T, Korte J, Mauery T, Mistree F (2001) Kriging models for global approximation in simulation-based multidisciplinary design optimization. AIAA J 39(12):2233–2241

    Google Scholar 

  • Simpson T, Toropov V, Balabanov V, Viana F (2008) Design and analysis of computer experiments in multidisciplinary design optimization: a review of how far we have come-or not. In: 12th AIAA/ISSMO multidisciplinary analysis and optimization conference, p 5802

  • Snoek J, Larochelle H, Adams RP (2012) Practical bayesian optimization of machine learning algorithms. In: Advances in neural information processing systems, pp 2951–2959

  • Sugiyama M (2006) Active learning in approximately linear regression based on conditional expectation of generalization error. J Mach Learn Res 7:141–166

    MathSciNet  MATH  Google Scholar 

  • Sun M, Chang CL, Zhang J, Mehmani A, Culligan P (2018) Break-even analysis of battery energy storage in buildings considering time-of-use rates. In: IEEE green technologies conference (GreenTech), pp 95–99

  • Tajbakhsh S D, del Castillo E, Rosenberger J L (2013) A fully Bayesian approach to the efficient global optimization algorithm. PhD thesis, Pennsylvania State University Working Paper

  • Tanabe R, Fukunaga A S (2014) Improving the search performance of shade using linear population size reduction. In: 2014 IEEE congress on evolutionary computation (CEC). IEEE, pp 1658–1665

  • Tian W (2013) A review of sensitivity analysis methods in building energy analysis. Renew Sust Energ Rev 20:411–419

    Google Scholar 

  • Tong W, Chowdhury S, Messac A (2016) A multi-objective mixed-discrete particle swarm optimization with multi-domain diversity preservation. Struct Multidiscip Optim 53(3):471–488

    MathSciNet  Google Scholar 

  • Toropov VV, Schramm U, Sahai A, Jones RD, Zeguer T (2005) Design optimization and stochastic analysis based on the moving least squares method. 6th World Congress of Structural and Multidisciplinary Optimization

  • Ulmer H, Streichert F, Zell A (2004) Evolution strategies with controlled model assistance. In: Congress on evolutionary computation, 2004. CEC2004. IEEE, vol 2, pp 1569–1576

  • Viana F A, Haftka R T, Watson L T (2013) Efficient global optimization algorithm assisted by multiple surrogate techniques. J Glob Optim 56(2):669–689

    MATH  Google Scholar 

  • Wang Y, Song Z, De Angelis V, Srivastava S (2018) Battery life-cycle optimization and runtime control for commercial buildings demand side management: a New York City case study. Energy 165:782–791

    Google Scholar 

  • Wild S M, Regis R G, Shoemaker C A (2008) Orbit: optimization by radial basis function interpolation in trust-regions. SIAM J Sci Comput 30(6):3197–3219

    MathSciNet  MATH  Google Scholar 

  • Williams B, Loeppky J L, Moore L M, Macklem M S (2011) Batch sequential design to achieve predictive maturity with calibrated computer models. Reliab Eng Syst Saf 96(9):1208–1219

    Google Scholar 

  • Yao W, Chen X, Huang Y, van Tooren M (2014) A surrogate-based optimization method with rbf neural network enhanced by linear interpolation and hybrid infill strategy. Optim Methods Softw 29(2):406–429

    MathSciNet  MATH  Google Scholar 

  • Yegnanarayana B (2004) Artificial neural networks. PHI Learning Pvt. Ltd.

  • Zhang J, Chowdhury S, Messac A (2012) An adaptive hybrid surrogate model. Struct Multidiscip Optim 46(2):223–238

    Google Scholar 

Download references

Funding

Support from the National Science Foundation (NSF) Award CMMI-1642340 is gratefully acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Souma Chowdhury.

Ethics declarations

Any opinions, findings, conclusions, or recommendations expressed in this paper are those of the authors and do not necessarily reflect the views of the NSF.

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Responsible Editor: Nathalie Bartoli

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Replication of results

To aid the replication of results, data and trained models associated with the numerical experiments presented in this paper have been made available through the following public repository: https://github.com/adamslab-ub/amr-samples-metamodels-package.

Appendices

Appendix A: Kernel density estimation (KDE)

KDE is a non-parametric model to estimate the probability density function of random variables. Here, it is assumed that \(\varDelta f = (\varDelta f_{1}, \varDelta f_{2}, ..., \varDelta f_{N_{\text {pop}}})\) is an independent and identically distributed sample drawn from a distribution with an unknown density ΘΔf. The kernel density estimator can then be used to determine ΘΔf, as given by:

$$ \hat{\varTheta}_{\varDelta f}(x; H) = \frac{1}{N_{\text{pop}}} \sum\limits_{i=1}^{N_{\text{pop}}} K_{H}(x-x_{i}) $$
(12)

Here, the kernel K(x) is a symmetric probability density function, H is the bandwidth matrix which is symmetric and positive-definite, and KH(x) = |H|− 1/2K(H− 1/2x). The choice of K is not as crucial as the choice of the H estimator for the accuracy of the KDE (Epanechnikov 1969). In this article, we consider \(K(x) = (2\pi )^{-d/2}\exp (-0.5x^{T}x)\), the standard normal throughout. The mean integrated squared error (MISE) method is used as a criterion for selecting the bandwidth matrix H (Duong and Hazelton 2003) as follows:

$$ \text{MISE}(H) = \mathbb{E}\Big[\int\Big(\varTheta_{\varDelta f}(x; H)-\hat{\varTheta}_{\varDelta f}(x; H)\Big)^{2} \Big] $$
(13)

Appendix B: Particle Swarm Optimization

Particle swarm optimization (PSO) is a population-based optimization method introduced by Kennedy and Eberhart (1995). In this method, each particle’s movement is described in terms of its velocity (vi(t)) and its location (xi(t)), where i denotes the i th particle and t denotes the t th iteration. Here, we specifically exploit the MDPSO algorithm developed by Chowdhury et al. (2013), which includes explicit diversity preservation in addition to the standard PSO dynamics, in order to provide greater robustness. In MDPSO, the velocity and location of particles are updated as follows:

$$ \mathbf{x}_{i}(t+1) = \mathbf{x}_{i}(t) + \mathbf{v}_{i}(t+1) $$
(14)
$$ \begin{array}{@{}rcl@{}} \mathbf{v}_{i}(t+1) &=& \omega \mathbf{v}_{i}(t) + r_{1}C_{1}(\mathbf{P}_{i}^{l}(t)-\mathbf{x}_{i}(t))\\ &&+ r_{2}C_{2}(\mathbf{P}^{g}(t)-\mathbf{x}_{i}(t))+r_{3}\gamma_{c}\hat{\nu}_{i}(t) \end{array} $$
(15)

Here, xi(t) and vi(t) respectively denote the position and the velocity of particle i at the tth iteration; ω, C1, and C2 represent the inertial weight, the individual search, and the global search coefficients, respectively; these are used to balance the local search (exploitation) and the global search (exploration); \(\mathbf {P}_{i}^{l}(t)\) is the local leader of particle i at the tth iteration, which represents the best local solution found in the motion history of particle i; Pg(t) is the global leader of the entire swarm at the tth iteration, which is determined by comparing the local leaders of all particles; γc is the coefficient used to weigh the explicit diversity preservation component; \(\hat {\nu }_{i}(t)\) is the explicit diversity preservation vector; and r1, r2, and r3 are random real numbers between 0 and 1.

Appendix C: Predictive estimation of model fidelity

Predictive estimation of model fidelity (PEMF) method (Mehmani et al. 2015) can be perceived as a novel sequential implementation of k-fold cross-validation, with carefully constructed error measures that are significantly less sensitive to outliers and the DoE (compared with mean or root mean square error measures). The PEMF method predicts the error by capturing the variation of the surrogate model error with an increasing density of training points (without investing any additional test points). Algorithm 1 summarizes the PEMF method.

figure a

Appendix D: The settings of COSMOS and MDPSO

Table 5 The COSMOS and MDPSO settings for the analytical problem and application problem

Appendix E: Description of six-hump camel back function

Figure 12 shows the six-hump camel back function, a well-known 2D test function used for benchmarking global optimization and surrogate modeling methods (Molga and Smutnicki 2005). It has six local minima, with two of them being the global minima (Molga and Smutnicki 2005). The global minima are located at (− 0.0898, 0.7126) and (0.0898,− 0.7126), and gives a minimum function value of f(x) = − 1.0316.

Fig. 12
figure 12

The response surface of the six-hump camel back function

$$ f(x_{1}, x_{2}) = \left( \frac{{x_{1}^{4}}}{3} - 2.1 {x_{1}^{2}} + 4\right){x_{1}^{2}} + x_{1} x_{2} + 4\left( {x_{2}^{2}} - 1\right){x_{2}^{2}} $$
(16)

where − 3 ≤ x1 ≤ 3 and − 2 ≤ x2 ≤ 2.

Appendix F: Further performance illustration on the six-hump problem

The following illustrations in Fig. 13 respectively represent the best individual run of PSO-AMR, PSO-AMR-Local, PSO-SBO-k1, and PSO-SBO-k2.

Fig. 13
figure 13

The selected best run of each optimization approach, for the six-hump camel back function. Left figures show filled contours and dashed line contours, respectively, representing the true function and the surrogate model (SM) approximation; in the PSO-AMR and PSO-AMR-Local cases, there are two SM contours, corresponding to the models before and after refinement. Right figures show the convergence and SM error histories over the optimization processes

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ghassemi, P., Mehmani, A. & Chowdhury, S. Adaptive in situ model refinement for surrogate-augmented population-based optimization. Struct Multidisc Optim 62, 2011–2034 (2020). https://doi.org/10.1007/s00158-020-02592-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00158-020-02592-6

Keywords

Navigation