Adaptive in situ model refinement for surrogate-augmented population-based optimization

Ghassemi, Payam; Mehmani, Ali; Chowdhury, Souma

doi:10.1007/s00158-020-02592-6

Adaptive in situ model refinement for surrogate-augmented population-based optimization

Research Paper
Published: 26 May 2020

Volume 62, pages 2011–2034, (2020)
Cite this article

Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

Payam Ghassemi¹,
Ali Mehmani² &
Souma Chowdhury¹

418 Accesses
5 Citations
Explore all metrics

Abstract

In surrogate-based optimization (SBO), the deception issues associated with the low fidelity of the surrogate model can be dealt with in situ model refinement that uses infill points during optimization. However, there is a lack of model refinement methods that are both independent of the choice of surrogate model (neural networks, radial basis functions, Kriging, etc.) and provides a methodical approach to preserve the fidelity of the search dynamics, especially in the case of population-based heuristic optimization processes. This paper presents an adaptive model refinement (AMR) approach to fill this important gap. Therein, the question of when to refine the surrogate model is answered by a novel hypothesis testing concept that compares the distribution of model error and distribution of function improvement over iterations. These distributions are respectively computed via a probabilistic cross-validation approach and by leveraging the probabilistic improvement information uniquely afforded by population-based algorithms such as particle swarm optimization. Moreover, the AMR method identifies the size of the batch of infill points needed for refinement. Numerical experiments performed on multiple benchmark functions and an optimal (building energy) planning problem demonstrate AMR’s ability to preserve computational efficiency of the SBO process while providing solutions of more attractive fidelity than those provisioned by a standard SBO approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-surrogate-based global optimization using a score-based infill criterion

Article 07 September 2018

Effectiveness of approximation strategy in surrogate-assisted fireworks algorithm

Article 14 July 2015

Study of the Influence of the Initial a Priori Training Dataset Size in the Efficiency and Convergence of Surrogate-Based Evolutionary Optimization

Notes

A Matlab implementation based on Jones et al. (1998)

References

Alexandrov N, Lewis R, Gumbert C, Green L, Newman P (1999) Optimization with variable-fidelity models applied to wing design. Tech. rep., ICASE, Institute for Computer Applications in Science and Engineering NASA Langley Research Center, Hampton, Virginia
Alexandrov N M, Dennis J E, Lewis R M, Torczon V (1998) A trust-region framework for managing the use of approximation models in optimization. Struct Optim 15(1):16–23
Google Scholar
Ascione F, Bianco N, Stasio C D, Mauro G M, Vanoli G P (2017) Artificial neural networks to predict energy performance and retrofit scenarios for any member of a building category: a novel approach. Energy 26(118):999–1017
Google Scholar
Audet C, Dennis J E, Moore D W, Booker A, Frank P D (2000) A surrogate-model-based method for constrained optimization. In: 8th symposium on multidisciplinary analysis and optimization. Long Beach, CA
Bichon B J, Eldred M S, Mahadevan S, McFarland J M (2013) Efficient global surrogate modeling for reliability-based design optimization. J Mech Des 135(1):011009
Google Scholar
Booker A J, Dennis J E, Frank P D, Serafini D B, Torczon V, Trosset M (1999) Rigorous framework for optimization of expensive functions by surrogates. Struct Optim 17:1–13
Google Scholar
Booker A J, Dennis J E, Frank P D, Serafini D B, Torczon V, Trosset M W (1999b) A rigorous framework for optimization of expensive functions by surrogates. Struct Optim 17(1):1–13
Chen X, Yang H, Sun K (2017) Developing a meta-model for sensitivity analyses and prediction of building performance for passively designed high-rise residential buildings. Applied energy 194:422–439
Google Scholar
Chen Y, Hong T (2018) Impacts of building geometry modeling methods on the simulation results of urban building energy models. Appl Energy 215:717–735
Google Scholar
Cheng G H, Younis A, Haji Hajikolaei K, Gary Wang G (2015) Trust region based mode pursuing sampling method for global optimization of high dimensional design problems. J Mech Des 137(2):021407
Google Scholar
Choi K, Youn BD, Yang RJ (2001) Moving least square method for reliability-based design optimization. Proc 4th world cong structural & multidisciplinary optimization
Chowdhury S, Tong W, Messac A, Zhang J (2013) A mixed-discrete particle swarm optimization algorithm with explicit diversity-preservation. Struct Multidiscip Optim 47(3):367–388
MathSciNet MATH Google Scholar
Chowdhury S, Mehmani A, Tong W, Messac A (2016) Adaptive model refinement in surrogate-based multiobjective optimization. In: 57th AIAA/ASCE/AHS/ASC structures, structural dynamics, and materials conference, p X00000. pp 0417
Clarke S M, Griebsch JH, Simpson TW (2005) Analysis of support vector regression for approximation of complex engineering analyses. Journal of Mechanical Design 127(6):1077–1087
Google Scholar
Corbin C D, Henze G P, May-Ostendorp P (2013) A model predictive control optimization environment for real-time commercial building application. J Build Perform Simul 5(3):159–174
Google Scholar
Deru M, Field K, Studer D, Benne K, Griffith B, Torcellini P, Liu B (2011) US Department of Energy commercial reference building models of the national building stock. Tech. rep., Department of Energy
Dixon L C W, Price R (1989) Truncated newton method for sparse unconstrained optimization using automatic differentiation. J Optim Theory Appl 60(2):261–275
MathSciNet MATH Google Scholar
DOE (2017) Commercial prototype building models. energycodes.gov/development/commercial
Duan Q, Sorooshian S, Gupta V (1992) Effective and efficient global optimization for conceptual rainfall-runoff models. Water Res Res 28(4):1015–1031
Google Scholar
Duong T, Hazelton M (2003) Plug-in bandwidth matrices for bivariate kernel density estimation. Nonparametr Stat 15(1):17–30
MathSciNet MATH Google Scholar
Epanechnikov V A (1969) Non-parametric estimation of a multivariate probability density. Theory Prob Appli 14(1):153–158
MathSciNet Google Scholar
Fernández-Godino MG, Park C, Kim NH, Haftka RT (2016) Review of multi-fidelity models. arXiv:160907196
Forrester A, Keane A (2009) Recent advances in surrogate-based optimization. Prog Aerosp Sci 45(1-3):50–79
Google Scholar
Forrester A, Sobester A, Keane A (2008) Engineering design via surrogate modelling: a practical guide. Wiley, New York
Google Scholar
Ghassemi P, Zhu K, Chowdhury S (2017) Optimal surrogate and neural network modeling for day-ahead forecasting of the hourly energy consumption of university buildings. In: ASME 2017 international design engineering technical conferences and computers and information in engineering conference, American Society of Mechanical Engineers, pp V02BT03A026–V02BT03A026
Glantz SA, Slinker BK, Neilands TB (1990) Primer of applied regression and analysis of variance, vol 309. McGraw-Hill , New York
Google Scholar
Gräning L, Jin Y, Sendhoff B (2007) Individual-based management of meta-models for evolutionary optimization with application to three-dimensional blade optimization. In: Evolutionary computation in dynamic and uncertain environments, Springer, pp 225–250
Griewank A O (1981) Generalized descent for global optimization. J Optim Theory Appl 34(1):11–39
MathSciNet MATH Google Scholar
Hansen N (2006) The cma evolution strategy: a comparing review. In: Towards a new evolutionary computation, Springer, pp 75–102
Hardy R L (1971) Multiquadric equations of topography and other irregular surfaces. J Geophys Res 76:1905–1915
Google Scholar
Hennig P, Schuler C J (2012) Entropy search for information-efficient global optimization. J Mach Learn Res 13:1809–1837
MathSciNet MATH Google Scholar
Tyler Hoyt AE, Zhang H (2015) Extending air temperature setpoints: simulated energy savings and design considerations for new and retrofit buildings. Build Environ 88:89–96
Google Scholar
Jakobsson S, Patriksson M, Rudholm J, Wojciechowski A (2010) A method for simulation based optimization using radial basis functions. Optim Eng 11(4):501–532
MathSciNet MATH Google Scholar
Jin R, Chen W, Simpson T W (2001) Comparative studies of metamodelling techniques under multiple modelling criteria. Struct Multidiscip Optim 23(1):1–13
Google Scholar
Jin Y (2005) A comprehensive survey of fitness approximation in evolutionary computation. Soft Comput 9 (1):3–12
Google Scholar
Jin Y (2011) Surrogate-assisted evolutionary computation: recent advances and future challenges. Swarm Evol Comput 1(2):61–70
Google Scholar
Jin Y, Sendhoff B (2004) Reducing fitness evaluations using clustering techniques and neural network ensembles. In: Genetic and evolutionary computation, GECCO, vol 2004, pp 688–699
Jin Y, Olhofer M, Sendhoff B (2002) A framework for evolutionary optimization with approximate fitness functions. IEEE Trans Evol Comput 6(5):481–494
Google Scholar
Jones D, Schonlau M, Welch W (1998) Efficient global optimization of expensive black-box functions. J Glob Optim 13(4):455–492
MathSciNet MATH Google Scholar
Keane A, Nair P (2005) Computational approaches for aerospace design: the pursuit of excellence. Wiley, New York
Google Scholar
Keane A J (2006) Statistical improvement criteria for use in multiobjective design optimization. AIAA J 44 (4):879–891
Google Scholar
Kennedy J, Eberhart RC (1995) Particle swarm optimization. In: IEEE international conference on neural networks, , vol IV. IEEE, Piscataway, pp 1942–1948
Kleijnen J P, Beers W V, Nieuwenhuyse I V (2012) Expected improvement in efficient global optimization through bootstrapped kriging. J Global Optim 54(1):59–73
MathSciNet MATH Google Scholar
Kourakos G, Mantoglou A (2009) Pumping optimization of coastal aquifers based on evolutionary algorithms and surrogate modular neural network models. Adv Water Resour 32(4):507–521
Google Scholar
Liu Y, Ghassemi P, Chowdhury S, Zhang J (2018) Surrogate based multi-objective optimization of j-type battery thermal management system. In: ASME 2018 international design engineering technical conferences and computers and information in engineering conference, American society of mechanical engineers digital collection
Lulekar S, Ghassemi P, Chowdhury S (2018) Cfd-based analysis and surrogate-based optimization of bio-inspired surface riblets for aerodynamic efficiency. In: 2018 Multidisciplinary Analysis and Optimization Conference, p 3107
March A, Willcox K, Wang Q (2011) Gradient-based multifidelity optimisation for aircraft design using Bayesian model calibration. Aeronaut J 115(1174):729–738
Google Scholar
Marduel X, Tribes C, Trepanier J Y (2006) Variable-fidelity optimization: efficiency and robustness. Optim Eng 7(4):479– 500
MathSciNet MATH Google Scholar
Marmin S, Chevalier C, Ginsbourger D (2015) Differentiating the multipoint expected improvement for optimal batch design. In: International workshop on machine learning, Optimization and big data. Springer, pp 37–48
McKay M, Conover W, Beckman R (1979) A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 21(2):239–245
MathSciNet MATH Google Scholar
Meeker W, Hahn G, Escobar L (2017) Statistical intervals: a guide for practitioners and researchers. Wiley Series in Probability and Statistics, Wiley. https://books.google.com/books?id=y3o0DgAAQBAJ
Mehmani A, Chowdhury S, Messac A (2015a) Adaptive switching of variable-fidelity models in population-based optimization algorithms. In: 16th AIAA/ISSMO multidisciplinary analysis and optimization conference, p 3233
Mehmani A, Chowdhury S, Messac A (2015) Predictive quantification of surrogate model fidelity based on modal variations with sample density. Struct Multidiscip Optim 52(2):353–373
Google Scholar
Mehmani A, Chowdhury S, Messac A (2016) Variable-fidelity optimization with in-situ surrogate model refinement. In: ASME 2015 international design engineering technical conferences and computers and information in engineering conference, American society of mechanical engineers digital collection
Mehmani A, Chowdhury S, Meinrenken C, Messac A (2018) Concurrent surrogate model selection (cosmos): optimizing model type, kernel function, and hyper-parameters. Struct Multidiscip Optim 57(3):1093–1114
MathSciNet Google Scholar
Molga M, Smutnicki C (2005) Test functions for optimization needs. Apr, 101
Moore R A, Romero D A, Paredis C J (2011) A rational design approach to gaussian process modeling for variable fidelity models. In: ASME 2011 International design engineering technical conferences (IDETC). Washington, DC
Pelikan M (2005) Hierarchical Bayesian optimization algorithm. In: Hierarchical Bayesian optimization algorithm, Springer, pp 105–129
Peng L, Liu L, Long T, Guo X (2014) Sequential rbf surrogate-based efficient optimization method for engineering design problems with expensive black-box functions. Chin J Mech Eng 27(6):1099–1111
Google Scholar
Rai R (2006) Qualitative and quantitative sequential sampling,. PhD thesis, University of Texas Texas, Austin, USA
Regis R G (2014) Constrained optimization by radial basis function interpolation for high-dimensional expensive black-box problems with infeasible initial points. Eng Optim 46(2):218–243
MathSciNet Google Scholar
Regis R G, Shoemaker C A (2013) Combining radial basis function surrogates and dynamic coordinate search in high-dimensional expensive black-box optimization. Eng Optim 45(5):529–555
MathSciNet Google Scholar
Robinson T D, Eldred M S, Willcox K E, Haimes R (2008) Surrogate-based optimization using multifidelity models with variable parameterization and corrected space mapping. AIAA J 46(11):2814–2822
Google Scholar
Rodriguez J F, Perez V M, Padmanabhan D, Renaud J E (2001) Sequential approximate optimization using variable fidelity response surface approximations. Struct Multidiscip Optim 22(1):24–34
Google Scholar
Sharif S A, Hammad A (2019) Developing surrogate ann for selecting near-optimal building energy renovation methods considering energy consumption, lcc and lca. J Buil Eng 25:100790
Google Scholar
Simpson T, Korte J, Mauery T, Mistree F (2001) Kriging models for global approximation in simulation-based multidisciplinary design optimization. AIAA J 39(12):2233–2241
Google Scholar
Simpson T, Toropov V, Balabanov V, Viana F (2008) Design and analysis of computer experiments in multidisciplinary design optimization: a review of how far we have come-or not. In: 12th AIAA/ISSMO multidisciplinary analysis and optimization conference, p 5802
Snoek J, Larochelle H, Adams RP (2012) Practical bayesian optimization of machine learning algorithms. In: Advances in neural information processing systems, pp 2951–2959
Sugiyama M (2006) Active learning in approximately linear regression based on conditional expectation of generalization error. J Mach Learn Res 7:141–166
MathSciNet MATH Google Scholar
Sun M, Chang CL, Zhang J, Mehmani A, Culligan P (2018) Break-even analysis of battery energy storage in buildings considering time-of-use rates. In: IEEE green technologies conference (GreenTech), pp 95–99
Tajbakhsh S D, del Castillo E, Rosenberger J L (2013) A fully Bayesian approach to the efficient global optimization algorithm. PhD thesis, Pennsylvania State University Working Paper
Tanabe R, Fukunaga A S (2014) Improving the search performance of shade using linear population size reduction. In: 2014 IEEE congress on evolutionary computation (CEC). IEEE, pp 1658–1665
Tian W (2013) A review of sensitivity analysis methods in building energy analysis. Renew Sust Energ Rev 20:411–419
Google Scholar
Tong W, Chowdhury S, Messac A (2016) A multi-objective mixed-discrete particle swarm optimization with multi-domain diversity preservation. Struct Multidiscip Optim 53(3):471–488
MathSciNet Google Scholar
Toropov VV, Schramm U, Sahai A, Jones RD, Zeguer T (2005) Design optimization and stochastic analysis based on the moving least squares method. 6th World Congress of Structural and Multidisciplinary Optimization
Ulmer H, Streichert F, Zell A (2004) Evolution strategies with controlled model assistance. In: Congress on evolutionary computation, 2004. CEC2004. IEEE, vol 2, pp 1569–1576
Viana F A, Haftka R T, Watson L T (2013) Efficient global optimization algorithm assisted by multiple surrogate techniques. J Glob Optim 56(2):669–689
MATH Google Scholar
Wang Y, Song Z, De Angelis V, Srivastava S (2018) Battery life-cycle optimization and runtime control for commercial buildings demand side management: a New York City case study. Energy 165:782–791
Google Scholar
Wild S M, Regis R G, Shoemaker C A (2008) Orbit: optimization by radial basis function interpolation in trust-regions. SIAM J Sci Comput 30(6):3197–3219
MathSciNet MATH Google Scholar
Williams B, Loeppky J L, Moore L M, Macklem M S (2011) Batch sequential design to achieve predictive maturity with calibrated computer models. Reliab Eng Syst Saf 96(9):1208–1219
Google Scholar
Yao W, Chen X, Huang Y, van Tooren M (2014) A surrogate-based optimization method with rbf neural network enhanced by linear interpolation and hybrid infill strategy. Optim Methods Softw 29(2):406–429
MathSciNet MATH Google Scholar
Yegnanarayana B (2004) Artificial neural networks. PHI Learning Pvt. Ltd.
Zhang J, Chowdhury S, Messac A (2012) An adaptive hybrid surrogate model. Struct Multidiscip Optim 46(2):223–238
Google Scholar

Download references

Funding

Support from the National Science Foundation (NSF) Award CMMI-1642340 is gratefully acknowledged.

Author information

Authors and Affiliations

Department of Mechanical and Aerospace Engineering, University at Buffalo, Buffalo, NY, 14260, USA
Payam Ghassemi & Souma Chowdhury
Data Science Institute, Columbia University, New York, NY, 10027, USA
Ali Mehmani

Authors

Payam Ghassemi
View author publications
You can also search for this author in PubMed Google Scholar
Ali Mehmani
View author publications
You can also search for this author in PubMed Google Scholar
Souma Chowdhury
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Souma Chowdhury.

Ethics declarations

Any opinions, findings, conclusions, or recommendations expressed in this paper are those of the authors and do not necessarily reflect the views of the NSF.

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Responsible Editor: Nathalie Bartoli

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Replication of results

To aid the replication of results, data and trained models associated with the numerical experiments presented in this paper have been made available through the following public repository: https://github.com/adamslab-ub/amr-samples-metamodels-package.

Appendices

Appendix A: Kernel density estimation (KDE)

KDE is a non-parametric model to estimate the probability density function of random variables. Here, it is assumed that $\varDelta f = (\varDelta f_{1}, \varDelta f_{2}, ..., \varDelta f_{N_{\text {pop}}})$ is an independent and identically distributed sample drawn from a distribution with an unknown density Θ_Δf. The kernel density estimator can then be used to determine Θ_Δf, as given by:

$$ \hat{\varTheta}_{\varDelta f}(x; H) = \frac{1}{N_{\text{pop}}} \sum\limits_{i=1}^{N_{\text{pop}}} K_{H}(x-x_{i}) $$

(12)

Here, the kernel K(x) is a symmetric probability density function, H is the bandwidth matrix which is symmetric and positive-definite, and K_H(x) = |H|^− 1/2K(H^− 1/2x). The choice of K is not as crucial as the choice of the H estimator for the accuracy of the KDE (Epanechnikov 1969). In this article, we consider $K(x) = (2\pi )^{-d/2}\exp (-0.5x^{T}x)$, the standard normal throughout. The mean integrated squared error (MISE) method is used as a criterion for selecting the bandwidth matrix H (Duong and Hazelton 2003) as follows:

$$ \text{MISE}(H) = \mathbb{E}\Big[\int\Big(\varTheta_{\varDelta f}(x; H)-\hat{\varTheta}_{\varDelta f}(x; H)\Big)^{2} \Big] $$

(13)

Appendix B: Particle Swarm Optimization

Particle swarm optimization (PSO) is a population-based optimization method introduced by Kennedy and Eberhart (1995). In this method, each particle’s movement is described in terms of its velocity (v_i(t)) and its location (x_i(t)), where i denotes the i th particle and t denotes the t th iteration. Here, we specifically exploit the MDPSO algorithm developed by Chowdhury et al. (2013), which includes explicit diversity preservation in addition to the standard PSO dynamics, in order to provide greater robustness. In MDPSO, the velocity and location of particles are updated as follows:

$$ \mathbf{x}_{i}(t+1) = \mathbf{x}_{i}(t) + \mathbf{v}_{i}(t+1) $$

(14)

$$ \begin{array}{@{}rcl@{}} \mathbf{v}_{i}(t+1) &=& \omega \mathbf{v}_{i}(t) + r_{1}C_{1}(\mathbf{P}_{i}^{l}(t)-\mathbf{x}_{i}(t))\\ &&+ r_{2}C_{2}(\mathbf{P}^{g}(t)-\mathbf{x}_{i}(t))+r_{3}\gamma_{c}\hat{\nu}_{i}(t) \end{array} $$

(15)

Here, x_i(t) and v_i(t) respectively denote the position and the velocity of particle i at the t^th iteration; ω, C₁, and C₂ represent the inertial weight, the individual search, and the global search coefficients, respectively; these are used to balance the local search (exploitation) and the global search (exploration); $\mathbf {P}_{i}^{l}(t)$ is the local leader of particle i at the t^th iteration, which represents the best local solution found in the motion history of particle i; P^g(t) is the global leader of the entire swarm at the t^th iteration, which is determined by comparing the local leaders of all particles; γ_c is the coefficient used to weigh the explicit diversity preservation component; $\hat {\nu }_{i}(t)$ is the explicit diversity preservation vector; and r₁, r₂, and r₃ are random real numbers between 0 and 1.

Appendix C: Predictive estimation of model fidelity

Predictive estimation of model fidelity (PEMF) method (Mehmani et al. 2015) can be perceived as a novel sequential implementation of k-fold cross-validation, with carefully constructed error measures that are significantly less sensitive to outliers and the DoE (compared with mean or root mean square error measures). The PEMF method predicts the error by capturing the variation of the surrogate model error with an increasing density of training points (without investing any additional test points). Algorithm 1 summarizes the PEMF method.

Appendix D: The settings of COSMOS and MDPSO

Table 5 The COSMOS and MDPSO settings for the analytical problem and application problem

Full size table

Appendix E: Description of six-hump camel back function

Figure 12 shows the six-hump camel back function, a well-known 2D test function used for benchmarking global optimization and surrogate modeling methods (Molga and Smutnicki 2005). It has six local minima, with two of them being the global minima (Molga and Smutnicki 2005). The global minima are located at (− 0.0898, 0.7126) and (0.0898,− 0.7126), and gives a minimum function value of f(x^∗) = − 1.0316.

$$ f(x_{1}, x_{2}) = \left( \frac{{x_{1}^{4}}}{3} - 2.1 {x_{1}^{2}} + 4\right){x_{1}^{2}} + x_{1} x_{2} + 4\left( {x_{2}^{2}} - 1\right){x_{2}^{2}} $$

(16)

where − 3 ≤ x₁ ≤ 3 and − 2 ≤ x₂ ≤ 2.

Appendix F: Further performance illustration on the six-hump problem

The following illustrations in Fig. 13 respectively represent the best individual run of PSO-AMR, PSO-AMR-Local, PSO-SBO-k1, and PSO-SBO-k2.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ghassemi, P., Mehmani, A. & Chowdhury, S. Adaptive in situ model refinement for surrogate-augmented population-based optimization. Struct Multidisc Optim 62, 2011–2034 (2020). https://doi.org/10.1007/s00158-020-02592-6

Download citation

Received: 16 October 2019
Revised: 09 March 2020
Accepted: 29 March 2020
Published: 26 May 2020
Issue Date: October 2020
DOI: https://doi.org/10.1007/s00158-020-02592-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptive in situ model refinement for surrogate-augmented population-based optimization

Abstract

Access this article

Similar content being viewed by others

Multi-surrogate-based global optimization using a score-based infill criterion

Effectiveness of approximation strategy in surrogate-assisted fireworks algorithm

Study of the Influence of the Initial a Priori Training Dataset Size in the Efficiency and Convergence of Surrogate-Based Evolutionary Optimization

Notes

References

Funding