Abstract
In surrogate-based optimization (SBO), the deception issues associated with the low fidelity of the surrogate model can be dealt with in situ model refinement that uses infill points during optimization. However, there is a lack of model refinement methods that are both independent of the choice of surrogate model (neural networks, radial basis functions, Kriging, etc.) and provides a methodical approach to preserve the fidelity of the search dynamics, especially in the case of population-based heuristic optimization processes. This paper presents an adaptive model refinement (AMR) approach to fill this important gap. Therein, the question of when to refine the surrogate model is answered by a novel hypothesis testing concept that compares the distribution of model error and distribution of function improvement over iterations. These distributions are respectively computed via a probabilistic cross-validation approach and by leveraging the probabilistic improvement information uniquely afforded by population-based algorithms such as particle swarm optimization. Moreover, the AMR method identifies the size of the batch of infill points needed for refinement. Numerical experiments performed on multiple benchmark functions and an optimal (building energy) planning problem demonstrate AMR’s ability to preserve computational efficiency of the SBO process while providing solutions of more attractive fidelity than those provisioned by a standard SBO approach.
Similar content being viewed by others
Notes
A Matlab implementation based on Jones et al. (1998)
References
Alexandrov N, Lewis R, Gumbert C, Green L, Newman P (1999) Optimization with variable-fidelity models applied to wing design. Tech. rep., ICASE, Institute for Computer Applications in Science and Engineering NASA Langley Research Center, Hampton, Virginia
Alexandrov N M, Dennis J E, Lewis R M, Torczon V (1998) A trust-region framework for managing the use of approximation models in optimization. Struct Optim 15(1):16–23
Ascione F, Bianco N, Stasio C D, Mauro G M, Vanoli G P (2017) Artificial neural networks to predict energy performance and retrofit scenarios for any member of a building category: a novel approach. Energy 26(118):999–1017
Audet C, Dennis J E, Moore D W, Booker A, Frank P D (2000) A surrogate-model-based method for constrained optimization. In: 8th symposium on multidisciplinary analysis and optimization. Long Beach, CA
Bichon B J, Eldred M S, Mahadevan S, McFarland J M (2013) Efficient global surrogate modeling for reliability-based design optimization. J Mech Des 135(1):011009
Booker A J, Dennis J E, Frank P D, Serafini D B, Torczon V, Trosset M (1999) Rigorous framework for optimization of expensive functions by surrogates. Struct Optim 17:1–13
Booker A J, Dennis J E, Frank P D, Serafini D B, Torczon V, Trosset M W (1999b) A rigorous framework for optimization of expensive functions by surrogates. Struct Optim 17(1):1–13
Chen X, Yang H, Sun K (2017) Developing a meta-model for sensitivity analyses and prediction of building performance for passively designed high-rise residential buildings. Applied energy 194:422–439
Chen Y, Hong T (2018) Impacts of building geometry modeling methods on the simulation results of urban building energy models. Appl Energy 215:717–735
Cheng G H, Younis A, Haji Hajikolaei K, Gary Wang G (2015) Trust region based mode pursuing sampling method for global optimization of high dimensional design problems. J Mech Des 137(2):021407
Choi K, Youn BD, Yang RJ (2001) Moving least square method for reliability-based design optimization. Proc 4th world cong structural & multidisciplinary optimization
Chowdhury S, Tong W, Messac A, Zhang J (2013) A mixed-discrete particle swarm optimization algorithm with explicit diversity-preservation. Struct Multidiscip Optim 47(3):367–388
Chowdhury S, Mehmani A, Tong W, Messac A (2016) Adaptive model refinement in surrogate-based multiobjective optimization. In: 57th AIAA/ASCE/AHS/ASC structures, structural dynamics, and materials conference, p X00000. pp 0417
Clarke S M, Griebsch JH, Simpson TW (2005) Analysis of support vector regression for approximation of complex engineering analyses. Journal of Mechanical Design 127(6):1077–1087
Corbin C D, Henze G P, May-Ostendorp P (2013) A model predictive control optimization environment for real-time commercial building application. J Build Perform Simul 5(3):159–174
Deru M, Field K, Studer D, Benne K, Griffith B, Torcellini P, Liu B (2011) US Department of Energy commercial reference building models of the national building stock. Tech. rep., Department of Energy
Dixon L C W, Price R (1989) Truncated newton method for sparse unconstrained optimization using automatic differentiation. J Optim Theory Appl 60(2):261–275
DOE (2017) Commercial prototype building models. energycodes.gov/development/commercial
Duan Q, Sorooshian S, Gupta V (1992) Effective and efficient global optimization for conceptual rainfall-runoff models. Water Res Res 28(4):1015–1031
Duong T, Hazelton M (2003) Plug-in bandwidth matrices for bivariate kernel density estimation. Nonparametr Stat 15(1):17–30
Epanechnikov V A (1969) Non-parametric estimation of a multivariate probability density. Theory Prob Appli 14(1):153–158
Fernández-Godino MG, Park C, Kim NH, Haftka RT (2016) Review of multi-fidelity models. arXiv:160907196
Forrester A, Keane A (2009) Recent advances in surrogate-based optimization. Prog Aerosp Sci 45(1-3):50–79
Forrester A, Sobester A, Keane A (2008) Engineering design via surrogate modelling: a practical guide. Wiley, New York
Ghassemi P, Zhu K, Chowdhury S (2017) Optimal surrogate and neural network modeling for day-ahead forecasting of the hourly energy consumption of university buildings. In: ASME 2017 international design engineering technical conferences and computers and information in engineering conference, American Society of Mechanical Engineers, pp V02BT03A026–V02BT03A026
Glantz SA, Slinker BK, Neilands TB (1990) Primer of applied regression and analysis of variance, vol 309. McGraw-Hill , New York
Gräning L, Jin Y, Sendhoff B (2007) Individual-based management of meta-models for evolutionary optimization with application to three-dimensional blade optimization. In: Evolutionary computation in dynamic and uncertain environments, Springer, pp 225–250
Griewank A O (1981) Generalized descent for global optimization. J Optim Theory Appl 34(1):11–39
Hansen N (2006) The cma evolution strategy: a comparing review. In: Towards a new evolutionary computation, Springer, pp 75–102
Hardy R L (1971) Multiquadric equations of topography and other irregular surfaces. J Geophys Res 76:1905–1915
Hennig P, Schuler C J (2012) Entropy search for information-efficient global optimization. J Mach Learn Res 13:1809–1837
Tyler Hoyt AE, Zhang H (2015) Extending air temperature setpoints: simulated energy savings and design considerations for new and retrofit buildings. Build Environ 88:89–96
Jakobsson S, Patriksson M, Rudholm J, Wojciechowski A (2010) A method for simulation based optimization using radial basis functions. Optim Eng 11(4):501–532
Jin R, Chen W, Simpson T W (2001) Comparative studies of metamodelling techniques under multiple modelling criteria. Struct Multidiscip Optim 23(1):1–13
Jin Y (2005) A comprehensive survey of fitness approximation in evolutionary computation. Soft Comput 9 (1):3–12
Jin Y (2011) Surrogate-assisted evolutionary computation: recent advances and future challenges. Swarm Evol Comput 1(2):61–70
Jin Y, Sendhoff B (2004) Reducing fitness evaluations using clustering techniques and neural network ensembles. In: Genetic and evolutionary computation, GECCO, vol 2004, pp 688–699
Jin Y, Olhofer M, Sendhoff B (2002) A framework for evolutionary optimization with approximate fitness functions. IEEE Trans Evol Comput 6(5):481–494
Jones D, Schonlau M, Welch W (1998) Efficient global optimization of expensive black-box functions. J Glob Optim 13(4):455–492
Keane A, Nair P (2005) Computational approaches for aerospace design: the pursuit of excellence. Wiley, New York
Keane A J (2006) Statistical improvement criteria for use in multiobjective design optimization. AIAA J 44 (4):879–891
Kennedy J, Eberhart RC (1995) Particle swarm optimization. In: IEEE international conference on neural networks, , vol IV. IEEE, Piscataway, pp 1942–1948
Kleijnen J P, Beers W V, Nieuwenhuyse I V (2012) Expected improvement in efficient global optimization through bootstrapped kriging. J Global Optim 54(1):59–73
Kourakos G, Mantoglou A (2009) Pumping optimization of coastal aquifers based on evolutionary algorithms and surrogate modular neural network models. Adv Water Resour 32(4):507–521
Liu Y, Ghassemi P, Chowdhury S, Zhang J (2018) Surrogate based multi-objective optimization of j-type battery thermal management system. In: ASME 2018 international design engineering technical conferences and computers and information in engineering conference, American society of mechanical engineers digital collection
Lulekar S, Ghassemi P, Chowdhury S (2018) Cfd-based analysis and surrogate-based optimization of bio-inspired surface riblets for aerodynamic efficiency. In: 2018 Multidisciplinary Analysis and Optimization Conference, p 3107
March A, Willcox K, Wang Q (2011) Gradient-based multifidelity optimisation for aircraft design using Bayesian model calibration. Aeronaut J 115(1174):729–738
Marduel X, Tribes C, Trepanier J Y (2006) Variable-fidelity optimization: efficiency and robustness. Optim Eng 7(4):479– 500
Marmin S, Chevalier C, Ginsbourger D (2015) Differentiating the multipoint expected improvement for optimal batch design. In: International workshop on machine learning, Optimization and big data. Springer, pp 37–48
McKay M, Conover W, Beckman R (1979) A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 21(2):239–245
Meeker W, Hahn G, Escobar L (2017) Statistical intervals: a guide for practitioners and researchers. Wiley Series in Probability and Statistics, Wiley. https://books.google.com/books?id=y3o0DgAAQBAJ
Mehmani A, Chowdhury S, Messac A (2015a) Adaptive switching of variable-fidelity models in population-based optimization algorithms. In: 16th AIAA/ISSMO multidisciplinary analysis and optimization conference, p 3233
Mehmani A, Chowdhury S, Messac A (2015) Predictive quantification of surrogate model fidelity based on modal variations with sample density. Struct Multidiscip Optim 52(2):353–373
Mehmani A, Chowdhury S, Messac A (2016) Variable-fidelity optimization with in-situ surrogate model refinement. In: ASME 2015 international design engineering technical conferences and computers and information in engineering conference, American society of mechanical engineers digital collection
Mehmani A, Chowdhury S, Meinrenken C, Messac A (2018) Concurrent surrogate model selection (cosmos): optimizing model type, kernel function, and hyper-parameters. Struct Multidiscip Optim 57(3):1093–1114
Molga M, Smutnicki C (2005) Test functions for optimization needs. Apr, 101
Moore R A, Romero D A, Paredis C J (2011) A rational design approach to gaussian process modeling for variable fidelity models. In: ASME 2011 International design engineering technical conferences (IDETC). Washington, DC
Pelikan M (2005) Hierarchical Bayesian optimization algorithm. In: Hierarchical Bayesian optimization algorithm, Springer, pp 105–129
Peng L, Liu L, Long T, Guo X (2014) Sequential rbf surrogate-based efficient optimization method for engineering design problems with expensive black-box functions. Chin J Mech Eng 27(6):1099–1111
Rai R (2006) Qualitative and quantitative sequential sampling,. PhD thesis, University of Texas Texas, Austin, USA
Regis R G (2014) Constrained optimization by radial basis function interpolation for high-dimensional expensive black-box problems with infeasible initial points. Eng Optim 46(2):218–243
Regis R G, Shoemaker C A (2013) Combining radial basis function surrogates and dynamic coordinate search in high-dimensional expensive black-box optimization. Eng Optim 45(5):529–555
Robinson T D, Eldred M S, Willcox K E, Haimes R (2008) Surrogate-based optimization using multifidelity models with variable parameterization and corrected space mapping. AIAA J 46(11):2814–2822
Rodriguez J F, Perez V M, Padmanabhan D, Renaud J E (2001) Sequential approximate optimization using variable fidelity response surface approximations. Struct Multidiscip Optim 22(1):24–34
Sharif S A, Hammad A (2019) Developing surrogate ann for selecting near-optimal building energy renovation methods considering energy consumption, lcc and lca. J Buil Eng 25:100790
Simpson T, Korte J, Mauery T, Mistree F (2001) Kriging models for global approximation in simulation-based multidisciplinary design optimization. AIAA J 39(12):2233–2241
Simpson T, Toropov V, Balabanov V, Viana F (2008) Design and analysis of computer experiments in multidisciplinary design optimization: a review of how far we have come-or not. In: 12th AIAA/ISSMO multidisciplinary analysis and optimization conference, p 5802
Snoek J, Larochelle H, Adams RP (2012) Practical bayesian optimization of machine learning algorithms. In: Advances in neural information processing systems, pp 2951–2959
Sugiyama M (2006) Active learning in approximately linear regression based on conditional expectation of generalization error. J Mach Learn Res 7:141–166
Sun M, Chang CL, Zhang J, Mehmani A, Culligan P (2018) Break-even analysis of battery energy storage in buildings considering time-of-use rates. In: IEEE green technologies conference (GreenTech), pp 95–99
Tajbakhsh S D, del Castillo E, Rosenberger J L (2013) A fully Bayesian approach to the efficient global optimization algorithm. PhD thesis, Pennsylvania State University Working Paper
Tanabe R, Fukunaga A S (2014) Improving the search performance of shade using linear population size reduction. In: 2014 IEEE congress on evolutionary computation (CEC). IEEE, pp 1658–1665
Tian W (2013) A review of sensitivity analysis methods in building energy analysis. Renew Sust Energ Rev 20:411–419
Tong W, Chowdhury S, Messac A (2016) A multi-objective mixed-discrete particle swarm optimization with multi-domain diversity preservation. Struct Multidiscip Optim 53(3):471–488
Toropov VV, Schramm U, Sahai A, Jones RD, Zeguer T (2005) Design optimization and stochastic analysis based on the moving least squares method. 6th World Congress of Structural and Multidisciplinary Optimization
Ulmer H, Streichert F, Zell A (2004) Evolution strategies with controlled model assistance. In: Congress on evolutionary computation, 2004. CEC2004. IEEE, vol 2, pp 1569–1576
Viana F A, Haftka R T, Watson L T (2013) Efficient global optimization algorithm assisted by multiple surrogate techniques. J Glob Optim 56(2):669–689
Wang Y, Song Z, De Angelis V, Srivastava S (2018) Battery life-cycle optimization and runtime control for commercial buildings demand side management: a New York City case study. Energy 165:782–791
Wild S M, Regis R G, Shoemaker C A (2008) Orbit: optimization by radial basis function interpolation in trust-regions. SIAM J Sci Comput 30(6):3197–3219
Williams B, Loeppky J L, Moore L M, Macklem M S (2011) Batch sequential design to achieve predictive maturity with calibrated computer models. Reliab Eng Syst Saf 96(9):1208–1219
Yao W, Chen X, Huang Y, van Tooren M (2014) A surrogate-based optimization method with rbf neural network enhanced by linear interpolation and hybrid infill strategy. Optim Methods Softw 29(2):406–429
Yegnanarayana B (2004) Artificial neural networks. PHI Learning Pvt. Ltd.
Zhang J, Chowdhury S, Messac A (2012) An adaptive hybrid surrogate model. Struct Multidiscip Optim 46(2):223–238
Funding
Support from the National Science Foundation (NSF) Award CMMI-1642340 is gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Any opinions, findings, conclusions, or recommendations expressed in this paper are those of the authors and do not necessarily reflect the views of the NSF.
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Responsible Editor: Nathalie Bartoli
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Replication of results
To aid the replication of results, data and trained models associated with the numerical experiments presented in this paper have been made available through the following public repository: https://github.com/adamslab-ub/amr-samples-metamodels-package.
Appendices
Appendix A: Kernel density estimation (KDE)
KDE is a non-parametric model to estimate the probability density function of random variables. Here, it is assumed that \(\varDelta f = (\varDelta f_{1}, \varDelta f_{2}, ..., \varDelta f_{N_{\text {pop}}})\) is an independent and identically distributed sample drawn from a distribution with an unknown density ΘΔf. The kernel density estimator can then be used to determine ΘΔf, as given by:
Here, the kernel K(x) is a symmetric probability density function, H is the bandwidth matrix which is symmetric and positive-definite, and KH(x) = |H|− 1/2K(H− 1/2x). The choice of K is not as crucial as the choice of the H estimator for the accuracy of the KDE (Epanechnikov 1969). In this article, we consider \(K(x) = (2\pi )^{-d/2}\exp (-0.5x^{T}x)\), the standard normal throughout. The mean integrated squared error (MISE) method is used as a criterion for selecting the bandwidth matrix H (Duong and Hazelton 2003) as follows:
Appendix B: Particle Swarm Optimization
Particle swarm optimization (PSO) is a population-based optimization method introduced by Kennedy and Eberhart (1995). In this method, each particle’s movement is described in terms of its velocity (vi(t)) and its location (xi(t)), where i denotes the i th particle and t denotes the t th iteration. Here, we specifically exploit the MDPSO algorithm developed by Chowdhury et al. (2013), which includes explicit diversity preservation in addition to the standard PSO dynamics, in order to provide greater robustness. In MDPSO, the velocity and location of particles are updated as follows:
Here, xi(t) and vi(t) respectively denote the position and the velocity of particle i at the tth iteration; ω, C1, and C2 represent the inertial weight, the individual search, and the global search coefficients, respectively; these are used to balance the local search (exploitation) and the global search (exploration); \(\mathbf {P}_{i}^{l}(t)\) is the local leader of particle i at the tth iteration, which represents the best local solution found in the motion history of particle i; Pg(t) is the global leader of the entire swarm at the tth iteration, which is determined by comparing the local leaders of all particles; γc is the coefficient used to weigh the explicit diversity preservation component; \(\hat {\nu }_{i}(t)\) is the explicit diversity preservation vector; and r1, r2, and r3 are random real numbers between 0 and 1.
Appendix C: Predictive estimation of model fidelity
Predictive estimation of model fidelity (PEMF) method (Mehmani et al. 2015) can be perceived as a novel sequential implementation of k-fold cross-validation, with carefully constructed error measures that are significantly less sensitive to outliers and the DoE (compared with mean or root mean square error measures). The PEMF method predicts the error by capturing the variation of the surrogate model error with an increasing density of training points (without investing any additional test points). Algorithm 1 summarizes the PEMF method.
Appendix D: The settings of COSMOS and MDPSO
Appendix E: Description of six-hump camel back function
Figure 12 shows the six-hump camel back function, a well-known 2D test function used for benchmarking global optimization and surrogate modeling methods (Molga and Smutnicki 2005). It has six local minima, with two of them being the global minima (Molga and Smutnicki 2005). The global minima are located at (− 0.0898, 0.7126) and (0.0898,− 0.7126), and gives a minimum function value of f(x∗) = − 1.0316.
where − 3 ≤ x1 ≤ 3 and − 2 ≤ x2 ≤ 2.
Appendix F: Further performance illustration on the six-hump problem
The following illustrations in Fig. 13 respectively represent the best individual run of PSO-AMR, PSO-AMR-Local, PSO-SBO-k1, and PSO-SBO-k2.
Rights and permissions
About this article
Cite this article
Ghassemi, P., Mehmani, A. & Chowdhury, S. Adaptive in situ model refinement for surrogate-augmented population-based optimization. Struct Multidisc Optim 62, 2011–2034 (2020). https://doi.org/10.1007/s00158-020-02592-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00158-020-02592-6