Abstract
This paper extends the considerations of the works (Darbon and Osher, Res Math Sci 3:19, 2016; Chow et al., 2017, arxiv.org/abs/1704.02524) regarding curse-of-dimensionality-free numerical approaches to solve certain types of Hamilton–Jacobi equations arising in optimal control problems, differential games and elsewhere. A rigorous formulation and justification for the extended Hopf–Lax formula of(Chow et al., 2017, arxiv.org/abs/1704.02524) is provided together with novel theoretical and practical discussions including useful recommendations. By using the method of characteristics, the solutions of some problem classes under convexity/concavity conditions on Hamiltonians (in particular, the solutions of Hamilton–Jacobi–Bellman equations in optimal control problems) are evaluated separately at different initial positions. This allows for the avoidance of the curse of dimensionality, as well as for choosing arbitrary computational regions. The corresponding feedback control strategies are obtained at selected positions without approximating the partial derivatives of the solutions. The results of numerical simulations demonstrate the high potential of the proposed techniques. It is also pointed out that, despite the indicated advantages, the related approaches still have a limited range of applicability, and their extensions to Hamilton–Jacobi–Isaacs equations in zero-sum two-player differential games are currently developed only for sufficiently narrow classes of control systems. These extensions require further investigation.
Similar content being viewed by others
References
Darbon, J., Osher, S.: Algorithms for overcoming the curse of dimensionality for certain Hamilton-Jacobi equations arising in control theory and elsewhere. Res. Math. Sci. 3, 19 (2016)
Chow, Y.T., Darbon, J., Osher, S., Yin, W.: Algorithm for overcoming the curse of dimensionality for state-dependent Hamilton–Jacobi equations. 2017. arxiv.org/abs/1704.02524
Subbotin, A.I.: Generalized Solutions of First-Order PDEs: The Dynamical Optimization Perspective. Birkhauser, Boston (1995)
Melikyan, A.A.: Generalized Characteristics of First Order PDEs: Application in Optimal Control and Differential Games. Birkhauser, Boston (1998)
Yong, J., Zhou, XYu.: Stochastic Controls: Hamiltonian Systems and HJB Equations. Springer, New York (1999)
Fleming, W.H., Soner, H.M.: Controlled Markov Processes and Viscosity Solutions. Springer, New York (2006)
Bardi, M., Capuzzo-Dolcetta, I.: Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations. Birkhauser, Boston (2008)
Yong, J.: Differential Games: A Concise Introduction. World Scientific Publishing, Singapore (2015)
Crandall, M.G., Lions, P.-L.: Two approximations of solutions of Hamilton-Jacobi equations. Math. Comput. 43, 1–19 (1984)
Osher, S., Shu, C.-W.: High order essentially non-oscillatory schemes for Hamilton-Jacobi equations. SIAM J. Numer. Anal. 28(4), 907–922 (1991)
Jiang, G., Peng, D.P.: Weighted ENO schemes for Hamilton-Jacobi equations. SIAM J. Sci. Comput. 21(6), 2126–2143 (2000)
Zhang, Y.-T., Shu, C.-W.: High-order WENO schemes for Hamilton-Jacobi equations on triangular meshes. SIAM J. Sci. Comput. 24(3), 1005–1030 (2003)
Bokanowski, O., Forcadel, N., Zidani, H.: Reachability and minimal times for state constrained nonlinear problems without any controllability assumption. SIAM J. Control Optim. 48, 4292–4316 (2010)
Bokanowski, O., Cristiani, E., Zidani, H.: An efficient data structure and accurate scheme to solve front propagation problems. J. Sci. Comput. 42(2), 251–273 (2010)
Bokanowski, O., Desilles, A., Zidani, H., Zhao, J.: User’s guide for the ROC-HJ solver: Reachability, Optimal Control, and Hamilton–Jacobi equations. Version 2.3 (10 May 2017). http://uma.ensta-paristech.fr/soft/ROC-HJ/
Falcone, M., Ferretti, R.: Convergence analysis for a class of high-order semi-Lagrangian advection schemes. SIAM J. Numer. Anal. 35(3), 909–940 (1998)
Falcone, M.: Numerical methods for differential games based on partial differential equations. Int. Game Theory Rev. 8, 231–272 (2006)
Cristiani, E., Falcone, M.: Fast semi-Lagrangian schemes for the Eikonal equation and applications. SIAM J. Numer. Anal. 45, 1979–2011 (2007)
Osher, S., Sethian, J.: Fronts propagating with curvature-dependent speed: algorithms based on Hamilton-Jacobi formulations. J. Comput. Phys. 79, 12–49 (1988)
Osher, S.: A level set formulation for the solution of the Dirichlet problem for Hamilton-Jacobi equations. SIAM J. Math. Anal. 24(5), 1145–1152 (1993)
Sethian, J.: Level Set Methods and Fast Marching Methods. Cambridge University Press, New York (1999)
Osher, S., Fedkiw, R.: Level Set Methods and Dynamic Implicit Surfaces. Springer, New York (2003)
Mitchell, I., Bayen, A., Tomlin, C.: Computing reachable sets for continuous dynamic games using level set methods. 2002. http://hybrid.stanford.edu/~bayen/publications.html
Mitchell, I., Bayen, A., Tomlin, C.: A time-dependent Hamilton–Jacobi formulation of reachable sets for continuous dynamic games. IEEE Trans. Autom. Control 50(7), 947–957 (2005)
Mitchell, I.: A Toolbox of Level Set Methods. Department of Computer Science, University of British Columbia (2012). http://www.cs.ubc.ca/~mitchell/ToolboxLS
Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)
Bellman, R.: Adaptive Control Processes: A Guided Tour. Princeton University Press, Princeton (1961)
McEneaney, W.M.: Max-Plus Methods in Nonlinear Control and Estimation. Birkhauser, Boston (2006)
McEneaney, W.M.: A curse-of-dimensionality-free numerical method for solution of certain HJB PDEs. SIAM J. Control Optim. 46(4), 1239–1276 (2007)
McEneaney, W.M., Deshpande, A., Gaubert, S.: Curse-of-complexity attenuation in the curse-of-dimensionality-free method for HJB PDEs. In: Proceedings of the 2008 American Control Conference, pp. 4684–4690 (2008)
McEneaney, W.M., Kluberg, J.: Convergence rate for a curse-of-dimensionality-free method for a class of HJB PDEs. SIAM J. Control Optim. 48(5), 3052–3079 (2009)
Gaubert, S., McEneaney, W.M., Qu, Z.: Curse of dimensionality reduction in max-plus based approximation methods: theoretical estimates and improved pruning algorithms. In: Proceedings of the 50th IEEE Conference on Decision and Control, pp. 1054–1061 (2011)
Akian, M., Gaubert, S., Lakhoua, A.: The max-plus finite element method for solving deterministic optimal control problems: basic properties and convergence analysis. SIAM J. Control Optim. 47(2), 817–848 (2008)
Kaise, H., McEneaney, W.M.: Idempotent expansions for continuous-time stochastic control: compact control space. In: Proceedings of the 49th IEEE Conference on Decision and Control, pp. 7015–7020 (2010)
Akian, M., Fodjo, E.: A probabilistic max-plus numerical method for solving stochastic control problems. In: Proceedings of the 55th IEEE Conference on Decision and Control (2016). https://doi.org/10.1109/CDC.2016.7799411
Kang, W., Wilcox, L.C.: Mitigating the curse of dimensionality: sparse grid characteristics method for optimal feedback control and HJB equations. Comput. Optim. Appl. 68(2), 289–315 (2017)
Hopf, E.: Generalized solutions of nonlinear equations of first order. J. Math. Mech. 14, 951–973 (1965)
Evans, L.C.: Partial Differential Equations. Graduate Studies in Mathematics, vol. 19. American Mathematical Society (1998)
Rublev, I.V.: Generalized Hopf formulas for the nonautonomous Hamilton-Jacobi equation. Comput. Math. Model. 11(4), 391–400 (2000)
Evans, L.C.: Envelopes and nonconvex Hamilton-Jacobi equations. Calc. Var. Partial Differ. Equ. 50(1–2), 257–282 (2014)
Mirică, S.: Extending Cauchy’s method of characteristics for Hamilton-Jacobi equations. Stud. Cercet. Mat. 37(6), 555–565 (1985)
Subbotina, N.N.: Method of Cauchy characteristics and generalized solutions of Hamilton-Jacobi-Bellman equations. Dokl. AN SSSR 320, 556–561 (1991). (in Russian)
Subbotina, N.N.: Necessary and Sufficient Optimality Conditions in Terms of Characteristics of the Hamilton–Jacobi–Bellman Equation. Report 393. Institut für Angewandte Mathematik und Statistic, Universität Würzburg, Würzburg (1992)
Subbotina, N.N.: The method of characteristics for Hamilton-Jacobi equations and applications to dynamical optimization. J. Math. Sci. 135(3), 2955–3091 (2006)
Subbotina, N.N., Tokmantsev, T.B.: On the efficiency of optimal grid synthesis in optimal control problems with fixed terminal time. Differ. Equ. 45, 1686–1697 (2009)
Subbotina, N.N., Tokmantsev, T.B.: Estimating error of the optimal grid design in the problems of nonlinear optimal control of prescribed duration. Autom. Remote Control 70(9), 1565–1578 (2009)
Subbotina, N.N., Kolpakova, E.A.: On the structure of locally Lipschitz minimax solutions of the Hamilton-Jacobi-Bellman equation in terms of classical characteristics. Proc. Steklov Inst. Math. 268(suppl. 1), 222–239 (2010)
Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton (1970)
Rockafellar, R.T.: Conjugate Duality and Optimization. SIAM Publications, Philadelphia (1974)
Pontryagin, L.S., Boltyansky, V.G., Gamkrelidze, R.V., Mishchenko, E.F.: The Mathematical Theory of Optimal Processes. Macmillan, New York (1964)
Markley, N.G.: Principles of Differential Equations. Wiley, Hoboken (2004)
Blagodatskikh, V.I., Filippov, A.F.: Differential inclusions and optimal control. Proc. Steklov Inst. Math. 169, 199–259 (1986)
Krasovskii, N.N., Subbotin, A.I.: Positional Differential Games. Nauka, Moscow (1974). (in Russian)
Subbotin, A.I., Chentsov, A.G.: Optimization of Guaranteed Result in Control Problems. Nauka, Moscow (1981). (in Russian)
Krasovskii, N.N., Subbotin, A.I.: Game-Theoretical Control Problems. Springer, New York (1988)
Berkovitz, L.D.: The existence of value and saddle point in games of fixed duration. SIAM J. Control Optim. 23(2), 172–196 (1985)
Başar, T., Olsder, G.J.: Dynamic Noncooperative Game Theory. Academic, New York (1995)
Bernhard, P.: Singular surfaces in differential games: an introduction. In: Hagedorn, P., Knobloch, H.W., Olsder, G.J. (eds.) Differential Games and Applications, Series Lecture Notes in Control and Information Sciences, vol. 3, pp. 1–33. Springer, Berlin (1977)
Bernhard, P.: Pursuit-evasion games and zero-sum two-person differential games. In: Encyclopedia of Systems and Control, pp. 1–7 (2014). https://doi.org/10.1007/978-1-4471-5102-9_270-1
Melikyan, A., Bernhard, P.: Geometry of optimal trajectories around a focal singular surface in differential games. Appl. Math. Optim. 52, 23–37 (2005)
Press, W.H., Teukolsky, S.A., Vetterling, W.T., Flannery, B.P.: Numerical Recipes: The Art of Scientific Computing. Cambridge University Press, New York (2007)
Wang, L.: Model Predictive Control System Design and Implementation Using MATLAB. Springer, London (2009)
Kloeden, P.E., Platen, E.: Numerical Solution of Stochastic Differential Equations. Springer, Berlin (1995)
Carletti, M.: Numerical solution of stochastic differential problems in the biosciences. J. Comput. Appl. Math. 185(2), 422–440 (2006)
Acknowledgements
This work was supported in part by AFOSR/AOARD Grant FA2386-16-1-4066.
Author information
Authors and Affiliations
Corresponding author
Additional information
I. Yegorov—Also known as Egorov.
Appendix
Appendix
Let us introduce one more class of differential games for which a single programmed iteration is enough to reach the closed-loop game value. The corresponding result was derived in [54, §V.2].
Consider the linear differential game (92), (90) under Assumptions 4.2, 4.6. In addition to the notations presented in the introduction and Sect. 4, adopt that
for all \( j \in \mathbb {N} ,\) \( y = (y_1,\, y_2, \ldots , y_j) \in \mathbb {R}^j \) and \( \varepsilon \geqslant 0 .\) Several more conditions are imposed.
Assumption A.1
There exist a vector \( y_0 \in \mathbb {R}^k ,\) numbers \( r \in [0, \,+\infty ) ,\) \( \bar{r} \in [0,\, +\infty ) \) and continuous functions
such that the set \( \mathcal {M} \) from Assumption 4.6 is determined by
and, for all \( t \in [0,\, T] ,\) \( l \in L_k, \) one has
Denote also
The sought-after result can now be formulated.
Theorem A.2
[54, §V.2] Under Assumptions 4.2, 4.6, A.1, the closed-loop game value for (92), (90) at any position \( (t_0,\, x_0) \in [0,\, T] \times \mathbb {R}^n \) is represented as
where \( V^* \) is the programmed maximin function specified in Proposition 4.7.
Theorem A.2 can be applied in the two subsequent examples.
Example A.3
[54, §V.2] Consider the problem that appears from the game of Example 4.10 just by replacing \( U_1 = \mathcal {B}_2(O_2,\, a_1) \) with \( U_1 = \mathcal {B}^0_2(O_2,\, a_1) .\) Then the closed-loop game value function is also represented as in (113), but the programmed maximin function \( V^* \) is now different. \(\square \)
Example A.4
[54, §V.2] For the game
Theorem A.2 leads to the representation
where \( R_{\alpha } \) is defined as in (116) and \( V^* \) is determined according to Proposition 4.7. \(\square \)
Rights and permissions
About this article
Cite this article
Yegorov, I., Dower, P.M. Perspectives on Characteristics Based Curse-of-Dimensionality-Free Numerical Approaches for Solving Hamilton–Jacobi Equations. Appl Math Optim 83, 1–49 (2021). https://doi.org/10.1007/s00245-018-9509-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00245-018-9509-6
Keywords
- Optimal control
- Differential games
- Feedback strategies
- Hamilton–Jacobi equations
- Method of characteristics
- Curse of dimensionality