Risk Sensitive Markov Decision Processes

Marcus, S. I.; Fernández-Gaucherand, E.; Hernández-Hernández, D.; Coraluppi, S.; Fard, P.

doi:10.1007/978-1-4612-4120-1_14

S. I. Marcus⁵,
E. Fernández-Gaucherand⁶,
D. Hernández-Hernández⁷,
S. Coraluppi⁵ &
…
P. Fard⁵

Part of the book series: Systems & Control: Foundations & Applications ((PSCT,volume 22))

870 Accesses
26 Citations

Abstract

Risk-sensitive control is an area of significant current interest in stochastic control theory. It is a generalization of the classical, risk-neutral approach, whereby we seek to minimize an exponential of the sum of costs that depends not only on the expected cost, but on higher order moments as well.

Research supported in part by the National Science Foundation under grant EEC 9402384.

Research supported in part by a grant from the University of Arizona Foundation and the Office of the Vice President for Research; and in part by the National Science Foundation under grant NSF-INT 9201430.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arapostathis, V.S. Borkar, E. Fernández-Gaucherand, M.K. Ghosh and S.I. Marcus. Discrete-time controlled Markov processes with average cost criterion: a survey. SIAM J. Control and Optim. 31 (1993), 282–344.
Article MathSciNet MATH Google Scholar
J.S. Baras and M.R. James. Robust and risk-sensitive output feedback control for finite state machines and hidden Markov models. To appear J. Math. Systems, Estimation and Control.
Google Scholar
A.Bensoussan and J.H. Van Schuppen. Optimal control of partially observable stochastic systems with exponential of integral performance index. SIAM J. Control and Optim. 23 (1985), 599–613.
Article MathSciNet Google Scholar
D.P. Bertsekas. Dynamic Programming: Deterministic and Stochastic Model. Englewood Cliffs, NJ: Prentice-Hall, 1987.
Google Scholar
V.S. Borkar. On minimum cost per unit of time control of Markov chains. SIAM J. Cont. and Optim. 22 (1984), 965–978.
Article MathSciNet MATH Google Scholar
R. Cavazos-Cadena. Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs. Kybernetika 25 (1989), 145–156.
MathSciNet MATH Google Scholar
R. Cavazos-Cadena and L.I. Sennott. Comparing recent assumptions for the existence of average optimal stationary policies. Oper. Res. Lett. 11 (1992), 33–37.
Article MathSciNet MATH Google Scholar
K.-J. Chung and M.J. Sobel. Discounted MDP’s: Distribution functions and exponential utility maximization. SI AM J. Control and Optim. 25 (1987), 49–62.
Article MathSciNet MATH Google Scholar
S. Coraluppi and S.I. Marcus. Risk-sensitive control of Markov decision processes. Proc. 1996 Conf. on Information Science and Systems. 934–939.
Google Scholar
P. Dupuis and R.S. Ellis. A Weak Convergence Approach to the Theory of Large Deviations. To be published by John Wiley & Sons.
Google Scholar
J.N. Eagle, II. A Utility Criterion for the Markov Decision Process. Ph.D. Dissertation. Stanford University. Stanford, Caliornia. 1975.
Google Scholar
C.-H. Fan, J.L. Speyer and C.R. Jaensch. Centralized and decentralized solutions fo the linear-exponential-Gaussian problem. IEEE Transactions on Automatic Control 39 (1994), 1986–2003.
Article MathSciNet MATH Google Scholar
E. Fernández-Gaucherand, A. Arapostathis, and S.I. Marcus. On the average cost optimality equation and the structure of optimal Policies for partially observable Markov decision processes. Annals of Operations Research 29 (1991), 439–470.
Article MathSciNet MATH Google Scholar
E. Fernández-Gaucherand, A. Arapostathis and S.I. Marcus. Analysis of an adaptive control scheme for a partially observed controlled Markov chain. IEEE Transactions on Automatic Control 38 (1993), 987–993.
Article MATH Google Scholar
E. Fernández-Gaucherand and S.I. Marcus. Risk-sensitive optimal control of hidden Markov models: structural results. To appear IEEE Transactions on Automatic Control.
Google Scholar
W.H. Fleming and D. Hernández-Hernández. Risk sensitive control of finite state machines on an infinite horizon I. To appear SIAM J. Control and Optim.
Google Scholar
W.H. Fleming and D. Hernández-Hernández. Risk sensitive control of finite state machines on an infinite horizon II. Technical Report. Division of Applied Mathematics. Brown University.
Google Scholar
W.H. Fleming and W.M. McEneaney. Risk-sensitive control and differential games. Springer Lecture Notes in Control and Info. Sci. 184 (1992), 185–197.
Article MathSciNet Google Scholar
W.H. Fleming and W.M. McEneaney. Risk-sensitive control on an infinite horizon. SIAM J. Control and Optim. 33 (1995), 1881–1915.
Article MathSciNet MATH Google Scholar
K. Glover and J.C. Doyle, State-space formulae for all stabilizing controllers that satisfy an H_∞-norm bound and relations to risk sensitivit. Systems and Control Lett. 11 (1988), 167–172.
Article MathSciNet MATH Google Scholar
L.P. Hansen and T.J. Sargent. Discounted linear exponential quadratic Gaussian control. IEEE Transactions on Automatic Control 40 (1995), 968–971.
Article MathSciNet MATH Google Scholar
D. Hernández-Hernández and S.I. Marcus. Risk-sensitive control of Markov processes in countable state space. To appear Systems and Control Lett.
Google Scholar
D. Hernández-Hernández and S.I. Marcus. Existence of risk sensitive optimal stationary policies for controlled Markov processes. Technical Report. Institute for Systems Research. University of Maryland.
Google Scholar
D. Hernández-Hernández, S.I. Marcus, and P. Fard. Analysis of a risk sensitive control problem for hidden Markov chains. Technical Report. Institute for Systems Research. University of Maryland.
Google Scholar
R.A. Howard and J.E. Matheson. Risk-sensitive Markov decision processes. Management Sci. 18 (1972), 356–369.
Article MathSciNet MATH Google Scholar
D.H. Jacobson. Optimal stochastic linear systems with exponential performance criteria and their relation to deterministic differential games. IEEE Transactions on Automatic Control 18 (1973), 124–131.
Article MATH Google Scholar
M.R. James, J.S. Baras and R.J. Elliott. Risk-sensitive control and dynamic games for partially observed discrete-time nonlinear systems. IEEE Transactions on Automatic Control 39 (1994), 780–792.
Article MathSciNet MATH Google Scholar
D.M. Kreps and E.L. Porteus. Temporal resolution of uncertainty and dynamic choice theory. Econometrica 46 (1978), 185–200.
Article MathSciNet MATH Google Scholar
P.R. Kumar and P. Varaiya. Stochastic Systems: Estimation, Identification and Adaptive Control. Englewood Cliffs, NJ: Prentice-Hall, 1986.
MATH Google Scholar
W.S. Lovejoy. On the convexity of policy regions in partially observed systems. Operations Research 35 (1987), 619–621.
Article MathSciNet MATH Google Scholar
T. Runolfsson. The equivalence between infinite horizon control of stochastic systems with exponential-of-integral performance index and stochastic differential games. IEEE Transactions on Automatic Control 39 (1994), 1551–1563.
Article MathSciNet MATH Google Scholar
R.D Smallwood and E.J. Sondik. The optimal control of partially observable Markov processes over a finite horizon. Operations Research 21 (1973), 1071–1088.
Article MATH Google Scholar
C.C. White. A Markov quality control process subject to partial observation. Management Science 23 (1977), 843–852.
Article MATH Google Scholar
P. Whittle. Risk-Sensitive Optimal Control New York: John Wiley & Sons, 1990.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Electrical Engineering Department and Institute for Systems Research, University of Maryland, 20742, College Park, Maryland, USA
S. I. Marcus, S. Coraluppi & P. Fard
Systems and Industrial Engineering Department, University of Arizona, 85721, Tucson, Arizona, USA
E. Fernández-Gaucherand
Department of Mathematics, CINVESTAV-IPN, Apartado postal 14-740, 07000, Mexico D.F., Mexico
D. Hernández-Hernández

Authors

S. I. Marcus
View author publications
You can also search for this author in PubMed Google Scholar
E. Fernández-Gaucherand
View author publications
You can also search for this author in PubMed Google Scholar
D. Hernández-Hernández
View author publications
You can also search for this author in PubMed Google Scholar
S. Coraluppi
View author publications
You can also search for this author in PubMed Google Scholar
P. Fard
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering and Applied Science, Washington University, 63130-4899, St. Louis, MO, USA
Christopher I. Byrnes
Dept. of Mathematical Sciences, Northern Illinois University, 60115, DeKalb, IL, USA
Biswa N. Datta
Dept. of Mathematics, Texas Tech University, 79409, Lubbock, TX, USA
Clyde F. Martin & David S. Gilliam &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Marcus, S.I., Fernández-Gaucherand, E., Hernández-Hernández, D., Coraluppi, S., Fard, P. (1997). Risk Sensitive Markov Decision Processes. In: Byrnes, C.I., Datta, B.N., Martin, C.F., Gilliam, D.S. (eds) Systems and Control in the Twenty-First Century. Systems & Control: Foundations & Applications, vol 22. Birkhäuser, Boston, MA. https://doi.org/10.1007/978-1-4612-4120-1_14

Download citation

DOI: https://doi.org/10.1007/978-1-4612-4120-1_14
Publisher Name: Birkhäuser, Boston, MA
Print ISBN: 978-1-4612-8662-2
Online ISBN: 978-1-4612-4120-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics