Average cost semi-markov decision processes

Sheldon M. Ross

doi:10.2307/3211944

Abstract

The semi-Markov decision model is considered under the criterion of long-run average cost. A new criterion, which for any policy considers the limit of the expected cost incurred during the first n transitions divided by the expected length of the first n transitions, is considered. Conditions guaranteeing that an optimal stationary (non-randomized) policy exist are then presented. It is also shown that the above criterion is equivalent to the usual one under certain conditions.

References

[1] Blackwell, D. (1965) Discounted dynamic programming. Ann. Math. Statist. 36, 226–235.Google Scholar

[2] Derman, C. (1966) Denumerable state Markovian decision processes–average cost criterion. Ann. Math. Statist. 37, 1545–1554.Google Scholar

[3] Howard, R. (1963) Semi-Markovian decision processes. Bull. Inst. Internat. Statist. 40, 625–652.Google Scholar

[4] Jewell, W. S. (1963) Markov renewal programming I and II. Operat. Res. 2, 938–971.Google Scholar

[5] Ross, S. M. (1968) Non-discounted denumerable Markovian decision models. Ann. Math. Statist. 39, 412–423.CrossRef Google Scholar

[6] Ross, S. M. (1968) Arbitrary state Markovian decision processes. Ann. Math. Statist. 39, 2118–2122.Google Scholar

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Lippman, Steven A. and Ross, Sheldon M. 1971. The Streetwalker’s Dilemma: A Job Shop Model. SIAM Journal on Applied Mathematics, Vol. 20, Issue. 3, p. 336.

Morton, R. 1973. Optimal control of stationary Markov processes. Stochastic Processes and their Applications, Vol. 1, Issue. 3, p. 237.

Gubenko, L. G. and Shtatland, �. S. 1974. Controlled semi-Markov processes. Cybernetics, Vol. 8, Issue. 2, p. 200.

Korolyuk, V. S. Brodi, S. M. and Turbin, A. F. 1975. Semi-markov processes and their applications. Journal of Soviet Mathematics, Vol. 4, Issue. 3, p. 244.

Bather, John 1976. Optimal stationary policies for denumerable Markov chains in continuous time. Advances in Applied Probability, Vol. 8, Issue. 1, p. 144.

Talman, A. J. J. 1979. A simple proof of the optimality of the best N‐policy in the M/G/l queueing control problem with removable server. Statistica Neerlandica, Vol. 33, Issue. 3, p. 143.

Gallisch, Eckhardt 1979. On monotone optimal policies in a queueing model ofM/G/1 type with controllable service time distribution. Advances in Applied Probability, Vol. 11, Issue. 4, p. 870.

Doshi, Bharat T. 1979. Generalized semi-Markov decision processes. Journal of Applied Probability, Vol. 16, Issue. 03, p. 618.

Sherif, Y. S. and Smith, M. L. 1981. Optimal maintenance models for systems subject to failure–A Review. Naval Research Logistics Quarterly, Vol. 28, Issue. 1, p. 47.

Yuškevič, A. A. 1981. Stochastic Differential Systems. Vol. 36, Issue. , p. 235.

Sherif, Yosef S 1982. Reliability analysis : Optimal inspection and maintenance schedules of failing systems. Microelectronics Reliability, Vol. 22, Issue. 1, p. 59.

Yushkevich, A. A. 1982. On Semi-Markov Controlled Models with an Average Reward Criterion. Theory of Probability & Its Applications, Vol. 26, Issue. 4, p. 796.

Doshi, Bharat T. and Lipper, Edward H. 1982. Applied Probability— Computer Science: The Interface. p. 269.

Rosberg, Zvi 1982. Semi-Markov decision processes with polynomial reward. Journal of Applied Probability, Vol. 19, Issue. 02, p. 301.

Kitaev, M. Yu. 1982. The Existence of Optimal Homogeneous Strategies of Controllable Semicontinuous Semi-Markov Models with Respect to the Average Cost Criterion. Theory of Probability & Its Applications, Vol. 26, Issue. 3, p. 614.

Hernández-Lerma, Onésimo and Marcus, Steven I 1983. Adaptive control of service in queueing systems. Systems & Control Letters, Vol. 3, Issue. 5, p. 283.

Robin, Maurice 1983. Long-term average cost control problems for continuous time Markov processes: A survey. Acta Applicandae Mathematicae, Vol. 1, Issue. 3, p. 281.

Hordijk, Arie and Van Der Duyn Schouten, Frank A. 1983. Average optimal policies in Markov decision drift processes with applications to a queueing and a replacement model. Advances in Applied Probability, Vol. 15, Issue. 2, p. 274.

Hernàndez-Lerma, Onésimo and Marcus, Steven I. 1984. Optimal adaptive control of priority assignment in queueing systems. Systems & Control Letters, Vol. 4, Issue. 2, p. 65.

Lai, Hang-Chin and Tanaka, Kensuke 1984. A noncooperativen-person semi-Markov game with a separable metric state space. Applied Mathematics & Optimization, Vol. 11, Issue. 1, p. 23.

Download full list

Article contents

Average cost semi-markov decision processes

Abstract

Access options

References

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Average cost semi-markov decision processes

Abstract

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests