Learning Automaton for Finite Semi-Markov Decision Processes

El-Fattah, Yousri M.

doi:10.1007/978-1-4612-5612-0_4

Yousri M. El-Fattah³

Part of the book series: Lecture Notes in Statistics ((LNS,volume 20))

222 Accesses

Abstract

A finite semi-Markov decision process is studied to maximize the expected average reward. The semi-Markov kernel of the process depends on an unknown parameter taking values in a subset [a, b] of ℝ^S. A controller modelled as a learning automaton updates sequentially the probabilities of generating decisions based on the observed decisions, states, and jump times. Convergence results are stated in the form of theorems and some examples are given.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

El-Fattah, Y.M. (1981) Gradient approach for recursive estimation and control in finite Markon chains. Adv. Appl. Probability, 13, 778–803.
Article MathSciNet MATH Google Scholar
Jewell, W.S. (1963) Markov-renewal programming, I,II. Operations Research, 2, 938–971.
Article MathSciNet Google Scholar
Polyak, B.T. and Tsypkin, Ya.Z. (1973) Pseudo-gradient adptation and training algorithms. Automation and Remote Control, 34, 377–397.
MathSciNet Google Scholar
Ross, S.M. (1970) Applied probability models with optimization applications. Holden Day, San Francisco.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

L.E.E.S.A., Faculté des Sciences, B.P. 1014, Rabat, Morocco
Yousri M. El-Fattah

Authors

Yousri M. El-Fattah
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Angewandte Mathematik der Universität Bonn, Wegelerstrasse 6, 5300, Bonn, Federal Republic of Germany
Ulrich Herkenrath & Walter Vogel &
Abt. Mathematik VII, Universität Ulm, Oberer Eselsberg, 7900, Ulm, Federal Republic of Germany
Dieter Kalin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

El-Fattah, Y.M. (1983). Learning Automaton for Finite Semi-Markov Decision Processes. In: Herkenrath, U., Kalin, D., Vogel, W. (eds) Mathematical Learning Models — Theory and Algorithms. Lecture Notes in Statistics, vol 20. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-5612-0_4

Download citation

DOI: https://doi.org/10.1007/978-1-4612-5612-0_4
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-90913-4
Online ISBN: 978-1-4612-5612-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics