Algorithms for stochastic games — A survey

Raghavan, T. E. S.; Filar, J. A.

doi:10.1007/BF01415989

Algorithms for stochastic games — A survey

Published: November 1991

Volume 35, pages 437–472, (1991)
Cite this article

Zeitschrift für Operations Research Aims and scope Submit manuscript

T. E. S. Raghavan¹ &
J. A. Filar²

1007 Accesses
59 Citations
Explore all metrics

Abstract

We consider finite state, finite action, stochastic games over an infinite time horizon. We survey algorithms for the computation of minimax optimal stationary strategies in the zerosum case, and of Nash equilibria in stationary strategies in the nonzerosum case. We also survey those theoretical results that pave the way towards future development of algorithms.

Zusammenfassung

In dieser Arbeit werden unendlichstufige stochastische Spiele mit endlichen ZuStands- und Aktionenräumen untersucht. Es wird ein Überblick gegeben über Algorithmen zur Berechnung von optimalen stationären Minimax-Strategien in Nullsummen-Spielen und von stationären Nash-Gleichgewichtsstrategien in Nicht-Nullsummen-Spielen. Einige theoretische Ergebnisse werden vorgestellt, die für die weitere Entwicklung von Algorithmen nützlich sind.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Monte Carlo Tree Search: a review of recent modifications and applications

Article Open access 19 July 2022

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Article 17 January 2019

References

Aumann RJ (1964) Mixed and Behaviour Strategies in Infinite Extensive Games. In: Dresher M, Shapley LS (eds) Advances in Game Theory. Princeton University Press. Annals of Mathematics Studies 52
Baykal-Gursoy M, Ross KW (1989) A Sample Path Approach to Stochastic Games. Techn Rep, Rutgers University
Bewley T, Kohlberg E (1976) The asymptotic theory of stochastic games. Math Oper Res 1:197–208
Google Scholar
Bewley T, Kohlberg E (1978) On Stochastic Games with Stationary Optimal Strategies. Math Oper Res 3:104–125
Google Scholar
Blackwell D, Ferguson T (1968) The Big Match. Ann Math Statistics 39:159–163
Google Scholar
Breton M (1987) Equilibre pour des Jeux Sequentiel. PhD Thesis, University of Montreal
Breton M, Filar JA, Haurie A, Shultz TA (1985) On the Computation of Equilibria in Discounted Stochastic Games. In: Basar T (ed) Dynamic Games and Applications in Economics. Springer, Lecture Notes in Economics and Mathematical Systems 265
Brown GW (1951) Iterative Solutions of Games by Fictitious Play. In: Koopmans TC (ed) Activity Analysis of Production and Allocation. Wiley
Charnes A, Schroeder R (1967) On Some Tactical Antisubmarine Games. Naval Res Log Qtly 14:291–311
Google Scholar
Derman C (1970) Finite State Markovian Decision Processes. Academic Press, New York
Google Scholar
Federgruen A (1980) Successive Approximation Methods in Undiscounted Stochastic Games. Oper Res 28:794–810
Google Scholar
Federgruen A (1983) Markovian Control Problems. Mathematical Centre Tracts 97, Amsterdam
Filar JA (1980) Algorithms for Solving Some Undiscounted Stochastic Games. PhD Thesis, University of Illinois at Chicago, Chicago, Illinois
Google Scholar
Filar JA Ordered Field Property for Stochastic Games When the Player Who Controls Transitions Changes from State to State. J Optim Theory Appl 34:503–513
Filar JA (1985) Player Aggregation in the Traveling Inspector Model. IEEE Trans on Aut Control AC-30:723–729
Google Scholar
Filar JA (1986) Quadratic Programming and the Single-Controller Stochastic Game. J Math Anal Appl 113:136–147
Google Scholar
Filar JA, Raghavan TES (1980) Two Remarks Concerning Two Undiscounted Stochastic Games. Tech Rep 392. John Hopkins University, Department of Mathematical Sciences
Filar JA, Raghavan TES (1984) A Matrix Game Solution of the Single-Controller Stochastic Game. Math Oper Res 9:356–362
Google Scholar
Filar JA, Shultz TA (1986) Nonlinear Programming and Stationary Strategies in Stochastic Games. Math Progr 35:243–247
Google Scholar
Filar JA, Shultz TA (1987) Bilinear Programming and Structured Stochastic Games. J Optim Theory Appl 53:85–104
Google Scholar
Filar JA, Shultz TA, Thuijsman F, Vrieze OJ (1987) Nonlinear Programming and Stationary Equilibria in Stochastic Games. Math Program (to appear)
Filar JA, Tolwinski B (1988) On the Algorithm of Pollatschek and Avi-Itzhak. In: Ferguson T et al. (eds) Stochastic Games and Related Topics, Shapley Honor volume. Kluwer, Dordrecht, The Netherlands (to appear)
Google Scholar
Fink AM (1964) Equilibrium in a Stochastic N-Person Game. J Sci in Hiroshima Univ, Series A-I. 28:89–93
Google Scholar
Gilette D (1957) Stochastic Games with Zero Stop Probabilities. In: Dresher AWTM, Wolfe P (eds) Contributions to the Theory of Games. Princeton University Press, Annals of Mathematics Studies 39
Gill PE, Murray W, Saunders MA, Wright MH (1983) User's Guide for SOL/NPSOL: A Fortran Package for Nonlinear Programming. Tech Rep SOL 83-12. Stanford University, Stanford, California
Google Scholar
Hoffman AJ, Karp RM (1966) On Non-terminating Stochastic Games. Management Sci 12:359–370
Google Scholar
Hordijk A, Kallenberg LGM (1981) Linear Programming and Markov Games I, II. In: Moeschlin O, Pallaschke D (eds) Game Theory and Mathematical Economics. North Holland
Hordijk A, Tijms HC (1975) A Modified Form of the Iterative Method of Dynamic Programming. Annals of Stat 3:203–208
Google Scholar
Lemke CE (1965) Bimatrix Equilibrium Points and Mathematical Programming. Management Sci 12:413–423
Google Scholar
Lemke CE, Howson JT (1964) Equilibrium Points of Bimatrix Games. J Soc Indust Appl Math 12:413–423
Google Scholar
Liggett T, Lipman S (1969) Stochastic Games with Perfect Information and Time Average Payoff. SIAM Rev 11:604–607
Google Scholar
Mertens JF, Neyman A (1981) Stochastic Games. International J Game Theory 10:53–56
Google Scholar
Mizuno N (1986) A New Algorithm for Non-Zerosum Markov Games. Tech Rep, New York University, Graduate School of Business
Mizuno N (1987) A Necessary Condition for the Existence of Average Reward Equilibrium Points for Finite N-Person Markov Games. Tech Rep, New York University, Graduate School of Business
Mohan SR, Raghavan TES (1987) An Algorithm for Discounted Switching Control Games. OR Spectrum 9:41–45
Google Scholar
Monash CA (1979) Stochastic Games. The Minimax Theorem. PhD Thesis, Harvard University
von Neumann J (1928) Zur Theorie der Gesellschaftsspiele. Math Annal 100:295–320
Google Scholar
Nowak A, Raghavan TES (1989) A Finite Step Algorithm via a Bimatrix Garne to a Single-Controller Non-zerosum Stochastic Game. Tech Rep 89-1. The University of Illinois at Chicago
Google Scholar
Parthasarathy T, Raghavan TES (1981) An Orderfield Property for Stochastic Garnes when One Player Controls Transition Probabilities. J Optim Theory Appl 33:375–392
Google Scholar
Parthasarathy T, Stern M (1977) Markov Games: A Survey. In: Roxin PLE, Sternberg R (eds) Differential Games and Control Theory. Marcel Dekker
Parthasarathy T, Tijs SH, Vrieze OJ (1984) Stochastic Games with State Independent Transitions and Separable Rewards. In: Hammer G, Pallaschke D (eds) Selected Topics in OR and Mathematical Economics. Springer, Lecture Notes Series 226
Pollatschek M, Avi-Itzhak B (1969) Algorithms for Stochastic Games with Geometrical Interpretation. Management Sci 15:399–415
Google Scholar
Raghavan TES, Tijs SH, Vrieze OJ (1985) On Stochastic Games with Additive Reward and Transition Structure. J Optim Theory Appl 47:451–464
Google Scholar
Rao S, Chandrasekaran R, Nair K (1973) Algorithms for Discounted Stochastic Games. J Optim Theory Appl 11:627–637
Google Scholar
Robinson J (1950) An Iterative Model of Solving a Game. Ann Math 54:296–301
Google Scholar
Rogers PD (1969) Non-zerosum Stochastic Games. PhD Thesis, University of California at Berkeley, Berkeley, California
Google Scholar
Rothblum UG (1978) Solving Stopping Stochastic Games by Maximizing a Linear Function Subject to Quadratic Constraints. In: Moeschlin O, Pallaschke D (eds) Game Theory and Related Topics. North Holland
Schweitzer PJ (1968) Perturbation theory and finite Markov chains. J Appl Prob 5:401–413
Google Scholar
Shapley LS (1953) Stochastic Games. Proc Nat Acad Sci USA 39:1095–1100
Google Scholar
Shapley LS (1964) Some Topics in Two Person Games. In: Dresher LSSM, Tucker AW (eds) Advances in Game Theory. Princeton University Press
Sinha S (1989) A Contribution to the Theory of Stochastic Games. PhD Thesis, Indian Statistical Institute, New Delhi
Google Scholar
Sobel MJ (1971) Non-cooperative Stochastic Games. Ann Math Stat 42:1930–1935
Google Scholar
Sobel MJ (1981) Myopic Solutions of Markov Decision Processes and Stochastic Games. Operat Res 29:995–1009
Google Scholar
Sobel MJ (1981) Stochastic Fishery Games with Myotopic Equilibria. In: Mirman LJ, Spulber D (eds) The Economics of Renewable Resources. Elsevier-North-Holland
Stern M (1975) On Stochastic Games with Limiting Average Payoff. PhD Thesis, University of Illinois at Chicago
Google Scholar
Takahashi M (1964) Equilibrium Points of Stochastic Non-cooperative n-Person Games. J Sci Hiroshima University, Series A-I, 28:95–99
Google Scholar
Thuijsman F (1989) Optimality and Equilibria in Stochastic Games. PhD Thesis, Rijksuniversiteit Limburg, Maastricht
Google Scholar
Vrieze OJ (1981) Linear Programming and Undiscounted Stochastic Games. OR Spectrum 3:29–35
Google Scholar
Vrieze OJ (1987) Stochastic Games with Finite State and Action Spaces. CWI Tracts 33, Amsterdam
Vrieze OJ, Thuijsman F (1986) On Equilibria in Repeated Games with Absorbing States. Tech Rep 8535. Catholic University, Nijmegen, Department of Mathematics
Google Scholar
Vrieze OJ, Tijs SH (1980) Fictitious Play Applied to Sequence of Games and Discounted Stochastic Games. Intern J Game Theory 11:71–85
Google Scholar
Vrieze OJ, Tijs SH, Raghavan TES, Filar JA (1983) A Finite Algorithm for the Switching Controller Stochastic Game. OR Spectrum 5:15–24
Google Scholar
van der Wal J (1981) Stochastic Dynamic Programming. Math Center Tracts 139, Amsterdam
van der Wal J (1977) Discounted Markov Games: Successive Approximation and Stopping Times. Intern J Game Theory 6:11–22
Google Scholar
Winston W (1978) A Stochastic Game Model of a Weapons Development Competition. SIAM J Control Optim 16:411–419
Google Scholar
Winston WL, Cabot AV (1984) A Stochastic Game Model of Football Play Selection. Tech Rep, Indiana University, Paper presented at the TIMS/ORSA joint National meeting in Dallas

Download references

Author information

Authors and Affiliations

Department of Mathematics, Statistics and Computer Science, University of Illinois at Chicago, 60680, Chicago, IL
T. E. S. Raghavan
Department of Mathematics and Statistics, University of Maryland Baltimore County, 21288, Baltimore, MD
J. A. Filar

Authors

T. E. S. Raghavan
View author publications
You can also search for this author in PubMed Google Scholar
J. A. Filar
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

This paper is based on the invited lectures given by the authors at the 12th Symposium for Operations Research in Passau, 1987. We are indebted to M. Abbad, Evangelista Fe, F. Thuijsman and O. J. Vrieze for valuable comments and discussion. Any remaining errors of either misinterpretation or of omission are the authors' alone.

Supported in part by the NSF under the grant # DMS-82601403.

Supported in part by the AFOSR and the NSF under the grant # ECS-8704954.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Raghavan, T.E.S., Filar, J.A. Algorithms for stochastic games — A survey. ZOR - Methods and Models of Operations Research 35, 437–472 (1991). https://doi.org/10.1007/BF01415989

Download citation

Received: 15 March 1988
Revised: 15 August 1989
Issue Date: November 1991
DOI: https://doi.org/10.1007/BF01415989

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Algorithms for stochastic games — A survey

Abstract

Zusammenfassung

Access this article

Similar content being viewed by others

Monte Carlo Tree Search: a review of recent modifications and applications

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Algorithms for stochastic games — A survey

Abstract

Zusammenfassung

Access this article

Similar content being viewed by others

Monte Carlo Tree Search: a review of recent modifications and applications

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation