Abstract
We briefly survey some results on the orderfield property of 2-player and multi-player stochastic games and their mixtures. Some of these classes of stochastic games can be solved by formulating them as a Linear Complementarity Problem (LCP) or (Generalized) Vertical Linear Complementarity Problem (VLCP). We discuss some of these results and prove that certain new subclasses and mixtures of multi-player (or n-person) stochastic games can be solved via LCP formulations.
Mohan, Neogy and Parthasarathy (in Proceedings of the International Conference on Complementarity Problems, 1997) proposed an LCP formulation of β-discounted (multi-player) polystochastic games where the transitions are controlled by one player, and proved that this LCP is processible by Lemke’s algorithm. Using this formulation repeatedly, we prove that we can solve a subclass of β-discounted switching control polystochastic games. As our proof is constructive, we have an algorithm for solving this subclass. This algorithm only involves iteratively solving different LCPs and hence, it follows that this subclass has the orderfield property, a question left open in the paper on orderfield property of mixtures of stochastic games by Krishnamurthy et al., (Indian J. Stat. 72:246–275, 2010). Furthermore, we use results from Krishnamurthy et al., Indian J. Stat. 72:246–275, (2010) to solve some mixture classes using LCP (or VLCP) formulations.
We also propose two different VLCP formulations for β-discounted zero-sum perfect information stochastic games, the underlying matrices of both formulations being R 0. As a result, we also have an alternative proof of the orderfield property of such games.
Similar content being viewed by others
References
Condon A (1992) The complexity of stochastic games. Inf Comput 96:203–224
Cottle RW, Dantzig G (1968) Complementary pivot theory of mathematical programming. Linear Algebra Appl 1:103–125
Cottle RW, Pang JS, Stone RE (1992) The linear complementarity problem. Academic Press, New York
Eaves BC (1971) The linear complementarity problem. Manag Sci 17:612–634
Filar JA (1981) Orderfield property of stochastic games when the player who controls the transition changes from state to state. J Optim Theory Appl 34:505–517
Filar JA, Schultz T, Thuijsman F, Vrieze OJ (1991) Nonlinear programming and stationary equilibria in stochastic games. Math Program 50:227–237
Filar JA, Vrieze OJ (1997) Competitive Markov decision processes. Springer, Berlin
Garcia CB (1973) Some classes of matrices in linear complementarity theory. Math Program 5:299–310
Gartner B, Rust L (2005) Simple stochastic games and P-matrix generalized linear complementarity problems. In: 15th international symposium on fundamentals of computation theory (FCT). Lecture Notes in Computer Science, vol 3623. Springer, Berlin, pp 209–220
Gillette D (1953) Representable infinite games. PhD Thesis. University of California, Berkeley
Gillette D (1957) Stochastic games with zero-stop probabilities. In: Dresher M, Tucker AW, Wolfe P (eds) Contributions to the theory of games, vol III. Ann Math Studies, vol 39. Princeton University Press, Princeton, pp 179–188
Howson JT Jr (1972) Equilibria of polymatrix games. Manag Sci 18:312–318
Janovskaya EB (1968) Equilibrium points of polymatrix games. Latvian mathematical collection (in Russian)
Kostreva MM (1976) Direct algorithms for complementarity problems. PhD Thesis, Rensselaer Polytechnic Institute, Troy, New York
Krishnamurthy N, Parthasarathy T, Ravindran G (2009–2010) Orderfield property and algorithms for stochastic games via dependency graphs. In: Proceedings of the international conference on frontiers of interface between statistics and sciences (in Honor of Prof CR Rao on Occasion of his 90th Birthday). Hyderabad, India, pp 286–297
Krishnamurthy N, Parthasarathy T, Ravindran G (2010) Orderfield property of mixtures of stochastic games. Sankhya: Indian J Stat A 72:246–275 (also available online at sankhya.isical.ac.in)
Lemke C (1965) Bimatrix equilibrium points and mathematical programming. Manag Sci 11:681–689
Mertens JF (2002) Stochastic games. In: Aumann R, Hart S (eds) Handbook of game theory with economic applications, vol 3. Elsevier, Amsterdam, pp 1809–1832. Chapter 47
Mertens JF, Neyman A (1981) Stochastic games. Int J Game Theory 10:53–66
Mohan SR, Neogy SK, Parthasarathy T (1997) Linear complementarity and discounted polystochastic game when one player controls transitions. In: Ferris MC, Pang J-S (eds) Proceedings of the international conference on complementarity problems. SIAM, Philadelphia, pp 284–294
Mohan SR, Neogy SK, Parthasarathy T (2001) Pivoting algorithms for some classes of stochastic games: A survey. Int Game Theory Rev 3(2–3):253–281
Mohan SR, Neogy SK, Parthasarathy T, Sinha S (1999) Vertical linear complementarity and discounted zero-sum stochastic games with ARAT structure. Math Program, Ser A, 86(3):637–648
Mohan SR, Neogy SK, Sridhar R (1996) The general linear complementarity problem revisited. Math Program 74:197–218
Murty KG (1997) In: Yu F-T (ed) Linear complementarity, linear and nonlinear programming. (First published in 1988 by Heldermann Verlag, Berlin, and out of print since 1994)
Nash JF (1951) Non-cooperative games. Ann Math 54:286–295
Neogy SK, Das AK, Sinha S, Gupta A (2008) On a mixture class of stochastic game with ordered field property. In: Neogy SK, Bapat RB, Das AK, Parthasarathy T (eds) Mathematical programming and game theory for decision making. ISI Platinum Jubilee Series on Statistical Science and Interdisciplinary Research, vol 1. World Scientific, Singapore, pp 451–477
Nowak AS, Raghavan TES (1993) A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game. Math Program 59:249–259
Parthasarathy T, Raghavan TES (1981) An orderfield property for stochastic games when one player controls transition probabilities. J Optim Theory Appl 33(3):375–392
Parthasarathy T, Tijs SH, Vrieze OJ (1984) Stochastic games with state independent transitions and separable rewards. Lect Notes Econ Math Syst, 226:262–271
Raghavan TES (2003) Finite-step algorithms for single controller and perfect information stochastic games. In: Neyman A, Sorin S (eds) NATO Science series: C, Stochastic games and applications. Kluwer Academic, Norwell. Chapter 15
Raghavan TES, Filar JA (1991) Algorithms for stochastic games—A survey. Z Oper-Res, 35:437–472
Raghavan TES, Syed Z (2002) Computing stationary Nash equilibria of undiscounted single-controller stochastic games. Math Oper Res 27(2):384–400
Raghavan TES, Syed Z (2003) A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information. Math Program A 95:513–532
Raghavan TES, Tijs SJ, Vrieze OJ (1986) Stochastic games with additive rewards and additive transitions. J Optim Theory Appl 47:451–464
Schultz TA (1992) Linear complementarity and discounted switching controller stochastic games. J Optim Theory Appl 73(1):89–99
Shapley L (1953) Stochastic games. Proc Natl Acad Sci USA 39:1095–1100
Sinha S (1989) A contribution to the theory of stochastic games. PhD thesis, Indian Statistical Institute, New Delhi, India
Sobel MJ (1981) Myopic solutions of Markov decision processes and stochastic games. Oper Res 29:995–1009
Thuijsman F, Raghavan TES (1997) Perfect information stochastic games and related classes. Int J Game Theory 26:403–408
Vrieze OJ (1981) Linear programming and undiscounted stochastic game in which one player controls transitions. OR Spektrum 3:29–35
Vrieze OJ, Tijs SH, Raghavan TES, Filar JA (1983) A finite algorithm for the switching control stochastic game. OR Spektrum 5(1):15–24
Weyl H (1950) Elementary proof of a minimax theorem due to von Neumann. In: Kuhn HW, Tucker AW (eds) Contributions to the theory of games, vol I. Ann Math Studies, vol 24. Princeton University Press, Princeton, pp 19–25
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Krishnamurthy, N., Parthasarathy, T. & Ravindran, G. Solving subclasses of multi-player stochastic games via linear complementarity problem formulations—a survey and some new results. Optim Eng 13, 435–457 (2012). https://doi.org/10.1007/s11081-011-9163-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11081-011-9163-1