AVERAGE-OPTIMAL ADAPTIVE POLICIES IN SEMI-MARKOV DECISION PROCESSES INCLUDING AN UNKNOWN PARAMETER

Masami Kurano

doi:10.15807/jorsj.28.252

Journal of the Operations Research Society of Japan

Online ISSN : 2188-8299
Print ISSN : 0453-4514
ISSN-L : 0453-4514

J-STAGE home
/
Journal of the Operations Rese ...
/
Volume 28 (1985) Issue 3
/
Article overview

AVERAGE-OPTIMAL ADAPTIVE POLICIES IN SEMI-MARKOV DECISION PROCESSES INCLUDING AN UNKNOWN PARAMETER

Masami Kurano

Author information

JOURNAL FREE ACCESS

1985 Volume 28 Issue 3 Pages 252-267

DOI https://doi.org/10.15807/jorsj.28.252

Details

Abstract

We consider the problem of minimizing the long-run average (expected) cost per unit time in a semi-Markov decision process including an unknown parameter. In the case of general state and action spaces and compact parameter space we construct the adaptive policy which has good properties under some identifiability conditions weaker than those for the strong consistency of the estimator. As example, we treat the age replacement with an unknown failure distribution.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!