Policy-Gradient Algorithms for Partially Observable Markov Decision Processes
-
Altmetric Citations
Description
Partially observable Markov decision processes are interesting because of their ability to model most conceivable real-world learning problems, for example, robot navigation, driving a car, speech recognition, stock trading, and playing games. The downside of this generality is that exact algorithms are computationally intractable. Such computational complexity motivates approximate approaches. One such class of algorithms are the so-called policy-gradient methods from reinforcement learning....[Show more]
Collections | Open Access Theses |
---|---|
Date published: | 2003 |
Type: | Thesis (PhD) |
URI: | http://hdl.handle.net/1885/48180 |
DOI: | 10.25911/5d7a2b73cbb88 |
Access Rights: | Open Access |
Download
File | Description | Size | Format | Image |
---|---|---|---|---|
02whole.pdf | 1.67 MB | Adobe PDF |
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.
Updated: 17 November 2022/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator