research-article

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

Authors:
Ronald Parr

Duke University, Durham, NC

Duke University, Durham, NC
View Profile

,
Lihong Li

Rutgers University, Piscataway, NJ

Rutgers University, Piscataway, NJ
View Profile

,
Gavin Taylor

Duke University, Durham, NC

Duke University, Durham, NC
View Profile

,
Christopher Painter-Wakefield

Duke University, Durham, NC

Duke University, Durham, NC
View Profile

,
Michael L. Littman

Rutgers University, Piscataway, NJ

Rutgers University, Piscataway, NJ
View Profile

ICML '08: Proceedings of the 25th international conference on Machine learningJuly 2008Pages 752–759https://doi.org/10.1145/1390156.1390251

Published:05 July 2008Publication History

ICML '08: Proceedings of the 25th international conference on Machine learning

Pages 752–759

ABSTRACT

We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the Bellman error, and show how this relationship can guide feature selection for model improvement and/or value-function improvement. We also show how these results give insight into the behavior of existing feature-selection algorithms.

References

Boyan, J. A. (1999). Least-squares temporal difference learning. ICML-99. Google ScholarDigital Library
Bradtke, S., & Barto, A. (1996). Linear least-squares algorithms for temporal difference learning. Machine Learning, 2. Google ScholarDigital Library
Dean, T., & Givan, R. (1997). Model minimization in Markov decision processes. AAAI-97. Google ScholarDigital Library
Keller, P., Mannor, S., & Precup, D. (2006). Automatic basis function construction for approximate dynamic programming and reinforcement learning. ICML 2006. Google ScholarDigital Library
Koller, D., & Parr, R. (1999). Computing factored value functios for policies in structured MDPs. IJCAI-99. Google ScholarDigital Library
Lagoudakis, M., & Parr, R. (2003). Least squares policy iteration. JMLR, 4. Google ScholarDigital Library
Mahadevan, S., & Maggioni, M. (2007). Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes. JMLR, 8. Google ScholarDigital Library
Mallat, S. G., & Zhang, Z. (1993). Matching pursuits with time-frequency dictionaries. IEEE Trans. on Signal Processing, 41.Google ScholarDigital Library
Parr, R., Painter-Wakefield, C., Li, L., & Littman, M. (2007). Analyzing feature generation for value-function approximation. ICML-07. Google ScholarDigital Library
Petrik, M. (2007). An analysis of Laplacian methods for value function approximation in MDPs. IJCAI-07. Google ScholarDigital Library
Sanner, S., & Boutilier, C. (2005). Approximate linear programming for first-order MDPs. UAI-05.Google Scholar
Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3. Google ScholarDigital Library
Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. The MIT Press. Google ScholarDigital Library
Wu, J.-H., & Givan, R. (2004). Feature-discovering approximate value iteration methods (Technical Report TR-ECE-04-06). Purdue University.Google Scholar
Yu, H., & Bertsekas, D. (2006). Convergence results for some temporal difference methods based on least squares (Technical Report LIDS-2697). MIT.Google Scholar

Index Terms

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

Recommendations

Kernelized value function approximation for reinforcement learning
ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning

A recent surge in research in kernelized approaches to reinforcement learning has sought to bring the benefits of kernelized machine learning techniques to reinforcement learning. Kernelized reinforcement learning techniques are fairly new and different ...
Read More
Linear feature encoding for reinforcement learning
NIPS'16: Proceedings of the 30th International Conference on Neural Information Processing Systems

Feature construction is of vital importance in reinforcement learning, as the quality of a value function or policy is largely determined by the corresponding features. The recent successes of deep reinforcement learning (RL) only increase the ...
Read More
Differentially Private Reinforcement Learning with Linear Function Approximation
SIGMETRICS/PERFORMANCE '22: Abstract Proceedings of the 2022 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems

Motivated by the wide adoption of reinforcement learning (RL) in real-world personalized services, where users' sensitive and private information needs to be protected, we study regret minimization in finite-horizon Markov decision processes (MDPs) ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICML '08: Proceedings of the 25th international conference on Machine learning
July 2008
1310 pages
ISBN:9781605582054
DOI:10.1145/1390156
General Chair:
William Cohen
Carnegie Mellon University
,
Program Chairs:
Andrew McCallum
University of Massachusetts Amherst
,
Sam Roweis
University of Toronto and Google
Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 July 2008
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate140of548submissions,26%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 67
  Total Citations
  View Citations
- 944
  Total Downloads
- Downloads (Last 12 months)62
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

ICML '08: Proceedings of the 25th international conference on Machine learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

Kernelized value function approximation for reinforcement learning

Linear feature encoding for reinforcement learning

Differentially Private Reinforcement Learning with Linear Function Approximation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

ICML '08: Proceedings of the 25th international conference on Machine learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

Kernelized value function approximation for reinforcement learning

Linear feature encoding for reinforcement learning

Differentially Private Reinforcement Learning with Linear Function Approximation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media