TY - GEN
T1 - Efficient Memory-Based Dynamic Programming
AU - Peng, Jing
N1 - Publisher Copyright:
© ICML 1995.All rights reserved
PY - 1995
Y1 - 1995
N2 - A novel memory-based approach to dynamic programming that addresses the issue of generalization is presented. In this approach action values are represented by storing actual experiences in a memory and computed by a kind of locally weighted regression, and generalizations are made by searching the memory for relevant experience. The new approach does not require the quantization of continuous state or action spaces and can achieve arbitrarily variable resolution. By concentrating on important areas of the state space while ignoring the rest, the method represents an attempt to dodge Bellman's curse of dimensionality. This memory-based dynamic programming method has been implemented on a parallel machine, the Connection Machine, and used to successfully model and control a cart-pole system.
AB - A novel memory-based approach to dynamic programming that addresses the issue of generalization is presented. In this approach action values are represented by storing actual experiences in a memory and computed by a kind of locally weighted regression, and generalizations are made by searching the memory for relevant experience. The new approach does not require the quantization of continuous state or action spaces and can achieve arbitrarily variable resolution. By concentrating on important areas of the state space while ignoring the rest, the method represents an attempt to dodge Bellman's curse of dimensionality. This memory-based dynamic programming method has been implemented on a parallel machine, the Connection Machine, and used to successfully model and control a cart-pole system.
UR - http://www.scopus.com/inward/record.url?scp=2342514644&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:2342514644
T3 - Proceedings of the 12th International Conference on Machine Learning, ICML 1995
SP - 438
EP - 446
BT - Proceedings of the 12th International Conference on Machine Learning, ICML 1995
A2 - Prieditis, Armand
A2 - Russell, Stuart
PB - Morgan Kaufmann Publishers, Inc.
T2 - 12th International Conference on Machine Learning, ICML 1995
Y2 - 9 July 1995 through 12 July 1995
ER -