Abstract
Sutton’s Dyna framework provides a novel and computationally appealing way to integrate learning, planning, and reacting in autonomous agents. Examined here is a class of strategies designed to enhance the learning and planning power of Dyna systems by increasing their computational efficiency. The benefit of using these strategies is demonstrated on some simple abstract learning tasks.
Original language | English |
---|---|
Pages (from-to) | 437-454 |
Number of pages | 18 |
Journal | Adaptive Behavior |
Volume | 1 |
Issue number | 4 |
DOIs | |
State | Published - Mar 1993 |
Keywords
- dynamic programming
- reinforcement learning
- sequential decision problems