Abstract
Sutton’s Dyna framework provides a novel and computationally appealing way to integrate learning, planning, and reacting in autonomous agents. Examined here is a class of strategies designed to enhance the learning and planning power of Dyna systems by increasing their computational efficiency. The benefit of using these strategies is demonstrated on some simple abstract learning tasks.
| Original language | English |
|---|---|
| Pages (from-to) | 437-454 |
| Number of pages | 18 |
| Journal | Adaptive Behavior |
| Volume | 1 |
| Issue number | 4 |
| DOIs | |
| State | Published - Mar 1993 |
Keywords
- dynamic programming
- reinforcement learning
- sequential decision problems
Fingerprint
Dive into the research topics of 'Efficient Learning and Planning Within the Dyna Framework'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver