1996 | ||
---|---|---|
4 | Steven J. Bradtke, Andrew G. Barto: Linear Least-Squares Algorithms for Temporal Difference Learning. Machine Learning 22(1-3): 33-57 (1996) | |
1995 | ||
3 | EE | Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh: Learning to Act Using Real-Time Dynamic Programming. Artif. Intell. 72(1-2): 81-138 (1995) |
1994 | ||
2 | EE | Steven J. Bradtke, Michael O. Duff: Reinforcement Learning Methods for Continuous-Time Markov Decision Problems. NIPS 1994: 393-400 |
1992 | ||
1 | EE | Steven J. Bradtke: Reinforcement Learning Applied to Linear Quadratic Regulation. NIPS 1992: 295-302 |
1 | Andrew G. Barto | [3] [4] |
2 | Michael O. Duff | [2] |
3 | Satinder P. Singh | [3] |