1998 |
4 | EE | Mohammad A. Al-Ansari,
Ronald J. Williams:
Robust, Efficient, Globally-Optimized Reinforcement Learning with the Parti-Game Algorithm.
NIPS 1998: 961-967 |
1996 |
3 | | Jing Peng,
Ronald J. Williams:
Incremental Multi-Step Q-Learning.
Machine Learning 22(1-3): 283-290 (1996) |
1994 |
2 | | Jing Peng,
Ronald J. Williams:
Incremental Multi-Step Q-Learning.
ICML 1994: 226-232 |
1992 |
1 | | Ronald J. Williams:
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning.
Machine Learning 8: 229-256 (1992) |