1998 | ||
---|---|---|
4 | Robert H. Crites, Andrew G. Barto: Elevator Group Control Using Multiple Reinforcement Learning Agents. Machine Learning 33(2-3): 235-262 (1998) | |
1995 | ||
3 | Tuomas Sandholm, Robert H. Crites: On Multiagent Q-Learning in a Semi-Competitive Domain. Adaption and Learning in Multi-Agent Systems 1995: 191-205 | |
2 | EE | Robert H. Crites, Andrew G. Barto: Improving Elevator Performance Using Reinforcement Learning. NIPS 1995: 1017-1023 |
1994 | ||
1 | EE | Robert H. Crites, Andrew G. Barto: An Actor/Critic Algorithm that is Equivalent to Q-Learning. NIPS 1994: 401-408 |
1 | Andrew G. Barto | [1] [2] [4] |
2 | Tuomas Sandholm | [3] |