2001 |
11 | | Nigel Tao,
Jonathan Baxter,
Lex Weaver:
A Multi-Agent Policy-Gradient Approach to Network Routing.
ICML 2001: 553-560 |
10 | EE | Lex Weaver,
Nigel Tao:
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning.
UAI 2001: 538-545 |
9 | EE | Jonathan Baxter,
Peter L. Bartlett,
Lex Weaver:
Experiments with Infinite-Horizon, Policy-Gradient Estimation.
J. Artif. Intell. Res. (JAIR) 15: 351-381 (2001) |
2000 |
8 | EE | Lex Weaver:
Design and Evaluation of Mechanisms for a Multicomputer Object Store
CoRR cs.DC/0004010: (2000) |
7 | EE | Lex Weaver,
Andrew Lynes:
Sorting Integers on the AP1000
CoRR cs.DC/0004013: (2000) |
6 | | Jonathan Baxter,
Andrew Tridgell,
Lex Weaver:
Learning to Play Chess Using Temporal Differences.
Machine Learning 40(3): 243-263 (2000) |
1999 |
5 | EE | Jonathan Baxter,
Andrew Tridgell,
Lex Weaver:
TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search
CoRR cs.LG/9901001: (1999) |
4 | EE | Jonathan Baxter,
Andrew Tridgell,
Lex Weaver:
KnightCap: A chess program that learns by combining TD(lambda) with game-tree search
CoRR cs.LG/9901002: (1999) |
1998 |
3 | | Jonathan Baxter,
Andrew Tridgell,
Lex Weaver:
KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search.
ICML 1998: 28-36 |
2 | EE | Lex Weaver,
Chris Johnson:
Pre-fetching tree-structured data in distributed memory
CoRR cs.DC/9810002: (1998) |
1 | EE | Lex Weaver,
Terry Bossomaier:
Evolution of Neural Networks to Play the Game of Dots-and-Boxes
CoRR cs.NE/9809111: (1998) |