2006 |
14 | | Malcolm J. A. Strens:
Combining Stochastic Task Models with Reinforcement Learning for Dynamic Scheduling.
ICAPS 2006: 426-429 |
2005 |
13 | EE | Spiros Kapetanakis,
Daniel Kudenko,
Malcolm J. A. Strens:
Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems.
Adaptive Agents and Multi-Agent Systems 2005: 106-118 |
12 | EE | Malcolm J. A. Strens:
Learning Multi-agent Search Strategies.
Adaptive Agents and Multi-Agent Systems 2005: 245-259 |
11 | EE | Malcolm J. A. Strens,
Neil Windelinckx:
Combining Planning with Reinforcement Learning for Multi-robot Task Allocation.
Adaptive Agents and Multi-Agent Systems 2005: 260-274 |
2004 |
10 | | Thomas Walker,
Daniel Kudenko,
Malcolm J. A. Strens:
Algorithms for Distributed Exploration.
ECAI 2004: 84-88 |
9 | EE | Malcolm J. A. Strens:
Efficient hierarchical MCMC for policy search.
ICML 2004 |
2003 |
8 | | Malcolm J. A. Strens:
Evolutionary MCMC Sampling and Optimization in Discrete Spaces.
ICML 2003: 736-743 |
7 | EE | Malcolm J. A. Strens,
Ian N. Gregory:
Tracking in cluttered images.
Image Vision Comput. 21(10): 891-911 (2003) |
2002 |
6 | EE | Spiros Kapetanakis,
Daniel Kudenko,
Malcolm J. A. Strens:
Reinforcement Learning Approaches to Coordination in Cooperative Multi-agent Systems.
Adaptive Agents and Multi-Agents Systems 2002: 18-32 |
5 | | Malcolm J. A. Strens,
Mark Bernhardt,
Nicholas Everett:
Markov Chain Monte Carlo Sampling using Direct Search Optimization.
ICML 2002: 602-609 |
4 | EE | Malcolm J. A. Strens,
Andrew W. Moore:
Policy Search using Paired Comparisons.
Journal of Machine Learning Research 3: 921-950 (2002) |
2001 |
3 | | Malcolm J. A. Strens,
Andrew W. Moore:
Direct Policy Search using Paired Statistical Tests.
ICML 2001: 545-552 |
2000 |
2 | | Malcolm J. A. Strens:
A Bayesian Framework for Reinforcement Learning.
ICML 2000: 943-950 |
1997 |
1 | EE | Malcolm J. A. Strens,
James F. Boyce:
Constraint Directed Learning for Unsupervised Image Sequence Segmentation.
ICIP (1) 1997: 743-746 |