2009 |
14 | EE | Alina Beygelzimer,
John Langford,
Yuri Lifshits,
Gregory B. Sorkin,
Alexander L. Strehl:
Conditional Probability Tree Estimation Analysis and Algorithms
CoRR abs/0903.4217: (2009) |
2008 |
13 | EE | Peng Dai,
Alexander L. Strehl,
Judy Goldsmith:
Expediting RL by using graphical structures.
AAMAS (3) 2008: 1325-1328 |
12 | EE | John Langford,
Alexander L. Strehl,
Jennifer Wortman:
Exploration scavenging.
ICML 2008: 528-535 |
11 | EE | Sharad Goel,
John Langford,
Alexander L. Strehl:
Predictive Indexing for Fast Search.
NIPS 2008: 505-512 |
10 | EE | Alexander L. Strehl,
Michael L. Littman:
An analysis of model-based Interval Estimation for Markov Decision Processes.
J. Comput. Syst. Sci. 74(8): 1309-1331 (2008) |
2007 |
9 | | Alexander L. Strehl,
Carlos Diuk,
Michael L. Littman:
Efficient Structure Learning in Factored-State MDPs.
AAAI 2007: 645-650 |
8 | EE | Alexander L. Strehl,
Michael L. Littman:
Online Linear Regression and Its Application to Model-Based Reinforcement Learning.
NIPS 2007 |
2006 |
7 | EE | Carlos Diuk,
Alexander L. Strehl,
Michael L. Littman:
A hierarchical approach to efficient reinforcement learning in deterministic domains.
AAMAS 2006: 313-319 |
6 | EE | Alexander L. Strehl,
Lihong Li,
Eric Wiewiora,
John Langford,
Michael L. Littman:
PAC model-free reinforcement learning.
ICML 2006: 881-888 |
5 | EE | Alexander L. Strehl,
Chris Mesterharm,
Michael L. Littman,
Haym Hirsh:
Experience-efficient learning in associative bandit problems.
ICML 2006: 889-896 |
4 | EE | Alexander L. Strehl,
Lihong Li,
Michael L. Littman:
Incremental Model-based Learners With Formal Learning-Time Guarantees.
UAI 2006 |
2005 |
3 | EE | Alexander L. Strehl,
Michael L. Littman:
A theoretical analysis of Model-Based Interval Estimation.
ICML 2005: 856-863 |
2 | EE | Bethany R. Leffler,
Michael L. Littman,
Alexander L. Strehl,
Thomas J. Walsh:
Efficient Exploration With Latent Structure.
Robotics: Science and Systems 2005: 81-88 |
2004 |
1 | EE | Alexander L. Strehl,
Michael L. Littman:
An Empirical Evaluation of Interval Estimation for Markov Decision Processes.
ICTAI 2004: 128-135 |