2008 |
16 | EE | Lihong Li:
A worst-case comparison between temporal difference and residual gradient with linear function approximation.
ICML 2008: 560-567 |
15 | EE | Lihong Li,
Michael L. Littman,
Thomas J. Walsh:
Knows what it knows: a framework for self-aware learning.
ICML 2008: 568-575 |
14 | EE | Ronald Parr,
Lihong Li,
Gavin Taylor,
Christopher Painter-Wakefield,
Michael L. Littman:
An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning.
ICML 2008: 752-759 |
13 | EE | John Langford,
Lihong Li,
Tong Zhang:
Sparse Online Learning via Truncated Gradient.
NIPS 2008: 905-912 |
12 | EE | Emma Brunskill,
Bethany R. Leffler,
Lihong Li,
Michael L. Littman,
Nicholas Roy:
CORL: A Continuous-state Offset-dynamics Reinforcement Learner.
UAI 2008: 53-61 |
11 | EE | John Langford,
Lihong Li,
Tong Zhang:
Sparse Online Learning via Truncated Gradient
CoRR abs/0806.4686: (2008) |
2007 |
10 | EE | Thomas J. Walsh,
Ali Nouri,
Lihong Li,
Michael L. Littman:
Planning and Learning in Environments with Delayed Feedback.
ECML 2007: 442-453 |
9 | EE | Ronald Parr,
Christopher Painter-Wakefield,
Lihong Li,
Michael L. Littman:
Analyzing feature generation for value-function approximation.
ICML 2007: 737-744 |
8 | EE | Jennifer Wortman,
Yevgeniy Vorobeychik,
Lihong Li,
John Langford:
Maintaining Equilibria During Exploration in Sponsored Search Auctions.
WINE 2007: 119-130 |
7 | EE | Lihong Li,
Vadim Bulitko,
Russell Greiner:
Focus of Attention in Reinforcement Learning.
J. UCS 13(9): 1246-1269 (2007) |
2006 |
6 | EE | Alexander L. Strehl,
Lihong Li,
Eric Wiewiora,
John Langford,
Michael L. Littman:
PAC model-free reinforcement learning.
ICML 2006: 881-888 |
5 | EE | Alexander L. Strehl,
Lihong Li,
Michael L. Littman:
Incremental Model-based Learners With Formal Learning-Time Guarantees.
UAI 2006 |
2005 |
4 | | Lihong Li,
Michael L. Littman:
Lazy Approximation for Solving Continuous Finite-Horizon MDPs.
AAAI 2005: 1175-1180 |
2004 |
3 | EE | Lihong Li,
Vadim Bulitko,
Russell Greiner:
Batch Reinforcement Learning with State Importance.
ECML 2004: 566-568 |
2003 |
2 | EE | Ilya Levner,
Vadim Bulitko,
Lihong Li,
Greg Lee,
Russell Greiner:
Towards Automated Creation of Image Interpretation Systems.
Australian Conference on Artificial Intelligence 2003: 653-665 |
1 | | Vadim Bulitko,
Lihong Li,
Russell Greiner,
Ilya Levner:
Lookahead Pathologies for Single Agent Search.
IJCAI 2003: 1531-1533 |