2006 |
5 | EE | Vladislav Tadic:
Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes.
Machine Learning 63(2): 107-133 (2006) |
2002 |
4 | EE | Sumetpal Singh,
Vladislav Tadic,
Arnaud Doucet:
A policy gradient method for SMDPs with application to call admission control.
ICARCV 2002: 1268-1274 |
2001 |
3 | | Vladislav Tadic:
On the Convergence of Temporal-Difference Learning with Linear Function Approximation.
Machine Learning 42(3): 241-267 (2001) |
1999 |
2 | EE | Vladislav Tadic:
Convergence Analysis of Temporal-Difference Learning Algorithms with Linear Function Approximation.
COLT 1999: 193-202 |
1 | EE | Vladislav Tadic:
On the Asymptotic Behaviour of a Constant Stepsize Temporal-Difference Learning Algorithm.
EuroCOLT 1999: 126-137 |