2001 | ||
---|---|---|
2 | Frédérick Garcia, Florent Serre: From Q(lambda) to Average Q-learning: Efficient Implementation of an Asymptotic Approximation. IJCAI 2001: 959-964 | |
2000 | ||
1 | Frédérick Garcia, Florent Serre: Efficient Asymptotic Approximation in Temporal Difference Learning. ECAI 2000: 296-300 |
1 | Frédérick Garcia | [1] [2] |